Is this data going to be historical in nature?
- For your chosen business (the business of your client) and the industry he/she is in, determine if it is advisable to plan this new data analytics function and database in a manner where it will be established at a cloud service provider (CSP)? Explain why. Find similar cases elsewhere.
- Where is this Big Data found?
- What is the format and type of the database going to be?
- How will the data get from wherever it is into this database? Supply a data flow diagram (DFD).
- Will you store unformatted data? If so what application will format the data when you read it for analysis?
- Will you store formatted data in a Data Warehouse? If so supply the schema diagram.
- Is this data going to be historical in nature?
- Is this data going to include a real-time component? If so this greatly complicates the scenario and you need to address the impact of outages on data loss and probably need to mention the need for a Helpdesk to support the real-time function. Any real-time component will significantly impact your future networking recommendations (is assignment 4).
- Will you be recommending some form of data warehouse? If so, will you use ETL formatting or something else?
- Will you be recommending a Hadoop structure? If so where will this be hosted?
- Create a workflow diagram (WFD) to show the activities from data generation, to data capture, to analysis of data, to report generation.