Big Data Student Name:Student ID: Table of Contents Introduction 3 Big Data Fault Tolerant Mechanisms Using Distributed Systems 3 Fault-tolerant systems put into place at Amazon 4 Amounts/volumes of data on integrity and “five 9 uptime” requirements 5 Distributed systems in the organization in terms of its big data processes. 6 Discuss what changes need to be made 7 Conclusion 8 References 8 Introduction The big data era has been changing on the communities with the government and e-commerce for the health organization. The challenges and opportunities are involved with the big data research approach with leveraging on opportunities with the domain specific analytics. The BigML is for the influence on the consumer purchase decision on the social commercial sites. Amazon trademarks and the trade dress might not be for the connection to the different products and services and so there are disparages or the discredits of Amazon. The case related to the same includes the AWS services with the focus on the computing, storage and the networking. The failures and the refreshing patterns of the AWS platforms are for operating with the minimal interactions to the human and the up-front financial investments. The IT managers tend to seek for the deployment and the solutions to the cloud with providing the high available and reliable system which is fault tolerant. The effective customer purchase decisions are for the big data analysis framework, with the big data and business intelligence (Hernandez et al., 2017). Big Data Fault Tolerant Mechanisms Using Distributed Systems Amazon has been involved with the growing of the social commerce sites with the online social information as a market force. The fault tolerance systems includes the ability to focus on the building of the system failures. AWS is for the ideally suited for the fault tolerant software system with the Amazon Elastic Computing Cloud with the Amazon EC2 with the AWS for the application development (Araújo et al., 2019). The ancillary services with Auto-Scaling and Elastic Load Balancing. The Elastic Computing Cloud includes the computing of resources with the server instances with building on the hosting for the software systems, with the fault tolerant systems with the multiple EC2 instances and the ancillary services like the auto-scaling and the elastic load balancing. The EC2 instances includes the IP addresses with the usual methods with the Amazon Machine Image (AMI) with the server, applications and operating system that runs the virtual server in the cloud. For Amazon, it involves the cloud computing with the advantage of cost efficiency, and the unlimited storage or the seamless access. The first step includes the building on the fault tolerant applications for AWS with the AMIs with configuring the systems. The launch, deployment and the data is needed by the applications. The static configuration is for the deployment with the frequency on the application changes with the flexibility on application changes (Fault-Tolerant Components on AWS, 2019). The auditing is based on the application configuration with AMI on instance failure with the addressing on the replacement instances that tend to make use on the same AMI. With the degradation instances with the Elastic Block Store that tends to provide the block of the storage volumes with the attachment on the instance and persist from the instance. Fault-tolerant systems put into place at Amazon Amazon EBS volumes is for the high reliability on the mitigation on the failure possibility. The building on the fault tolerant applications are for the duplicity on the original and then there are no major loss for the data and functionality. The Amazon is involved with the GFS with the data persistence and the Map Reduce that involves the execution for the entire operations and then handling the efficient load balancing. The theoretical aspects are defined for the BI programs with the collaborative leveraging through links to the business strategy and then embedding on the organizational processes with enabling actions for the right time. The research is based on the embedded factors into the organizational processes with the knowledge component (Neto et al., 2018). The fault tolerant components on AWS includes the computing resources with the server instances that are for the high reliability and fault tolerance. It is for the EC2 instances which is then familiar to the operating systems like those of the Linux and Windows. One can accommodate to the software with the running on the operating systems. The EC2 instances have the different IP addresses with the usual methods for the interaction to the remote machines. There are service instances that are for Amazon Machine Image to define the software configuration for the launch of instances. The auto-scaling is for the collection on the Amazon EC2 instances with the logical grouping that is needed for the automated approach on the scaling and management. The fleeting is for the required capacity with the monitoring on the fleet instances with the allowing on the launches as and when needed (Pittandavida, 2019). The fault tolerant approach includes the factors with focusing on the business dependency for the IT resources and expertise. The technologies have been merging the lines with the quality of data. As per the analysis, the analytics involves the streamlining with the data visualization solution downstream to ensure about the data which is used for the business analytics (Wamba et al., 2015). Amounts/volumes of data on integrity and “five 9 uptime” requirements The service level agreements are for the cloud providers with the IT people involved with the handling of Amazon Web services with the specific uptime for the SLA. The approach is based on the offering the SLA beyond uptime with the throughput SLA with the consistency SLA with the services on delivering on resources with reserved approach for the violation of SLA. The latency SLA with the document payload for 1Kb with the service for the delivery on the sub-10ms with the sub-15ms with the violation on the SLA. The approach is for the Google and Microsoft with the providing of the 5 9s uptime SLA with the multi-region instances. Here, one tends to be willing on going on 50% service credit maximum. The SLA is for the service credit maximum with the SLA with the service traits on the uptime on the four 9s with the exception on the read only uptime with the 25% credit maximum. Amazon lacks the official public SLA with the ranking on the category. The failures can be useful considering the approach for: The software leakage memory and the resources. It includes the software one writes and then it depends on the application framework or the device drivers.File systems for the fragmentation over the time that tends to impact the performance as well.Facing the hardware devices with degradation over the time. The advantage is based on the analytical platform which is for the easy-to-use factors with the creation and dynamic with change on real time event stream processing applications. Oracle Data Integrator tends to bring the big data platform portability with the ability to handle the transformation capabilities. The data is on the social systems with demonstrating on the dynamically connecting on the ways and locations. The revealing is for the unexpected on the patterns for the global behaviour (Tian et al., 2017). The data is on the systems with the long standing questions with the human mobility. The software systems tends to degrade over the time with the file systems fragmentation over the time and then handling the impacts performance. Distributed systems in the organization in terms of its big data processes. The fault tolerance includes the system ability to focus on the busy e-commerce includes the computing of the storage, networking and the databases with the Amazon EC2 with the Elastic Block Store or the Auto scaling. The EC2 instances is for the accommodating on the operating systems with the Amazon Machine Image that includes the developer with the AWS to decide on the frequency for the application change. The speed of launch with the AMI reducing the time on the static configuration minimizing on the launch time. The elastic block store with the persistent block storage volumes are for the use of the EC2 instances. The approach is based on the dimensionality reduction techniques with the larger number of covariates with the specifics and the regression models. The strength is for the association on the p value with the focus on the development of the study design, and then defining the predictors or the parameters. The AWS is for the connection to the private fiber optic networking that enables architect applications for the failure over between the AZs without any interruption. Hence, the designing and operations are for the applications and databases with high availability and the scalability to handle the traditional single data centre or the multi-data centres. Discuss what changes need to be made For the distributed system, there is a need to focus on the specific user type with the multiple accounts that involves the generation of the big data. The final violation is for the ideal user assumptions with the manipulation with the platform that is for the unintended ways with achieving on the reported cases. The platform is based on the extra governmental political mobilizations with the protests. Hence, for Amazon, there is a need for using the technology requirements along with the new data savvy generation. The leverage opportunities are presented for the domain specific analytics. It is needed for the emerging applications with the wider business applications. The comprehensive model approach is for the model of business which includes the integration of the theory of business rules for adjusting the machine learning models. The algorithm is to pattern the analysis and then work on the different big data and knowledge management systems (Lazer et al., 2017). Conclusion The evidence based documentation is for the current and the emerging applications where the emphasis is on the knowledge based views on the intangible resources with competitive advantage and the organizational success. The model building is for the validation and the selection on the specifics that relates to the high predicate power. It is for the covariates that is determined through handling the advancement of the techniques like those of the neural networks. References Araújo Neto, J. P., Pianto, D. M., & Ralha, C. G. (2019). Towards increasing reliability of Amazon EC2 spot instances with a fault-tolerant multi-agent architecture. Multiagent and Grid Systems, 15(3), 259-287. Fault-Tolerant Components on AWS (2019). Reference at:< https://docs.aws.amazon.com/whitepapers/latest/fault-tolerant-components/fault-tolerant-components.pdf> Hernandez, I., & Zhang, Y. (2017). Using predictive analytics and big data to optimize pharmaceutical outcomes. American journal of health-system pharmacy, 74(18), 1494-1500. Lazer, D., & Radford, J. (2017). Data ex machina: introduction to big data. Annual Review of Sociology, 43, 19-39. Neto, J. P. A., Pianto, D. M., & Ralha, C. G. (2018, October). An agent-based fog computing architecture for resilience on Amazon EC2 spot instances. In 2018 7th Brazilian Conference on Intelligent Systems (BRACIS) (pp. 360-365). IEEE. Pittandavida, S. (2019). Auto-recovery and continuous disaster tolerance in Amazon Web Services instance using Autodeployer script automation tool (Doctoral dissertation, Dublin, National College of Ireland). Tian, X., & Liu, L. (2017). Does big data mean big knowledge? Integration of big data analysis and conceptual model for social commerce research. Electronic Commerce Research, 17(1), 169-183. Wamba, S. F., Akter, S., Edwards, A., Chopin, G., & Gnanzou, D. (2015). How ‘big data’can make big impact: Findings from a systematic review and a longitudinal case study. International Journal of Production Economics, 165, 234-246.
