Design and develop a prototype application with core Python (version 3.7 - 3.10) in Anaconda environment using a Jupyter notebook that demonstrates how data from the given data set (3 csv files – one is a mapping file with 10 rows & other 2 files have data = 150k rows each) can be formatted, reshaped and used to generate specific outputs. The application needs to be a single programme that provides the required functionality.
Based on interest & project finalization, further details would be shared. Would require deliverable with original thoughts & structure without any use of AI/Chat GPT or other AI generated content (if it’s AI generated content the project would be cancelled).
Require following 2 deliverables - 1st prototype deliverable by 12 December'24 & second deliverable by 15 December'24.
Deliverable: 1. Prototype program High level functionality The application should provide the basic functionality to load the initial data set (CSV file, translate it into a suitable format – preferably JSON or XML, JSON with means to back up the format using either files or a database (use an SQL or relational database - set up and populate database via a local host connection), keeping the current state of the data when the program is closed and make it available when the program is reopened. The file will require some cleaning & transformations.
A GUI (Python with Tkinter on a Jupyter notebook) to interact with the data set(s) that enables the user to: o Load CSV file o Data transformation o Load transformed data o Manipulate data for desired output
2. Report: Section 1: 1200 words Section 2: 1200 words Section 3: 600 words
Specific instructions: use NumPy, Pandas, Seaborn, Matplotlib only + appropriate Python core libraries to access/query a database & the application and its parts must not run concurrently, and must NOT use Python threads.