SQLite data analysis and extension

Cerrado Publicado hace 6 meses Pagado a la entrega
Cerrado Pagado a la entrega

Requirement:

You have a "categories" table with three columns: "id," "name," and "main_category_id," containing 118,118 rows. The "id" field represents the category's unique identifier, the "name" field contains the category's English name, and the "main_category_id" is currently empty (NULL), serving as a foreign key referencing the yet-to-be-populated "main_categories" table.

Here's an example of the "categories" table content:

id name main_category_id

1 attributed-no-source NULL

2 best NULL

6 out-of-control NULL

8 worst NULL

9 dance NULL

Notably, the "name" column contains whole words or hyphenated words.

Additionally, there is an empty "main_categories" table with two columns: "id" and "name."

Task:

Your task is to cluster the categories, find meaningful cluster names that reflect the grouped categories, populate the "main_categories" table with these cluster names, and establish foreign key relationships between the "main_categories" and "categories" tables using the "id" column of the "main_categories" table and the "main_category_id" column of the "categories" table. To maintain consistency, the cluster names should be in lowercase, whether they consist of whole words or hyphenated words.

Additional Points:

1. It is crucial that the main categories you create accurately reflect the content and meaning of the categories contained within each cluster. This ensures that the clustering is intuitive and beneficial.

2. While creating the main categories, please aim to keep the number of main categories reasonable. We want to maintain a balance between granularity and simplicity.

Acceptance Criteria (AC):

AC1: The "main_categories" table must contain the primary category clusters. For example:

id name

1 love

2 life

3 friendship

4 inspiration

5 wisdom

6 relationships

... ...

AC2: The "main_category_id" column in the "categories" table should contain the ID of the cluster (acting as a foreign key referencing the "id" of the "main_categories" table). This should be applied to all 118,118 rows in the "categories" table. For example:

id name main_category_id

3 life 2

4 love 1

7 truth 5

19 philosophy 5

20 friendship 3

23 marriage 6

33 friends 3

50 desire 1

52 honesty 5

55 passion 1

56 reality 2

57 relationships 6

... ... ...

SQLite Python Procesamiento de datos Programación de bases de datos Administración de bases de datos

Nº del proyecto: #37390506

Sobre el proyecto

23 propuestas Proyecto remoto Activo hace 5 meses

23 freelancers están ofertando un promedio de $139 por este trabajo

schoudhary1553

Top 1% in Freelancer.com Hi, Greetings! ✅checked your project details: ✅Completed Time: In project deadline We have worked on 900 + Projects. I have 6 + years of the experience in same kind of projects. If you are look Más

$180 USD en 3 días
(452 comentarios)
8.4
ExpertSoul

Hi, I will efficiently cluster and populate the "main_categories" table with meaningful cluster names, and establish foreign key relationships in the "categories" table for all 118,118 rows, ensuring data consistency a Más

$100 USD en 1 día
(32 comentarios)
5.9
prakash2813

⭐⭐⭐⭐⭐ Hi there, I am full stack developer with 7+ years of experience in website and desktop app development with Oracle, MySQL, MariaDB, SQL Server, PostgreSQL, MongoDB and more. I have strong expertise in Database d Más

$230 USD en 3 días
(40 comentarios)
5.9
nycer847

Hello Dear, I hope you are doing great and are in good health. I see that you need a statistical data analyst. I am an expert data analyst and research consultant with a deep understanding of several financial models Más

$155 USD en 8 días
(1 comentario)
4.2
MQamar123

Dear Client, my name is Muhammad, and I am a tech enthusiast with over 7 years of experience in software development. From my extensive background, you can expect me to have the expertise required to help you with your Más

$30 USD en 7 días
(5 comentarios)
4.3
Ghazik

Hello Ready to SQLite data analysis and extension. Please send me a message to discuss further requirements of this project in chat. Thanks

$200 USD en 4 días
(8 comentarios)
3.8
ahmedsud

Hi, i can do the task, please contact me to discuss more. Can do PoC on DB and show you the results on given data. Or can be done through remote access directly on your provided workstation.

$111 USD en 3 días
(4 comentarios)
2.7
keremozsa

Hello there! My name is Kerem and I'm a Python enthusiast with 8+ years of experience. I understand that you need help clustering the categories in your "categories" table and populating the "main_categories" table wit Más

$140 USD en 7 días
(1 comentario)
1.9
rajeevnewnetlink

Hi, We went through your project description and it seems like our team is a great fit for this job. We are an expert team which have many years of experience on Python, Data Processing, Database Administration, Data Más

$110 USD en 7 días
(4 comentarios)
1.4
danwrights605

SOFTWARE ARCHITECTURE EXPERT GOOD IN PHP, PYTHON, JAVA SCRIPT, C PROGRAMMING, SQL AND CUDA I have gone through your project details and requirements keenly. I am very convinced to deliver the project within your expect Más

$120 USD en 5 días
(1 comentario)
0.4
msaqibshah1990

Hello, We specialize in data processing, database administration, database programming, Python and more. We have over 10+ years of experience working with clients to deliver accurate products that meet their needs. I Más

$110 USD en 1 día
(1 comentario)
0.0
crispuswanjihia6

DATA BASE PROGRAMMER DEAR EMPLOYER, I’ve completed the exact same projects before successfully. Awarding me will be the fastest way to complete your task with the best rates possible. I CAN ASSURE YOU 100% THAT WE ARE Más

$140 USD en 7 días
(0 comentarios)
0.0
pulkitupadhyay92

"Hello , I trust you're doing well. I wanted to reach out to express my sincere interest in your project. Our experience and portfolio speak volumes about our ability to meet your project's demands. Our track record Más

$140 USD en 7 días
(0 comentarios)
0.0
siamak2099

Hello there! Thank you for considering me for this project. I am an experienced software developer and data analyst who has a strong background in software engineering and data analysis. I would love to help you analyz Más

$100 USD en 2 días
(0 comentarios)
0.0
rcrz1986

I understand your project requirement and would love to help you with your needs. As a dedicated full-stack developer, I have a deep understanding of Database administration, SQLite, DML and DDL. And these allow me to Más

$100 USD en 1 día
(0 comentarios)
0.0
johnwanjehia

To achieve the task, i will follow these steps: Data Preprocessing: Check the data for any inconsistencies or issues. Make sure that the "categories" table is clean and consistent. Clustering: Use a suitable cluster Más

$100 USD en 3 días
(0 comentarios)
0.0
chenguoqiangsg

Hello Boss, I am a college student who is about to graduate from the University of Wollongong in Australia. My undergraduate major is business information systems. During my undergraduate period, I got excellent result Más

$140 USD en 7 días
(0 comentarios)
0.0
vinod150987

Hi, Hope you well ! I have 12 + years experience in .NET with C++, C#, VB, JAVA, SpringBoot, ASP.NET MVC, ASP.NET Core, Angular, IIS, Web Service, WCF, JavaScript, jQuery, SQL, PL/SQL, SSIS, SSRS, Crystal Report, XML , Más

$230 USD en 1 día
(0 comentarios)
0.0
MohamedFathyB

I am a proficient SQL expert with extensive experience in data engineering at Vodafone. I excel in database management and have a proven track record in handling complex tasks. Leveraging my expertise, I will efficient Más

$120 USD en 7 días
(0 comentarios)
0.0
ahmedinmek1

I know that your demand is to manage your dataset by classification mechanism of unsupervised machine learning technique. Give me chance and let me do it.

$140 USD en 7 días
(0 comentarios)
0.0