forked from b2l-chennai/project-ideas
-
Notifications
You must be signed in to change notification settings - Fork 0
Query Collector
Dorai Thodla edited this page Nov 22, 2020
·
1 revision
Goal:
Collect SQL queries from open source projects on github to study them
Input:
- Programming Language (like Python)
- Number of projects to scan (n)
Output:
- SQL queries in the project (note that some projects may be using ORMs like SQLAlchemy or internal ORMs like Django Projects)
Process:
- Search for projects developed in the specified (user requested) programming language
- Pick the top n projects (n=100 by default)
- Analyze the repositories (that latest commits)
- Find whether they use sql databases (a scan of import statements may be a good place to start)
- Find the SQL strings and extract them (they may be buried in the code (which is bad) or in separate files or stored procedures)