Skip to content

Learn from Curated Repository Collection #22

@alirezamshi

Description

@alirezamshi

Description

Learn from ~2k curated repositories in ML, data, and quant domains. Extract knowledge and serve via skills.leeroo.com.

Data Source

Repository list: https://api.leeroo.com/storage/v1/object/public/avatars/f47ac10b-58cc-4372-a567-0e02b2c3d479/file_list.json

Domains covered:

  • Machine Learning
  • Data Engineering
  • Quantitative Finance

Goal

  1. Run repo learner (issue Implement Repository Learner #7 ) on all repos in the list
  2. Extract patterns, abstractions, and best practices
  3. Index into knowledge graph
  4. Serve at https://skills.leeroo.com

Considerations

  • Batch processing for 2k repos
  • Rate limiting for GitHub API
  • Incremental updates (new repos, repo changes)
  • Quality filtering (stars, activity, code quality)

References

Metadata

Metadata

Assignees

Labels

P0Highest Priority

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions