Skip to content
@stanford-futuredata

Future Data Systems

We are a CS research group building data-intensive systems

Popular repositories Loading

  1. ColBERT ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    Python 3.3k 404

  2. macrobase macrobase Public

    MacroBase: A Search Engine for Fast Data

    Java 664 126

  3. ARES ARES Public

    Automated Evaluation of RAG Systems

    Python 548 57

  4. noscope noscope Public

    Accelerating network inference over video

    Python 435 122

  5. sparser sparser Public

    Sparser: Raw Filtering for Faster Analytics over Raw Data

    C 430 54

  6. dawn-bench-entries dawn-bench-entries Public

    DAWNBench: An End-to-End Deep Learning Benchmark and Competition

    Python 260 73

Repositories

Showing 10 of 70 repositories
  • FrugalGPT Public

    FrugalGPT: better quality and lower cost for LLM applications

    stanford-futuredata/FrugalGPT’s past year of commit activity
    Jupyter Notebook 200 Apache-2.0 25 3 0 Updated Feb 10, 2025
  • colbert-serve Public
    stanford-futuredata/colbert-serve’s past year of commit activity
    Python 2 0 0 0 Updated Jan 16, 2025
  • ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    stanford-futuredata/ColBERT’s past year of commit activity
    Python 3,250 MIT 404 82 20 Updated Nov 18, 2024
  • ARES Public

    Automated Evaluation of RAG Systems

    stanford-futuredata/ARES’s past year of commit activity
    Python 548 Apache-2.0 57 13 1 Updated Nov 4, 2024
  • stk Public
    stanford-futuredata/stk’s past year of commit activity
    Python 100 Apache-2.0 19 2 0 Updated Aug 26, 2024
  • gavel Public

    Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

    stanford-futuredata/gavel’s past year of commit activity
    Jupyter Notebook 126 MIT 32 8 2 Updated Jul 25, 2024
  • InQuest Public

    Accelerating Aggregation Queries on Unstructured Streams of Data

    stanford-futuredata/InQuest’s past year of commit activity
    Python 7 2 1 0 Updated Apr 18, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    stanford-futuredata/Megatron-LM’s past year of commit activity
    Python 34 2,638 0 2 Updated Jan 19, 2024
  • tasti Public

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data (SIGMOD 2022)

    stanford-futuredata/tasti’s past year of commit activity
    Python 15 5 0 0 Updated Jan 17, 2024
  • omg Public
    stanford-futuredata/omg’s past year of commit activity
    Python 21 Apache-2.0 3 0 0 Updated Sep 20, 2023

Top languages

Loading…

Most used topics

Loading…