Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Pricing

  • Log In
  • Sign Up
Aneta Melisa Stal's picture
6 5 3

Aneta Melisa Stal

melisa
YuntaoChen's profile picture 21world's profile picture cmhungsteve's profile picture
·

AI & ML interests

NLP

Organizations

Writer's profile picture BigCode's profile picture

Collections 1

Daily Papers
  • Simple linear attention language models balance the recall-throughput tradeoff

    Paper • 2402.18668 • Published Feb 28 • 17
  • Linear Transformers with Learnable Kernel Functions are Better In-Context Models

    Paper • 2402.10644 • Published Feb 16 • 73
  • Repeat After Me: Transformers are Better than State Space Models at Copying

    Paper • 2402.01032 • Published Feb 1 • 22
  • Zoology: Measuring and Improving Recall in Efficient Language Models

    Paper • 2312.04927 • Published Dec 8, 2023 • 2

models 3

melisa/short-mistral-7b-with-healing2-math-2000

Updated 19 days ago

melisa/taco-decoder

Updated Jul 1, 2022

melisa/taco-tagger

Updated Jun 29, 2022

datasets

None public yet
Company
© Hugging Face
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs