🦨 Alpha's Tech Garden

        • BEIR
        • HotpotQA
        • LLM Benchmarks
        • MTEB (Massive Text Embedding Benchmark)
        • NLP Benchmarks
        • CLIP
        • Embed v3
        • Embeddings
        • Qwen3
        • Word2Vec
        • Best input data format for LLMs
        • Claude
        • Gemini
        • Getting started with LLaMA 2
        • Glitch tokens
        • GPT-4
        • InstructZero
        • LazyAxolotl
        • LLMs
        • Mamba: Linear-Time Sequence Modeling with Selective State Spaces
        • MIRACL
        • vLLM
        • Breadth-first search in a graph
        • Brute force for an optimization problem
        • Central Limit Theorem
        • Clustering
        • Coefficient of Determination
        • Computer models
        • Confidence intervals
        • Cross validation
        • Data
        • Depth-first search in a graph
        • Dynamic programming
        • Empirical Rule
        • Features
        • Gambler's Fallacy
        • Graphs
        • Greedy algorithms for optimization problems
        • How to find the right model
        • Inferential Statistics
        • Introduction to Computational Thinking and Data Science
        • Law of Large Numbers
        • Least Squares Objective Function
        • Linear regression
        • Machine Learning
        • Minkowski Metric (Distance)
        • Model metrics
        • Modelling a spring
        • Monte Carlo Simulations
        • Normal Distribution
        • Objective functions
        • Optimization models
        • Probability
        • Probability Density Function
        • Probability Distributions
        • Random numbers
        • Random Walks
        • Regression to the Mean
        • Sampling
        • Sanity checks
        • Shortest path in a graph
        • Simulation models
        • Skew
        • Standard Error of the Mean
        • Stochastic processes
        • Supervised-Unsupervised Learning
        • Testing a model
        • The Birthday Problem
        • The Knapsack Problem
        • Uncertainty
        • Validation
        • Variance
      • 15ai
      • Activation functions
      • AI Incident Database
      • Bayesian Optimization
      • Binary or Step function
      • Extreme Learning Machines
      • GPTQ
      • Hubber Regressor
      • Hyperbolic tangent function (tanh)
      • Hyperparameters
      • LangChain 🦜🔗
      • Linear function
      • Liquid Neural Networks
      • LLM Speed Performance
      • LLM Visualization
      • LLMs (Large Language Models)
      • LoRA
      • Loss functions
      • Model Unlearning
      • Multi-hop queries
      • Papers With Code: State Of The Art
      • Prompt Engineering
      • RANSAC Regression
      • ReLU
      • Retrieval Augmented Generation
      • Sanctuary
      • Scaling transformers to 1M-plus tokens
      • Semantic Search
      • Sigmoid
      • Softmax
      • Theil Sen Regression
      • Transformers.js
        • Android API Levels and Usage
        • API Gateway
        • API Key Authentication
        • Autonomous Systems
        • AWS Lambda Runtimes
        • Basic Authentication
        • Border Gateway Protocol
        • Cache concepts
        • Consistent Hashing
        • Designing REST APIs Guidelines
        • Diagrams
        • Long Polling
        • OAuth Authentication
        • Rate Limiting
        • REST Authentication methods
        • Short polling
        • Token Authentication
        • Animations (Blender)
        • AutoKeying
        • Blender
        • Blender Marketplaces
        • Depth of Field (Blender)
        • Graph Editor (Blender)
        • Layout workspace (Blender)
        • Render on AWS
        • Supported GPUs
        • Texture exporting
        • Texture Painting
        • UV Mapping
        • Workspaces in Blender
        • Applying for Citizenship
        • British North America Act
        • Constitutional monarchy
        • Discover Canada
        • Electoral district
        • Federal state
        • Governor general
        • Message to our readers
        • Notice
        • Parliament
        • Parliamentary democracy
        • Rights and Responsibilities of Citizenship
        • The Oath of Citizenship
        • Who We Are
        • Detecting Locks in PostgreSQL
        • PostgreSQL
        • PostgreSQL Column Sizes
        • PostgreSQL Vacuuming
        • Redis AOF (Append Only File)
        • Saving DB backup before a destructive operation
          • Code Coverage
          • Static Code Analysis
          • CSS Grid Layouts
          • KeyCode.info
          • MockServiceWorker
          • Checking for NaN without imports
          • line_profiler
          • Numba Performance notes
          • pip
          • Python cache
          • Python Missing Libraries
          • Python Notebook Profiling
          • Quickly Creating a Production-Ready API Using FastAPI and Docker… Explained with Memes (DockerCon 2022 notes)
          • tracemalloc
          • SQLBolt
          • Static interfaces
        • Bloom filter
        • MDN Curriculum
        • Untitled
        • Beta
        • Derivatives
        • Stop-Limit orders
        • Time Weighted Return
        • bfloat16
        • HEIC
        • JPEG
        • JSON Lines
        • Polyphasic Sleep
        • Habit Method Cards
        • JenkinsFile guide
        • Apple Photos Sync Log
        • Fixing high CPU usage by Spotlight
        • Manually downloading or evicting iCloud files
        • OSStatus
        • Troubleshooting iCloud Drive Sync
        • Disagree and commit
        • Manager's ReadMe
        • Technical Debt
        • Complexity classes
        • Matrix derivatives
        • Strassen's Tensor Multiplication Algorithm
        • Tensor Multiplication
        • Welford's method for computing variance
        • dotfiles
        • Accessing multiple Kubernetes clusters
        • Building Observability for the 99% developers (DockerCon 2022 notes)
        • Creating a new user with SSH access in Linux
        • Cryptographic Algorithms
        • Downloading SSL certificate from a website
        • HS256
        • JSON Web Tokens
        • Kubectl snippets
        • Setting up GitHub pages with HTTPS
        • The Barbell Method of Reading
        • API Gateway Mappings
        • AWS Lambdas
        • Hosting webs with S3 AWS and Serverless
        • Bash Expansions
        • Command line interface guidelines
        • Command line snippets
        • curl
        • Docker snippets
        • Downloading likes from TikTok
        • Extract lyrics from beet mp3 files
        • git aliases within oh my zsh
        • git snippets
        • grep
        • iterm2 notifications
        • Jenkins Git Plugin
        • lighttpd
        • Migrating all Redis keys to another instance
        • nbdime and nbdiff
        • Protomaps
        • QR Codes for Wifi access
        • Reducing git history size
        • RegEx tester tools
        • Setting up a quick Jupyter Server in AWS
        • SSH Cheatsheet
        • Uploading files to AWS instances
        • zip snippets
        • Conversation Design Fundamentals Course
        • Fonts
        • Laws of UX
        • Psychology of Speed: A Guide to Perceived Performance
      • Advertising Technology
      • Vim bindings in Obsidian
    Home

    ❯

    ai

    ❯

    benchmarks

    ❯

    NLP Benchmarks

    NLP Benchmarks

    Oct 22, 20251 min read

    • nlp
    • benchmark
    • ai
    • ml
    • index
    • MTEB
    • (Super)GLUE
    • Big-BENCH
    • SemEval
    • USEB
    • BEIR
    • MIRACL

    Graph View

    Backlinks

    • No backlinks found

    Created with Quartz v4.4.0 © 2025

    • GitHub