Skip to main content
Version: 2.0.0

Ankiren Linguistic API

API for managing NLP datasets in the Linguistic microservice.

Features

  • Dataset Catalog: Browse and manage NLP dataset metadata
  • Token Search: Search through dataset tokens with mixed token/tag sequence matching
  • Tag Statistics: View POS tag distributions and counts
  • Export: Download filtered search results as TSV files

Authentication

  • Public endpoints (GET) require no authentication
  • Admin endpoints (POST, PUT, DELETE) require JWT with admin role
  • JWT can be passed via:
    • Authorization: Bearer <token> header
    • authjs.session-token cookie

Data Storage

  • Dataset metadata is stored in Cloudflare D1 (SQLite)
  • Token data is stored in Cloudflare R2 (object storage) in VLSP format

Authentication

JWT token from shell app authentication

Security Scheme Type:

http

HTTP Authorization Scheme:

bearer

Bearer format:

JWT

Contact

Ankiren Team:

URL: https://ankiren.com