Parcourir la source

Deploy fine-tuned GRPO model as default query expansion (#67)

* Add query expansion model finetuning infrastructure

- Training scripts for Qwen3-0.6B and 1.7B models
- Dataset generation from s-emanuilov/query-expansion
- Evaluation scripts comparing finetuned vs baseline models
- GRPO RL training script (optional improvement)
- Export script for GGUF conversion

Results:
- 0.6B finetuned: 95% format compliance (lex/vec/hyde)
- Baseline: 0% format compliance
- Dataset: 5,157 examples on HuggingFace Hub

Models available at:
- tobil/qmd-query-expansion-0.6B (recommended)
- tobil/qmd-query-expansion-train (dataset)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Fix GRPO training script for TRL API compatibility

- Use max_completion_length instead of max_new_tokens
- Use processing_class instead of tokenizer
- Use args instead of config for GRPOTrainer
- Add __name__ attribute to reward function class
- Accept **kwargs in reward function for extra TRL args
- Add new LoRA adapter after merging SFT weights

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Update README with final evaluation results

- 0.6B SFT: 95% format compliance (best)
- 0.6B GRPO: 0% (catastrophic forgetting from RL)
- 1.7B v2: training completed, evaluation pending
- Added GRPO evaluation results

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Add comprehensive scoring system for query expansion

New scoring criteria (0-100 points):
- Format (30): Must have lex: and vec: prefixes
- Diversity (30): Multiple types, no echoing query, diverse expansions
- Hyde (20): Optional, concise, no newlines, no word repetition
- Quality (20): Lex=keywords, vec=natural language

See SCORING.md for full documentation.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Add HuggingFace login and comprehensive scoring to GRPO v2 training

- Add explicit HF_TOKEN login before training
- Use SCORING.md criteria as RL reward function
- Conservative training: LR 1e-6, LoRA rank 4
- Reward scores: good=0.94, bad=0.38

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Refactor finetune folder: train/rl scripts with YAML configs

Major changes:
- train.py: Generic SFT training script using YAML config
- rl.py: Generic GRPO training script using YAML config
- configs/: YAML configs per training run (sft_v4.yaml, grpo_v4.yaml)
- dataset/: Data preparation scripts moved here
- tui.py: Interactive model testing interface

Training results:
- SFT v4: 98.8% avg score (all Excellent)
- GRPO v4: 0% (failed - model drifted to verbose explanations)

Removed per-model scripts (train_0.6B.py, train_1.7B.py, etc)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Add named entity extraction to GRPO reward function

Key changes:
- Extract named entities (acronyms, proper nouns, technical terms)
- Heavy penalty (-30) when lex queries miss named entities
- Penalty (-15) for generic filler phrases like "find information about"
- Compound entity detection (TDS motorsports -> both words)
- Update GRPO config with KL regularization (beta=0.04)
- Lower learning rate (5e-7) and add max_steps (200)

Test results:
- "who is TDS motorsports" good: 1.00, bad: 0.30 (was 0.75)
- "how to use React hooks" good: 0.87, bad: 0.45 (was 0.75)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Add chat template leakage detection to reward function

Zero reward for outputs containing:
- <|im_start|>, <|im_end|> tokens
- <think>, </think> tags (Qwen3 thinking mode)
- Role markers like \nassistant\n, \nuser\n
- <|endoftext|> token

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Strict format validation: every line must be lex:/vec:/hyde:

Any line that doesn't start with a valid prefix now returns 0.0
instead of just counting as a penalty. This prevents any prose,
explanations, bullet points, or other invalid content.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Clean up evaluation files

- Remove old versioned evaluation files (0.6B, 1.7B, baseline)
- Rename evaluation_v4.json -> evaluation_sft.json
- Rename evaluation_v4_grpo.json -> evaluation_grpo_failed.json

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Refactor evals into separate run and score scripts

New structure:
- evals/run.py: Generate model outputs to JSONL
- evals/score.py: Score outputs with detailed breakdown
- evals/queries.txt: Test queries (26 total)

Features:
- Supports both HF Hub and local model paths
- Named entity preservation scoring
- Chat template leakage detection
- Strict format validation (every line must be lex:/vec:/hyde:)
- Generic phrase detection

Usage:
  uv run evals/run.py --model tobil/qmd-query-expansion-0.6B-v4
  uv run evals/score.py evals/results_*.jsonl

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Fix GRPO model loading to use SFT base first

The GRPO adapter was trained on merged SFT weights, so loading it
directly on the base model results in 0% score. Added --sft-model
parameter to evals/run.py to load SFT first, then apply GRPO adapter.

With correct loading: GRPO scores 89.7% (all 26 queries Excellent).

Updated README with correct GRPO score and loading instructions.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Fix TUI to load GRPO models with SFT base first

GRPO adapters were trained on merged SFT weights, so they need SFT
loaded and merged first before applying the GRPO adapter.

Updated MODELS config to include sft_base path for GRPO models,
and load_model() now handles the SFT -> merge -> GRPO flow.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Update README for unified model repository structure

All models (0.6B, 1.7B, 4B) with SFT and GRPO variants now go into
a single HuggingFace repo (tobil/qmd-query-expansion) with subfolders
for each size and training method.

Updated loading examples to show subfolder-based model loading.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Update README with separate model repos

Changed from subfolder approach to separate repos per model since
trainer.push_to_hub() doesn't support subfolder argument.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Add 1.7B and 4B GRPO training and GGUF conversion scripts

Training scripts for GRPO fine-tuning:
- train_1.7B_grpo.py: GRPO training for Qwen3-1.7B
- train_4B_grpo.py: GRPO training for Qwen3-4B

GGUF conversion scripts:
- convert_1.7B_gguf.py: Merge SFT+GRPO adapters and convert to GGUF
- convert_4B_gguf.py: Merge SFT+GRPO adapters and convert to GGUF

All scripts use PEP 723 inline dependencies for HuggingFace Jobs.

Models published:
- tobil/qmd-query-expansion-1.7B-sft
- tobil/qmd-query-expansion-1.7B-grpo
- tobil/qmd-query-expansion-1.7B-gguf
- tobil/qmd-query-expansion-4B-sft
- tobil/qmd-query-expansion-4B-grpo
- tobil/qmd-query-expansion-4B-gguf

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Remove beads issue tracking

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Remove beads reference from CLAUDE.md

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Fix GRPO reward function to handle think blocks and end tokens

- Strip <|im_end|> token from completions (model output includes it)
- Change think_penalty to skipped_think bonus (+20 for not using think)
- Adjust max_possible to account for bonus (120/140)
- Fix typo in chat template artifact check

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Make TUI model list dynamic from HuggingFace Hub

- Fetch available qmd-query-expansion models from tobil/ on Hub
- Auto-detect model size (0.6B, 1.7B, 4B) and use correct base model
- Group models by type (SFT vs GRPO) in menu
- Skip GGUF repos in model listing

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Fix GRPO training: apply chat template to prompts

The SFT model was trained with chat template format but GRPO was
passing raw prompts. Now prompts are formatted with tokenizer.apply_chat_template()
so the model sees the same format it learned during SFT.

Also update extract_query_from_prompt to strip chat template artifacts.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

* Finetune 2.0: consolidate and simplify the entire training pipeline

Consolidate ~2,800 lines of duplicated code across 12 files into 5 clean,
well-documented files targeting Qwen3-1.7B end-to-end.

Key changes:
- Extract reward function into single source of truth (reward.py)
  Previously duplicated 3x with divergent bugs across rl.py,
  train_1.7B_grpo.py, and train_4B_grpo.py
- Unify training into one script with sft/grpo subcommands (train.py)
  Replaces train.py + rl.py + train_1.7B_grpo.py + train_4B_grpo.py
- Merge eval generate+score into single eval.py
  Replaces evals/run.py + evals/score.py
- Parameterize GGUF conversion by --size (convert_gguf.py)
  Replaces convert_1.7B_gguf.py + convert_4B_gguf.py
- Fix critical bug: rl.py silently ignored beta/temperature from config,
  causing the exact catastrophic drift its own comments warned about
- Fix prompt consistency: all files use /no_think chat template format
- Retarget configs from 0.6B to 1.7B
- Comprehensive README documenting the full pipeline

Removed: rl.py, train_1.7B_grpo.py, train_4B_grpo.py, convert_1.7B_gguf.py,
convert_4B_gguf.py, tui.py, evals/run.py, evals/score.py

Net: -3,429 lines, +382 lines

Co-Authored-By: Claude (claude-fudge-eap-cc) <noreply@anthropic.com>

* Add HF Jobs scripts, temporal query examples, and training results

- jobs/sft.py and jobs/grpo.py: self-contained scripts for
  `hf jobs uv run` (no local GPU needed)
- 12 temporal/recency query examples in training data (e.g. "recent
  news about Shopify" -> lex with years 2025/2026)
- 4 temporal test queries in evals/queries.txt
- README updated with HF Jobs workflow, training results, and
  updated file structure
- Remove .beads tracking

SFT and GRPO successfully trained on A10G via HF Jobs:
  SFT: eval loss 0.321, token accuracy 92.4%
  GRPO: mean reward 0.757, 200 steps, KL 0.00048

Co-Authored-By: Claude (claude-fudge-eap-cc) <noreply@anthropic.com>

* Deploy fine-tuned GRPO model as default for query expansion

Switch from generic Qwen3-1.7B-Q8_0 (~2.2GB) to fine-tuned
qmd-query-expansion-1.7B-q4_k_m (~1.1GB). The fine-tuned Q4
scores 91.7% avg with 30/30 Excellent, outperforming the base Q8.

- Update default generate model in src/llm.ts
- Update README model table, architecture diagram, config block
- Add v2 training data, eval scripts, and quantize job
- Remove superseded v1 training data (5,742 → 1,000 examples)
- Update finetune README with v2 results and file structure

Co-Authored-By: Claude (claude-fudge-eap-cc) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
Tobias Lütke il y a 3 mois
Parent
commit
eb1b77c8cb

+ 0 - 29
.beads/.gitignore

@@ -1,29 +0,0 @@
-# SQLite databases
-*.db
-*.db?*
-*.db-journal
-*.db-wal
-*.db-shm
-
-# Daemon runtime files
-daemon.lock
-daemon.log
-daemon.pid
-bd.sock
-
-# Legacy database files
-db.sqlite
-bd.db
-
-# Merge artifacts (temporary files from 3-way merge)
-beads.base.jsonl
-beads.base.meta.json
-beads.left.jsonl
-beads.left.meta.json
-beads.right.jsonl
-beads.right.meta.json
-
-# Keep JSONL exports and config (source of truth for git)
-!issues.jsonl
-!metadata.json
-!config.json

+ 0 - 1
.beads/.local_version

@@ -1 +0,0 @@
-0.43.0

+ 0 - 81
.beads/README.md

@@ -1,81 +0,0 @@
-# Beads - AI-Native Issue Tracking
-
-Welcome to Beads! This repository uses **Beads** for issue tracking - a modern, AI-native tool designed to live directly in your codebase alongside your code.
-
-## What is Beads?
-
-Beads is issue tracking that lives in your repo, making it perfect for AI coding agents and developers who want their issues close to their code. No web UI required - everything works through the CLI and integrates seamlessly with git.
-
-**Learn more:** [github.com/steveyegge/beads](https://github.com/steveyegge/beads)
-
-## Quick Start
-
-### Essential Commands
-
-```bash
-# Create new issues
-bd create "Add user authentication"
-
-# View all issues
-bd list
-
-# View issue details
-bd show <issue-id>
-
-# Update issue status
-bd update <issue-id> --status in_progress
-bd update <issue-id> --status done
-
-# Sync with git remote
-bd sync
-```
-
-### Working with Issues
-
-Issues in Beads are:
-- **Git-native**: Stored in `.beads/issues.jsonl` and synced like code
-- **AI-friendly**: CLI-first design works perfectly with AI coding agents
-- **Branch-aware**: Issues can follow your branch workflow
-- **Always in sync**: Auto-syncs with your commits
-
-## Why Beads?
-
-✨ **AI-Native Design**
-- Built specifically for AI-assisted development workflows
-- CLI-first interface works seamlessly with AI coding agents
-- No context switching to web UIs
-
-🚀 **Developer Focused**
-- Issues live in your repo, right next to your code
-- Works offline, syncs when you push
-- Fast, lightweight, and stays out of your way
-
-🔧 **Git Integration**
-- Automatic sync with git commits
-- Branch-aware issue tracking
-- Intelligent JSONL merge resolution
-
-## Get Started with Beads
-
-Try Beads in your own projects:
-
-```bash
-# Install Beads
-curl -sSL https://raw.githubusercontent.com/steveyegge/beads/main/scripts/install.sh | bash
-
-# Initialize in your repo
-bd init
-
-# Create your first issue
-bd create "Try out Beads"
-```
-
-## Learn More
-
-- **Documentation**: [github.com/steveyegge/beads/docs](https://github.com/steveyegge/beads/tree/main/docs)
-- **Quick Start Guide**: Run `bd quickstart`
-- **Examples**: [github.com/steveyegge/beads/examples](https://github.com/steveyegge/beads/tree/main/examples)
-
----
-
-*Beads: Issue tracking that moves at the speed of thought* ⚡

+ 0 - 62
.beads/config.yaml

@@ -1,62 +0,0 @@
-# Beads Configuration File
-# This file configures default behavior for all bd commands in this repository
-# All settings can also be set via environment variables (BD_* prefix)
-# or overridden with command-line flags
-
-# Issue prefix for this repository (used by bd init)
-# If not set, bd init will auto-detect from directory name
-# Example: issue-prefix: "myproject" creates issues like "myproject-1", "myproject-2", etc.
-# issue-prefix: ""
-
-# Use no-db mode: load from JSONL, no SQLite, write back after each command
-# When true, bd will use .beads/issues.jsonl as the source of truth
-# instead of SQLite database
-# no-db: false
-
-# Disable daemon for RPC communication (forces direct database access)
-# no-daemon: false
-
-# Disable auto-flush of database to JSONL after mutations
-# no-auto-flush: false
-
-# Disable auto-import from JSONL when it's newer than database
-# no-auto-import: false
-
-# Enable JSON output by default
-# json: false
-
-# Default actor for audit trails (overridden by BD_ACTOR or --actor)
-# actor: ""
-
-# Path to database (overridden by BEADS_DB or --db)
-# db: ""
-
-# Auto-start daemon if not running (can also use BEADS_AUTO_START_DAEMON)
-# auto-start-daemon: true
-
-# Debounce interval for auto-flush (can also use BEADS_FLUSH_DEBOUNCE)
-# flush-debounce: "5s"
-
-# Git branch for beads commits (bd sync will commit to this branch)
-# IMPORTANT: Set this for team projects so all clones use the same sync branch.
-# This setting persists across clones (unlike database config which is gitignored).
-# Can also use BEADS_SYNC_BRANCH env var for local override.
-# If not set, bd sync will require you to run 'bd config set sync.branch <branch>'.
-# sync-branch: "beads-sync"
-
-# Multi-repo configuration (experimental - bd-307)
-# Allows hydrating from multiple repositories and routing writes to the correct JSONL
-# repos:
-#   primary: "."  # Primary repo (where this database lives)
-#   additional:   # Additional repos to hydrate from (read-only)
-#     - ~/beads-planning  # Personal planning repo
-#     - ~/work-planning   # Work planning repo
-
-# Integration settings (access with 'bd config get/set')
-# These are stored in the database, not in this file:
-# - jira.url
-# - jira.project
-# - linear.url
-# - linear.api-key
-# - github.org
-# - github.repo

+ 0 - 48
.beads/issues.jsonl

@@ -1,48 +0,0 @@
-{"id":"qmd-0ic","title":"in qmd status, list all the additonal contexts under the collections that match","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:41:42.126194-05:00","updated_at":"2025-12-12T17:14:48.268119-05:00","closed_at":"2025-12-12T17:14:48.268119-05:00"}
-{"id":"qmd-18s","title":"Move cleanup/maintenance DB operations to store.ts","description":"Move cleanup operations from cleanup() command to store.ts. Create methods like deleteInactiveDocuments(), vacuumDatabase(), cleanupOrphanedContent(), etc.","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:36:21.815781-05:00","updated_at":"2025-12-12T16:42:36.896806-05:00","closed_at":"2025-12-12T16:42:36.896806-05:00","dependencies":[{"issue_id":"qmd-18s","depends_on_id":"qmd-29c","type":"parent-child","created_at":"2025-12-12T16:37:03.014111-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-1xd","title":"Update tests for YAML-based collections","description":"Update all tests to use YAML config instead of DB collections. Update test helpers to create temporary YAML configs.","notes":"Test suite has been updated for YAML-based collections. 92 tests passing, 4 skipped, 10 failing.\n\nThe 4 skipped tests call getStatus() which has a bug (queries non-existent collections table).\n\nThe 10 failing tests are due to bugs in store.ts functions (findDocument, getDocumentBody, getDocument, findSimilarFiles, matchFilesByGlob) that need to be updated to use YAML configuration. These are production code bugs, not test bugs.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:53.349545-05:00","updated_at":"2025-12-13T11:37:16.935866-05:00","closed_at":"2025-12-13T11:37:16.935866-05:00","dependencies":[{"issue_id":"qmd-1xd","depends_on_id":"qmd-thw","type":"blocks","created_at":"2025-12-13T09:55:08.14305-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-29c","title":"Move all database operations from qmd.ts to store.ts","description":"Currently qmd.ts has ~70 direct database operations (db.prepare, db.exec). All database operations should be moved to store.ts to improve separation of concerns. qmd.ts should only use high-level methods from store.ts that don't require direct SQL knowledge.","notes":"Phase 1 complete: Moved collection operations (listCollections, removeCollection, renameCollection) to store.ts. Created 4 subtasks for remaining work: document indexing, context management, embeddings, and cleanup operations.","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:32:13.722223-05:00","updated_at":"2025-12-12T16:49:53.829124-05:00","closed_at":"2025-12-12T16:49:53.829124-05:00"}
-{"id":"qmd-2gn","title":"Fix store.ts functions to use YAML collections","description":"Update findDocument(), getDocumentBody(), getDocument(), findSimilarFiles(), matchFilesByGlob(), and getStatus() to use YAML collection configuration instead of querying the collections table. These functions currently fail because they try to query the non-existent collections table.","notes":"Fixed:\n- FTS schema (filepath, title, body columns) \n- getStatus() to use YAML collections\n- searchFTS() to not query collections table\n- findDocument() absolute path matching\n\nTest results: 93 passing (up from 92), 4 skipped, 9 failing\n\nRemaining failures:\n- getDocumentBody (2 tests)\n- getDocument (1 test)  \n- findSimilarFiles (2 tests)\n- matchFilesByGlob (1 test)\n- Integration/context tests (3 tests)","status":"in_progress","priority":1,"issue_type":"bug","created_at":"2025-12-13T11:37:22.706882-05:00","updated_at":"2025-12-13T12:32:38.336752-05:00"}
-{"id":"qmd-3qi","title":"Document docid hash usage in CLAUDE.md","description":"Update CLAUDE.md to document that short hash IDs (#abc123) work with get and multi-get commands. Include examples.","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-15T12:54:30.634116-05:00","updated_at":"2025-12-15T13:12:38.91973-05:00","closed_at":"2025-12-15T13:12:38.91973-05:00","dependencies":[{"issue_id":"qmd-3qi","depends_on_id":"qmd-lwo","type":"parent-child","created_at":"2025-12-15T12:54:52.002856-05:00","created_by":"daemon","metadata":"{}"},{"issue_id":"qmd-3qi","depends_on_id":"qmd-apl","type":"blocks","created_at":"2025-12-15T12:54:52.066036-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-3z9","title":"Design YAML schema and create collections.ts module","description":"Create collections.ts to manage YAML-based collection configuration at ~/.config/qmd/index.yml. Define TypeScript types for collections and contexts. Implement load/save functions with Bun's native YAML support.","design":"YAML structure:\n```yaml\n# Global context for all collections\nglobal_context: \"...\"\n\ncollections:\n  name:\n    path: /absolute/path\n    pattern: \"**/*.md\"\n    context:\n      \"/path/prefix\": \"Description\"\n      \"/\": \"Root context\"\n```\n\nTypeScript types:\n- Collection: { path, pattern, context }\n- CollectionConfig: { global_context?, collections }\n- Functions: loadConfig(), saveConfig(), getCollection(), listCollections()","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:52.586027-05:00","updated_at":"2025-12-13T09:56:57.309927-05:00","closed_at":"2025-12-13T09:56:57.309927-05:00"}
-{"id":"qmd-4ru","title":"Update document retrieval for new schema","description":"Functions like getDocument, findDocument, getMultipleDocuments need to work with new schema (path instead of filepath, content joins, virtual paths).","status":"closed","priority":0,"issue_type":"task","created_at":"2025-12-12T15:29:53.911881-05:00","updated_at":"2025-12-12T15:56:11.054888-05:00","closed_at":"2025-12-12T15:56:11.054888-05:00","dependencies":[{"issue_id":"qmd-4ru","depends_on_id":"qmd-ama","type":"discovered-from","created_at":"2025-12-12T15:29:53.912607-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-4u4","title":"Move embedding/vector DB operations to store.ts","description":"Move vector indexing DB operations from vectorIndex() to store.ts. Create methods like getHashesForEmbedding(), insertEmbedding(), clearEmbeddings(), etc.","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:36:21.683434-05:00","updated_at":"2025-12-12T16:42:40.42653-05:00","closed_at":"2025-12-12T16:42:40.42653-05:00","dependencies":[{"issue_id":"qmd-4u4","depends_on_id":"qmd-29c","type":"parent-child","created_at":"2025-12-12T16:37:02.944591-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-6s5","title":"Export current database to index.yml","description":"Write a script to export current collections and path_contexts from SQLite to ~/.config/qmd/index.yml format. Include all collection metadata and contexts.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:52.707844-05:00","updated_at":"2025-12-13T09:57:36.650437-05:00","closed_at":"2025-12-13T09:57:36.650437-05:00","dependencies":[{"issue_id":"qmd-6s5","depends_on_id":"qmd-3z9","type":"blocks","created_at":"2025-12-13T09:55:07.606834-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-7ss","title":"remove all the symlinks and stuff in the git repo, clean up the root directory","status":"closed","priority":4,"issue_type":"task","created_at":"2025-12-12T16:40:00.744982-05:00","updated_at":"2025-12-12T17:11:18.034215-05:00","closed_at":"2025-12-12T17:11:18.034215-05:00"}
-{"id":"qmd-8eu","title":"Update documents table schema for collection names","description":"Change documents.collection_id (integer FK) to documents.collection (text). Update all queries and indices. Keep backwards compatibility during transition.","design":"Schema change:\n- Add `collection TEXT` column\n- Migrate data: UPDATE documents SET collection = (SELECT name FROM collections WHERE id = collection_id)\n- Drop collection_id column\n- Update FTS5 trigger\n- Update all queries in store.ts","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:52.830305-05:00","updated_at":"2025-12-13T10:08:24.88716-05:00","closed_at":"2025-12-13T10:08:24.88716-05:00","dependencies":[{"issue_id":"qmd-8eu","depends_on_id":"qmd-6s5","type":"blocks","created_at":"2025-12-13T09:55:07.662048-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-9ij","title":"Conditional query expansion based on BM25 signal strength","description":"Query expansion helps recall but injects false positives. Skip expansion if original BM25 top-5 has strong signals (exact term hits, high proximity). Only expand when recall is weak.","status":"closed","priority":3,"issue_type":"feature","created_at":"2025-12-20T17:18:41.806447-05:00","updated_at":"2025-12-21T12:05:40.85997-05:00","closed_at":"2025-12-21T12:05:40.85997-05:00"}
-{"id":"qmd-9ua","title":"Update all qmd commands for YAML-based collections","description":"Update qmd.ts commands: collection add/list/remove/rename, status, update, ls. All should use collections.ts instead of store.ts collection functions.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:53.14644-05:00","updated_at":"2025-12-13T10:17:39.67707-05:00","closed_at":"2025-12-13T10:17:39.67707-05:00","dependencies":[{"issue_id":"qmd-9ua","depends_on_id":"qmd-u84","type":"blocks","created_at":"2025-12-13T09:55:07.893268-05:00","created_by":"daemon","metadata":"{}"},{"issue_id":"qmd-9ua","depends_on_id":"qmd-oxy","type":"blocks","created_at":"2025-12-13T09:55:07.942221-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-afe","title":"implement qmd collection rename, which changes the global path prefix for the collection","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T15:55:54.779325-05:00","updated_at":"2025-12-12T16:29:24.153196-05:00","closed_at":"2025-12-12T16:29:24.153196-05:00"}
-{"id":"qmd-ama","title":"Refactor database system","description":"All documents should be stored as content addressable hash, e.g. hash, doc, created_at,\n┃ updated_at. documents should be a file system layer on top e.g. collection, path, hash,\n┃ created_at, updated_at. (collection,path)\n┃\n┃\n\n┃ All documents should be stored as content addressable hash, e.g. hash, doc, created_at,\n┃ updated_at. documents should be a file system layer on top e.g. collection_id, path, hash,\n┃ created_at, updated_at. (collection,path) is unique. There is also collection which stores PWD\n┃ + glob pattern, name (\\w+). Every document is treated as path qmd://collection.name/","notes":"## Completed\n- ✅ Implemented content-addressable storage (content table with hash→doc mapping)\n- ✅ Refactored documents table as file system layer (collection_id, path, hash)\n- ✅ Added collection names (e.g., \"pages\", \"journals\", \"archive\")\n- ✅ Implemented virtual paths (qmd://collection-name/path/to/file.md)\n- ✅ Added hierarchical context support (collection-scoped)\n- ✅ Successfully migrated existing database\n- ✅ Updated search functions to work with new schema\n- ✅ Updated indexing logic to use content-addressable storage\n- ✅ Orphaned content hash cleanup\n\n## Still TODO\n- Fix migration SQL to properly extract basename (currently needs manual fix)\n- Implement `qmd collection add . --name \u003cname\u003e --mask '**/*.md'`\n- Implement `qmd ls [path]` for exploring virtual file tree","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-10T10:57:35.497489-05:00","updated_at":"2025-12-12T15:39:48.879143-05:00","closed_at":"2025-12-12T15:39:48.879143-05:00"}
-{"id":"qmd-apl","title":"Support docid hash lookup in get and multi-get commands","description":"Allow get and multi-get to accept #hash (6 char) as a file identifier. Add lookup function to resolve short hash to full document path. Handle collisions gracefully.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-15T12:54:30.448466-05:00","updated_at":"2025-12-15T13:12:38.741755-05:00","closed_at":"2025-12-15T13:12:38.741755-05:00","dependencies":[{"issue_id":"qmd-apl","depends_on_id":"qmd-lwo","type":"parent-child","created_at":"2025-12-15T12:54:51.903613-05:00","created_by":"daemon","metadata":"{}"},{"issue_id":"qmd-apl","depends_on_id":"qmd-gbt","type":"blocks","created_at":"2025-12-15T12:54:52.031069-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-bs8","title":"Update documentation for YAML configuration","description":"Update CLAUDE.md, README.md with new YAML configuration approach. Document index.yml format and manual editing instructions.","status":"open","priority":2,"issue_type":"task","created_at":"2025-12-13T09:54:53.449584-05:00","updated_at":"2025-12-13T09:54:53.449584-05:00","dependencies":[{"issue_id":"qmd-bs8","depends_on_id":"qmd-1xd","type":"blocks","created_at":"2025-12-13T09:55:08.264615-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-bx1","title":"Fix migration SQL for proper basename extraction","description":"The migration currently generates collection names incorrectly (uses full path instead of basename). Need to fix the SQL in migrateToContentAddressable to properly extract the directory basename.","status":"closed","priority":1,"issue_type":"bug","created_at":"2025-12-12T15:29:53.757723-05:00","updated_at":"2025-12-12T15:50:29.349134-05:00","closed_at":"2025-12-12T15:50:29.349134-05:00","dependencies":[{"issue_id":"qmd-bx1","depends_on_id":"qmd-ama","type":"discovered-from","created_at":"2025-12-12T15:29:53.758524-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-c0m","title":"Comprehensive CLI review and consistency pass","description":"Review entire CLI command structure:\n- Consistent naming (add vs create, remove vs delete)\n- Consistent flag usage (--name, --mask, etc)\n- Update help text for all commands\n- Ensure virtual paths work everywhere\n- Test all commands end-to-end","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-12T15:29:38.083564-05:00","updated_at":"2025-12-12T16:06:51.544695-05:00","closed_at":"2025-12-12T16:06:51.544695-05:00"}
-{"id":"qmd-clr","title":"fix embed","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:14:55.292114-05:00","updated_at":"2025-12-12T16:31:27.661829-05:00","closed_at":"2025-12-12T16:31:27.661829-05:00"}
-{"id":"qmd-d00","title":"Add offline evaluation harness for tuning","description":"Create a small benchmark with ~100 labeled queries from real searches. Would enable tuning: expansion on/off threshold, candidate count (30 vs 100), blending weights, reranker threshold.","notes":"Test samples must be: 1) entirely synthetic, OR 2) public documents (e.g., public podcasts, public memos). No private/personal content in eval set.","status":"closed","priority":3,"issue_type":"feature","created_at":"2025-12-20T17:18:42.007265-05:00","updated_at":"2025-12-21T12:10:40.497797-05:00","closed_at":"2025-12-21T12:10:40.497797-05:00"}
-{"id":"qmd-deh","title":"Refactor database introduce qmd collection *","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-10T10:56:04.516137-05:00","updated_at":"2025-12-12T16:12:12.349428-05:00","closed_at":"2025-12-12T16:12:12.349428-05:00"}
-{"id":"qmd-df5","title":"Rerank multiple chunks per document with score aggregation","description":"Currently we only rerank 1 chunk per doc (selected by keyword heuristic). Should rerank top 2-3 chunks per document, then aggregate scores (max, softmax, or top-2 average). This improves ranking for long documents where the keyword-matched chunk isn't always the most relevant.","status":"closed","priority":2,"issue_type":"feature","created_at":"2025-12-20T17:18:41.592575-05:00","updated_at":"2025-12-21T12:04:11.777309-05:00","closed_at":"2025-12-21T12:04:11.777309-05:00"}
-{"id":"qmd-dmi","title":"Implement 'qmd collection' commands","description":"Add explicit collection management:\n- qmd collection add . --name \u003cname\u003e --mask '**/*.md'\n- qmd collection list\n- qmd collection remove \u003cname\u003e\n\nThis gives users control over collection names and patterns.","status":"closed","priority":1,"issue_type":"feature","created_at":"2025-12-12T15:29:53.810666-05:00","updated_at":"2025-12-12T16:02:08.079158-05:00","closed_at":"2025-12-12T16:02:08.079158-05:00","dependencies":[{"issue_id":"qmd-dmi","depends_on_id":"qmd-ama","type":"discovered-from","created_at":"2025-12-12T15:29:53.811294-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-dt1","title":"Redesign context add command for better usability","description":"Current issues: \n1. Virtual path qmd://journals/ is rejected as invalid\n2. Syntax is confusing - sometimes path is first arg, sometimes second\n3. Need to support collection root context (qmd://name/)\n4. Should be intuitive: qmd context add \u003cwhere\u003e \u003cwhat\u003e\nDesign goals:\n- Support qmd://collection/ for collection root context\n- Support qmd://collection/path for path-specific context\n- Clear, consistent syntax\n- Good error messages","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:39:19.764114-05:00","updated_at":"2025-12-13T09:41:38.467861-05:00","closed_at":"2025-12-13T09:41:38.467861-05:00"}
-{"id":"qmd-e2c","title":"Implement 'qmd ls' command","description":"Add command to explore virtual file tree:\n- qmd ls → list all collections\n- qmd ls \u003ccollection\u003e → list files in collection\n- qmd ls \u003ccollection\u003e/\u003cpath\u003e → list files under path\nOutput: flat list of qmd:// paths","status":"closed","priority":1,"issue_type":"feature","created_at":"2025-12-12T15:29:53.859804-05:00","updated_at":"2025-12-12T15:55:12.777701-05:00","closed_at":"2025-12-12T15:55:12.777701-05:00","dependencies":[{"issue_id":"qmd-e2c","depends_on_id":"qmd-ama","type":"discovered-from","created_at":"2025-12-12T15:29:53.860535-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-gbt","title":"Add docid field (first 6 chars of hash) to search results","description":"Include docid formatted as #hash[0:5] in all search output formats (CLI, JSON, CSV, Markdown, XML, files). The docid should be the first 6 characters of the document's SHA256 hash.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-15T12:54:30.388094-05:00","updated_at":"2025-12-15T13:12:27.553492-05:00","closed_at":"2025-12-15T13:12:27.553492-05:00","dependencies":[{"issue_id":"qmd-gbt","depends_on_id":"qmd-lwo","type":"parent-child","created_at":"2025-12-15T12:54:51.846848-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-ht6","title":"Add --line-numbers CLI option for line-numbered output","description":"Add --line-numbers flag to CLI that formats output with line numbers: each line becomes \"{lineNum}: {content}\". Apply to get, multi-get, and search snippet output.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-15T12:54:30.51773-05:00","updated_at":"2025-12-15T13:12:38.810032-05:00","closed_at":"2025-12-15T13:12:38.810032-05:00","dependencies":[{"issue_id":"qmd-ht6","depends_on_id":"qmd-lwo","type":"parent-child","created_at":"2025-12-15T12:54:51.941635-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-hw2","title":"Fix MCP resource listing and display paths","description":"Remove MCP resource listing, fix display paths to include collection name, handelize filenames during indexing, make line-numbers default for MCP snippets.","status":"closed","priority":1,"issue_type":"epic","created_at":"2025-12-16T12:00:02.815946-05:00","updated_at":"2025-12-16T12:26:17.690252-05:00","closed_at":"2025-12-16T12:26:17.690252-05:00"}
-{"id":"qmd-i3t","title":"Move context management DB operations to store.ts","description":"Move path_contexts INSERT/DELETE/SELECT operations from addContext(), listContexts(), removeContext() to store.ts. Create methods like insertContext(), deleteContext(), etc.","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:36:21.561746-05:00","updated_at":"2025-12-12T16:48:57.271485-05:00","closed_at":"2025-12-12T16:48:57.271485-05:00","dependencies":[{"issue_id":"qmd-i3t","depends_on_id":"qmd-29c","type":"parent-child","created_at":"2025-12-12T16:37:02.866006-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-j9z","title":"Add unit tests for content addressable hashes","description":"add same file from multiple places and verify that they both point at same hash. drop one collection and the content stays.","status":"closed","priority":3,"issue_type":"task","created_at":"2025-12-12T15:39:15.459504-05:00","updated_at":"2025-12-12T16:21:35.473776-05:00","closed_at":"2025-12-12T16:21:35.473776-05:00"}
-{"id":"qmd-kf8","title":"Move document indexing DB operations to store.ts","description":"Move INSERT/UPDATE/DELETE operations for documents and content tables from indexFiles() to store.ts. Create methods like insertDocument(), updateDocument(), deactivateDocuments(), etc.","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:36:14.558702-05:00","updated_at":"2025-12-12T16:45:38.830978-05:00","closed_at":"2025-12-12T16:45:38.830978-05:00","dependencies":[{"issue_id":"qmd-kf8","depends_on_id":"qmd-29c","type":"parent-child","created_at":"2025-12-12T16:37:02.770251-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-ltg","title":"look for missing context","description":"i ran qmd context list and thats only one bit of context, i had a lot more. i think the path matching isn't quite working right","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:42:57.324769-05:00","updated_at":"2025-12-12T17:16:27.835047-05:00","closed_at":"2025-12-12T17:16:27.835047-05:00"}
-{"id":"qmd-lwo","title":"Add short docid hash references and line numbers to output","description":"Add short 6-character hash IDs (docid #abc123) to search results and document retrieval, support these IDs in get/multi-get commands, and add --line-numbers option for line-numbered output.","status":"closed","priority":1,"issue_type":"epic","created_at":"2025-12-15T12:54:30.335556-05:00","updated_at":"2025-12-15T13:12:43.758557-05:00","closed_at":"2025-12-15T13:12:43.758557-05:00"}
-{"id":"qmd-mro","title":"Add lineNumbers boolean to MCP tools","description":"Add lineNumbers boolean parameter to MCP get, multi_get, and search tools. When true, return content with line numbers prefixed to each line.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-15T12:54:30.581671-05:00","updated_at":"2025-12-15T13:12:38.8682-05:00","closed_at":"2025-12-15T13:12:38.8682-05:00","dependencies":[{"issue_id":"qmd-mro","depends_on_id":"qmd-lwo","type":"parent-child","created_at":"2025-12-15T12:54:51.970363-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-oxy","title":"Update context system to use YAML","description":"Remove path_contexts table. Implement context management in collections.ts. Update context add/list/rm commands to modify YAML file instead of database.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:53.042839-05:00","updated_at":"2025-12-13T10:16:07.680285-05:00","closed_at":"2025-12-13T10:16:07.680285-05:00","dependencies":[{"issue_id":"qmd-oxy","depends_on_id":"qmd-3z9","type":"blocks","created_at":"2025-12-13T09:55:07.842488-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-p1h","title":"Create collection add|remove","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-10T10:57:00.717864-05:00","updated_at":"2025-12-12T16:12:00.557003-05:00","closed_at":"2025-12-12T16:12:00.557003-05:00"}
-{"id":"qmd-rck","title":"move the source files to src/*, clean up teh directory","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T16:40:19.198119-05:00","updated_at":"2025-12-12T17:12:22.502746-05:00","closed_at":"2025-12-12T17:12:22.502746-05:00"}
-{"id":"qmd-rhd","title":"Fix 'qmd status' output for new schema","description":"Update status to show collections by name, cleaner context display, virtual path examples.","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T15:29:54.020596-05:00","updated_at":"2025-12-12T16:13:28.08389-05:00","closed_at":"2025-12-12T16:13:28.08389-05:00","dependencies":[{"issue_id":"qmd-rhd","depends_on_id":"qmd-ama","type":"discovered-from","created_at":"2025-12-12T15:29:54.021095-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-s1y","title":"Update 'qmd add-context' for collection scoping","description":"Update add-context to work with collection-scoped contexts using new path_contexts schema.","notes":"Refactoring to:\n- qmd context add [path] \"text\" (defaults to current collection if in one)\n- qmd context list\n- qmd context rm \u003cpath\u003e\n- Support \"/\" for global/system context\n- Auto-detect collection from pwd","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T15:29:54.076582-05:00","updated_at":"2025-12-12T15:37:47.683263-05:00","closed_at":"2025-12-12T15:37:47.683263-05:00"}
-{"id":"qmd-thw","title":"Drop collections and path_contexts tables","description":"Remove collections and path_contexts tables from schema. Update initDb() to not create these tables. Only keep documents, content, and search indices.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:53.247136-05:00","updated_at":"2025-12-13T10:27:06.54374-05:00","closed_at":"2025-12-13T10:27:06.54374-05:00","dependencies":[{"issue_id":"qmd-thw","depends_on_id":"qmd-9ua","type":"blocks","created_at":"2025-12-13T09:55:08.027101-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-u84","title":"Refactor store.ts to use collections.ts","description":"Replace all collection DB queries with collections.ts calls. Remove getCollectionById, getCollectionByName, listCollections DB functions. Use YAML config instead.","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:54:52.936782-05:00","updated_at":"2025-12-13T10:16:07.681047-05:00","closed_at":"2025-12-13T10:16:07.681047-05:00","dependencies":[{"issue_id":"qmd-u84","depends_on_id":"qmd-3z9","type":"blocks","created_at":"2025-12-13T09:55:07.720439-05:00","created_by":"daemon","metadata":"{}"},{"issue_id":"qmd-u84","depends_on_id":"qmd-8eu","type":"blocks","created_at":"2025-12-13T09:55:07.782051-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-vro","title":"Update 'qmd get' to support virtual paths","description":"Allow qmd get to accept both virtual paths (qmd://journals/...) and filesystem paths, plus fuzzy matching by filename.","status":"closed","priority":0,"issue_type":"task","created_at":"2025-12-12T15:29:53.963113-05:00","updated_at":"2025-12-12T15:47:29.178955-05:00","closed_at":"2025-12-12T15:47:29.178955-05:00","dependencies":[{"issue_id":"qmd-vro","depends_on_id":"qmd-ama","type":"discovered-from","created_at":"2025-12-12T15:29:53.963641-05:00","created_by":"daemon","metadata":"{}"}]}
-{"id":"qmd-x19","title":"Update 'qmd add-context' for collection-scoped contexts","description":"Update add-context to work with collections:\n- qmd add-context \u003ccollection\u003e/\u003cpath\u003e \"context description\"\n- Support both virtual and filesystem paths\n- Update to use new path_contexts schema","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T15:29:38.142575-05:00","updated_at":"2025-12-12T15:53:00.525001-05:00","closed_at":"2025-12-12T15:53:00.525001-05:00"}
-{"id":"qmd-x64","title":"for each collection, on update, check if there is a .git directory, if so write out the git status, add --pull as a qmd update --pull parameter which also executes git pull before reindexing\n","status":"closed","priority":2,"issue_type":"task","created_at":"2025-12-12T17:04:15.994054-05:00","updated_at":"2025-12-12T17:14:40.107181-05:00","closed_at":"2025-12-12T17:14:40.107181-05:00"}
-{"id":"qmd-yzj","title":"Add optional update: command support to collections YAML","description":"Collections can now specify an optional 'update:' key with a bash command that will be executed during 'qmd update' before indexing files. Runs in cwd, prints output, stops on error.","status":"closed","priority":2,"issue_type":"feature","created_at":"2025-12-13T11:16:32.527608-05:00","updated_at":"2025-12-13T11:17:57.500434-05:00","closed_at":"2025-12-13T11:17:57.500434-05:00"}
-{"id":"qmd-zin","title":"Improve qmd ls command to be more like ls -l with colors","description":"Make qmd ls more Unix-like:\n1. Format like ls -l with columns (permissions, size, date, name)\n2. Add colors (directories, files, etc.)\n3. Dim the qmd:// prefix to show it's optional\n4. Show file sizes in human-readable format\n5. Show modification times\n6. Consider adding -l flag for long format","status":"closed","priority":1,"issue_type":"task","created_at":"2025-12-13T09:44:48.703843-05:00","updated_at":"2025-12-13T09:48:22.298822-05:00","closed_at":"2025-12-13T09:48:22.298822-05:00"}

+ 0 - 4
.beads/metadata.json

@@ -1,4 +0,0 @@
-{
-  "database": "beads.db",
-  "jsonl_export": "issues.jsonl"
-}

+ 0 - 2
CLAUDE.md

@@ -1,7 +1,5 @@
 # QMD - Quick Markdown Search
 
-**Note**: This project uses [bd (beads)](https://github.com/steveyegge/beads) for issue tracking. Use `bd` commands instead of markdown TODOs. See AGENTS.md for workflow details.
-
 Use Bun instead of Node.js (`bun` not `node`, `bun install` not `npm install`).
 
 ## Commands

+ 3 - 3
README.md

@@ -112,7 +112,7 @@ Although the tool works perfectly fine when you just tell your agent to use it o
                         ▼                             ▼
                ┌────────────────┐            ┌────────────────┐
                │ Query Expansion│            │  Original Query│
-               │   (Qwen3-1.7B) │            │   (×2 weight)  │
+               │  (fine-tuned)  │            │   (×2 weight)  │
                └───────┬────────┘            └───────┬────────┘
                        │                             │
                        │ 2 alternative queries       │
@@ -213,7 +213,7 @@ QMD uses three local GGUF models (auto-downloaded on first use):
 |-------|---------|------|
 | `embeddinggemma-300M-Q8_0` | Vector embeddings | ~300MB |
 | `qwen3-reranker-0.6b-q8_0` | Re-ranking | ~640MB |
-| `Qwen3-1.7B-Q8_0` | Query expansion | ~2.2GB |
+| `qmd-query-expansion-1.7B-q4_k_m` | Query expansion (fine-tuned) | ~1.1GB |
 
 Models are downloaded from HuggingFace and cached in `~/.cache/qmd/models/`.
 
@@ -515,7 +515,7 @@ Models are configured in `src/llm.ts` as HuggingFace URIs:
 ```typescript
 const DEFAULT_EMBED_MODEL = "hf:ggml-org/embeddinggemma-300M-GGUF/embeddinggemma-300M-Q8_0.gguf";
 const DEFAULT_RERANK_MODEL = "hf:ggml-org/Qwen3-Reranker-0.6B-Q8_0-GGUF/qwen3-reranker-0.6b-q8_0.gguf";
-const DEFAULT_GENERATE_MODEL = "hf:ggml-org/Qwen3-1.7B-GGUF/Qwen3-1.7B-Q8_0.gguf";
+const DEFAULT_GENERATE_MODEL = "hf:tobil/qmd-query-expansion-1.7B-gguf/qmd-query-expansion-1.7B-q4_k_m.gguf";
 ```
 
 ### EmbeddingGemma Prompt Format

+ 19 - 0
finetune/.gitignore

@@ -0,0 +1,19 @@
+# Model checkpoints (stored on HuggingFace Hub)
+qmd-query-expansion-*/
+*.pt
+*.safetensors
+
+# Processed data files (regenerated by prepare_data.py)
+data/train/
+data/train_v2/train.jsonl
+data/train_v2/train_chat.jsonl
+data/train_v2/val.jsonl
+data/qmd_expansion_cleaned.jsonl
+data/quality_report.txt
+
+# Eval results
+evals/results_*.jsonl
+
+# Python cache
+__pycache__/
+*.pyc

+ 286 - 0
finetune/README.md

@@ -0,0 +1,286 @@
+# QMD Query Expansion Fine-Tuning
+
+Train small language models to expand search queries for [QMD](https://github.com/tobi/qmd)'s hybrid retrieval pipeline.
+
+## What This Does
+
+Given a raw search query like `"auth config"`, the trained model produces structured expansions:
+
+```
+lex: authentication configuration
+lex: auth settings setup
+vec: how to configure authentication settings
+vec: authentication configuration options
+hyde: Authentication can be configured by setting the AUTH_SECRET environment variable.
+```
+
+These feed into QMD's three search backends:
+- **`lex:`** lines go to BM25 full-text search (short, keyword-focused)
+- **`vec:`** lines go to vector similarity search (natural language phrases)
+- **`hyde:`** is a hypothetical document passage for embedding-based retrieval ([HyDE](https://arxiv.org/abs/2212.10496) technique)
+
+## Quick Start
+
+### Cloud training via HuggingFace Jobs (no GPU needed)
+
+```bash
+# 1. SFT: teach the model the output format (~45 min on A10G, ~$1.50)
+hf jobs uv run --flavor a10g-large --secrets HF_TOKEN --timeout 2h jobs/sft.py
+
+# 2. GRPO: RL refinement on top of SFT (~20 min on A10G, ~$0.50)
+hf jobs uv run --flavor a10g-large --secrets HF_TOKEN --timeout 4h jobs/grpo.py
+
+# 3. Evaluate against test queries (needs local GPU or use eval job)
+uv run eval.py --model tobil/qmd-query-expansion-1.7B-grpo \
+               --sft-model tobil/qmd-query-expansion-1.7B-sft
+
+# 4. Convert to GGUF for local deployment (Ollama, llama.cpp)
+uv run convert_gguf.py --size 1.7B
+```
+
+### Local training (if you have a GPU)
+
+```bash
+uv run train.py sft  --config configs/sft.yaml
+uv run train.py grpo --config configs/grpo.yaml
+```
+
+### Monitoring HF Jobs
+
+```bash
+hf jobs ps                           # list running jobs
+hf jobs inspect <job-id>             # check status
+hf jobs logs <job-id>                # stream logs
+hf jobs cancel <job-id>              # cancel a job
+```
+
+## Prompt Format
+
+All tools use the same prompt — **Qwen3 chat template with `/no_think`**:
+
+```
+<|im_start|>user
+/no_think Expand this search query: {query}<|im_end|>
+<|im_start|>assistant
+```
+
+The `/no_think` directive suppresses Qwen3's chain-of-thought mode, producing
+direct `lex:/vec:/hyde:` output without `<think>` blocks.
+
+## File Structure
+
+```
+finetune/
+├── reward.py          # Scoring/reward function (single source of truth)
+├── train.py           # Unified SFT + GRPO training (two subcommands)
+├── eval.py            # Generate expansions and score them
+├── convert_gguf.py    # GGUF conversion for Ollama/llama.cpp
+├── jobs/
+│   ├── sft.py         # Self-contained SFT for HuggingFace Jobs
+│   ├── grpo.py        # Self-contained GRPO for HuggingFace Jobs
+│   ├── eval.py        # Self-contained eval for HuggingFace Jobs
+│   ├── eval_common.py # Shared eval utilities
+│   └── quantize.py    # GGUF quantization for HuggingFace Jobs
+├── configs/
+│   ├── sft.yaml       # SFT hyperparameters for Qwen3-1.7B
+│   └── grpo.yaml      # GRPO hyperparameters for Qwen3-1.7B
+├── evals/
+│   └── queries.txt    # 31 test queries across 8 categories
+├── data/
+│   └── qmd_expansion_v2.jsonl  # Source training data (1,000 high-quality examples)
+├── dataset/
+│   ├── generate_data.py         # Generate data via Claude API
+│   ├── generate_data_offline.py # Generate from existing HF dataset
+│   ├── prepare_data.py          # Format for Qwen3 chat template
+│   └── clean_data.py            # Detect technical term misinterpretations
+├── SCORING.md         # Detailed scoring rubric reference
+└── README.md          # This file
+```
+
+## Training Pipeline
+
+### Stage 1: SFT (Supervised Fine-Tuning)
+
+Teaches the model the `lex:/vec:/hyde:` output format from labeled examples.
+
+| Parameter | Value |
+|-----------|-------|
+| Base model | `Qwen/Qwen3-1.7B` |
+| Method | LoRA (rank 16, alpha 32) |
+| Target modules | All projection layers (q/k/v/o/gate/up/down) |
+| Dataset | ~2,290 examples (train split) |
+| Effective batch size | 16 (4 × 4 gradient accumulation) |
+| Epochs | 5 |
+| Learning rate | 2e-4 (cosine schedule) |
+
+```bash
+uv run train.py sft --config configs/sft.yaml
+uv run train.py sft --config configs/sft.yaml --dry-run  # preview config
+```
+
+### Stage 2: GRPO (Group Relative Policy Optimization)
+
+Reinforcement learning on top of the merged SFT weights. The model generates
+multiple expansions per query, they are scored by the reward function, and the
+model is updated to prefer higher-scoring outputs.
+
+| Parameter | Value |
+|-----------|-------|
+| Base | Merged SFT checkpoint |
+| Method | LoRA (rank 4, alpha 8) — smaller for RL stability |
+| Target modules | q_proj, v_proj only |
+| Reward | `reward.py` (rule-based, 5 dimensions) |
+| KL beta | 0.04 — prevents drift from SFT checkpoint |
+| Generations per prompt | 4 |
+| Max steps | 200 |
+| Learning rate | 5e-7 |
+
+**Important:** `beta > 0` is critical. With `beta=0` the model experiences
+catastrophic drift and scores drop to 0%.
+
+```bash
+uv run train.py grpo --config configs/grpo.yaml
+uv run train.py grpo --config configs/grpo.yaml --dry-run  # test reward function
+```
+
+## Evaluation
+
+`eval.py` generates expansions from a model and scores them against test queries:
+
+```bash
+# Evaluate an SFT model
+uv run eval.py --model tobil/qmd-query-expansion-1.7B-sft
+
+# Evaluate a GRPO model (needs SFT adapter merged first)
+uv run eval.py --model tobil/qmd-query-expansion-1.7B-grpo \
+               --sft-model tobil/qmd-query-expansion-1.7B-sft
+
+# Verbose output with deduction details
+uv run eval.py --model tobil/qmd-query-expansion-1.7B-sft -v
+
+# Save detailed scores to JSON
+uv run eval.py --model tobil/qmd-query-expansion-1.7B-sft -o scores.json
+
+# Score an existing JSONL file (backwards compat with old run.py output)
+uv run eval.py --score-only evals/results_old.jsonl
+```
+
+## Reward Function
+
+`reward.py` is the single source of truth for scoring. It is used both as the
+GRPO reward signal during training and for evaluation.
+
+Five scoring dimensions (max 120 without hyde, 140 with):
+
+| Dimension | Points | What It Measures |
+|-----------|--------|------------------|
+| **Format** | 0-30 | Has lex/vec lines, no invalid lines |
+| **Diversity** | 0-30 | Multiple expansion types, diverse content, no query echoes |
+| **HyDE** | 0-20 | Present, 50-200 chars, single line, not repetitive |
+| **Quality** | 0-20 | Lex shorter than vec, natural language, preserves key terms |
+| **Entity** | -45 to +20 | Named entities preserved in lex and vec lines |
+| **Think bonus** | 0-20 | Reward for NOT using `<think>` mode |
+
+**Hard failures** (instant 0.0):
+- Chat template leakage (`<|im_start|>`, `<|im_end|>`, etc.)
+- Any line without a valid `lex:`, `vec:`, or `hyde:` prefix
+
+```bash
+# Self-test the reward function
+uv run reward.py
+```
+
+## GGUF Conversion
+
+Merges base + SFT + GRPO adapters into a single model and produces
+quantized GGUF files for deployment:
+
+```bash
+# Use preset for 1.7B
+uv run convert_gguf.py --size 1.7B
+
+# Use preset for 4B
+uv run convert_gguf.py --size 4B
+
+# Custom models
+uv run convert_gguf.py --base Qwen/Qwen3-1.7B \
+                       --sft tobil/qmd-query-expansion-1.7B-sft \
+                       --grpo tobil/qmd-query-expansion-1.7B-grpo \
+                       --output tobil/qmd-query-expansion-1.7B-gguf
+```
+
+### Using with Ollama
+
+```bash
+huggingface-cli download tobil/qmd-query-expansion-1.7B-gguf \
+    qmd-query-expansion-1.7B-q4_k_m.gguf --local-dir .
+
+echo 'FROM ./qmd-query-expansion-1.7B-q4_k_m.gguf' > Modelfile
+ollama create qmd-expand -f Modelfile
+ollama run qmd-expand
+```
+
+## Data Pipeline
+
+The training data (1,000 examples in `data/qmd_expansion_v2.jsonl`) was generated
+from two sources and cleaned for quality. To regenerate:
+
+```bash
+# Generate from existing HuggingFace dataset (bulk, no API needed)
+uv run dataset/generate_data_offline.py
+
+# Generate via Claude API (higher quality, needs ANTHROPIC_API_KEY)
+uv run dataset/generate_data.py --count 100
+
+# Detect and fix technical term misinterpretations
+uv run dataset/clean_data.py
+
+# Format for Qwen3 chat template, add short-query augmentation, split train/val
+uv run dataset/prepare_data.py
+```
+
+## Architecture Notes
+
+The two-stage training approach (SFT → GRPO) is standard for structured-output models:
+
+1. **SFT** establishes format compliance and basic query understanding. It uses
+   a large LoRA (rank 16, all projection layers) because it needs to learn a
+   new output format from scratch.
+
+2. **GRPO** refines quality within the learned format. It uses a small LoRA
+   (rank 4, q/v only) and KL regularization to make incremental improvements
+   without losing what SFT taught.
+
+The reward function is entirely rule-based (no LLM judge) which makes it fast,
+deterministic, and suitable as an RL signal. See `SCORING.md` for the full rubric.
+
+## Training Results (Qwen3-1.7B, v2)
+
+### SFT
+
+| Metric | Value |
+|--------|-------|
+| Final train loss | 0.472 |
+| Final eval loss | 0.304 |
+| Token accuracy (train) | 97.4% |
+| Token accuracy (eval) | 93.8% |
+| Epochs | 5 |
+| Hardware | A10G (24 GB VRAM) |
+
+### GRPO
+
+| Metric | Value |
+|--------|-------|
+| Mean reward | 0.757 |
+| Final loss | 0.0005 |
+| KL divergence | 0.00048 |
+| Mean completion length | ~58 tokens |
+| Training time | ~19 min (200 steps) |
+| Hardware | A10G (24 GB VRAM) |
+
+### Evaluation Scores
+
+| Model | Average Score | Excellent (30) |
+|-------|--------------|-----------------|
+| SFT | 92.0% | 30/30 |
+| GRPO | 91.7% | 30/30 |

+ 286 - 0
finetune/SCORING.md

@@ -0,0 +1,286 @@
+# QMD Query Expansion Scoring
+
+## Goal
+
+Transform a random typed query into a great set of retrieval-optimized expansions.
+
+**Input:** `"auth config"`
+**Output:**
+```
+lex: authentication configuration
+lex: auth settings setup
+vec: how to configure authentication settings
+vec: authentication configuration options
+hyde: Authentication can be configured by setting the AUTH_SECRET environment variable and enabling the auth middleware in your application's config file.
+```
+
+## Output Format
+
+| Prefix | Purpose | Required | Count |
+|--------|---------|----------|-------|
+| `lex:` | BM25 keyword variations (shorter, keyword-focused) | Yes | 1-3 |
+| `vec:` | Semantic reformulations (natural language) | Yes | 1-3 |
+| `hyde:` | Hypothetical document passage | Optional | 0-1 |
+
+## Scoring Criteria
+
+### 1. Format Compliance (0-30 points)
+
+| Criterion | Points | Deduction |
+|-----------|--------|-----------|
+| Has at least one `lex:` line | +10 | -10 if missing |
+| Has at least one `vec:` line | +10 | -10 if missing |
+| All lines have valid prefix (`lex:`, `vec:`, `hyde:`) | +10 | -5 per invalid line |
+| No garbage/prose outside of prefixed lines | - | -10 if present |
+
+### 2. Diversity & Coverage (0-30 points)
+
+| Criterion | Points | Deduction |
+|-----------|--------|-----------|
+| 2+ different types present (lex + vec) | +10 | -10 if only one type |
+| 2+ total expansions | +5 | -5 if only one |
+| Multiple lex: lines are diverse (edit distance > 3) | +5 | -2 per duplicate pair |
+| Multiple vec: lines are diverse (edit distance > 5) | +5 | -2 per duplicate pair |
+| lex/vec not identical to original query | +5 | -5 per line that equals query |
+
+### 3. Hyde Quality (0-20 points, optional bonus)
+
+| Criterion | Points | Deduction |
+|-----------|--------|-----------|
+| Hyde present and well-formed | +5 | - |
+| Hyde is concise (50-200 chars) | +5 | -3 if too short, -5 if too long |
+| Hyde has no newlines | +5 | -5 if contains newlines |
+| Hyde has no excessive repetition | +5 | -3 if word repeats 3+ times |
+
+### 4. Content Quality (0-20 points)
+
+| Criterion | Points | Deduction |
+|-----------|--------|-----------|
+| Base relevance | +5 | Subjective |
+| Lex lines preserve key terms from query | +5 | -5 if lex is generic |
+| Lex lines are keyword-focused (shorter) | +5 | -2 if lex is longer than vec |
+| Vec lines are natural language (complete phrases) | +5 | -2 if vec is just keywords |
+
+### 5. Named Entity Preservation (0-20 points, CRITICAL)
+
+Named entities are proper nouns, brand names, technical terms, and acronyms that MUST appear in lex queries. This prevents generic expansions that lose the specific topic.
+
+| Criterion | Points | Deduction |
+|-----------|--------|-----------|
+| All lex lines contain at least one entity | +15 | - |
+| Some lex lines contain entities | +5 | - |
+| NO lex lines contain entities | - | **-30 HEAVY PENALTY** |
+| Generic filler phrases in lex | - | -15 per phrase |
+| Entities also in vec lines | +5 | - |
+
+**Named Entity Detection:**
+- All-caps acronyms: `TDS`, `API`, `GPU`, `AWS`
+- Capitalized proper nouns: `React`, `Docker`, `Kubernetes`
+- Technical terms: `node.js`, `C++`, `.NET`
+- CamelCase: `JavaScript`, `TypeScript`
+- Compound names: `TDS motorsports` → both words are entities
+
+**Generic Filler Phrases (BANNED in lex):**
+- "find information about"
+- "search for", "look up"
+- "get information", "learn about"
+- "details about", "guide to"
+
+**Examples:**
+
+| Query | Bad Lex (Score: 0.30) | Good Lex (Score: 1.00) |
+|-------|----------------------|------------------------|
+| `who is TDS motorsports` | `lex: find information about` | `lex: TDS motorsports history` |
+| | `lex: company details` | `lex: TDS motorsports founders` |
+| `how to use React hooks` | `lex: programming tutorial` | `lex: React hooks tutorial` |
+| | `lex: how to code` | `lex: useEffect useState hooks` |
+
+**Key Rule**: If a query mentions a specific entity (brand, product, technology), EVERY lex line should include that entity or a direct variation of it.
+
+## Score Calculation
+
+```
+Total Score = Format + Diversity + Hyde + Quality + Entity
+Max Score = 120 (100 without hyde)
+Normalized = Total / Max (0.0 - 1.0)
+```
+
+**Rating:**
+- 0.80-1.00: Excellent
+- 0.60-0.79: Good
+- 0.40-0.59: Acceptable
+- 0.20-0.39: Poor
+- 0.00-0.19: Failed
+
+**Note:** Entity score can go negative, heavily penalizing outputs that miss named entities.
+
+## Examples
+
+### Good Example (Score: 85/100)
+
+**Query:** `"react hooks"`
+
+**Output:**
+```
+lex: react hooks tutorial
+lex: usestate useeffect
+vec: how to use react hooks in functional components
+vec: react hooks best practices guide
+hyde: React Hooks allow you to use state and lifecycle features in functional components without writing a class.
+```
+
+**Scoring:**
+- Format: 30/30 (all prefixes valid)
+- Diversity: 28/30 (2 types, 4 expansions, good diversity)
+- Hyde: 17/20 (present, good length, no repetition)
+- Quality: 10/20 (relevant, good structure)
+
+### Bad Example (Score: 15/100)
+
+**Query:** `"auth"`
+
+**Output:**
+```
+auth is an important concept that relates to authentication.
+The answer should be in Chinese.
+The answer should be in Chinese.
+```
+
+**Scoring:**
+- Format: 0/30 (no valid prefixes)
+- Diversity: 0/30 (no valid expansions)
+- Hyde: 0/20 (N/A)
+- Quality: 15/20 (some relevance but wrong format)
+
+### Medium Example (Score: 55/100)
+
+**Query:** `"docker networking"`
+
+**Output:**
+```
+lex: docker networking
+vec: docker networking
+hyde: Docker networking is an important concept. Docker networking is used for container communication. Docker networking configuration is essential.
+```
+
+**Scoring:**
+- Format: 30/30 (valid prefixes)
+- Diversity: 10/30 (lex=vec=query, no diversity)
+- Hyde: 5/20 (too repetitive - "docker networking" 3x)
+- Quality: 10/20 (relevant but low effort)
+
+## Heuristics
+
+### Repetition Detection
+
+```python
+def word_repetition_score(text):
+    words = text.lower().split()
+    counts = Counter(words)
+    # Deduct for words appearing 3+ times (excluding stopwords)
+    stopwords = {'the', 'a', 'an', 'is', 'are', 'to', 'for', 'of', 'in', 'and', 'or'}
+    repeated = sum(1 for w, c in counts.items() if c >= 3 and w not in stopwords)
+    return max(0, 5 - repeated * 2)
+```
+
+### Diversity Check (Simple)
+
+```python
+def is_diverse(a, b, min_distance=3):
+    """Check if two strings are sufficiently different."""
+    a, b = a.lower().strip(), b.lower().strip()
+    if a == b:
+        return False
+    # Simple: check if one is not a substring of the other
+    if a in b or b in a:
+        return False
+    # Check edit distance (simplified)
+    return len(set(a.split()) ^ set(b.split())) >= min_distance
+```
+
+### Query Echo Detection
+
+```python
+def echoes_query(expansion, query):
+    """Check if expansion is just echoing the query."""
+    exp = expansion.lower().strip()
+    q = query.lower().strip()
+    return exp == q or exp in q or q in exp
+```
+
+### Named Entity Extraction
+
+```python
+KEY_TERM_STOPWORDS = {'what', 'is', 'how', 'to', 'the', 'a', 'an', 'in', 'on', 'for', 'of',
+                      'and', 'or', 'with', 'my', 'your', 'do', 'does', 'can', 'i', 'me', 'we',
+                      'who', 'where', 'when', 'why', 'which', 'find', 'get', 'show', 'tell'}
+
+def extract_named_entities(query: str) -> set:
+    """Extract named entities using simple heuristics."""
+    entities = set()
+    words = query.split()
+    prev_was_entity = False
+
+    for i, word in enumerate(words):
+        clean = word.strip('.,!?:;()[]"\'')
+        if not clean:
+            prev_was_entity = False
+            continue
+
+        is_entity = False
+
+        # All-caps acronyms: TDS, API, GPU
+        if clean.isupper() and len(clean) >= 2:
+            entities.add(clean.lower())
+            is_entity = True
+        # Capitalized proper nouns (not first word)
+        elif i > 0 and clean[0].isupper() and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower())
+            is_entity = True
+        # Technical terms: node.js, C++
+        elif any(c in clean for c in '.+-#@') and len(clean) >= 2:
+            entities.add(clean.lower())
+            is_entity = True
+        # CamelCase: JavaScript
+        elif len(clean) > 1 and any(c.isupper() for c in clean[1:]) and clean[0].isupper():
+            entities.add(clean.lower())
+            is_entity = True
+        # Word following an entity (compound names: TDS motorsports)
+        elif prev_was_entity and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower())
+            is_entity = True
+
+        prev_was_entity = is_entity
+
+    return entities
+```
+
+### Generic Phrase Detection
+
+```python
+GENERIC_LEX_PHRASES = {
+    'find information about', 'search for', 'look up', 'get information',
+    'learn about', 'information on', 'details about', 'find out about',
+    'what is', 'how to', 'guide to', 'help with'
+}
+
+def lex_is_generic(lex_line: str) -> bool:
+    """Check if lex line is a useless generic filler."""
+    lex_lower = lex_line.lower().strip()
+    for phrase in GENERIC_LEX_PHRASES:
+        if phrase in lex_lower:
+            # Check if there's specific content beyond the generic phrase
+            remaining = lex_lower
+            for word in phrase.split():
+                remaining = remaining.replace(word, '', 1).strip()
+            if len(remaining) < 3:  # Nothing specific left
+                return True
+    return False
+```
+
+## Training Data Requirements
+
+1. **EOM tokens**: Ensure training examples end with proper end-of-message tokens
+2. **Diverse examples**: Include varied query types (short, long, technical, casual)
+3. **Quality hyde**: Hyde passages should be informative, not template-y
+4. **No repetition**: Avoid "This is important. This is very important." patterns

+ 43 - 0
finetune/configs/grpo.yaml

@@ -0,0 +1,43 @@
+# GRPO Training Config for QMD Query Expansion
+# Target: Qwen3-1.7B, trained on top of merged SFT weights
+#
+# Usage: uv run train.py grpo --config configs/grpo.yaml
+#
+# The reward function (reward.py) scores expansions on format compliance,
+# diversity, hyde quality, content quality, and named entity preservation.
+# beta > 0 is critical to prevent drift from the SFT checkpoint.
+
+model:
+  base: "Qwen/Qwen3-1.7B"
+  sft: "tobil/qmd-query-expansion-1.7B-sft"
+  output: "tobil/qmd-query-expansion-1.7B-grpo"
+
+dataset:
+  name: "tobil/qmd-query-expansion-train-v2"
+  prompt_field: "messages"
+  max_samples: 1000
+
+training:
+  epochs: 1
+  batch_size: 2
+  gradient_accumulation_steps: 8
+  learning_rate: 5e-7
+  max_grad_norm: 0.5
+  max_steps: 200
+
+grpo:
+  num_generations: 4
+  max_completion_length: 200
+  beta: 0.04  # KL regularization - prevents drift from SFT checkpoint
+
+lora:
+  rank: 4
+  alpha: 8
+  dropout: 0.05
+  target_modules:
+    - "q_proj"
+    - "v_proj"
+
+tracking:
+  project: "qmd-query-expansion"
+  run_name: "grpo-1.7B"

+ 40 - 0
finetune/configs/sft.yaml

@@ -0,0 +1,40 @@
+# SFT Training Config for QMD Query Expansion
+# Target: Qwen3-1.7B with LoRA
+#
+# Usage: uv run train.py sft --config configs/sft.yaml
+
+model:
+  base: "Qwen/Qwen3-1.7B"
+  output: "tobil/qmd-query-expansion-1.7B-sft"
+
+dataset:
+  name: "tobil/qmd-query-expansion-train-v2"
+  text_field: "text"
+  split: "train"
+  eval_split: 0.1
+
+training:
+  epochs: 5
+  batch_size: 4
+  gradient_accumulation_steps: 4
+  learning_rate: 2e-4
+  max_length: 512
+  warmup_ratio: 0.03
+  lr_scheduler: "cosine"
+
+lora:
+  rank: 16
+  alpha: 32
+  dropout: 0.0
+  target_modules:
+    - "q_proj"
+    - "k_proj"
+    - "v_proj"
+    - "o_proj"
+    - "gate_proj"
+    - "up_proj"
+    - "down_proj"
+
+tracking:
+  project: "qmd-query-expansion"
+  run_name: "sft-1.7B"

+ 221 - 0
finetune/convert_gguf.py

@@ -0,0 +1,221 @@
+#!/usr/bin/env python3
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "transformers>=4.36.0",
+#     "peft>=0.7.0",
+#     "torch>=2.0.0",
+#     "accelerate>=0.24.0",
+#     "huggingface_hub>=0.20.0",
+#     "sentencepiece>=0.1.99",
+#     "protobuf>=3.20.0",
+#     "numpy",
+#     "gguf",
+# ]
+# ///
+"""
+Convert QMD query expansion model to GGUF format.
+
+Loads the base model, merges SFT and GRPO adapters, then converts to
+GGUF with multiple quantizations for use with Ollama/llama.cpp/LM Studio.
+
+Usage:
+    uv run convert_gguf.py --size 1.7B
+    uv run convert_gguf.py --size 4B --skip-quantize
+    uv run convert_gguf.py --base Qwen/Qwen3-1.7B \
+                           --sft tobil/qmd-query-expansion-1.7B-sft \
+                           --grpo tobil/qmd-query-expansion-1.7B-grpo \
+                           --output tobil/qmd-query-expansion-1.7B-gguf
+"""
+
+import argparse
+import os
+import subprocess
+import sys
+
+import torch
+from huggingface_hub import HfApi, login
+from peft import PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer
+
+# Preset configurations for each model size
+PRESETS = {
+    "1.7B": {
+        "base": "Qwen/Qwen3-1.7B",
+        "sft": "tobil/qmd-query-expansion-1.7B-sft",
+        "grpo": "tobil/qmd-query-expansion-1.7B-grpo",
+        "output": "tobil/qmd-query-expansion-1.7B-gguf",
+        "ollama_name": "qmd-expand",
+    },
+    "4B": {
+        "base": "Qwen/Qwen3-4B",
+        "sft": "tobil/qmd-query-expansion-4B-sft",
+        "grpo": "tobil/qmd-query-expansion-4B-grpo",
+        "output": "tobil/qmd-query-expansion-4B-gguf",
+        "ollama_name": "qmd-expand-4b",
+    },
+}
+
+
+def run_cmd(cmd, description):
+    """Run a shell command with error handling."""
+    print(f"  {description}...")
+    try:
+        subprocess.run(cmd, check=True, capture_output=True, text=True)
+        return True
+    except subprocess.CalledProcessError as e:
+        print(f"  FAILED: {' '.join(cmd)}")
+        if e.stderr:
+            print(f"  {e.stderr[:500]}")
+        return False
+    except FileNotFoundError:
+        print(f"  Command not found: {cmd[0]}")
+        return False
+
+
+def main():
+    parser = argparse.ArgumentParser(description="Convert QMD model to GGUF")
+    parser.add_argument("--size", choices=PRESETS.keys(), help="Use preset config for model size")
+    parser.add_argument("--base", help="Base model (overrides preset)")
+    parser.add_argument("--sft", help="SFT adapter (overrides preset)")
+    parser.add_argument("--grpo", help="GRPO adapter (overrides preset)")
+    parser.add_argument("--output", help="Output HF repo (overrides preset)")
+    parser.add_argument("--skip-quantize", action="store_true", help="Only produce FP16 GGUF")
+    parser.add_argument("--no-upload", action="store_true", help="Don't upload to HF Hub")
+    args = parser.parse_args()
+
+    # Resolve config
+    if args.size:
+        preset = PRESETS[args.size]
+        base_model = args.base or preset["base"]
+        sft_model = args.sft or preset["sft"]
+        grpo_model = args.grpo or preset["grpo"]
+        output_repo = args.output or preset["output"]
+    elif args.base and args.sft and args.grpo and args.output:
+        base_model = args.base
+        sft_model = args.sft
+        grpo_model = args.grpo
+        output_repo = args.output
+    else:
+        parser.error("Either --size or all of --base/--sft/--grpo/--output are required")
+
+    model_name = output_repo.split("/")[-1].replace("-gguf", "")
+    print(f"QMD GGUF Conversion: {model_name}")
+    print("=" * 60)
+
+    # Install build tools (for Colab/cloud environments)
+    print("\nInstalling build dependencies...")
+    subprocess.run(["apt-get", "update", "-qq"], capture_output=True)
+    subprocess.run(["apt-get", "install", "-y", "-qq", "build-essential", "cmake", "git"], capture_output=True)
+
+    # Login
+    hf_token = os.environ.get("HF_TOKEN")
+    if hf_token:
+        print("Logging in to HuggingFace...")
+        login(token=hf_token)
+
+    # Step 1: Load and merge
+    print(f"\nStep 1: Loading base model {base_model}...")
+    model = AutoModelForCausalLM.from_pretrained(
+        base_model, torch_dtype=torch.bfloat16, device_map="auto", trust_remote_code=True,
+    )
+
+    print(f"Step 2: Merging SFT adapter {sft_model}...")
+    model = PeftModel.from_pretrained(model, sft_model)
+    model = model.merge_and_unload()
+
+    print(f"Step 3: Merging GRPO adapter {grpo_model}...")
+    model = PeftModel.from_pretrained(model, grpo_model)
+    model = model.merge_and_unload()
+
+    tokenizer = AutoTokenizer.from_pretrained(base_model, trust_remote_code=True)
+
+    # Step 2: Save merged model
+    merged_dir = "/tmp/merged_model"
+    print(f"\nStep 4: Saving merged model to {merged_dir}...")
+    model.save_pretrained(merged_dir, safe_serialization=True)
+    tokenizer.save_pretrained(merged_dir)
+
+    # Step 3: Setup llama.cpp
+    print("\nStep 5: Setting up llama.cpp...")
+    if not os.path.exists("/tmp/llama.cpp"):
+        run_cmd(["git", "clone", "--depth", "1", "https://github.com/ggerganov/llama.cpp.git", "/tmp/llama.cpp"],
+                "Cloning llama.cpp")
+    subprocess.run([sys.executable, "-m", "pip", "install", "-q", "-r", "/tmp/llama.cpp/requirements.txt"],
+                   capture_output=True)
+
+    # Step 4: Convert to FP16 GGUF
+    gguf_dir = "/tmp/gguf_output"
+    os.makedirs(gguf_dir, exist_ok=True)
+    gguf_file = f"{gguf_dir}/{model_name}-f16.gguf"
+
+    print(f"\nStep 6: Converting to FP16 GGUF...")
+    if not run_cmd([sys.executable, "/tmp/llama.cpp/convert_hf_to_gguf.py",
+                    merged_dir, "--outfile", gguf_file, "--outtype", "f16"],
+                   "Converting"):
+        sys.exit(1)
+
+    size_mb = os.path.getsize(gguf_file) / (1024 * 1024)
+    print(f"  FP16: {size_mb:.1f} MB")
+
+    # Step 5: Quantize
+    quantized_files = []
+    if not args.skip_quantize:
+        print("\nStep 7: Building quantize tool...")
+        os.makedirs("/tmp/llama.cpp/build", exist_ok=True)
+        run_cmd(["cmake", "-B", "/tmp/llama.cpp/build", "-S", "/tmp/llama.cpp", "-DGGML_CUDA=OFF"],
+                "CMake configure")
+        run_cmd(["cmake", "--build", "/tmp/llama.cpp/build", "--target", "llama-quantize", "-j", "4"],
+                "Building llama-quantize")
+        quantize_bin = "/tmp/llama.cpp/build/bin/llama-quantize"
+
+        print("\nStep 8: Quantizing...")
+        for quant_type, desc in [("Q4_K_M", "4-bit"), ("Q5_K_M", "5-bit"), ("Q8_0", "8-bit")]:
+            qfile = f"{gguf_dir}/{model_name}-{quant_type.lower()}.gguf"
+            if run_cmd([quantize_bin, gguf_file, qfile, quant_type], f"{quant_type} ({desc})"):
+                qsize = os.path.getsize(qfile) / (1024 * 1024)
+                print(f"  {quant_type}: {qsize:.1f} MB")
+                quantized_files.append((qfile, quant_type))
+
+    # Step 6: Upload
+    if not args.no_upload:
+        print(f"\nStep 9: Uploading to {output_repo}...")
+        api = HfApi()
+        api.create_repo(repo_id=output_repo, repo_type="model", exist_ok=True)
+
+        api.upload_file(path_or_fileobj=gguf_file,
+                        path_in_repo=f"{model_name}-f16.gguf", repo_id=output_repo)
+        for qfile, qtype in quantized_files:
+            api.upload_file(path_or_fileobj=qfile,
+                            path_in_repo=f"{model_name}-{qtype.lower()}.gguf", repo_id=output_repo)
+
+        # Upload README
+        readme = f"""---
+base_model: {base_model}
+tags: [gguf, llama.cpp, quantized, query-expansion, qmd]
+---
+# {model_name} (GGUF)
+
+GGUF conversion of the QMD Query Expansion model.
+
+## Details
+- **Base:** {base_model}
+- **SFT:** {sft_model}
+- **GRPO:** {grpo_model}
+- **Task:** Query expansion (lex/vec/hyde format)
+
+## Prompt Format
+```
+<|im_start|>user
+/no_think Expand this search query: your query here<|im_end|>
+<|im_start|>assistant
+```
+"""
+        api.upload_file(path_or_fileobj=readme.encode(),
+                        path_in_repo="README.md", repo_id=output_repo)
+
+    print(f"\nDone! Repository: https://huggingface.co/{output_repo}")
+
+
+if __name__ == "__main__":
+    main()

+ 1000 - 0
finetune/data/qmd_expansion_v2.jsonl

@@ -0,0 +1,1000 @@
+{"input": "how tourism affects local cultures", "output": "lex: tourism impact local culture\nlex: cultural effects mass tourism communities\nlex: tourist influence indigenous traditions\nvec: what are the positive and negative effects of tourism on local cultural traditions and communities\nvec: how does mass tourism change the customs, language, and daily life of host communities\nhyde: Tourism often leads to cultural commodification, where traditional dances, crafts, and rituals are adapted to meet tourist expectations. In Bali, temple ceremonies have been shortened and repackaged as entertainment, diluting their spiritual significance for locals."}
+{"input": "how to ferment foods at home", "output": "lex: home fermentation vegetables guide\nlex: lacto fermentation salt brine method\nlex: homemade sauerkraut kimchi ferment\nvec: what is the step-by-step process for fermenting vegetables at home using salt brine\nvec: how do you safely ferment foods like sauerkraut and kimchi in your kitchen\nhyde: To ferment vegetables at home, submerge them in a 2-3% salt brine in a mason jar. Keep at room temperature (65-75°F) for 3-7 days, burping the jar daily to release CO2. Taste after day 3 and refrigerate once the tanginess is to your liking."}
+{"input": "how to mix modern and vintage decor", "output": "lex: modern vintage decor mix interior design\nlex: combining antique furniture contemporary style\nvec: how do you blend vintage furniture and antique pieces with modern interior design elements\nvec: what are effective ways to combine mid-century or antique decor with contemporary minimalist style\nhyde: Pair a vintage wooden dresser with a sleek modern mirror. Use neutral wall colors as a backdrop and let one statement antique piece anchor each room. Mix textures—a velvet mid-century sofa with clean-lined metal side tables creates visual contrast without clashing."}
+{"input": "how to perform a scientific experiment", "output": "lex: scientific experiment steps procedure\nlex: scientific method hypothesis variables control\nlex: lab experiment design methodology\nvec: what are the steps to design and carry out a controlled scientific experiment\nvec: how do you formulate a hypothesis, set up controls, and collect data in a scientific experiment\nhyde: Step 1: Define your research question. Step 2: Formulate a testable hypothesis. Step 3: Identify independent, dependent, and controlled variables. Step 4: Design your procedure with a control group. Step 5: Collect and record data systematically. Step 6: Analyze results and draw conclusions."}
+{"input": "web mail", "output": "lex: webmail client email browser\nlex: web-based email service provider\nlex: online email login inbox access\nvec: how to access and use web-based email services like Gmail, Outlook, or Yahoo Mail through a browser\nvec: what are the most popular webmail providers and how do their features compare\nhyde: Webmail allows you to access your email through a web browser without installing a desktop client. Popular services include Gmail (mail.google.com), Outlook.com, Yahoo Mail, and ProtonMail. Log in with your credentials to read, compose, and manage messages from any device."}
+{"input": "what does the quran cover", "output": "lex: quran topics contents themes\nlex: quran teachings subjects covered\nvec: what are the main topics and themes discussed in the Quran\nvec: what subjects does the Quran address including theology, law, morality, and prophetic stories\nhyde: The Quran covers topics including monotheism (tawhid), the Day of Judgment, stories of prophets from Adam to Muhammad, ethical conduct, family law, dietary rules, charity (zakat), prayer, and the relationship between God and humanity. It contains 114 surahs organized roughly by length."}
+{"input": "web config", "output": "lex: web.config file IIS ASP.NET\nlex: web server configuration settings\nlex: web.config XML settings authentication\nvec: how to configure a web.config file for IIS and ASP.NET applications\nvec: what settings and sections are available in a web.config file for web server configuration\nhyde: The web.config file is an XML configuration file used by IIS and ASP.NET. It controls settings such as authentication, authorization, custom errors, connection strings, and HTTP handlers. Place it in the root of your application directory. Example: <configuration><system.web><compilation debug=\"true\"/></system.web></configuration>"}
+{"input": "how to choose farm equipment", "output": "lex: farm equipment selection tractor implements\nlex: agricultural machinery buying guide\nlex: choosing tractor size horsepower acreage\nvec: what factors should you consider when selecting farm equipment like tractors and implements for your land\nvec: how do you match the right agricultural machinery to your farm size, crop type, and budget\nhyde: Match tractor horsepower to your acreage: 25-45 HP for under 50 acres, 45-85 HP for 50-200 acres, and 100+ HP for large operations. Consider PTO power for running implements like mowers and tillers. Evaluate whether two-wheel or four-wheel drive suits your terrain. Used equipment can save 40-60% over new."}
+{"input": "how do thought experiments aid philosophical reasoning", "output": "lex: thought experiments philosophy reasoning\nlex: philosophical thought experiment trolley problem examples\nvec: how do philosophers use thought experiments like the trolley problem to test moral and logical intuitions\nvec: what role do hypothetical scenarios play in advancing philosophical arguments and theories\nhyde: Thought experiments isolate specific variables in complex problems by constructing hypothetical scenarios. Judith Jarvis Thomson's violinist argument tests bodily autonomy intuitions, while the trolley problem probes deontological vs. consequentialist reasoning. They help philosophers identify hidden assumptions and clarify conceptual boundaries."}
+{"input": "what is the significance of logic in philosophy", "output": "lex: logic philosophy significance role\nlex: formal logic philosophical argument validity\nvec: why is logic considered foundational to philosophical inquiry and argumentation\nvec: how does formal and informal logic help philosophers evaluate the validity of arguments\nhyde: Logic provides the structural framework for all philosophical reasoning. Aristotle's syllogistic logic established rules for valid deduction. Modern formal logic, including propositional and predicate calculus, allows philosophers to precisely evaluate argument validity, identify fallacies, and construct rigorous proofs."}
+{"input": "how to train for a 5k run", "output": "lex: 5k run training plan beginner\nlex: couch to 5k running program schedule\nvec: what is a good beginner training plan to prepare for running a 5k race\nvec: how many weeks does it take to train for a 5k and what should each week look like\nhyde: An 8-week 5K training plan for beginners: Weeks 1-2, alternate 1 min running and 2 min walking for 20 minutes, 3 days per week. Weeks 3-4, run 3 min, walk 1 min. Weeks 5-6, run 5 min, walk 1 min. Weeks 7-8, run continuously for 25-30 minutes. Include rest days between runs."}
+{"input": "how to engage with political dialogues", "output": "lex: political dialogue conversation civil discourse\nlex: discussing politics constructively disagreement\nvec: how can you have productive political conversations with people who hold different views\nvec: what techniques help maintain respectful and constructive political dialogue across ideological divides\nhyde: Start by listening actively and asking clarifying questions rather than immediately countering. Use \"I\" statements instead of accusations. Acknowledge shared values before addressing disagreements. Avoid strawmanning—restate the other person's position accurately before responding. Focus on specific policies rather than party labels."}
+{"input": "what is competitive analysis", "output": "lex: competitive analysis business strategy\nlex: competitor analysis market research framework\nvec: what is competitive analysis in business and how do companies use it to inform strategy\nvec: what frameworks and methods are used to conduct a competitive analysis of rival companies\nhyde: Competitive analysis is the process of identifying competitors and evaluating their strategies, strengths, and weaknesses relative to your own. Key frameworks include Porter's Five Forces, SWOT analysis, and competitor profiling. Analyze pricing, product features, market share, marketing channels, and customer reviews."}
+{"input": "how does the united nations operate", "output": "lex: united nations structure operations governance\nlex: UN general assembly security council agencies\nvec: how is the United Nations structured and what are the roles of its main bodies like the General Assembly and Security Council\nvec: how does the UN make decisions, enforce resolutions, and coordinate international action\nhyde: The UN operates through six principal organs: the General Assembly (all 193 members, one vote each), the Security Council (15 members, 5 permanent with veto power), the Secretariat, the International Court of Justice, ECOSOC, and the Trusteeship Council. Resolutions require majority votes; Security Council decisions need 9 of 15 votes with no P5 veto."}
+{"input": "what are the crusades?", "output": "lex: crusades medieval holy wars Jerusalem\nlex: crusades history 1096 Christian Muslim\nvec: what were the Crusades and why did European Christians launch military campaigns to the Holy Land\nvec: what were the major Crusades, their outcomes, and their lasting impact on Europe and the Middle East\nhyde: The Crusades were a series of religious wars between 1096 and 1291, initiated by the Latin Church to recapture the Holy Land from Muslim rule. The First Crusade (1096-1099) captured Jerusalem. Subsequent crusades had mixed results, and the last Crusader stronghold at Acre fell in 1291."}
+{"input": "what is a literary theme?", "output": "lex: literary theme definition examples\nlex: theme in literature central idea meaning\nvec: what is a literary theme and how does it differ from the subject or plot of a story\nvec: how do authors develop and convey themes throughout a work of literature\nhyde: A literary theme is the underlying message or central idea explored in a work of fiction. Unlike the subject (what the story is about), the theme is what the story says about that subject. For example, a novel's subject might be war, while its theme could be \"war dehumanizes both victors and victims.\""}
+{"input": "what is the ethical significance of consent", "output": "lex: consent ethics moral significance\nlex: informed consent autonomy medical ethics\nvec: why is consent considered ethically important in medical, legal, and interpersonal contexts\nvec: how does the concept of informed consent protect individual autonomy and human dignity\nhyde: Consent is ethically significant because it respects individual autonomy—the right of persons to make decisions about their own bodies and lives. In medical ethics, informed consent requires that patients understand the risks, benefits, and alternatives before agreeing to treatment. Without valid consent, actions become coercive regardless of their intent."}
+{"input": "paint mix", "output": "lex: paint color mixing guide ratios\nlex: acrylic oil paint mixing technique\nlex: paint color chart combinations blending\nvec: how do you mix paint colors to achieve specific shades and hues\nvec: what are the basic color mixing ratios and techniques for acrylic and oil paints\nhyde: Start with the three primary colors: red, blue, and yellow. Mix red and blue for purple, blue and yellow for green, red and yellow for orange. Add white to lighten (tint) and black to darken (shade). Mix small amounts gradually—it takes less dark paint to shift a light color than the reverse."}
+{"input": "how to conserve energy in the office?", "output": "lex: office energy conservation tips\nlex: reduce electricity workplace energy saving\nvec: what are practical ways to reduce energy consumption in an office or workplace\nvec: how can offices save electricity through lighting, HVAC, and equipment management\nhyde: Switch to LED lighting and install occupancy sensors in conference rooms and restrooms. Set computers to sleep mode after 10 minutes of inactivity. Use smart power strips to eliminate phantom loads. Set thermostats to 68°F in winter and 76°F in summer. These measures typically reduce office energy use by 20-30%."}
+{"input": "how to test soil ph?", "output": "lex: soil pH test kit method\nlex: test soil acidity alkalinity garden\nvec: how do you test the pH level of garden soil using a home test kit or meter\nvec: what methods are available for measuring soil pH and interpreting the results for gardening\nhyde: Insert a soil pH meter probe 4-6 inches into moist soil for a quick reading. For more accuracy, use a chemical test kit: mix one part soil with one part distilled water, let settle, then add the indicator solution. Compare the color to the chart. Most garden plants prefer pH 6.0-7.0."}
+{"input": "navigating sustainable building certifications", "output": "lex: sustainable building certification LEED BREEAM\nlex: green building standards certification process\nvec: what are the main sustainable building certifications like LEED, BREEAM, and WELL, and how do you achieve them\nvec: how do you navigate the requirements and application process for green building certifications\nhyde: LEED (Leadership in Energy and Environmental Design) awards points across categories: energy, water, materials, indoor quality, and site selection. Projects need 40-49 points for Certified, 50-59 for Silver, 60-79 for Gold, and 80+ for Platinum. BREEAM is more common in Europe and uses a percentage-based scoring system."}
+{"input": "what is the role of religious leaders?", "output": "lex: religious leaders role function community\nlex: clergy priests imams rabbis duties responsibilities\nvec: what roles do religious leaders like priests, imams, and rabbis play in their communities\nvec: how do religious leaders guide spiritual practice, provide counsel, and serve their congregations\nhyde: Religious leaders serve as spiritual guides, interpreters of sacred texts, and community organizers. A parish priest administers sacraments, leads worship, and provides pastoral care. An imam leads prayers, delivers Friday sermons (khutbah), and offers religious guidance. Rabbis teach Torah, arbitrate Jewish law, and counsel congregants."}
+{"input": "how to maintain a balanced diet", "output": "lex: balanced diet nutrition food groups\nlex: healthy eating meal plan macronutrients\nvec: how do you maintain a balanced diet with the right proportions of proteins, carbohydrates, fats, and vitamins\nvec: what does a daily balanced meal plan look like for an average adult\nhyde: A balanced diet includes roughly 45-65% carbohydrates, 20-35% fats, and 10-35% protein. Fill half your plate with fruits and vegetables, a quarter with whole grains, and a quarter with lean protein. Aim for 25-30g of fiber daily. Limit added sugars to under 25g and sodium to under 2300mg per day."}
+{"input": "what is moral philosophy", "output": "lex: moral philosophy ethics definition branches\nlex: ethics normative metaethics applied\nvec: what is moral philosophy and what are its main branches including normative ethics and metaethics\nvec: how does moral philosophy address questions of right and wrong, virtue, and duty\nhyde: Moral philosophy, or ethics, is the branch of philosophy concerned with questions of right and wrong conduct. It includes three main branches: metaethics (the nature of moral judgments), normative ethics (frameworks like utilitarianism, deontology, and virtue ethics), and applied ethics (specific issues like abortion or euthanasia)."}
+{"input": "how to use a light meter", "output": "lex: light meter photography exposure reading\nlex: incident reflected light meter settings\nvec: how do you use a handheld light meter to measure exposure for photography\nvec: what is the difference between incident and reflected light metering and when should you use each\nhyde: Point an incident light meter at the camera from the subject's position with the dome facing the lens. It reads the light falling on the subject, giving accurate exposure regardless of subject brightness. For reflected metering, point the meter at the subject from the camera position. Set the ISO first, then read the recommended aperture and shutter speed."}
+{"input": "what is the significance of creative writing?", "output": "lex: creative writing significance purpose value\nlex: creative writing literary expression storytelling\nvec: why is creative writing significant as a form of artistic expression and communication\nvec: how does creative writing contribute to culture, self-expression, and empathy\nhyde: Creative writing allows individuals to explore complex emotions, construct meaning, and communicate experiences that resist straightforward exposition. Through fiction, poetry, and memoir, writers develop empathy by inhabiting other perspectives. Studies show that reading literary fiction improves theory of mind and emotional intelligence."}
+{"input": "what are the key principles of confucianism?", "output": "lex: confucianism key principles ren li xiao\nlex: confucian philosophy five relationships virtues\nvec: what are the core principles and virtues of Confucianism such as ren, li, and filial piety\nvec: how do the five key relationships in Confucianism structure social and moral order\nhyde: The key principles of Confucianism include Ren (benevolence/humaneness), Li (ritual propriety), Xiao (filial piety), Yi (righteousness), and Zhi (wisdom). The Five Relationships define social bonds: ruler-subject, parent-child, husband-wife, elder-younger sibling, and friend-friend. Each relationship carries reciprocal obligations."}
+{"input": "what is agile project management", "output": "lex: agile project management scrum kanban\nlex: agile methodology sprints iterative development\nvec: what is agile project management and how does it differ from traditional waterfall approaches\nvec: how do agile frameworks like Scrum and Kanban organize work into sprints and iterations\nhyde: Agile project management is an iterative approach that delivers work in short cycles called sprints (typically 1-4 weeks). Teams hold daily standups, plan sprint backlogs, and conduct retrospectives. Key frameworks include Scrum (with defined roles: Product Owner, Scrum Master, Team) and Kanban (continuous flow with WIP limits)."}
+{"input": "what is the significance of the harlem renaissance", "output": "lex: Harlem Renaissance significance African American culture\nlex: Harlem Renaissance 1920s literature art music\nvec: what was the Harlem Renaissance and why was it significant for African American culture and arts\nvec: which writers, artists, and musicians defined the Harlem Renaissance and what impact did they have\nhyde: The Harlem Renaissance (1920s-1930s) was a cultural explosion centered in Harlem, New York, that transformed African American literature, music, and art. Langston Hughes, Zora Neale Hurston, and Claude McKay produced groundbreaking literary works. Jazz and blues flourished at the Cotton Club. The movement asserted Black identity and challenged racial stereotypes."}
+{"input": "what triggered world war i", "output": "lex: World War I causes triggers assassination\nlex: WWI outbreak 1914 Franz Ferdinand alliances\nvec: what events and conditions triggered the start of World War I in 1914\nvec: how did the assassination of Archduke Franz Ferdinand lead to a full-scale world war through the alliance system\nhyde: The assassination of Archduke Franz Ferdinand of Austria-Hungary on June 28, 1914, in Sarajevo triggered WWI. Austria-Hungary issued an ultimatum to Serbia. The alliance system pulled in Russia (allied with Serbia), Germany (allied with Austria-Hungary), France (allied with Russia), and Britain (allied with France and Belgium)."}
+{"input": "how to improve drawing skills?", "output": "lex: improve drawing skills practice techniques\nlex: learn to draw exercises sketching\nvec: what exercises and practice routines help improve drawing and sketching skills for beginners\nvec: how can you develop better hand-eye coordination and observational skills for drawing\nhyde: Practice gesture drawing daily: set a timer for 30-60 seconds and sketch the overall pose of a figure or object without lifting your pencil. Draw from life, not just photos. Study basic forms—spheres, cylinders, boxes—and learn to see complex objects as combinations of these shapes. Fill a sketchbook page every day."}
+{"input": "what is international relations", "output": "lex: international relations definition political science\nlex: IR theory realism liberalism diplomacy\nvec: what is the field of international relations and what theories explain how states interact\nvec: how does international relations study diplomacy, conflict, trade, and cooperation between nations\nhyde: International relations (IR) is a subfield of political science that studies interactions between states, international organizations, and non-state actors. Major theoretical frameworks include realism (states pursue power in an anarchic system), liberalism (institutions and cooperation reduce conflict), and constructivism (social norms shape state behavior)."}
+{"input": "what is the human genome project", "output": "lex: Human Genome Project HGP DNA sequencing\nlex: human genome mapping genes 2003 completed\nvec: what was the Human Genome Project and what did it accomplish in mapping human DNA\nvec: how has the Human Genome Project influenced genetics, medicine, and our understanding of human biology\nhyde: The Human Genome Project (1990-2003) was an international research effort to sequence all 3.2 billion base pairs of human DNA and identify approximately 20,500 genes. Completed in April 2003, it cost $2.7 billion and has enabled advances in personalized medicine, genetic testing, and understanding of hereditary diseases."}
+{"input": "how to assess a neighborhood safety", "output": "lex: neighborhood safety assessment crime check\nlex: evaluate neighborhood crime rate walkability\nvec: how do you assess whether a neighborhood is safe before moving there\nvec: what factors and data sources help evaluate neighborhood safety including crime statistics and local conditions\nhyde: Check crime maps on sites like CrimeMapping.com or SpotCrime using the ZIP code. Walk the neighborhood at different times of day and night. Look for signs of community investment: maintained properties, street lighting, and active businesses. Talk to residents and visit the local police precinct for crime statistics."}
+{"input": "what are the characteristics of a just society", "output": "lex: just society characteristics principles fairness\nlex: social justice equality Rawls distributive justice\nvec: what are the defining characteristics of a just society according to political philosophy\nvec: how do philosophers like John Rawls define justice and the principles of a fair society\nhyde: John Rawls argued a just society is one where principles are chosen behind a \"veil of ignorance\"—not knowing your own position. His two principles: (1) equal basic liberties for all, and (2) social and economic inequalities are arranged to benefit the least advantaged (difference principle) with fair equality of opportunity."}
+{"input": "what is the significance of the narrative arc?", "output": "lex: narrative arc significance story structure\nlex: narrative arc exposition climax resolution\nvec: what is a narrative arc and why is it significant in storytelling and fiction writing\nvec: how do the stages of a narrative arc—exposition, rising action, climax, falling action, resolution—shape a story\nhyde: The narrative arc structures a story's progression from exposition through rising action to climax, then falling action and resolution. Gustav Freytag formalized this as a five-act pyramid. A strong arc creates tension, develops characters through conflict, and delivers emotional payoff, keeping readers engaged from beginning to end."}
+{"input": "what is bioethics", "output": "lex: bioethics definition medical ethics biology\nlex: bioethics issues euthanasia cloning genetic engineering\nvec: what is bioethics and what moral questions does it address in medicine and biological science\nvec: how does bioethics evaluate issues like genetic engineering, euthanasia, and organ transplantation\nhyde: Bioethics is an interdisciplinary field that examines ethical issues arising from advances in biology and medicine. Core principles include autonomy (patient choice), beneficence (do good), non-maleficence (do no harm), and justice (fair distribution). It addresses topics such as end-of-life care, genetic editing (CRISPR), stem cell research, and clinical trial ethics."}
+{"input": "what is the significance of reincarnation in hinduism", "output": "lex: reincarnation hinduism samsara karma\nlex: Hindu rebirth cycle moksha atman\nvec: what role does reincarnation play in Hindu belief and how is it connected to karma and moksha\nvec: how does the concept of samsara and the cycle of rebirth shape Hindu spiritual practice\nhyde: In Hinduism, reincarnation (samsara) is the cycle of death and rebirth of the atman (soul). Karma—the accumulated results of actions—determines the conditions of each rebirth. The ultimate goal is moksha: liberation from the cycle of samsara, achieved through jnana (knowledge), bhakti (devotion), or karma yoga (selfless action)."}
+{"input": "learn code", "output": "lex: learn programming coding beginner\nlex: learn to code online courses tutorials\nlex: programming language beginner Python JavaScript\nvec: how can a beginner start learning to code and which programming language should they learn first\nvec: what are the best free resources and online courses for learning programming from scratch\nhyde: Start with Python or JavaScript—both have gentle learning curves and wide applications. Free resources include freeCodeCamp.org, Codecademy, and CS50 on edX. Begin with variables, loops, and functions, then build small projects. Practice daily on coding challenges at sites like LeetCode or Codewars."}
+{"input": "what is the significance of the enlightenment?", "output": "lex: Enlightenment significance 18th century philosophy\nlex: Age of Enlightenment reason science liberty\nvec: what was the Enlightenment and why is it considered a turning point in Western intellectual history\nvec: how did Enlightenment thinkers like Voltaire, Locke, and Kant influence modern democracy and science\nhyde: The Enlightenment (c. 1685-1815) emphasized reason, individual liberty, and scientific inquiry over tradition and religious authority. Thinkers like John Locke (natural rights), Voltaire (freedom of speech), and Kant (\"dare to know\") laid the intellectual foundations for democratic revolutions, constitutional government, and the separation of church and state."}
+{"input": "google docs", "output": "lex: Google Docs word processor cloud\nlex: Google Docs collaboration editing sharing\nlex: Google Docs templates formatting features\nvec: how do you use Google Docs to create, edit, and collaborate on documents online\nvec: what features does Google Docs offer for real-time collaboration, formatting, and sharing\nhyde: Google Docs is a free cloud-based word processor at docs.google.com. It supports real-time collaboration—multiple users can edit simultaneously with changes tracked by color. Share documents via link or email with view, comment, or edit permissions. It auto-saves to Google Drive and supports export to .docx, .pdf, and other formats."}
+{"input": "how to perform statistical analysis in research", "output": "lex: statistical analysis research methods\nlex: statistical tests t-test ANOVA regression research\nvec: how do researchers choose and perform appropriate statistical analyses for their data\nvec: what are the common statistical methods used in academic research and when should each be applied\nhyde: Choose your statistical test based on your data type and research question. Use t-tests for comparing two group means, ANOVA for three or more groups, chi-square for categorical data, and regression for predicting outcomes. Check assumptions: normality (Shapiro-Wilk test), homogeneity of variance (Levene's test), and independence of observations."}
+{"input": "what is the role of physics in engineering", "output": "lex: physics role engineering applications\nlex: physics principles mechanical electrical civil engineering\nvec: how do physics principles apply to engineering disciplines like mechanical, electrical, and civil engineering\nvec: what fundamental physics concepts are essential for engineers to understand and apply\nhyde: Physics underpins all engineering disciplines. Mechanical engineers apply Newton's laws and thermodynamics to design engines and machines. Electrical engineers use Maxwell's equations and semiconductor physics to build circuits. Civil engineers rely on statics and material strength calculations to design buildings and bridges that withstand loads."}
+{"input": "how to read a topographic map?", "output": "lex: topographic map reading contour lines\nlex: topo map elevation contour interval legend\nvec: how do you read contour lines and elevation data on a topographic map\nvec: what do the symbols, contour lines, and colors on a USGS topographic map represent\nhyde: Contour lines connect points of equal elevation. Lines close together indicate steep terrain; lines far apart indicate gentle slopes. The contour interval (stated in the legend) is the elevation difference between adjacent lines. Every fifth line is an index contour, drawn thicker with the elevation labeled. Brown lines show terrain, blue shows water."}
+{"input": "how to choose car speakers?", "output": "lex: car speakers choosing size type\nlex: car audio speakers coaxial component upgrade\nvec: how do you choose aftermarket car speakers that fit your vehicle and sound preferences\nvec: what is the difference between coaxial and component car speakers and which should you buy\nhyde: Check your car's speaker sizes (common: 6.5\", 6x9\", 5.25\") using a fitment guide. Coaxial speakers are all-in-one replacements—easy to install with tweeter built in. Component speakers separate the woofer, tweeter, and crossover for better sound staging but require more installation work. Look for sensitivity (85+ dB) and RMS power handling matching your head unit or amp."}
+{"input": "where to buy organic seeds?", "output": "lex: buy organic seeds online garden\nlex: organic seed suppliers heirloom non-GMO\nvec: where can you buy certified organic and heirloom seeds for a home garden\nvec: which online seed companies sell high-quality organic and non-GMO vegetable and flower seeds\nhyde: Trusted organic seed suppliers include Johnny's Selected Seeds, High Mowing Organic Seeds, Seed Savers Exchange, and Baker Creek Heirloom Seeds. Look for USDA Certified Organic labels and non-GMO verification. Order in January-February for spring planting. Many offer sampler packs for beginners."}
+{"input": "challenges of digital transformation", "output": "lex: digital transformation challenges obstacles\nlex: enterprise digital transformation barriers legacy systems\nvec: what are the main challenges organizations face when undergoing digital transformation\nvec: how do legacy systems, culture resistance, and skill gaps hinder digital transformation efforts\nhyde: Common digital transformation challenges include resistance to change from employees, integrating legacy systems with new platforms, data silos across departments, cybersecurity risks during migration, and shortage of skilled talent. McKinsey reports that 70% of digital transformation initiatives fail, often due to organizational culture rather than technology."}
+{"input": "what makes a good thriller novel?", "output": "lex: thriller novel elements writing techniques\nlex: good thriller pacing suspense plot twists\nvec: what elements make a thriller novel compelling including pacing, suspense, and plot structure\nvec: how do successful thriller writers build tension and keep readers turning pages\nhyde: A great thriller has a high-stakes central conflict, a ticking clock, and a protagonist under escalating pressure. Pacing is crucial—short chapters and cliffhanger endings drive momentum. Plant red herrings and misdirection, then deliver a twist that recontextualizes earlier clues. The antagonist should be intelligent and formidable, making the hero's victory feel earned."}
+{"input": "what is the composition of the earth's atmosphere", "output": "lex: earth atmosphere composition gases percentages\nlex: atmospheric gases nitrogen oxygen argon CO2\nvec: what gases make up the Earth's atmosphere and in what proportions\nvec: what is the chemical composition of Earth's atmosphere including trace gases\nhyde: Earth's atmosphere is composed of 78.09% nitrogen (N₂), 20.95% oxygen (O₂), 0.93% argon (Ar), and 0.04% carbon dioxide (CO₂). Trace gases include neon, helium, methane, krypton, and water vapor (0-4% depending on humidity). The atmosphere extends roughly 480 km above the surface and is divided into five layers: troposphere, stratosphere, mesosphere, thermosphere, and exosphere."}
+{"input": "how to file a petition to government", "output": "lex: file petition government civic action\nlex: government petition create submit signatures\nvec: how do you create and file a formal petition to a government body or elected representative\nvec: what is the process for submitting a petition to local, state, or federal government\nhyde: To file a petition, clearly state your request and supporting reasons. Collect signatures from eligible constituents—most jurisdictions require a minimum number based on population. File the petition with the appropriate government office (city clerk, state legislature, or Congress). Online platforms like Change.org can amplify support but may not satisfy legal petition requirements."}
+{"input": "how to grow rhododendrons?", "output": "lex: grow rhododendrons planting care soil\nlex: rhododendron acidic soil shade watering\nvec: how do you plant and care for rhododendrons including soil, light, and watering requirements\nvec: what soil pH and growing conditions do rhododendrons need to thrive\nhyde: Rhododendrons require acidic soil (pH 4.5-6.0), partial shade, and consistent moisture. Plant in well-drained soil amended with peat moss or composted pine bark. Mulch with 2-3 inches of pine needles. Water deeply once a week—they have shallow root systems sensitive to drought. Avoid planting too deep; keep the root ball crown at soil level."}
+{"input": "what is the ethics of surveillance", "output": "lex: surveillance ethics privacy government\nlex: mass surveillance civil liberties Fourth Amendment\nvec: what are the ethical issues surrounding government and corporate surveillance of citizens\nvec: how do privacy rights conflict with security justifications for mass surveillance programs\nhyde: Mass surveillance raises fundamental questions about the balance between security and privacy. Critics argue programs like the NSA's PRISM violate Fourth Amendment protections against unreasonable search. Proponents claim surveillance prevents terrorism. The chilling effect—self-censorship by citizens who know they're watched—threatens free expression and democratic participation."}
+{"input": "regex match", "output": "lex: regex match pattern regular expression\nlex: regex syntax matching groups capture\nlex: regular expression examples tutorial\nvec: how do you write and use regular expressions to match patterns in text\nvec: what is the syntax for regex pattern matching including groups, quantifiers, and character classes\nhyde: A regex (regular expression) matches text patterns. Common syntax: `.` matches any character, `*` means zero or more, `+` means one or more, `?` means optional. `[a-z]` matches lowercase letters. `\\d` matches digits. Capture groups use parentheses: `(\\d{3})-(\\d{4})` matches and captures phone number parts. Use `^` for start and `$` for end of line."}
+{"input": "what is the ethics of research", "output": "lex: research ethics principles IRB\nlex: ethical research human subjects informed consent\nvec: what ethical principles govern scientific and academic research involving human subjects\nvec: how do institutional review boards ensure ethical standards in research studies\nhyde: Research ethics are governed by the Belmont Report's three principles: respect for persons (informed consent), beneficence (minimize harm, maximize benefit), and justice (fair selection of subjects). Institutional Review Boards (IRBs) review all human subjects research. Key requirements include voluntary participation, confidentiality, right to withdraw, and risk-benefit assessment."}
+{"input": "how to set intentions for the day?", "output": "lex: set daily intentions morning routine\nlex: intention setting mindfulness journaling\nvec: how do you set meaningful daily intentions as part of a morning routine\nvec: what is the practice of setting intentions and how does it differ from goal-setting\nhyde: Each morning, sit quietly for 2-3 minutes and ask yourself: \"How do I want to feel today?\" and \"What matters most today?\" Write one to three intentions in a journal—e.g., \"I will be present in conversations\" or \"I will approach challenges with curiosity.\" Intentions focus on how you show up, not on tasks to complete. Review them at midday and evening."}
+{"input": "what is the role of sacred music in worship?", "output": "lex: sacred music worship role function\nlex: religious hymns chants liturgical music\nvec: what role does sacred music play in religious worship services across different faiths\nvec: how do hymns, chants, and liturgical music enhance the experience of communal worship\nhyde: Sacred music serves multiple functions in worship: it creates a contemplative atmosphere, unifies the congregation through shared singing, reinforces theological themes through lyrics, and marks liturgical transitions. Gregorian chant in Catholic Mass, bhajans in Hindu puja, and the Islamic adhan each use distinct musical forms to invoke the sacred and facilitate prayer."}
+{"input": "what are the features of ancient roman society?", "output": "lex: ancient Roman society features structure\nlex: Roman social classes patricians plebeians republic\nvec: what were the defining features of ancient Roman society including social classes, government, and daily life\nvec: how was ancient Roman society structured in terms of class hierarchy, citizenship, and law\nhyde: Roman society was divided into patricians (aristocratic families), plebeians (common citizens), freedmen, and slaves. Citizens had legal rights including voting and property ownership. The Senate held political power, though plebeians gained representation through tribunes. Roman law (Twelve Tables, 450 BC) codified legal principles still influential today. The paterfamilias held authority over extended households."}
+{"input": "what is the role of family in society", "output": "lex: family role society function socialization\nlex: family structure social institution support\nvec: what roles does the family unit play in society including socialization, support, and cultural transmission\nvec: how do families function as the primary social institution for raising children and maintaining social order\nhyde: The family is society's primary unit of socialization, teaching children language, norms, and values. Functionalist sociologists identify four key roles: socialization of children, economic cooperation, emotional support, and regulation of sexual behavior. Families also transmit cultural identity, religious traditions, and social status across generations."}
+{"input": "what is quantitative easing explained", "output": "lex: quantitative easing QE monetary policy\nlex: quantitative easing central bank bond buying\nvec: what is quantitative easing and how do central banks use it to stimulate the economy\nvec: how does the Federal Reserve's quantitative easing program work and what are its effects on inflation and interest rates\nhyde: Quantitative easing (QE) is an unconventional monetary policy where a central bank buys government bonds and other securities to inject money into the economy. When the Fed buys bonds, it increases bank reserves, lowers long-term interest rates, and encourages lending. The Fed used QE after 2008 and during COVID-19, expanding its balance sheet to over $8 trillion."}
+{"input": "what is guerrilla marketing", "output": "lex: guerrilla marketing unconventional low-cost\nlex: guerrilla marketing examples campaigns street\nvec: what is guerrilla marketing and how do businesses use unconventional tactics to promote products\nvec: what are examples of successful guerrilla marketing campaigns and what makes them effective\nhyde: Guerrilla marketing uses unconventional, low-cost tactics to create memorable brand experiences in unexpected places. Examples include flash mobs, street art installations, viral stunts, and ambient advertising placed in surprising locations. Jay Conrad Levinson coined the term in 1984. Success depends on creativity, surprise, and shareability rather than large advertising budgets."}
+{"input": "what is the study of geology", "output": "lex: geology study earth science rocks minerals\nlex: geology branches mineralogy tectonics stratigraphy\nvec: what is geology and what do geologists study about the Earth's structure, materials, and history\nvec: what are the main branches of geology including mineralogy, petrology, and plate tectonics\nhyde: Geology is the scientific study of the Earth's structure, composition, and processes. Geologists examine rocks, minerals, fossils, and landforms to understand Earth's 4.5-billion-year history. Major branches include mineralogy (minerals), petrology (rocks), stratigraphy (rock layers), paleontology (fossils), and tectonics (plate movement and earthquakes)."}
+{"input": "how to photograph artwork?", "output": "lex: photograph artwork lighting camera setup\nlex: art photography reproduction color accuracy\nvec: how do you photograph paintings and artwork with accurate color and minimal glare\nvec: what camera settings, lighting, and techniques produce high-quality photographs of artwork\nhyde: Use two identical lights at 45-degree angles to the artwork to eliminate glare and ensure even illumination. Mount the camera on a tripod, centered and parallel to the surface. Shoot in RAW at ISO 100, f/8 for sharpness. Include a color checker card in one frame for accurate white balance. Use a remote shutter to avoid camera shake."}
+{"input": "what are smart home technologies", "output": "lex: smart home technologies devices IoT\nlex: smart home automation hub Alexa Google Home\nvec: what smart home technologies are available for automating lighting, security, climate, and entertainment\nvec: how do smart home devices and IoT platforms like Alexa, Google Home, and HomeKit work together\nhyde: Smart home technologies connect devices via Wi-Fi, Zigbee, Z-Wave, or Matter protocol to a central hub or voice assistant. Common categories include smart lighting (Philips Hue), thermostats (Nest, Ecobee), security cameras (Ring, Arlo), locks (August, Yale), and speakers (Amazon Echo, Google Nest). Automations trigger actions based on time, location, or sensor data."}
+{"input": "how sports influence youth development", "output": "lex: sports youth development influence benefits\nlex: youth athletics child development teamwork discipline\nvec: how does participation in sports influence the physical, social, and emotional development of young people\nvec: what benefits do organized sports provide for youth including teamwork, discipline, and mental health\nhyde: Research shows youth sports participation improves physical fitness, teaches teamwork and leadership, and builds self-esteem. A 2019 study in the Journal of Sport and Health Science found that adolescents who play organized sports report lower rates of depression and anxiety. However, excessive pressure and early specialization can lead to burnout and injury."}
+{"input": "how to build self-confidence", "output": "lex: build self-confidence techniques self-esteem\nlex: improve confidence self-worth mindset\nvec: what are practical strategies for building self-confidence and overcoming self-doubt\nvec: how can someone develop greater self-confidence through daily habits and mindset shifts\nhyde: Start by setting small, achievable goals and completing them—each success builds evidence of competence. Practice self-compassion: replace harsh self-criticism with the tone you'd use with a friend. Keep a \"wins\" journal and review it weekly. Gradually expand your comfort zone by doing one slightly uncomfortable thing each day. Confidence grows from accumulated experience, not positive thinking alone."}
+{"input": "how to plan a family field trip?", "output": "lex: family field trip planning kids activities\nlex: family outing day trip educational fun\nvec: how do you plan an enjoyable and educational family field trip with children\nvec: what are tips for organizing a family day trip including choosing destinations, packing, and budgeting\nhyde: Choose an age-appropriate destination: museums, nature centers, farms, or historical sites. Check hours, admission costs, and accessibility online. Pack snacks, water, sunscreen, and a first-aid kit. Plan for shorter attention spans—schedule breaks every 60-90 minutes. Involve kids in planning by letting them choose one activity. Bring a scavenger hunt list to keep them engaged."}
+{"input": "what is a scientific model", "output": "lex: scientific model definition types examples\nlex: scientific models simulation representation theory\nvec: what is a scientific model and how do scientists use models to explain and predict natural phenomena\nvec: what are the different types of scientific models including physical, mathematical, and computational models\nhyde: A scientific model is a simplified representation of a system or phenomenon used to explain observations and make predictions. Models can be physical (a globe representing Earth), mathematical (equations describing gravity), or computational (climate simulations). All models are approximations—George Box wrote, \"All models are wrong, but some are useful.\""}
+{"input": "io file", "output": "lex: file I/O input output operations\nlex: file read write programming IO\nlex: file handling open close stream\nvec: how do you perform file input and output operations in programming languages\nvec: what are the common methods for reading from and writing to files in Python, Java, or C\nhyde: File I/O involves opening a file, reading or writing data, and closing it. In Python: `with open('file.txt', 'r') as f: data = f.read()` for reading, and `with open('file.txt', 'w') as f: f.write('hello')` for writing. The `with` statement ensures the file is properly closed. Use 'a' mode to append, 'rb'/'wb' for binary files."}
+{"input": "what are creative portrait ideas?", "output": "lex: creative portrait photography ideas techniques\nlex: portrait photo ideas poses lighting creative\nvec: what are unique and creative portrait photography ideas for interesting and artistic results\nvec: how can you use lighting, props, angles, and locations for creative portrait photography\nhyde: Try shooting through prisms or crystal balls for rainbow light effects. Use fairy lights wrapped around the subject for warm bokeh. Photograph through rain-covered glass for a moody feel. Use dramatic side lighting with one bare bulb for chiaroscuro portraits. Shoot reflections in puddles, mirrors, or sunglasses. Double exposure combining portraits with textures or nature works well in-camera or in post."}
+{"input": "fix hair", "output": "lex: fix hair repair damaged broken\nlex: hair repair treatment dry frizzy damaged\nlex: hairstyle fix bad hair day\nvec: how do you fix and repair damaged, dry, or frizzy hair\nvec: what are quick fixes for a bad hair day and long-term solutions for hair damage\nhyde: For damaged hair, use a deep conditioning mask with keratin or argan oil once a week. Trim split ends every 6-8 weeks. Reduce heat styling—if you must, use a heat protectant spray at 300°F max. For a quick bad hair day fix, try dry shampoo at the roots, a slicked-back bun, or braids. Sleep on a silk pillowcase to reduce friction and breakage."}
+{"input": "build up", "output": "lex: build up strength fitness training\nlex: build up muscle mass exercise\nlex: buildup gradual increase accumulation\nvec: how do you progressively build up strength and muscle through a structured training program\nvec: what does it mean to build up endurance, skills, or resources gradually over time\nhyde: To build up strength, follow progressive overload: gradually increase weight, reps, or sets each week. A beginner program like Starting Strength adds 5 lbs to compound lifts every session. Eat adequate protein (0.7-1g per pound bodyweight). Rest 48 hours between training the same muscle group. Consistency over 8-12 weeks produces measurable strength gains."}
+{"input": "how to participate in a protest", "output": "lex: participate protest rally demonstration rights\nlex: protest safety tips First Amendment rights\nvec: how do you safely and effectively participate in a protest or public demonstration\nvec: what should you know about your legal rights and safety precautions when attending a protest\nhyde: Know your rights: the First Amendment protects peaceful assembly on public property. Bring water, snacks, a phone charger, and ID. Write an emergency contact number on your arm. Stay with a buddy and agree on a meeting point. Wear comfortable shoes and weather-appropriate clothing. If tear gas is used, move upwind. Document police interactions by filming at a safe distance."}
+{"input": "what is the principle of utility?", "output": "lex: principle of utility utilitarianism Bentham Mill\nlex: utility principle greatest happiness greatest number\nvec: what is the principle of utility in utilitarian ethics as defined by Bentham and Mill\nvec: how does the utilitarian principle of utility evaluate actions based on their consequences for overall happiness\nhyde: The principle of utility, formulated by Jeremy Bentham, states that the morally right action is the one that produces the greatest happiness for the greatest number. Bentham's felicific calculus measured pleasure by intensity, duration, certainty, and extent. John Stuart Mill refined this, distinguishing higher (intellectual) pleasures from lower (bodily) pleasures."}
+{"input": "how to create a brand logo", "output": "lex: brand logo design create process\nlex: logo design principles typography color branding\nvec: how do you design an effective brand logo from concept to final design\nvec: what principles of logo design ensure a brand mark is memorable, scalable, and versatile\nhyde: Start by researching the brand's values, target audience, and competitors. Sketch 20-30 rough concepts on paper before going digital. A strong logo works in black and white, at small sizes (favicon), and large formats (billboard). Limit to 2-3 colors and one typeface. Test on business cards, websites, and merchandise. Tools: Adobe Illustrator, Figma, or Affinity Designer for vector-based design."}
+{"input": "how to check tire pressure?", "output": "lex: check tire pressure gauge PSI\nlex: tire pressure TPMS correct level car\nvec: how do you check and adjust tire pressure using a tire gauge\nvec: what is the correct tire pressure for a car and how often should it be checked\nhyde: Check tire pressure when tires are cold (before driving or 3+ hours after). Remove the valve cap, press a tire gauge firmly onto the valve stem, and read the PSI. Compare to the recommended pressure on the driver's door jamb sticker (not the tire sidewall—that's the maximum). Add air at a gas station if low. Check all four tires plus the spare monthly."}
+{"input": "how to cook quinoa", "output": "lex: cook quinoa recipe instructions stovetop\nlex: quinoa cooking ratio water time\nvec: what is the correct method for cooking quinoa on the stovetop with the right water ratio\nvec: how do you cook fluffy quinoa and what is the water to quinoa ratio\nhyde: Rinse 1 cup quinoa in a fine mesh strainer to remove bitter saponins. Combine with 2 cups water and a pinch of salt in a saucepan. Bring to a boil, reduce to low, cover, and simmer for 15 minutes. Remove from heat and let steam with the lid on for 5 minutes. Fluff with a fork. Yields about 3 cups cooked quinoa."}
+{"input": "how to prevent identity theft", "output": "lex: prevent identity theft protection tips\nlex: identity theft prevention credit freeze monitor\nvec: what steps can you take to protect yourself from identity theft and fraud\nvec: how do credit freezes, strong passwords, and monitoring help prevent identity theft\nhyde: Freeze your credit at all three bureaus (Equifax, Experian, TransUnion)—it's free and prevents unauthorized accounts. Use unique passwords with a password manager. Enable two-factor authentication on all financial accounts. Shred documents with personal information. Monitor bank statements weekly and check your credit report annually at AnnualCreditReport.com."}
+{"input": "how to start a blog", "output": "lex: start blog setup hosting platform\nlex: blogging beginners WordPress Substack setup\nvec: how do you start a blog from scratch including choosing a platform, domain, and writing your first posts\nvec: what are the steps to launch a successful blog and attract readers\nhyde: Choose a platform: WordPress.org for full control (needs hosting), or Substack/Ghost for simplicity. Pick a niche you can write about consistently. Register a domain name ($10-15/year). Write 5-10 posts before launching so visitors find content immediately. Optimize for SEO with clear titles and headers. Share on social media and engage with other bloggers in your niche."}
+{"input": "documentary photography", "output": "lex: documentary photography style techniques\nlex: documentary photojournalism storytelling long-term\nvec: what is documentary photography and how does it differ from photojournalism and street photography\nvec: what techniques and approaches do documentary photographers use to tell stories through images\nhyde: Documentary photography aims to chronicle real events, conditions, or people over time to create a truthful narrative. Unlike photojournalism's focus on breaking news, documentary work unfolds over weeks, months, or years. Key practitioners include Dorothea Lange (Great Depression), Sebastião Salgado (workers, migration), and James Nachtwey (conflict). Shoot with available light, build trust with subjects, and caption extensively."}
+{"input": "what causes tides", "output": "lex: tides causes moon gravitational pull\nlex: tidal forces moon sun earth gravity\nvec: what causes ocean tides and how do the gravitational forces of the moon and sun create them\nvec: how does the moon's gravitational pull create high and low tides on Earth\nhyde: Tides are primarily caused by the gravitational pull of the Moon on Earth's oceans. The side of Earth facing the Moon experiences a direct gravitational pull creating a tidal bulge (high tide). A second bulge forms on the opposite side due to inertial forces. The Sun's gravity also contributes—spring tides (highest) occur during full and new moons when Sun and Moon align."}
+{"input": "what is the history of christianity?", "output": "lex: history Christianity origins spread timeline\nlex: Christianity history Jesus apostles church development\nvec: what is the history of Christianity from its origins with Jesus to the modern era\nvec: how did Christianity spread from a small Jewish sect to a global religion over two millennia\nhyde: Christianity originated in 1st-century Judea with the teachings of Jesus of Nazareth. After his crucifixion (c. 30 AD), apostles like Paul spread the faith across the Roman Empire. Constantine legalized it in 313 AD (Edict of Milan). The Great Schism (1054) split Eastern Orthodox and Roman Catholic churches. The Protestant Reformation began in 1517 with Martin Luther."}
+{"input": "what is the industrial revolution", "output": "lex: Industrial Revolution history manufacturing 18th century\nlex: Industrial Revolution steam engine factories Britain\nvec: what was the Industrial Revolution and how did it transform manufacturing, society, and the economy\nvec: when and where did the Industrial Revolution begin and what were its major innovations and consequences\nhyde: The Industrial Revolution began in Britain around 1760-1840, transforming agrarian economies into industrial ones. Key innovations included the steam engine (James Watt), spinning jenny (textile production), and iron smelting with coke. Factories replaced cottage industries. Urbanization accelerated as workers moved to cities. It brought economic growth but also child labor, pollution, and harsh working conditions."}
+{"input": "what is sustainable forestry?", "output": "lex: sustainable forestry management practices\nlex: sustainable logging forest stewardship FSC\nvec: what is sustainable forestry and how does it balance timber harvesting with forest ecosystem health\nvec: what practices and certifications like FSC ensure forests are managed sustainably\nhyde: Sustainable forestry manages forests to meet current timber needs without compromising future generations' resources. Practices include selective logging (harvesting individual trees rather than clearcutting), replanting harvested areas, maintaining buffer zones near waterways, and preserving biodiversity corridors. The Forest Stewardship Council (FSC) certifies sustainably managed forests."}
+{"input": "what is character arc?", "output": "lex: character arc definition types fiction\nlex: character arc development flat dynamic transformation\nvec: what is a character arc in fiction and how do characters change throughout a story\nvec: what are the different types of character arcs including positive, negative, and flat arcs\nhyde: A character arc is the transformation a character undergoes from the beginning to the end of a story. In a positive arc, the character overcomes a flaw or false belief (e.g., Scrooge in A Christmas Carol). In a negative arc, they descend (Walter White in Breaking Bad). In a flat arc, the character's beliefs remain constant but they change the world around them."}
+{"input": "how to address ethical dilemmas in research", "output": "lex: ethical dilemmas research handling IRB\nlex: research ethics conflict resolution informed consent\nvec: how should researchers identify and address ethical dilemmas that arise during scientific studies\nvec: what frameworks and procedures help resolve ethical conflicts in academic and clinical research\nhyde: When facing an ethical dilemma in research, consult your IRB or ethics committee immediately. Common dilemmas include conflicts between maximizing data quality and minimizing participant burden, handling incidental findings, and balancing confidentiality with mandatory reporting obligations. Document your reasoning and decisions. The Belmont Report provides foundational guidance: respect for persons, beneficence, and justice."}
+{"input": "how to manage stress effectively", "output": "lex: manage stress effectively coping techniques\nlex: stress management relaxation anxiety reduction\nvec: what are evidence-based techniques for managing stress and reducing anxiety in daily life\nvec: how can you manage chronic stress through exercise, mindfulness, and lifestyle changes\nhyde: Effective stress management combines multiple approaches. Exercise 30 minutes daily—even walking reduces cortisol. Practice diaphragmatic breathing: inhale 4 counts, hold 4, exhale 6. Limit caffeine after noon. Maintain consistent sleep and wake times. Cognitive reframing: identify catastrophic thoughts and replace them with realistic assessments. Social connection is protective—schedule regular time with supportive people."}
+{"input": "how does the philosophy of science address scientific change", "output": "lex: philosophy of science scientific change paradigm shift\nlex: Kuhn paradigm revolution Popper falsification Lakatos\nvec: how do philosophers of science like Kuhn, Popper, and Lakatos explain scientific revolutions and theory change\nvec: what does the philosophy of science say about how scientific knowledge evolves and paradigms shift\nhyde: Thomas Kuhn argued science progresses through paradigm shifts: periods of \"normal science\" within an accepted framework are punctuated by revolutionary crises when anomalies accumulate. Karl Popper proposed that science advances through falsification—theories must be testable and those that survive rigorous attempts at refutation are provisionally accepted. Lakatos offered a middle ground with his research programme methodology."}
+{"input": "what are the rituals of judaism", "output": "lex: Judaism rituals practices observances\nlex: Jewish rituals Shabbat Passover bar mitzvah kosher\nvec: what are the major rituals and religious observances in Judaism\nvec: how do Jewish rituals like Shabbat, Passover, and bar/bat mitzvah mark life and calendar events\nhyde: Key Jewish rituals include Shabbat (weekly rest from Friday sunset to Saturday night with candle lighting, kiddush, and challah), the Passover seder (retelling the Exodus), Yom Kippur fasting, circumcision (brit milah) on the 8th day, bar/bat mitzvah at 13/12, and daily prayer (Shacharit, Mincha, Ma'ariv). Keeping kosher governs dietary laws separating meat and dairy."}
+{"input": "how do scientists communicate their findings", "output": "lex: scientists communicate findings publications\nlex: scientific communication peer review journal conference\nvec: how do scientists share and publish their research findings with the scientific community and public\nvec: what are the channels scientists use to communicate results including journals, conferences, and preprints\nhyde: Scientists communicate findings through peer-reviewed journal articles (the gold standard), conference presentations (talks and posters), and preprint servers like arXiv and bioRxiv for rapid dissemination. The publication process involves writing a manuscript, submitting to a journal, peer review by 2-3 experts, revision, and acceptance. Increasingly, scientists also use social media and press releases to reach the public."}
+{"input": "mock test", "output": "lex: mock test practice exam preparation\nlex: mock exam sample questions test prep\nlex: practice test online free exam\nvec: how do you use mock tests and practice exams to prepare for standardized tests and certifications\nvec: where can you find free mock tests and practice exams for tests like SAT, GRE, or professional certifications\nhyde: Mock tests simulate real exam conditions—same time limits, question types, and format. Take full-length practice tests under timed conditions every 1-2 weeks during preparation. Review every wrong answer to identify weak areas. Free mock tests are available on Khan Academy (SAT), ETS (GRE), and official certification body websites. Score trends across mock tests predict actual performance."}
+{"input": "what is the purpose of foreshadowing?", "output": "lex: foreshadowing purpose literary device fiction\nlex: foreshadowing examples narrative technique\nvec: what is the purpose of foreshadowing in literature and how do authors use it to build suspense\nvec: how does foreshadowing create anticipation and cohesion in a story's plot\nhyde: Foreshadowing plants clues or hints about future events in a narrative, building suspense and making plot developments feel earned rather than arbitrary. Chekhov's gun principle—if a gun appears in Act 1, it must fire by Act 3—is a classic example. Effective foreshadowing is subtle enough to miss on first reading but obvious in retrospect, rewarding rereading."}
+{"input": "what is trail running?", "output": "lex: trail running off-road terrain\nlex: trail running shoes gear technique\nvec: what is trail running and how does it differ from road running\nvec: what gear, technique, and training do you need for trail running on off-road terrain\nhyde: Trail running is running on unpaved surfaces—dirt paths, mountain trails, forest tracks, and rocky terrain. Unlike road running, it requires navigating elevation changes, uneven footing, and obstacles. Use trail shoes with aggressive lugs for grip and rock plates for protection. Shorten your stride on technical terrain. Popular distances range from 5K to ultramarathons (50+ miles)."}
+{"input": "what was the impact of the cold war?", "output": "lex: Cold War impact consequences effects\nlex: Cold War legacy geopolitics nuclear arms race\nvec: what were the major political, social, and economic impacts of the Cold War on the world\nvec: how did the Cold War shape international relations, the nuclear arms race, and proxy conflicts\nhyde: The Cold War (1947-1991) divided the world into Western (NATO) and Eastern (Warsaw Pact) blocs. Its impacts include the nuclear arms race (peaking at 70,000+ warheads), proxy wars in Korea, Vietnam, and Afghanistan, the Space Race, decolonization movements influenced by superpower competition, and the eventual collapse of the Soviet Union in 1991 leading to U.S. unipolarity."}
+{"input": "street photography ethics", "output": "lex: street photography ethics legal rights\nlex: street photography consent privacy public space\nvec: what are the ethical considerations and legal rights involved in street photography\nvec: is it ethical to photograph strangers in public and what are the legal rules around street photography\nhyde: In most countries, photographing people in public spaces is legally permitted since there is no expectation of privacy. However, ethical street photographers follow principles: avoid exploiting vulnerable people, don't photograph children without parental awareness, respect requests to delete images, and consider whether the image dignifies or demeans the subject. Some photographers adopt a \"golden rule\" approach."}
+{"input": "vitosha mountain", "output": "lex: Vitosha mountain Sofia Bulgaria\nlex: Vitosha hiking trails Cherni Vrah peak\nvec: what are the hiking trails and attractions on Vitosha mountain near Sofia, Bulgaria\nvec: what is Vitosha mountain and what outdoor activities are available in Vitosha Nature Park\nhyde: Vitosha is a mountain massif on the outskirts of Sofia, Bulgaria, reaching 2,290m at Cherni Vrah (Black Peak). Vitosha Nature Park offers hiking trails, ski runs at Aleko, and the Boyana Waterfall. The golden bridges stone river is a popular landmark. Access from Sofia takes 30 minutes by car or bus. The mountain is a popular day trip for Sofia residents year-round."}
+{"input": "what is an anthology?", "output": "lex: anthology definition literary collection\nlex: anthology book short stories poems collected works\nvec: what is an anthology and how are literary anthologies compiled and organized\nvec: what types of works are typically collected in an anthology such as short stories, poems, or essays\nhyde: An anthology is a curated collection of literary works—short stories, poems, essays, or excerpts—by various authors, assembled around a common theme, genre, or time period. Editors select and arrange pieces to create a coherent reading experience. Examples include The Norton Anthology of English Literature and Best American Short Stories, published annually."}
+{"input": "what is the significance of the yom kippur?", "output": "lex: Yom Kippur significance Jewish holy day\nlex: Yom Kippur Day of Atonement fasting prayer\nvec: what is Yom Kippur and why is it the most significant holy day in Judaism\nvec: how do Jewish people observe Yom Kippur through fasting, prayer, and repentance\nhyde: Yom Kippur (Day of Atonement) is the holiest day in Judaism, falling on the 10th of Tishrei. Observers fast for 25 hours from sunset to sunset, abstaining from food, water, leather shoes, and bathing. The day is spent in synagogue prayer, including the Kol Nidre service and the Neilah closing prayer. It is a day of repentance (teshuvah) for sins against God, concluding the ten Days of Awe."}
+{"input": "what is clean camping?", "output": "lex: clean camping Leave No Trace principles\nlex: clean camping eco-friendly minimal impact\nvec: what is clean camping and how do you minimize your environmental impact while camping outdoors\nvec: what are the Leave No Trace principles and how do they apply to clean camping practices\nhyde: Clean camping follows Leave No Trace principles: plan ahead, travel on durable surfaces, dispose of waste properly, leave what you find, minimize campfire impact, respect wildlife, and be considerate of others. Pack out all trash including food scraps. Use biodegradable soap 200 feet from water sources. Dig catholes 6-8 inches deep for human waste. Leave campsites cleaner than you found them."}
+{"input": "how to evaluate scientific claims critically", "output": "lex: evaluate scientific claims critical thinking\nlex: scientific literacy evidence evaluation peer review\nvec: how do you critically evaluate scientific claims and distinguish credible research from misinformation\nvec: what criteria should you use to assess whether a scientific study's conclusions are reliable\nhyde: Check the source: is it published in a peer-reviewed journal? Look for sample size, control groups, and statistical significance (p < 0.05). Distinguish correlation from causation. Check if results have been replicated by independent researchers. Evaluate conflicts of interest and funding sources. Be skeptical of single studies—look for systematic reviews and meta-analyses that synthesize multiple studies."}
+{"input": "what is the significance of song in worship?", "output": "lex: song worship significance religious singing\nlex: worship music congregational singing hymns praise\nvec: what role does congregational singing and worship music play in religious services\nvec: why is song considered a significant form of spiritual expression and communal worship across faiths\nhyde: Singing in worship engages the whole person—body, mind, and emotions—in ways that spoken word alone cannot. Neuroscience shows group singing synchronizes heart rates and releases oxytocin, fostering communal bonding. In Christian worship, hymns reinforce theology through memorable lyrics. The Psalms themselves are songs, and Paul urged believers to address one another \"in psalms, hymns, and spiritual songs\" (Ephesians 5:19)."}
+{"input": "what is the significance of algae in ecosystems", "output": "lex: algae ecosystem role food chain\nlex: algae oxygen production aquatic ecosystems\nlex: algae photosynthesis carbon cycle\nvec: what role do algae play in aquatic and marine ecosystems\nvec: how do algae contribute to oxygen production and food webs\nhyde: Algae produce approximately 50% of the world's oxygen through photosynthesis and form the base of aquatic food chains. Phytoplankton, a type of microalgae, supports marine ecosystems by providing energy to zooplankton, fish, and larger organisms."}
+{"input": "how to train for a marathon", "output": "lex: marathon training plan schedule\nlex: long distance running program beginner\nlex: marathon race preparation mileage\nvec: what is a good training plan for running a first marathon\nvec: how to build weekly mileage for marathon race preparation\nhyde: A typical 16-week marathon training plan starts with a base of 15-20 miles per week, gradually increasing the long run by 1-2 miles each week. Include easy runs, tempo runs at marathon pace, and one rest day. Taper volume 2-3 weeks before race day."}
+{"input": "how to handle a child's tantrum in public?", "output": "lex: child tantrum public calm techniques\nlex: toddler meltdown coping strategies\nvec: what are effective ways to calm a toddler having a tantrum in a public place\nvec: how should parents respond when their child has a meltdown in a store or restaurant\nhyde: When your child has a tantrum in public, stay calm and speak in a low, steady voice. Get down to their eye level, acknowledge their feelings, and offer simple choices. If needed, move to a quieter spot and wait for the intensity to pass before addressing the behavior."}
+{"input": "how to invest in index funds", "output": "lex: index fund investing brokerage account\nlex: S&P 500 index fund buy shares\nlex: passive investing index ETF\nvec: how to open a brokerage account and buy index funds for long-term investing\nvec: what are the steps to start investing in S&P 500 or total market index funds\nhyde: To invest in index funds, open a brokerage account with a provider like Vanguard, Fidelity, or Schwab. Choose a broad market index fund such as VTSAX or an S&P 500 ETF like VOO. Set up automatic contributions and reinvest dividends for compound growth."}
+{"input": "what is data science", "output": "lex: data science statistics machine learning\nlex: data science analysis programming Python R\nvec: what does data science involve and what skills are needed to work in the field\nvec: how does data science combine statistics, programming, and domain knowledge\nhyde: Data science is an interdisciplinary field that uses statistical methods, machine learning algorithms, and programming to extract insights from structured and unstructured data. Practitioners typically work with Python or R, use tools like pandas and scikit-learn, and apply techniques such as regression, classification, and clustering."}
+{"input": "how to improve concentration skills?", "output": "lex: improve focus concentration techniques\nlex: attention span exercises deep work\nvec: what are practical techniques to improve focus and concentration during work or study\nvec: how can I train my brain to maintain attention for longer periods\nhyde: To improve concentration, try the Pomodoro technique: work for 25 minutes, then take a 5-minute break. Eliminate distractions by silencing notifications and using website blockers. Regular exercise, adequate sleep, and mindfulness meditation have all been shown to increase sustained attention."}
+{"input": "how to participate in earth hour?", "output": "lex: Earth Hour participation lights off event\nlex: Earth Hour date 2026 how to join\nvec: how do I participate in the annual Earth Hour lights-off event\nvec: what can individuals and businesses do during Earth Hour to show support\nhyde: Earth Hour takes place on the last Saturday of March each year. To participate, turn off all non-essential lights for one hour starting at 8:30 PM local time. You can also share your participation on social media using #EarthHour and organize community events."}
+{"input": "what are nanotechnologies", "output": "lex: nanotechnology nanomaterials nanoscale engineering\nlex: nanotech applications medicine electronics\nvec: what is nanotechnology and how are nanoscale materials used in different industries\nvec: what are the main applications of nanotechnology in medicine and electronics\nhyde: Nanotechnology involves manipulating matter at the nanoscale, typically between 1 and 100 nanometers. Applications include targeted drug delivery using nanoparticles, carbon nanotube transistors in electronics, and nanocoatings that repel water and resist corrosion."}
+{"input": "how to create a color palette for painting?", "output": "lex: color palette painting color theory\nlex: mixing paint colors warm cool complementary\nvec: how do artists create a cohesive color palette for a painting using color theory\nvec: what techniques help choose harmonious paint colors for an artwork\nhyde: Start with a limited palette of 4-6 colors: a warm and cool version of each primary (e.g., cadmium yellow, lemon yellow, ultramarine blue, cerulean blue, alizarin crimson, cadmium red). Mix swatches to map out your range. Use complementary colors for contrast and analogous colors for harmony."}
+{"input": "how to make homemade pasta", "output": "lex: homemade pasta recipe dough eggs flour\nlex: fresh pasta making rolling cutting\nvec: what is the recipe and technique for making fresh pasta dough from scratch\nvec: how to roll and cut homemade pasta without a pasta machine\nhyde: Combine 2 cups of 00 flour with 3 large eggs on a clean surface. Knead the dough for 8-10 minutes until smooth and elastic. Wrap in plastic and rest for 30 minutes. Roll out thin with a rolling pin or pasta machine, then cut into desired shapes like fettuccine or tagliatelle."}
+{"input": "how to reduce stress", "output": "lex: stress reduction techniques relaxation\nlex: manage stress exercise meditation breathing\nvec: what are effective daily habits for reducing stress and improving mental health\nvec: how can breathing exercises and physical activity help lower stress levels\nhyde: Regular physical activity releases endorphins that naturally reduce stress. Practice deep breathing: inhale for 4 counts, hold for 4, exhale for 6. Other effective strategies include progressive muscle relaxation, journaling, limiting caffeine, and maintaining a consistent sleep schedule of 7-9 hours."}
+{"input": "how to develop a research hypothesis", "output": "lex: research hypothesis formulation testable\nlex: hypothesis writing independent dependent variable\nvec: how do you write a clear and testable research hypothesis for a study\nvec: what are the steps to develop a hypothesis from a research question\nhyde: A research hypothesis is a specific, testable prediction about the relationship between variables. Start by identifying your research question, then review existing literature. Formulate the hypothesis as an if-then or directional statement, clearly defining the independent and dependent variables."}
+{"input": "what is social contract theory", "output": "lex: social contract theory Hobbes Locke Rousseau\nlex: social contract political philosophy government legitimacy\nvec: what is social contract theory and how did Hobbes, Locke, and Rousseau differ in their views\nvec: how does social contract theory explain the legitimacy of government authority\nhyde: Social contract theory proposes that individuals consent, either explicitly or tacitly, to surrender some freedoms to a governing authority in exchange for social order. Hobbes argued for an absolute sovereign, Locke emphasized natural rights and limited government, and Rousseau stressed the general will of the people."}
+{"input": "code share", "output": "lex: code sharing platform snippet pastebin\nlex: codeshare live collaborative editor\nlex: share code online GitHub Gist\nvec: what are the best platforms for sharing code snippets with others online\nvec: how to share code collaboratively in real time with another developer\nhyde: CodeShare.io is a free online editor for sharing code in real time. Paste or type your code, share the generated URL, and others can view or edit simultaneously. For permanent sharing, GitHub Gists let you create public or secret snippets with syntax highlighting and version history."}
+{"input": "what is the significance of the american revolution", "output": "lex: American Revolution significance independence 1776\nlex: American Revolution impact democracy constitutional government\nvec: why was the American Revolution historically significant for democracy and self-governance\nvec: how did the American Revolution influence other independence movements worldwide\nhyde: The American Revolution (1775-1783) established the United States as an independent nation and introduced a constitutional republic based on Enlightenment principles. The Declaration of Independence asserted natural rights, and the resulting Constitution created a framework of representative government that influenced the French Revolution and Latin American independence movements."}
+{"input": "how to understand political ideologies", "output": "lex: political ideologies left right spectrum\nlex: liberalism conservatism socialism political theory\nvec: how can someone learn about different political ideologies and where they fall on the spectrum\nvec: what are the main differences between liberalism, conservatism, socialism, and libertarianism\nhyde: Political ideologies are organized systems of beliefs about governance and society. The left-right spectrum places socialism and progressivism on the left, emphasizing equality and collective action, while conservatism and libertarianism sit on the right, prioritizing individual freedom and tradition. Each ideology has distinct views on the role of government, economics, and social policy."}
+{"input": "how to build confidence in social situations?", "output": "lex: social confidence building shyness overcome\nlex: social anxiety tips conversation skills\nvec: what are practical steps to feel more confident when talking to people at social events\nvec: how can someone overcome social anxiety and build self-confidence in group settings\nhyde: Start small: make eye contact and greet one new person at each event. Prepare a few open-ended questions in advance. Focus on listening rather than performing. After each interaction, note what went well. Gradual exposure reduces anxiety over time—the more you practice, the more natural conversations become."}
+{"input": "what to pack for a day hike", "output": "lex: day hike packing list gear essentials\nlex: hiking backpack water food first aid\nvec: what should I bring in my backpack for a day hike in the mountains\nvec: what are the essential items to pack for a full-day hiking trip\nhyde: Day hike essentials: 2 liters of water, trail snacks (nuts, bars, fruit), map or GPS device, sun protection (hat, sunscreen, sunglasses), first aid kit, rain layer, extra warm layer, headlamp, and a fully charged phone. Wear moisture-wicking layers and broken-in hiking boots."}
+{"input": "what is digital collage art?", "output": "lex: digital collage art Photoshop mixed media\nlex: digital collage techniques layers composition\nvec: what is digital collage art and how is it created using software\nvec: what tools and techniques do artists use to make digital collages\nhyde: Digital collage art combines photographs, illustrations, textures, and graphic elements assembled in software like Photoshop, Procreate, or Canva. Artists layer, mask, blend, and transform images to create surreal or thematic compositions. Unlike physical collage, digital tools allow non-destructive editing and infinite experimentation with scale and color."}
+{"input": "how to fix a car radiator leak?", "output": "lex: car radiator leak repair fix sealant\nlex: radiator hose replacement coolant leak\nvec: how to diagnose and fix a leaking car radiator or radiator hose\nvec: can radiator stop-leak sealant permanently fix a small coolant leak\nhyde: For a small radiator leak, a stop-leak product like Bar's Leaks can provide a temporary fix. Add it to the coolant reservoir and run the engine. For permanent repair, locate the leak by pressurizing the cooling system, then either solder the radiator, replace the damaged hose, or install a new radiator if the damage is severe."}
+{"input": "where to buy saffron", "output": "lex: buy saffron threads online spice shop\nlex: saffron purchase quality grade price\nvec: where is the best place to buy high-quality saffron threads online or in stores\nvec: how to find genuine saffron and avoid counterfeit or adulterated products\nhyde: Buy saffron from reputable spice retailers like Penzeys, Burlap & Barrel, or specialty grocery stores. Look for grade 1 (Sargol or Negin) Iranian or Spanish saffron. Expect to pay $8-15 per gram. Avoid suspiciously cheap saffron—it may be dyed safflower or corn silk."}
+{"input": "what is mahayana buddhism", "output": "lex: Mahayana Buddhism bodhisattva teachings\nlex: Mahayana vs Theravada Buddhism sutras\nvec: what are the core beliefs and practices of Mahayana Buddhism\nvec: how does Mahayana Buddhism differ from Theravada Buddhism\nhyde: Mahayana Buddhism, the \"Great Vehicle,\" emerged around the 1st century CE and emphasizes the bodhisattva ideal—the aspiration to attain enlightenment for the benefit of all sentient beings, not just oneself. Key texts include the Heart Sutra and Lotus Sutra. Major traditions include Zen, Pure Land, and Tibetan Buddhism."}
+{"input": "what is utilitarianism in ethics", "output": "lex: utilitarianism ethics greatest happiness principle\nlex: utilitarianism Bentham Mill consequentialism\nvec: what is utilitarianism and how does it determine right and wrong actions\nvec: how did Jeremy Bentham and John Stuart Mill develop utilitarian ethics\nhyde: Utilitarianism is a consequentialist ethical theory holding that the morally right action is the one that produces the greatest happiness for the greatest number. Jeremy Bentham proposed a quantitative \"felicific calculus,\" while John Stuart Mill distinguished between higher and lower pleasures, arguing quality of happiness matters as much as quantity."}
+{"input": "what is climate change?", "output": "lex: climate change global warming greenhouse gases\nlex: climate change causes effects CO2 emissions\nvec: what causes climate change and what are its effects on the planet\nvec: how do greenhouse gas emissions from human activity drive global warming\nhyde: Climate change refers to long-term shifts in global temperatures and weather patterns. Since the Industrial Revolution, burning fossil fuels has released carbon dioxide and methane, trapping heat in the atmosphere. This has caused average global temperatures to rise by about 1.1°C, leading to melting ice caps, rising sea levels, and more extreme weather events."}
+{"input": "what is the difference between positive and negative rights", "output": "lex: positive rights negative rights difference\nlex: positive negative rights examples entitlements liberties\nvec: what is the distinction between positive and negative rights in political philosophy\nvec: can you explain positive rights versus negative rights with examples\nhyde: Negative rights require others to refrain from interfering—examples include freedom of speech, the right to privacy, and freedom from torture. Positive rights require others to provide something—examples include the right to education, healthcare, or a minimum standard of living. The distinction is central to debates between libertarians and welfare-state advocates."}
+{"input": "what causes migraines", "output": "lex: migraine causes triggers brain\nlex: migraine headache serotonin vascular nerve\nvec: what are the biological causes and common triggers of migraine headaches\nvec: why do some people get migraines and what happens in the brain during one\nhyde: Migraines involve abnormal brain activity affecting nerve signals, chemicals, and blood vessels. Cortical spreading depression—a wave of electrical activity across the cortex—triggers the trigeminal nerve, releasing inflammatory peptides. Common triggers include stress, hormonal changes, certain foods (aged cheese, alcohol), sleep disruption, and bright lights."}
+{"input": "how to talk to kids about bullying?", "output": "lex: talk children bullying conversation advice\nlex: kids bullying prevention parent discussion\nvec: how should parents talk to their children about bullying at school\nvec: what are age-appropriate ways to discuss bullying with kids and help them respond\nhyde: Start the conversation calmly by asking open-ended questions: \"Has anyone at school been mean to you or someone else?\" Listen without overreacting. Teach your child to say \"Stop, I don't like that\" firmly, walk away, and tell a trusted adult. Role-play scenarios so they can practice responses."}
+{"input": "when to replace windshield wipers?", "output": "lex: replace windshield wipers signs worn\nlex: wiper blade replacement frequency lifespan\nvec: how often should windshield wipers be replaced and what are signs they need changing\nvec: what are the signs that windshield wiper blades are worn out and need replacement\nhyde: Replace windshield wipers every 6-12 months or when you notice streaking, skipping, squeaking, or smearing. Inspect the rubber edge for cracks, tears, or stiffness. If wipers leave unwiped areas or chatter across the glass, it's time for new blades. Extreme heat and cold accelerate deterioration."}
+{"input": "how to aerate lawn manually?", "output": "lex: aerate lawn manually core aeration fork\nlex: lawn aeration by hand spike tool\nvec: how to aerate a lawn by hand without a machine using a garden fork or manual aerator\nvec: what is the best technique for manually aerating compacted soil in a yard\nhyde: To aerate manually, push a garden fork or manual core aerator into the soil every 4-6 inches, rocking it slightly to loosen the earth. Work in rows across the lawn. The best time to aerate is early fall for cool-season grasses or late spring for warm-season grasses. Water the lawn the day before to soften the soil."}
+{"input": "how to improve business communication", "output": "lex: business communication skills effective workplace\nlex: professional email writing clear messaging\nvec: how can employees improve their written and verbal communication skills at work\nvec: what techniques make business emails and presentations clearer and more effective\nhyde: Effective business communication starts with clarity: state the purpose in the first sentence, use short paragraphs, and include a clear call to action. In meetings, summarize key points and assign action items. Avoid jargon when possible. Active listening—paraphrasing what others say—builds rapport and reduces misunderstandings."}
+{"input": "how to manage anxiety naturally", "output": "lex: manage anxiety natural remedies without medication\nlex: anxiety relief breathing exercise meditation\nvec: what are natural ways to manage anxiety without medication\nvec: how can exercise, breathing techniques, and lifestyle changes reduce anxiety symptoms\nhyde: Natural anxiety management includes regular aerobic exercise (30 minutes, 5 days a week), diaphragmatic breathing, progressive muscle relaxation, and limiting caffeine and alcohol. Cognitive behavioral techniques like thought journaling help identify and challenge anxious thinking patterns. Herbal supplements such as chamomile and ashwagandha show some evidence of benefit."}
+{"input": "how to draft a lease agreement", "output": "lex: lease agreement draft template rental\nlex: residential lease contract terms clauses\nvec: what should be included when drafting a residential lease agreement\nvec: how to write a legally sound rental lease agreement between landlord and tenant\nhyde: A residential lease agreement should include: names of landlord and tenant, property address, lease term (start/end dates), monthly rent amount and due date, security deposit amount and return conditions, maintenance responsibilities, pet policy, late fee terms, and termination/renewal clauses. Both parties should sign and retain copies."}
+{"input": "what is burnout?", "output": "lex: burnout syndrome workplace exhaustion\nlex: burnout symptoms causes recovery\nvec: what is burnout and what are its symptoms, causes, and effects on health\nvec: how does chronic work stress lead to burnout and what does it feel like\nhyde: Burnout is a state of chronic physical and emotional exhaustion caused by prolonged stress, typically work-related. The WHO classifies it by three dimensions: energy depletion, increased mental distance or cynicism toward one's job, and reduced professional efficacy. Symptoms include fatigue, insomnia, irritability, and difficulty concentrating."}
+{"input": "how to let go of negative thoughts?", "output": "lex: let go negative thoughts techniques\nlex: negative thinking patterns CBT mindfulness\nvec: how to stop dwelling on negative thoughts and break rumination cycles\nvec: what mindfulness or cognitive techniques help release negative thinking\nhyde: To let go of negative thoughts, practice cognitive defusion: observe the thought without engaging it, label it (\"I'm having the thought that...\"), and let it pass like a cloud. Mindfulness meditation trains this skill. Write recurring worries in a journal, then close it—this externalizes them. Challenge distortions by asking: \"Is this thought based on facts or assumptions?\""}
+{"input": "how to brew the perfect cup of tea", "output": "lex: brew tea temperature steep time\nlex: tea brewing method loose leaf\nvec: what are the correct water temperatures and steeping times for different types of tea\nvec: how to brew loose leaf tea properly for the best flavor\nhyde: Water temperature and steep time vary by tea type. Black tea: 200-212°F for 3-5 minutes. Green tea: 160-180°F for 2-3 minutes. White tea: 160-185°F for 4-5 minutes. Oolong: 185-205°F for 3-5 minutes. Use 1 teaspoon of loose leaf per 8 oz cup. Pre-warm the teapot with hot water for consistent extraction."}
+{"input": "what is anarchism", "output": "lex: anarchism political philosophy anti-state\nlex: anarchism theory Kropotkin Bakunin mutual aid\nvec: what is anarchism as a political philosophy and what do anarchists believe\nvec: how do different branches of anarchism envision a society without government\nhyde: Anarchism is a political philosophy that rejects involuntary, coercive hierarchy—particularly the state—and advocates for voluntary, cooperative social organization. Major branches include anarcho-communism (Kropotkin), which envisions communal ownership, anarcho-syndicalism, which organizes through labor unions, and individualist anarchism, which emphasizes personal autonomy."}
+{"input": "how to stay motivated daily?", "output": "lex: daily motivation habits discipline routine\nlex: stay motivated goals productivity tips\nvec: what are practical strategies to stay motivated and productive every day\nvec: how to maintain motivation when working toward long-term goals\nhyde: Set one clear priority each morning rather than a long to-do list. Break large goals into small daily tasks. Track streaks—visual progress reinforces consistency. Pair difficult tasks with rewards. On low-motivation days, commit to just 5 minutes; starting is the hardest part, and momentum usually follows."}
+{"input": "list sort", "output": "lex: sort list programming algorithm\nlex: list sort Python Java ascending descending\nlex: array sorting methods comparison\nvec: how to sort a list or array in different programming languages\nvec: what sorting algorithms are used for lists and how do they compare in performance\nhyde: In Python, sort a list in-place with list.sort() or return a new sorted list with sorted(). Use key= for custom sorting: sorted(items, key=lambda x: x.name). In Java, use Collections.sort() or List.sort(). Common algorithms include quicksort (O(n log n) average), mergesort (stable, O(n log n)), and timsort (Python/Java default)."}
+{"input": "what was the renaissance period", "output": "lex: Renaissance period 14th-17th century Europe\nlex: Renaissance art culture Florence rebirth\nvec: what was the Renaissance period and why was it significant in European history\nvec: how did the Renaissance transform art, science, and culture in Europe\nhyde: The Renaissance (14th-17th century) was a cultural movement that began in Florence, Italy, marking the transition from the medieval period to modernity. It saw a revival of classical Greek and Roman art and philosophy. Key figures include Leonardo da Vinci, Michelangelo, and Galileo. The invention of the printing press accelerated the spread of new ideas across Europe."}
+{"input": "what is a smart thermostat?", "output": "lex: smart thermostat WiFi programmable Nest Ecobee\nlex: smart thermostat energy savings features\nvec: what is a smart thermostat and how does it save energy compared to a regular thermostat\nvec: how do smart thermostats like Nest and Ecobee learn and control home temperature\nhyde: A smart thermostat connects to WiFi and can be controlled via a smartphone app. Models like the Nest Learning Thermostat and Ecobee use sensors and machine learning to build a schedule based on your habits. They adjust heating and cooling automatically, reducing energy use by 10-15% on average compared to standard programmable thermostats."}
+{"input": "what is the great barrier reef", "output": "lex: Great Barrier Reef Australia coral ecosystem\nlex: Great Barrier Reef marine biodiversity coral bleaching\nvec: what is the Great Barrier Reef and why is it important for marine biodiversity\nvec: where is the Great Barrier Reef located and what threats does it face\nhyde: The Great Barrier Reef, off the coast of Queensland, Australia, is the world's largest coral reef system, stretching over 2,300 kilometers. It comprises nearly 3,000 individual reef systems and supports over 1,500 fish species, 400 coral species, and 30 species of whales and dolphins. Coral bleaching from rising ocean temperatures is its greatest threat."}
+{"input": "what is the significance of the sacred heart?", "output": "lex: Sacred Heart Jesus Catholic devotion\nlex: Sacred Heart significance symbolism Christianity\nvec: what does the Sacred Heart of Jesus symbolize in Catholic tradition\nvec: what is the history and religious significance of devotion to the Sacred Heart\nhyde: The Sacred Heart is a devotional image in Catholicism representing Jesus Christ's divine love for humanity. Popularized by St. Margaret Mary Alacoque's 17th-century visions, it depicts Christ's heart surrounded by a crown of thorns, flames, and a cross. The feast of the Sacred Heart is celebrated 19 days after Pentecost."}
+{"input": "what is survival camping?", "output": "lex: survival camping wilderness skills bushcraft\nlex: survival camping gear shelter fire water\nvec: what is survival camping and what skills do you need to camp with minimal gear\nvec: how to prepare for a survival camping trip in the wilderness\nhyde: Survival camping means spending time outdoors with minimal or no modern gear, relying on wilderness skills. Core skills include building a debris shelter, starting fire with a ferro rod or bow drill, purifying water by boiling or filtering, navigating with a map and compass, and foraging or trapping for food."}
+{"input": "how to fix wifi connection dropping", "output": "lex: WiFi dropping connection fix troubleshoot\nlex: WiFi disconnecting frequently router reset\nvec: how to troubleshoot a WiFi connection that keeps dropping or disconnecting\nvec: why does my WiFi keep cutting out and how do I fix it\nhyde: If your WiFi keeps dropping, try these steps: 1) Restart your router and modem by unplugging for 30 seconds. 2) Move closer to the router or remove obstructions. 3) Change the WiFi channel in router settings to reduce interference. 4) Update router firmware. 5) Check for driver updates on your device. 6) Disable power-saving mode for your wireless adapter."}
+{"input": "what are the key elements of horror writing?", "output": "lex: horror writing elements techniques atmosphere\nlex: horror fiction suspense tension dread\nvec: what literary elements and techniques make horror writing effective\nvec: how do horror authors create suspense, tension, and fear in their stories\nhyde: Effective horror writing relies on atmosphere, pacing, and the unknown. Build dread through setting—dark, isolated, claustrophobic spaces. Use sensory details to ground the reader. Withhold information: what the reader imagines is scarier than what you show. Escalate tension gradually, then release it with a shock. Relatable characters make the stakes feel real."}
+{"input": "what is the importance of free press", "output": "lex: free press importance democracy journalism\nlex: freedom of press First Amendment accountability\nvec: why is a free press important for democracy and holding governments accountable\nvec: what role does press freedom play in protecting civil liberties and public information\nhyde: A free press serves as a watchdog on government and powerful institutions, exposing corruption, fraud, and abuse. The First Amendment protects press freedom in the United States. Without it, citizens lack access to independent information needed to make informed decisions. Countries with restricted press freedoms consistently rank lower on democracy indices."}
+{"input": "what are the best national parks?", "output": "lex: best national parks USA visit\nlex: top national parks Yellowstone Yosemite Zion\nvec: what are the most popular and scenic national parks to visit in the United States\nvec: which national parks offer the best hiking, scenery, and wildlife experiences\nhyde: Top US national parks include Yellowstone (geysers, wildlife), Yosemite (granite cliffs, waterfalls), Grand Canyon (layered red rock), Zion (slot canyons, river hikes), Glacier (pristine alpine lakes), and Acadia (Atlantic coastline). Visit during shoulder season (May or September) for fewer crowds and pleasant weather."}
+{"input": "what is deconstruction", "output": "lex: deconstruction Derrida literary theory philosophy\nlex: deconstruction meaning binary oppositions text\nvec: what is deconstruction in philosophy and literary theory as developed by Jacques Derrida\nvec: how does deconstructionist analysis challenge fixed meaning in texts\nhyde: Deconstruction, associated with Jacques Derrida, is a method of critical analysis that examines how meaning in texts is constructed through binary oppositions (speech/writing, presence/absence). Derrida argued that meaning is never fixed; it is always deferred through a chain of signifiers. Deconstruction reveals the internal contradictions and assumptions hidden within texts."}
+{"input": "how to repair a leaky faucet", "output": "lex: leaky faucet repair fix dripping\nlex: faucet washer O-ring cartridge replacement\nvec: how to fix a dripping faucet by replacing the washer or cartridge\nvec: what are the step-by-step instructions for repairing a leaky kitchen or bathroom faucet\nhyde: Turn off the water supply valves under the sink. Remove the faucet handle by unscrewing the decorative cap and handle screw. Pull out the stem or cartridge. For compression faucets, replace the rubber washer and O-ring. For cartridge faucets, replace the entire cartridge. Reassemble, turn the water back on, and test for leaks."}
+{"input": "what is the significance of the ganges river in hinduism?", "output": "lex: Ganges River Hinduism sacred significance\nlex: Ganga river Hindu rituals purification\nvec: why is the Ganges River considered sacred in Hinduism\nvec: what religious rituals and beliefs are associated with the Ganges in Hindu tradition\nhyde: The Ganges (Ganga) is Hinduism's holiest river, personified as the goddess Ganga. Hindus believe bathing in the Ganges washes away sins and that immersing ashes of the dead in the river frees the soul from the cycle of rebirth. The cities of Varanasi and Haridwar along the Ganges host major pilgrimage sites and cremation ghats."}
+{"input": "best places to buy bonsai trees", "output": "lex: buy bonsai trees online nursery shop\nlex: bonsai tree purchase quality species\nvec: where are the best places to buy bonsai trees online or at local nurseries\nvec: which online retailers and nurseries sell high-quality bonsai trees for beginners\nhyde: Reputable bonsai retailers include Bonsai Boy of New York, Brussel's Bonsai, and Eastern Leaf (online). Local bonsai nurseries and Japanese garden shops often carry better-quality specimens. For beginners, start with hardy species like Chinese elm, ficus, or juniper. Expect to pay $30-80 for a quality starter tree."}
+{"input": "what are the principles of physics", "output": "lex: physics principles fundamental laws\nlex: Newton's laws thermodynamics relativity quantum\nvec: what are the fundamental principles and laws of physics\nvec: how do Newton's laws, thermodynamics, and quantum mechanics form the foundations of physics\nhyde: The fundamental principles of physics include Newton's three laws of motion, the law of universal gravitation, the laws of thermodynamics (energy conservation, entropy), Maxwell's equations for electromagnetism, Einstein's special and general relativity, and quantum mechanics. These describe how matter, energy, space, and time interact at all scales."}
+{"input": "how to optimize website for seo", "output": "lex: SEO optimization website search engine ranking\nlex: on-page SEO meta tags keywords content\nvec: what are the key steps to optimize a website for search engine rankings\nvec: how to improve on-page and technical SEO for better Google search results\nhyde: On-page SEO: use target keywords in title tags, H1 headings, and meta descriptions. Write unique, high-quality content over 1,000 words. Optimize images with alt text and compression. Technical SEO: ensure fast page load times (under 3 seconds), mobile responsiveness, HTTPS, clean URL structure, and an XML sitemap submitted to Google Search Console."}
+{"input": "what are the sacred texts of buddhism", "output": "lex: Buddhist sacred texts scriptures Tripitaka\nlex: Buddhism sutras Pali Canon Mahayana texts\nvec: what are the main sacred texts and scriptures of Buddhism\nvec: how do the Pali Canon and Mahayana sutras differ as Buddhist scriptures\nhyde: The primary Buddhist scripture is the Tripitaka (Pali Canon), composed of three \"baskets\": the Vinaya Pitaka (monastic rules), Sutta Pitaka (discourses of the Buddha), and Abhidhamma Pitaka (philosophical analysis). Mahayana Buddhism adds texts like the Heart Sutra, Diamond Sutra, and Lotus Sutra, emphasizing the bodhisattva path."}
+{"input": "how to participate in public hearings", "output": "lex: public hearing participation attend testify\nlex: public hearing comment speak local government\nvec: how can citizens participate and give testimony at public hearings\nvec: what are the steps to attend and speak at a local government public hearing\nhyde: To participate in a public hearing, check your local government website for upcoming meetings and agendas. Sign up to speak in advance if required. Prepare a concise statement (usually 2-3 minutes). State your name and address for the record. Focus on facts and personal impact. You can also submit written comments before the deadline."}
+{"input": "what is a hypothesis", "output": "lex: hypothesis definition scientific research\nlex: hypothesis testable prediction experiment\nvec: what is a hypothesis in the scientific method and how is one formed\nvec: what makes a good scientific hypothesis and how is it different from a theory\nhyde: A hypothesis is a testable prediction about the relationship between two or more variables. In the scientific method, it follows observation and research: based on existing knowledge, you propose an explanation that can be tested through experimentation. A hypothesis must be falsifiable—there must be a possible outcome that would prove it wrong."}
+{"input": "what is extreme sports photography?", "output": "lex: extreme sports photography action camera\nlex: adventure sports photography techniques shutter speed\nvec: what is extreme sports photography and what equipment and techniques does it require\nvec: how do photographers capture high-speed action shots in extreme sports\nhyde: Extreme sports photography captures athletes performing in high-risk activities like surfing, snowboarding, rock climbing, and base jumping. Photographers use fast shutter speeds (1/1000s or faster), continuous autofocus, and burst mode. Key gear includes weather-sealed DSLRs or mirrorless cameras, telephoto lenses (70-200mm), and GoPro-style action cameras for POV shots."}
+{"input": "how to live sustainably?", "output": "lex: sustainable living tips eco-friendly lifestyle\nlex: reduce waste carbon footprint daily habits\nvec: what are practical everyday habits for living a more sustainable and eco-friendly life\nvec: how can individuals reduce their carbon footprint and waste in daily living\nhyde: Sustainable living starts with reducing consumption: buy less, choose durable goods, and repair before replacing. Eat more plant-based meals, which have a lower carbon footprint. Use public transit, bike, or walk. Reduce waste through composting and recycling. Switch to renewable energy and use LED lighting. Carry reusable bags, bottles, and containers."}
+{"input": "what is epistemological relativism", "output": "lex: epistemological relativism knowledge truth\nlex: epistemological relativism philosophy objectivity\nvec: what is epistemological relativism and how does it challenge objective truth claims\nvec: how does epistemological relativism argue that knowledge is relative to perspective or culture\nhyde: Epistemological relativism holds that knowledge and truth are not absolute but are relative to the social, cultural, or historical context in which they are produced. Different communities may have equally valid but incompatible knowledge systems. Critics argue this leads to self-refutation: the claim that all knowledge is relative is itself presented as an absolute truth."}
+{"input": "what is mixed media art?", "output": "lex: mixed media art techniques materials\nlex: mixed media collage painting assemblage\nvec: what is mixed media art and what materials and techniques are commonly used\nvec: how do artists combine different media like paint, paper, and found objects in mixed media artwork\nhyde: Mixed media art combines two or more artistic media in a single work—for example, acrylic paint with collaged paper, fabric, ink, and found objects. Techniques include layering, texturing with gels and paste, image transfers, and assemblage. The combination of materials creates visual depth and tactile richness that single-medium works cannot achieve."}
+{"input": "how to work at microsoft?", "output": "lex: Microsoft jobs hiring apply career\nlex: Microsoft interview process software engineer\nvec: how to apply for a job at Microsoft and what is the interview process like\nvec: what qualifications and steps are needed to get hired at Microsoft\nhyde: Apply through Microsoft's careers portal at careers.microsoft.com. Most technical roles require a CS degree or equivalent experience. The interview process typically includes a phone screen, online coding assessment, and an on-site loop of 4-5 interviews covering algorithms, system design, and behavioral questions. Prepare with LeetCode and system design practice."}
+{"input": "what are the characteristics of haiku?", "output": "lex: haiku characteristics syllable structure\nlex: haiku poetry 5-7-5 Japanese nature\nvec: what are the defining characteristics and rules of haiku poetry\nvec: how is a traditional Japanese haiku structured and what themes does it explore\nhyde: Haiku is a Japanese poetic form traditionally consisting of three lines with a 5-7-5 syllable pattern (or 17 morae in Japanese). Haiku typically captures a moment in nature and includes a kigo (seasonal word) and a kireji (cutting word) that creates a pause or shift. The poem juxtaposes two images to evoke emotion through suggestion rather than direct statement."}
+{"input": "what is plato's theory of forms", "output": "lex: Plato theory of Forms Ideas philosophy\nlex: Platonic Forms abstract reality idealism\nvec: what is Plato's theory of Forms and how does it explain reality\nvec: how did Plato distinguish between the world of Forms and the physical world\nhyde: Plato's theory of Forms posits that the physical world is a shadow of a higher, non-material reality consisting of perfect, eternal Forms (Ideas). A beautiful object participates in the Form of Beauty; a just action reflects the Form of Justice. True knowledge comes from understanding these abstract Forms through reason, not through sensory experience of the changeable physical world."}
+{"input": "what is the law of attraction?", "output": "lex: law of attraction manifestation positive thinking\nlex: law of attraction belief visualization\nvec: what is the law of attraction and how is it supposed to work\nvec: does the law of attraction have any scientific basis or evidence\nhyde: The law of attraction is the belief that positive or negative thoughts bring positive or negative experiences into a person's life. Proponents, popularized by the book \"The Secret,\" claim that visualizing desired outcomes and maintaining a positive mindset attracts those outcomes. Scientists generally consider it pseudoscience, though positive thinking can influence motivation and goal-directed behavior."}
+{"input": "what is literary parody?", "output": "lex: literary parody satire imitation genre\nlex: parody literature examples humor exaggeration\nvec: what is literary parody and how does it use imitation for comedic or critical effect\nvec: what are famous examples of parody in literature\nhyde: Literary parody imitates the style, conventions, or content of a specific work or genre for comedic or critical effect. It exaggerates distinctive features to expose flaws or absurdities. Examples include Don Quixote (parodying chivalric romances), Northanger Abbey (Gothic novels), and The Hitchhiker's Guide to the Galaxy (science fiction tropes)."}
+{"input": "how to invest in cryptocurrency safely?", "output": "lex: cryptocurrency investing safely beginner\nlex: crypto investment security wallet exchange\nvec: how can beginners invest in cryptocurrency safely and minimize risk of loss\nvec: what security measures should you take when buying and storing cryptocurrency\nhyde: To invest in crypto safely: use reputable exchanges like Coinbase or Kraken with two-factor authentication. Never invest more than you can afford to lose. Transfer holdings to a hardware wallet (Ledger, Trezor) for long-term storage. Diversify across Bitcoin and Ethereum rather than speculative altcoins. Beware of phishing scams and never share your seed phrase."}
+{"input": "what is a protagonist?", "output": "lex: protagonist definition literature main character\nlex: protagonist role story narrative hero\nvec: what is a protagonist in literature and what role do they play in a story\nvec: how does the protagonist differ from the antagonist in narrative fiction\nhyde: The protagonist is the central character of a narrative, the one whose goals and conflicts drive the plot. The story is told from their perspective or follows their journey. Protagonists are not always heroes—they can be antiheroes or morally ambiguous characters. The antagonist opposes the protagonist, creating the central conflict of the story."}
+{"input": "how to prepare for a promotion review?", "output": "lex: promotion review preparation performance\nlex: job promotion meeting self-assessment achievements\nvec: how should an employee prepare for a promotion review meeting with their manager\nvec: what documentation and evidence should you gather before a promotion discussion\nhyde: Before your promotion review, compile a list of key accomplishments with measurable results (revenue generated, projects delivered, efficiency improvements). Gather positive feedback from colleagues and clients. Align your achievements with the next-level job description. Prepare specific examples demonstrating leadership, initiative, and impact. Practice articulating your case concisely."}
+{"input": "how to reduce personal water usage?", "output": "lex: reduce water usage conservation tips home\nlex: save water household low-flow fixtures\nvec: what are practical ways to reduce water consumption at home\nvec: how can individuals conserve water in their daily routines and household\nhyde: Install low-flow showerheads (2 GPM or less) and faucet aerators. Fix leaky faucets—a drip wastes up to 3,000 gallons per year. Take shorter showers (5 minutes saves 12 gallons). Run dishwashers and washing machines only with full loads. Water gardens in the early morning to reduce evaporation. Collect rainwater for outdoor use."}
+{"input": "sustainable technology", "output": "lex: sustainable technology green tech renewable energy\nlex: sustainable technology clean energy innovation\nvec: what are examples of sustainable technologies that reduce environmental impact\nvec: how is technology being used to promote sustainability and fight climate change\nhyde: Sustainable technologies aim to reduce environmental impact while meeting human needs. Examples include solar panels and wind turbines for clean energy, electric vehicles, energy-efficient building materials, carbon capture systems, biodegradable plastics, precision agriculture that reduces water and pesticide use, and smart grids that optimize energy distribution."}
+{"input": "how do i vote in person", "output": "lex: vote in person polling place Election Day\nlex: in-person voting process ID requirements\nvec: what are the steps to vote in person at a polling place on Election Day\nvec: what do I need to bring and expect when voting in person for the first time\nhyde: To vote in person, check your registration status and find your polling location at vote.org or your state's election website. Bring a valid photo ID if required by your state. On Election Day, go to your assigned polling place, check in with a poll worker, receive your ballot, mark your choices, and submit your ballot through the scanner or ballot box."}
+{"input": "what is aquaponics?", "output": "lex: aquaponics fish plants symbiotic system\nlex: aquaponics setup grow food fish tank\nvec: what is aquaponics and how does it combine fish farming with plant growing\nvec: how does an aquaponics system work and what can you grow with it\nhyde: Aquaponics is a food production system that combines aquaculture (raising fish) with hydroponics (growing plants in water). Fish waste provides natural fertilizer for the plants, and the plants filter the water for the fish, creating a symbiotic cycle. Common setups use tilapia or goldfish with leafy greens, herbs, and tomatoes."}
+{"input": "what is the significance of the hajj in islam?", "output": "lex: Hajj Islam pilgrimage Mecca significance\nlex: Hajj pillar Islam Kaaba rituals\nvec: why is the Hajj pilgrimage to Mecca significant in Islam\nvec: what are the rituals and spiritual meaning of the Hajj for Muslims\nhyde: The Hajj is the fifth pillar of Islam, requiring every able-bodied Muslim who can afford it to make the pilgrimage to Mecca at least once in their lifetime. Performed during Dhul Hijjah, the rituals include circling the Kaaba seven times (tawaf), walking between Safa and Marwah, standing at Arafat, and the symbolic stoning of the devil at Mina."}
+{"input": "what is a hypothesis testing", "output": "lex: hypothesis testing statistics null alternative\nlex: hypothesis test p-value significance level\nvec: what is hypothesis testing in statistics and how does it work\nvec: how do you perform a hypothesis test using null and alternative hypotheses\nhyde: Hypothesis testing is a statistical method for making decisions using data. You state a null hypothesis (H0, no effect) and an alternative hypothesis (H1, effect exists). Collect data and calculate a test statistic. If the p-value is below the significance level (typically 0.05), reject H0. Common tests include t-test, chi-square, and ANOVA."}
+{"input": "how to publish a scientific article", "output": "lex: publish scientific article journal peer review\nlex: scientific paper submission academic journal\nvec: what are the steps to publish a research article in a peer-reviewed scientific journal\nvec: how does the peer review and journal submission process work for scientific papers\nhyde: To publish a scientific article: 1) Write the manuscript following IMRAD format (Introduction, Methods, Results, Discussion). 2) Choose a target journal matching your topic and impact level. 3) Format per the journal's author guidelines. 4) Submit through the journal's online portal. 5) Respond to peer reviewer comments during revision. The process typically takes 3-12 months."}
+{"input": "what are common themes in poetry?", "output": "lex: poetry themes common literary motifs\nlex: poetry themes love death nature identity\nvec: what are the most common themes explored in poetry across different periods\nvec: how do poets use recurring themes like love, death, and nature in their work\nhyde: Common poetry themes include love and desire, mortality and the passage of time, nature and the seasons, loss and grief, identity and self-discovery, war and conflict, beauty, spirituality, and social justice. These universal themes recur across periods—from Sappho's love lyrics to Keats's meditations on mortality to contemporary poets exploring identity."}
+{"input": "how to write a resume", "output": "lex: write resume format template job\nlex: resume writing tips work experience skills\nvec: how to write an effective resume that stands out to employers and recruiters\nvec: what should be included in a resume and how should it be formatted\nhyde: A strong resume includes: contact information, a brief professional summary (2-3 sentences), work experience in reverse chronological order with bullet-point achievements, education, and relevant skills. Use action verbs (\"led,\" \"built,\" \"increased\") and quantify results (\"increased sales by 25%\"). Keep it to one page for under 10 years of experience. Tailor it to each job posting."}
+{"input": "what are key performance indicators", "output": "lex: key performance indicators KPIs metrics\nlex: KPI examples business performance measurement\nvec: what are key performance indicators and how are they used to measure business success\nvec: how do companies choose and track the right KPIs for their goals\nhyde: Key Performance Indicators (KPIs) are measurable values that demonstrate how effectively a company is achieving its objectives. Examples include revenue growth rate, customer acquisition cost, employee retention rate, and net promoter score. Effective KPIs are specific, measurable, achievable, relevant, and time-bound (SMART). They should align directly with strategic goals."}
+{"input": "how to find art inspiration online?", "output": "lex: art inspiration online websites platforms\nlex: art inspiration Pinterest Behance DeviantArt\nvec: where can artists find creative inspiration and references online\nvec: what websites and platforms are best for discovering art inspiration\nhyde: Top platforms for art inspiration include Pinterest (curated mood boards), Behance and Dribbble (professional portfolios), ArtStation (digital and concept art), DeviantArt (community art), and Instagram art hashtags. Museums also offer virtual collections: Google Arts & Culture, the Met's Open Access, and the Rijksmuseum's digital archive."}
+{"input": "what is the significance of logic in ethics?", "output": "lex: logic ethics moral reasoning philosophy\nlex: logical arguments ethical theory validity\nvec: what role does logic play in ethical reasoning and moral philosophy\nvec: how do philosophers use logical arguments to evaluate ethical claims\nhyde: Logic provides the structural framework for ethical reasoning. Valid arguments require that conclusions follow necessarily from premises. In ethics, logic helps identify fallacies, test the consistency of moral principles, and evaluate whether ethical claims are well-supported. For example, the logical form of universalizability in Kant's categorical imperative tests moral maxims for contradiction."}
+{"input": "how to engage in sustainable urban living?", "output": "lex: sustainable urban living city eco-friendly\nlex: urban sustainability public transit green housing\nvec: what are practical ways to live sustainably in a city environment\nvec: how can urban residents reduce their environmental footprint in daily life\nhyde: Sustainable urban living includes using public transit, biking, or walking instead of driving. Choose an energy-efficient apartment, reduce food waste through composting and meal planning, shop at local farmers markets, and support community gardens. Use shared resources like tool libraries and car-sharing services to reduce individual consumption."}
+{"input": "what is the concept of shalom in judaism?", "output": "lex: shalom Judaism peace concept meaning\nlex: shalom Hebrew wholeness completeness Jewish\nvec: what does the concept of shalom mean in Judaism beyond just peace\nvec: how is shalom understood as wholeness and completeness in Jewish theology\nhyde: Shalom in Judaism means far more than the absence of conflict. Derived from the Hebrew root meaning \"wholeness\" or \"completeness,\" shalom encompasses peace, harmony, welfare, and flourishing. It describes right relationships between people, with God, and with creation. The pursuit of shalom (rodef shalom) is a central ethical obligation in Jewish life."}
+{"input": "how do structuralism and functionalism differ", "output": "lex: structuralism functionalism differences psychology\nlex: structuralism Wundt functionalism James psychology\nvec: what are the differences between structuralism and functionalism in psychology\nvec: how did Wundt's structuralism differ from William James's functionalism\nhyde: Structuralism, founded by Wilhelm Wundt, sought to break down mental processes into their basic elements through introspection—analyzing the structure of consciousness. Functionalism, led by William James, focused instead on the purpose of mental processes—how the mind helps organisms adapt to their environment. Structuralism asked \"what is consciousness?\" while functionalism asked \"what is consciousness for?\""}
+{"input": "duolingo courses", "output": "lex: Duolingo language courses available\nlex: Duolingo app languages learn\nvec: what language courses are available on Duolingo and which are the most popular\nvec: how effective is Duolingo for learning a new language and what languages does it offer\nhyde: Duolingo offers courses in over 40 languages, including Spanish, French, German, Japanese, Korean, Mandarin, Italian, Portuguese, and Hindi. Each course uses gamified lessons with speaking, listening, reading, and writing exercises. Popular courses include Spanish for English speakers (the most enrolled) and English for Spanish speakers."}
+{"input": "how to hang artwork without nails", "output": "lex: hang artwork without nails wall\nlex: picture hanging command strips adhesive hooks\nvec: how to hang pictures and artwork on walls without using nails or drilling holes\nvec: what are the best no-damage methods for hanging frames on walls\nhyde: Command Strips by 3M hold up to 16 lbs and leave no wall damage—press firmly for 30 seconds and wait 1 hour before hanging. Other nail-free options include adhesive hooks, velcro strips, magnetic frames, and picture hanging wire with adhesive anchors. For heavier pieces, use monkey hooks which require only a tiny hole, no hammer needed."}
+{"input": "how augmented reality is applied in different fields", "output": "lex: augmented reality applications fields industry\nlex: AR technology healthcare education retail\nvec: how is augmented reality being used in healthcare, education, and retail industries\nvec: what are real-world applications of augmented reality across different fields\nhyde: Augmented reality overlays digital content onto the real world and is applied across many fields. In healthcare, surgeons use AR to visualize anatomy during procedures. In education, AR apps bring textbook content to life in 3D. Retailers like IKEA use AR to let customers preview furniture in their homes. In manufacturing, AR guides workers through assembly with step-by-step overlays."}
+{"input": "what are the best soil types for roses", "output": "lex: best soil roses growing type\nlex: rose garden soil pH loam drainage\nvec: what type of soil do roses grow best in and how should it be prepared\nvec: what soil pH and composition are ideal for growing healthy rose bushes\nhyde: Roses thrive in well-draining loamy soil with a pH between 6.0 and 6.5. Amend heavy clay soil with compost and coarse sand to improve drainage. Mix in aged manure or rose-specific fertilizer before planting. Ensure soil holds moisture without becoming waterlogged. Mulch with 2-3 inches of organic material to retain moisture and regulate temperature."}
+{"input": "how to encourage siblings to get along?", "output": "lex: siblings get along fighting conflict resolution\nlex: sibling rivalry reduce cooperation strategies\nvec: how can parents encourage their children to get along and reduce sibling rivalry\nvec: what strategies help siblings resolve conflicts and build positive relationships\nhyde: Give each child one-on-one time to reduce competition for attention. Avoid comparing siblings or labeling them (\"the smart one\"). Teach conflict resolution: help them express feelings with \"I\" statements and find compromises. Praise cooperation when you see it. Set clear family rules about physical aggression and name-calling."}
+{"input": "what is the great wall of china?", "output": "lex: Great Wall China history construction\nlex: Great Wall China length dynasty defense\nvec: what is the Great Wall of China and why was it built\nvec: how long is the Great Wall of China and which dynasties built it\nhyde: The Great Wall of China is a series of fortifications built over centuries to protect Chinese states and empires from northern invasions. The most well-known sections were built during the Ming Dynasty (1368-1644). The total length, including all branches and sections across dynasties, is approximately 21,196 kilometers (13,171 miles)."}
+{"input": "how to attend a political rally", "output": "lex: attend political rally event tips\nlex: political rally preparation safety what to bring\nvec: how to find and attend a political rally or campaign event in your area\nvec: what should you know before attending your first political rally\nhyde: Find rallies through candidate websites, social media, or event platforms like Eventbrite. Register if required (RSVP is often free). Arrive early as venues fill up. Bring water, sunscreen if outdoors, a charged phone, and valid ID. Wear comfortable shoes. Be aware of your surroundings and know the exit locations. Follow posted rules about signs and bags."}
+{"input": "what is the function of a narrative arc?", "output": "lex: narrative arc function story structure\nlex: narrative arc exposition climax resolution plot\nvec: what is a narrative arc and how does it structure a story from beginning to end\nvec: what are the parts of a narrative arc and why is it important in storytelling\nhyde: A narrative arc is the structure that shapes a story's progression. It typically follows five stages: exposition (introduces characters and setting), rising action (builds conflict and tension), climax (the turning point), falling action (consequences unfold), and resolution (conflict is resolved). The arc gives readers a satisfying sense of progression and closure."}
+{"input": "arg parse", "output": "lex: argparse Python command line arguments\nlex: argument parser CLI Python module\nvec: how to use Python argparse module to parse command line arguments\nvec: how to define positional and optional arguments with argparse\nhyde: Use argparse to handle CLI arguments: parser = argparse.ArgumentParser(); parser.add_argument(\"file\"); args = parser.parse_args(). Supports positional args, optional flags, subcommands, and type validation."}
+{"input": "how to draw realistic portraits?", "output": "lex: draw realistic portrait pencil technique\nlex: portrait drawing face proportions shading\nvec: how to draw a realistic human portrait with accurate proportions and shading\nvec: what techniques do artists use to draw lifelike faces with pencil\nhyde: Start with a lightly sketched oval. Divide the face: eyes sit at the midpoint, the nose halfway between eyes and chin, and the mouth one-third below the nose. Use a grid or Loomis method for proportions. Build tonal values gradually—light layers first, then darker shadows. Blend with a tortillon for smooth skin textures. Pay close attention to the light source direction."}
+{"input": "what is the impact of religion on culture?", "output": "lex: religion impact culture society influence\nlex: religion culture art morality traditions\nvec: how has religion shaped culture, art, and social norms throughout history\nvec: what influence does religion have on cultural values, laws, and traditions\nhyde: Religion has profoundly shaped cultures worldwide—influencing art (Gothic cathedrals, Islamic calligraphy, Hindu temple sculpture), moral codes, legal systems (Sharia, Canon law), dietary practices, marriage customs, holidays, and music. Religious narratives provide shared identity and meaning. The Protestant work ethic, for example, influenced Western capitalism according to Max Weber."}
+{"input": "what is the ethics of war", "output": "lex: ethics of war just war theory morality\nlex: just war ethics military conflict jus ad bellum\nvec: what is just war theory and the ethical principles governing warfare\nvec: how do philosophers evaluate whether a war is morally justified\nhyde: Just war theory establishes criteria for morally permissible warfare. Jus ad bellum (right to go to war) requires just cause, legitimate authority, right intention, last resort, proportionality, and reasonable chance of success. Jus in bello (right conduct in war) requires distinction between combatants and civilians and proportional use of force."}
+{"input": "how to analyze scientific data statistically", "output": "lex: statistical analysis scientific data methods\nlex: statistical tests data analysis research t-test ANOVA\nvec: how to choose and apply the right statistical tests for analyzing scientific research data\nvec: what are the steps for performing statistical analysis on experimental data\nhyde: Choose your statistical test based on data type and research question. For comparing two group means, use an independent t-test (parametric) or Mann-Whitney U (non-parametric). For three or more groups, use one-way ANOVA. For correlations, use Pearson's r (continuous) or Spearman's rho (ordinal). Report effect sizes and confidence intervals alongside p-values."}
+{"input": "how to analyze experimental data", "output": "lex: analyze experimental data methods results\nlex: experimental data analysis visualization interpretation\nvec: what are the steps to properly analyze and interpret experimental research data\nvec: how to organize, visualize, and draw conclusions from experimental results\nhyde: Start by cleaning the data: remove outliers using predefined criteria and check for missing values. Calculate descriptive statistics (mean, median, standard deviation). Visualize distributions with histograms or box plots. Apply appropriate statistical tests to evaluate hypotheses. Interpret results in context of your research question and note limitations."}
+{"input": "climate action", "output": "lex: climate action policy emissions reduction\nlex: climate action carbon neutral renewable energy 2026\nvec: what actions are governments and individuals taking to combat climate change\nvec: what are the most effective climate action strategies for reducing greenhouse gas emissions\nhyde: Climate action encompasses policies and initiatives to reduce greenhouse gas emissions and adapt to climate change. Key strategies include transitioning to renewable energy, electrifying transportation, improving energy efficiency in buildings, protecting forests, and implementing carbon pricing. The Paris Agreement aims to limit warming to 1.5°C above pre-industrial levels."}
+{"input": "what are the main teachings of shinto?", "output": "lex: Shinto teachings beliefs practices Japan\nlex: Shinto kami nature purity rituals\nvec: what are the core beliefs and teachings of the Shinto religion in Japan\nvec: how does Shinto view nature, purity, and the spiritual world\nhyde: Shinto, Japan's indigenous religion, centers on the worship of kami—spirits inhabiting natural features, ancestors, and sacred places. Core teachings emphasize purity (physical and spiritual cleanliness), harmony with nature, respect for ancestors, and community ritual. There is no single scripture; practice focuses on shrine worship, seasonal festivals (matsuri), and purification rites (harae)."}
+{"input": "chronic pain management clinics", "output": "lex: chronic pain management clinic treatment\nlex: pain clinic multidisciplinary therapy near me\nvec: what services do chronic pain management clinics offer and how do they treat patients\nvec: how to find a reputable chronic pain management clinic for long-term treatment\nhyde: Chronic pain management clinics use a multidisciplinary approach combining medication management, physical therapy, cognitive behavioral therapy, nerve blocks, and interventional procedures like epidural steroid injections. Teams typically include pain medicine physicians, physical therapists, and psychologists. Ask your primary care doctor for a referral or search the American Academy of Pain Medicine directory."}
+{"input": "what is a business consultant", "output": "lex: business consultant role responsibilities\nlex: management consulting services\nlex: business advisory consultant\nvec: what does a business consultant do and what services do they provide\nvec: what qualifications and skills are needed to become a business consultant\nhyde: A business consultant is a professional who advises organizations on strategy, operations, and management. They analyze business problems, identify inefficiencies, and recommend solutions to improve performance and profitability."}
+{"input": "what are the characteristics of classic literature?", "output": "lex: classic literature characteristics traits\nlex: literary classics defining features\nvec: what qualities make a work of fiction considered classic literature\nvec: what distinguishes classic literature from other genres or time periods\nhyde: Classic literature is defined by its enduring relevance, universal themes, and artistic merit. These works explore the human condition through complex characters, moral dilemmas, and language that transcends the era in which they were written."}
+{"input": "what is blockchain technology", "output": "lex: blockchain technology distributed ledger\nlex: blockchain decentralized cryptographic\nlex: blockchain consensus mechanism\nvec: how does blockchain technology work as a distributed ledger system\nvec: what are the technical components that make up a blockchain\nhyde: Blockchain is a distributed ledger technology where transactions are recorded in blocks linked by cryptographic hashes. Each block contains a timestamp and transaction data, forming an immutable chain validated by a network of nodes through consensus mechanisms."}
+{"input": "where to buy luxury bedding sets", "output": "lex: luxury bedding sets buy online\nlex: high-end sheets duvet comforter\nlex: premium Egyptian cotton bedding\nvec: where can I purchase high-quality luxury bedding sets online or in stores\nvec: which brands sell the best luxury sheets and duvet covers\nhyde: Shop our collection of luxury bedding sets crafted from 100% Egyptian cotton and Italian-woven sateen. Thread counts from 400 to 1000. Free shipping on orders over $200. Available in king, queen, and California king sizes."}
+{"input": "how to retire early", "output": "lex: early retirement financial planning\nlex: FIRE financial independence retire early\nlex: early retirement savings rate\nvec: how much money do you need to save to retire before age 50\nvec: what financial strategies allow people to retire early through the FIRE movement\nhyde: To retire early, aim to save 50-70% of your income and invest in low-cost index funds. At a 4% safe withdrawal rate, you need roughly 25x your annual expenses. A person spending $40,000/year needs about $1 million to retire."}
+{"input": "how climate change affects farming", "output": "lex: climate change agriculture crop yields\nlex: global warming farming drought impact\nlex: climate change food production\nvec: how does rising global temperature affect crop yields and food production\nvec: what effects does climate change have on soil quality and growing seasons for farmers\nhyde: Rising temperatures and shifting precipitation patterns reduce crop yields by 2-6% per decade. Droughts, heat stress, and unpredictable frost dates disrupt planting schedules, while increased CO2 levels alter nutrient content in staple crops like wheat and rice."}
+{"input": "how to assess car tire damage?", "output": "lex: car tire damage inspection signs\nlex: tire wear tread depth sidewall\nlex: tire replacement damage indicators\nvec: how do you inspect car tires for damage and know when they need replacement\nvec: what are the signs of dangerous tire wear or sidewall damage on a vehicle\nhyde: Check tire tread depth using the penny test—insert a penny with Lincoln's head facing down. If you can see the top of his head, the tread is below 2/32\" and the tire needs replacing. Also inspect sidewalls for bulges, cracks, or cuts."}
+{"input": "kindle library", "output": "lex: kindle library ebook collection\nlex: Amazon Kindle digital library management\nlex: kindle book organization archive\nvec: how to manage and organize your ebook library on a Kindle device\nvec: how to borrow library books on Kindle through Libby or OverDrive\nhyde: Your Kindle Library stores all purchased and borrowed ebooks. Access it by tapping 'Library' on the home screen. Filter by 'Downloaded' or 'All' to see books stored on the device or in the cloud. Use collections to organize titles by genre or topic."}
+{"input": "how to plant wildflowers in clay soil?", "output": "lex: wildflower planting clay soil\nlex: wildflower seeds heavy clay ground\nvec: what is the best method for growing wildflowers in heavy clay soil\nvec: which wildflower species thrive in clay soil conditions\nhyde: To plant wildflowers in clay soil, amend the top 2-3 inches with coarse sand and compost to improve drainage. Choose clay-tolerant species like black-eyed Susan, coneflower, and bee balm. Sow seeds in fall or early spring, pressing them into the surface without burying deeply."}
+{"input": "how to photograph the milky way", "output": "lex: milky way astrophotography camera settings\nlex: night sky photography milky way\nlex: milky way photo long exposure\nvec: what camera settings and equipment do you need to photograph the milky way\nvec: how to find the best location and time for milky way photography\nhyde: Set your camera to manual mode with an aperture of f/2.8 or wider, ISO 3200-6400, and a shutter speed of 15-25 seconds using the 500 rule. Use a sturdy tripod and a wide-angle lens. Shoot during a new moon away from light pollution."}
+{"input": "what are ocean currents", "output": "lex: ocean currents thermohaline circulation\nlex: ocean surface currents deep water\nlex: ocean current patterns global\nvec: what causes ocean currents and how do they circulate water around the globe\nvec: what is the difference between surface ocean currents and deep water thermohaline circulation\nhyde: Ocean currents are continuous, directed movements of seawater driven by wind, temperature, salinity, and the Earth's rotation. Surface currents are driven primarily by wind patterns, while deep-water thermohaline circulation is driven by differences in water density."}
+{"input": "what is the concept of moral absolutism?", "output": "lex: moral absolutism ethical theory\nlex: moral absolutism objective right wrong\nvec: what does moral absolutism mean as an ethical philosophy\nvec: how does moral absolutism differ from moral relativism in determining right and wrong\nhyde: Moral absolutism holds that certain actions are inherently right or wrong regardless of context, culture, or consequence. Under this view, ethical rules are universal and unchanging—lying is always wrong, for example, even if it could prevent harm."}
+{"input": "how to set up a smart home?", "output": "lex: smart home setup devices hub\nlex: home automation WiFi Zigbee Z-Wave\nlex: smart home starter guide speakers lights\nvec: what devices and hubs do you need to set up a smart home automation system\nvec: how to connect smart lights thermostats and speakers in a home network\nhyde: Start with a smart speaker like Amazon Echo or Google Nest as your central hub. Connect smart bulbs (Philips Hue, LIFX) and a smart thermostat (Nest, Ecobee) over WiFi or Zigbee. Use the companion app to create automations like turning off lights at bedtime."}
+{"input": "what is cycling commute?", "output": "lex: cycling commute bike to work\nlex: bicycle commuting urban transportation\nvec: what does it mean to commute by bicycle and what are the benefits\nvec: how do people use cycling as their daily commute to work in cities\nhyde: Cycling commute refers to using a bicycle as your primary transportation to and from work. Bike commuters typically ride 3-15 miles each way, saving on fuel costs while getting daily exercise. Many cities now have protected bike lanes and bike-share programs."}
+{"input": "how to approach ethical decision-making", "output": "lex: ethical decision-making framework steps\nlex: ethical reasoning moral dilemma process\nvec: what frameworks or steps help with making ethical decisions in difficult situations\nvec: how do you systematically evaluate moral choices when facing an ethical dilemma\nhyde: A structured approach to ethical decision-making involves: (1) identify the ethical issue, (2) gather relevant facts, (3) consider stakeholders affected, (4) evaluate options using ethical frameworks like utilitarianism or deontology, and (5) make and justify your decision."}
+{"input": "how to find a reliable realtor", "output": "lex: find reliable realtor real estate agent\nlex: choosing trustworthy real estate agent\nvec: how do you find and vet a trustworthy real estate agent for buying or selling a home\nvec: what qualities and credentials should you look for in a reliable realtor\nhyde: Check that the realtor is licensed in your state and has no disciplinary actions. Read online reviews, ask for references from recent clients, and verify their transaction history. A good agent should know the local market and communicate promptly."}
+{"input": "how to lease a car?", "output": "lex: car lease process terms payments\nlex: vehicle leasing agreement negotiation\nvec: what are the steps to lease a car and what terms should you negotiate\nvec: how do car lease payments work and what fees are involved\nhyde: To lease a car, negotiate the capitalized cost (sale price), money factor (interest rate), and residual value. Monthly payments are based on the difference between the cap cost and residual, divided by the lease term, plus a finance charge. Typical leases run 24-36 months."}
+{"input": "how do different cultures commemorate death?", "output": "lex: death rituals funeral customs cultures\nlex: cultural death commemoration ceremonies\nvec: what are the different ways cultures around the world honor and commemorate the dead\nvec: how do funeral rituals and mourning traditions vary across religions and cultures\nhyde: In Mexico, Día de los Muertos celebrates deceased loved ones with altars, marigolds, and sugar skulls. Hindu cremation ceremonies release the soul for reincarnation. In Ghana, elaborate fantasy coffins reflect the deceased's life. Japanese Obon festivals welcome ancestral spirits home."}
+{"input": "how to change a tire", "output": "lex: change flat tire steps jack\nlex: car tire replacement spare\nvec: what are the step-by-step instructions for changing a flat tire on the side of the road\nvec: how to safely jack up a car and replace a flat tire with the spare\nhyde: Loosen the lug nuts slightly before jacking. Place the jack under the vehicle frame near the flat tire and raise until the tire clears the ground. Remove lug nuts, pull off the flat, mount the spare, and hand-tighten the nuts in a star pattern. Lower the car and torque to 80-100 ft-lbs."}
+{"input": "how to develop a positive mindset?", "output": "lex: positive mindset development habits\nlex: positive thinking mental attitude techniques\nvec: what daily habits and techniques help develop and maintain a positive mindset\nvec: how can you train your brain to think more positively and overcome negative thought patterns\nhyde: Developing a positive mindset starts with awareness of negative self-talk. Replace \"I can't\" with \"I'm learning to.\" Practice daily gratitude by writing three things you're thankful for. Surround yourself with supportive people and limit exposure to negativity."}
+{"input": "what is bioinformatics", "output": "lex: bioinformatics computational biology genomics\nlex: bioinformatics DNA sequence analysis\nvec: what is the field of bioinformatics and how does it apply computational methods to biological data\nvec: how is bioinformatics used to analyze DNA sequences and genomic data\nhyde: Bioinformatics is an interdisciplinary field that combines biology, computer science, and statistics to analyze biological data. It involves developing algorithms and software to process DNA sequences, protein structures, and gene expression data from high-throughput experiments."}
+{"input": "how to prepare for a triathlon", "output": "lex: triathlon training plan preparation\nlex: swim bike run triathlon training\nvec: what training plan should a beginner follow to prepare for their first triathlon\nvec: how to balance swimming cycling and running workouts when training for a triathlon\nhyde: A 12-week sprint triathlon plan builds endurance across all three disciplines. Week 1: swim 2x (20 min), bike 2x (30 min), run 3x (20 min). Gradually increase volume by 10% per week. Include one brick workout (bike-to-run) weekly to simulate race-day transitions."}
+{"input": "how to paint a car?", "output": "lex: car paint job spray booth steps\nlex: automotive painting primer clearcoat\nvec: what is the step-by-step process for painting a car at home or in a garage\nvec: what preparation and materials are needed to repaint a car yourself\nhyde: Sand the existing paint with 400-grit wet sandpaper until smooth. Apply 2-3 coats of automotive primer, sanding between coats with 600-grit. Spray the base color in thin, even passes, allowing 15 minutes flash time between coats. Finish with 2-3 coats of clearcoat."}
+{"input": "lab test", "output": "lex: lab test blood work results\nlex: laboratory diagnostic testing medical\nlex: lab test ordered interpretation\nvec: what types of medical lab tests are commonly ordered and what do the results mean\nvec: how to understand blood test results from a laboratory\nhyde: Common lab tests include CBC (complete blood count), CMP (comprehensive metabolic panel), lipid panel, and thyroid function tests. A CBC measures white blood cells, red blood cells, hemoglobin, and platelets. Results outside the reference range may indicate infection, anemia, or other conditions."}
+{"input": "where to buy iphone 14", "output": "lex: buy iPhone 14 price deals\nlex: iPhone 14 purchase Apple store carrier\nvec: where can you buy an iPhone 14 at the best price online or in retail stores\nvec: which stores and carriers currently sell the iPhone 14 and offer trade-in deals\nhyde: Buy iPhone 14 starting at $599 from Apple.com, or save with carrier deals from Verizon, AT&T, and T-Mobile. Trade in your old device for up to $400 off. Also available at Best Buy, Walmart, and Amazon with financing options."}
+{"input": "what is the categorical imperative", "output": "lex: categorical imperative Kant ethics\nlex: Kantian categorical imperative universal law\nvec: what is Kant's categorical imperative and how does it function as a moral principle\nvec: how does the categorical imperative test whether an action is morally permissible\nhyde: The categorical imperative, formulated by Immanuel Kant, states: \"Act only according to that maxim by which you can at the same time will that it should become a universal law.\" It requires that moral rules apply unconditionally to all rational beings, regardless of personal desires."}
+{"input": "latest research on renewable agriculture", "output": "lex: renewable agriculture research 2025 2026\nlex: regenerative sustainable farming research\nlex: renewable agriculture soil carbon sequestration\nvec: what are the latest scientific findings on regenerative and renewable agriculture techniques\nvec: what recent research has been published on sustainable farming and soil health in 2025 or 2026\nhyde: A 2025 study in Nature Food found that cover cropping and no-till practices increased soil organic carbon by 8-12% over five years. Researchers also demonstrated that integrating livestock grazing with crop rotation improved soil microbial diversity by 23%."}
+{"input": "cloud deploy", "output": "lex: cloud deployment pipeline CI/CD\nlex: cloud deploy AWS Azure GCP\nlex: cloud infrastructure deployment automation\nvec: how to deploy applications to cloud platforms like AWS, Azure, or Google Cloud\nvec: what tools and pipelines are used for automated cloud deployment\nhyde: Deploy to the cloud using `gcloud deploy` or configure a CI/CD pipeline with GitHub Actions. Define your infrastructure with Terraform or CloudFormation, build container images, push to a registry, and roll out to Kubernetes or serverless environments."}
+{"input": "what is the significance of day of the dead", "output": "lex: Day of the Dead Día de los Muertos significance\nlex: Day of the Dead Mexican tradition meaning\nvec: what is the cultural and spiritual significance of Day of the Dead in Mexican tradition\nvec: why is Día de los Muertos celebrated and what does it mean to families in Mexico\nhyde: Día de los Muertos, celebrated November 1-2, is a Mexican tradition honoring deceased loved ones. Families build ofrendas (altars) decorated with marigolds, photos, and the departed's favorite foods. It blends pre-Columbian Aztec beliefs with Catholic All Saints' and All Souls' Days."}
+{"input": "what is stonehenge", "output": "lex: Stonehenge prehistoric monument England\nlex: Stonehenge purpose construction history\nvec: what is Stonehenge and why was it built on Salisbury Plain in England\nvec: what do archaeologists know about the history and purpose of Stonehenge\nhyde: Stonehenge is a prehistoric stone circle on Salisbury Plain in Wiltshire, England, built in stages from roughly 3000 to 2000 BCE. The massive sarsen stones, some weighing 25 tons, were transported from Marlborough Downs 25 miles north. Its alignment with the summer solstice sunrise suggests astronomical or ceremonial function."}
+{"input": "bug fix", "output": "lex: bug fix debugging software\nlex: bug fix code patch issue\nlex: software bug troubleshooting resolution\nvec: how to identify and fix bugs in software code effectively\nvec: what is the process for debugging and resolving code issues\nhyde: To fix a bug, first reproduce it reliably and identify the exact conditions that trigger it. Use a debugger or add logging to narrow down the faulty code path. Write a regression test that captures the bug, then modify the code until the test passes."}
+{"input": "how to wax a car?", "output": "lex: car wax application steps\nlex: wax car paint protection polish\nvec: what is the proper technique for waxing a car to protect the paint finish\nvec: how often should you wax a car and what products work best\nhyde: Wash and dry the car thoroughly before waxing. Apply a thin layer of carnauba or synthetic wax with a foam applicator pad using circular motions. Work one panel at a time, let it haze for 5-10 minutes, then buff off with a clean microfiber towel."}
+{"input": "what is the veil of ignorance", "output": "lex: veil of ignorance Rawls justice\nlex: John Rawls original position veil of ignorance\nvec: what is John Rawls' veil of ignorance thought experiment in political philosophy\nvec: how does the veil of ignorance help determine principles of justice in a fair society\nhyde: The veil of ignorance is a thought experiment by John Rawls in A Theory of Justice (1971). It asks people to choose principles of justice from an \"original position\" where they don't know their own race, gender, wealth, or abilities. Rawls argues this produces fair, impartial rules."}
+{"input": "what are the challenges of multiculturalism", "output": "lex: multiculturalism challenges social integration\nlex: multicultural society tensions cultural diversity\nvec: what social and political challenges arise in multicultural societies\nvec: how do multicultural nations deal with cultural conflict and integration difficulties\nhyde: Multicultural societies face challenges including language barriers, cultural misunderstandings, and tensions between assimilation and cultural preservation. Debates arise over shared national identity, religious accommodation in public institutions, and equitable representation of minority groups."}
+{"input": "what are smart cities?", "output": "lex: smart city technology IoT urban\nlex: smart cities infrastructure data sensors\nvec: what defines a smart city and what technologies do they use\nvec: how do smart cities use IoT sensors and data analytics to improve urban infrastructure\nhyde: Smart cities integrate IoT sensors, data analytics, and connected infrastructure to improve urban services. Examples include adaptive traffic signals that reduce congestion by 25%, smart grids that optimize energy distribution, and sensors that monitor air quality and water systems in real time."}
+{"input": "how to optimize supply chain", "output": "lex: supply chain optimization logistics\nlex: supply chain efficiency inventory management\nvec: what strategies and tools can companies use to optimize their supply chain operations\nvec: how do businesses reduce supply chain costs while improving delivery speed and reliability\nhyde: Optimize your supply chain by implementing demand forecasting with machine learning, reducing safety stock through just-in-time inventory, and diversifying suppliers to mitigate risk. Use real-time tracking and warehouse management systems to cut lead times by 15-30%."}
+{"input": "what is an elevator pitch", "output": "lex: elevator pitch short business presentation\nlex: elevator pitch 30-second summary\nvec: what is an elevator pitch and how do you structure an effective one\nvec: how do you deliver a compelling 30-second pitch for a business idea or job opportunity\nhyde: An elevator pitch is a concise, 30-60 second summary of who you are and what you offer. Structure it as: hook (attention-grabbing opening), problem you solve, your solution, and a call to action. Practice until it sounds conversational, not rehearsed."}
+{"input": "how to rotate car tires?", "output": "lex: car tire rotation pattern schedule\nlex: tire rotation front rear cross\nvec: how often should you rotate car tires and what pattern should you follow\nvec: what is the correct tire rotation procedure for front-wheel and all-wheel drive vehicles\nhyde: Rotate tires every 5,000-7,500 miles. For front-wheel drive, move fronts straight to the rear and cross the rears to the front. For rear-wheel drive, move rears straight forward and cross the fronts to the rear. All-wheel drive uses the rearward cross pattern."}
+{"input": "how to participate in a pow wow", "output": "lex: pow wow Native American attend participate\nlex: pow wow etiquette attendance protocol\nvec: how can non-Native people respectfully attend and participate in a pow wow\nvec: what are the etiquette rules and customs visitors should follow at a pow wow\nhyde: When attending a pow wow, stand during grand entry and honor songs. Don't touch dancers' regalia without permission. Ask before photographing. Bring a lawn chair, as seating is limited. Some dances are intertribal and open to all—the emcee will announce when visitors may join the circle."}
+{"input": "car rust", "output": "lex: car rust prevention treatment\nlex: automotive rust repair body panel\nlex: car rust removal undercarriage\nvec: how to prevent and treat rust on a car body and undercarriage\nvec: what causes rust on cars and how can you repair rusted panels\nhyde: Car rust forms when bare metal is exposed to moisture and salt. Treat surface rust by sanding to bare metal, applying rust converter, priming, and repainting. For structural rust, cut out the damaged section and weld in a patch panel. Prevent rust with regular washing and undercoating."}
+{"input": "what is moral obligation", "output": "lex: moral obligation ethical duty\nlex: moral obligation philosophy definition\nvec: what does moral obligation mean in ethics and where do moral duties come from\nvec: how do philosophers define and justify moral obligations people have toward others\nhyde: A moral obligation is a duty to act in accordance with ethical principles, regardless of legal requirements. For example, one may feel morally obligated to help a stranger in danger. Philosophers debate whether moral obligations stem from reason (Kant), consequences (Mill), or social contracts."}
+{"input": "what is the purpose of a thesis statement?", "output": "lex: thesis statement purpose essay writing\nlex: thesis statement argument academic paper\nvec: what role does a thesis statement play in an essay or academic paper\nvec: why is a strong thesis statement important and how should it be written\nhyde: A thesis statement presents the central argument of an essay in one or two sentences, typically at the end of the introduction. It tells the reader what the paper will argue and provides a roadmap for the evidence and analysis that follow. A strong thesis is specific, debatable, and supportable."}
+{"input": "how to attend a diplomatic event", "output": "lex: diplomatic event attendance protocol etiquette\nlex: diplomatic reception dress code invitation\nvec: what are the etiquette rules and dress codes for attending a diplomatic event or reception\nvec: how do you get invited to and properly conduct yourself at a diplomatic function\nhyde: At diplomatic events, follow the dress code specified on the invitation (black tie, business formal). Arrive punctually, greet the host first, and address ambassadors as \"Your Excellency.\" Exchange business cards with both hands. Avoid discussing controversial political topics unless invited to do so."}
+{"input": "what is renewable energy", "output": "lex: renewable energy sources solar wind\nlex: renewable energy types clean power\nvec: what are the main types of renewable energy and how do they generate electricity\nvec: how do renewable energy sources like solar and wind power differ from fossil fuels\nhyde: Renewable energy comes from naturally replenishing sources: solar, wind, hydroelectric, geothermal, and biomass. Solar panels convert sunlight into electricity using photovoltaic cells. Wind turbines capture kinetic energy from moving air. These sources produce little or no greenhouse gas emissions during operation."}
+{"input": "what is machine learning", "output": "lex: machine learning algorithms training data\nlex: machine learning AI neural networks\nvec: what is machine learning and how do algorithms learn from data to make predictions\nvec: how does machine learning differ from traditional programming and rule-based systems\nhyde: Machine learning is a subset of artificial intelligence where algorithms learn patterns from training data rather than following explicit rules. Given labeled examples, a supervised learning model adjusts its parameters to minimize prediction error. Common algorithms include linear regression, decision trees, and neural networks."}
+{"input": "what is the role of the protagonist?", "output": "lex: protagonist role literary fiction\nlex: protagonist main character story function\nvec: what role does the protagonist play in driving the plot of a novel or story\nvec: how does the protagonist function as the central character in literary fiction\nhyde: The protagonist is the central character whose goals and conflicts drive the narrative. They face obstacles, make choices, and undergo transformation through the story arc. Readers experience the plot primarily through the protagonist's perspective, creating emotional investment in their journey."}
+{"input": "api test", "output": "lex: API testing automated endpoint\nlex: REST API test Postman integration\nlex: API endpoint validation testing\nvec: how to write automated tests for REST API endpoints\nvec: what tools and methods are used for API testing and validation\nhyde: Test API endpoints using Postman or write automated tests with a framework like Jest or pytest. Send requests to each endpoint and assert status codes, response bodies, and headers. Example: `expect(response.status).toBe(200)` and validate the JSON schema of the response."}
+{"input": "how to improve civic engagement", "output": "lex: civic engagement participation community\nlex: civic engagement voting local government\nvec: what are effective ways to increase civic engagement and community participation\nvec: how can citizens get more involved in local government and community decision-making\nhyde: Improve civic engagement by attending city council meetings, volunteering for local organizations, and contacting elected officials about issues you care about. Register to vote and participate in every election, including local and midterm races. Join neighborhood associations and community boards."}
+{"input": "sustainable agriculture", "output": "lex: sustainable agriculture farming methods\nlex: sustainable agriculture soil health crop rotation\nlex: sustainable agriculture environmental impact\nvec: what farming practices make agriculture sustainable and environmentally friendly\nvec: how does sustainable agriculture balance food production with environmental conservation\nhyde: Sustainable agriculture maintains productivity while protecting natural resources. Key practices include crop rotation, cover cropping, integrated pest management, reduced tillage, and efficient water use. These methods improve soil health, reduce erosion, and lower dependence on synthetic fertilizers and pesticides."}
+{"input": "how to fix car door lock?", "output": "lex: car door lock repair fix stuck\nlex: car door lock actuator replacement\nvec: how to diagnose and fix a car door lock that is stuck or not working\nvec: how to replace a broken car door lock actuator or mechanism\nhyde: If the car door lock won't engage, check the fuse first. Test the lock with the key and remote separately. If the remote works but the button doesn't, the switch is faulty. If neither works, the lock actuator has likely failed. Remove the door panel, disconnect the actuator, and replace it."}
+{"input": "drug test", "output": "lex: drug test urine screening types\nlex: drug test employment panel detection\nlex: drug testing workplace results\nvec: what types of drug tests are used for employment and what substances do they detect\nvec: how long do drugs stay detectable in urine blood and hair drug tests\nhyde: The standard 5-panel drug test screens for marijuana (THC), cocaine, opiates, amphetamines, and PCP. Urine tests detect most substances for 1-7 days, except marijuana which can be detected for up to 30 days in heavy users. Hair follicle tests cover approximately 90 days."}
+{"input": "how to participate in lobbying efforts", "output": "lex: lobbying participation advocacy government\nlex: citizen lobbying elected officials\nvec: how can ordinary citizens participate in lobbying and advocacy to influence legislation\nvec: what steps are involved in organizing a lobbying effort for a political cause\nhyde: Citizens can lobby by contacting representatives via phone, email, or scheduled meetings. Prepare a one-page brief on your issue with specific policy asks. Join advocacy organizations that coordinate lobbying days at state capitols. Grassroots lobbying involves petitions, public comment periods, and organized letter-writing campaigns."}
+{"input": "how do you find inspiration for photography?", "output": "lex: photography inspiration ideas creative\nlex: photography creative motivation techniques\nvec: where do photographers find creative inspiration for new projects and subjects\nvec: what techniques help overcome creative block and find fresh ideas for photography\nhyde: Find photography inspiration by studying the work of photographers you admire on platforms like Flickr, 500px, and Instagram. Try a 365-day photo challenge. Walk familiar routes at different times of day. Limit yourself to one lens or shoot only in black and white to force creative thinking."}
+{"input": "how to install car led lights?", "output": "lex: car LED lights installation wiring\nlex: LED headlight bulb install car\nvec: how to install aftermarket LED lights on a car including wiring and connections\nvec: step-by-step guide for replacing car headlights or interior lights with LEDs\nhyde: To install LED headlights, open the hood and locate the headlight housing. Twist the bulb holder counterclockwise to remove the old halogen bulb. Insert the LED bulb, secure the heat sink or fan module, and connect the driver if included. Test both low and high beams before reassembling."}
+{"input": "how to critically analyze research papers", "output": "lex: research paper critical analysis evaluation\nlex: academic paper critique methodology\nvec: how do you critically evaluate the methodology and conclusions of a research paper\nvec: what framework should you use to analyze the strengths and weaknesses of an academic study\nhyde: When analyzing a research paper, evaluate: (1) Is the research question clearly stated? (2) Is the methodology appropriate and reproducible? (3) Is the sample size adequate? (4) Do the results support the conclusions? (5) Are limitations acknowledged? Check for conflicts of interest and citation of relevant prior work."}
+{"input": "what is mindfulness meditation", "output": "lex: mindfulness meditation practice technique\nlex: mindfulness meditation awareness breathing\nvec: what is mindfulness meditation and how do you practice it\nvec: what are the mental and physical health benefits of regular mindfulness meditation\nhyde: Mindfulness meditation involves focusing attention on the present moment without judgment. Sit comfortably, close your eyes, and observe your breath. When thoughts arise, acknowledge them without engaging and gently return focus to breathing. Start with 5-10 minutes daily and gradually increase duration."}
+{"input": "what is the digital divide", "output": "lex: digital divide internet access inequality\nlex: digital divide technology gap socioeconomic\nvec: what is the digital divide and how does it affect people without internet access\nvec: what factors contribute to the technology gap between different socioeconomic groups\nhyde: The digital divide refers to the gap between those who have access to computers and the internet and those who do not. Roughly 2.7 billion people worldwide remain offline. Factors include income, geography, age, and education. Rural areas and developing countries are disproportionately affected."}
+{"input": "what is nihilism", "output": "lex: nihilism philosophy meaning Nietzsche\nlex: nihilism existential moral meaning\nvec: what is nihilism as a philosophical position and what does it claim about meaning and values\nvec: how did Nietzsche and other philosophers develop and respond to nihilism\nhyde: Nihilism is the philosophical view that life lacks objective meaning, purpose, or intrinsic value. Existential nihilism holds that no action is inherently meaningful. Friedrich Nietzsche warned that the \"death of God\" would lead to nihilism but urged individuals to create their own values through the will to power."}
+{"input": "how to improve self-discipline?", "output": "lex: self-discipline improvement habits willpower\nlex: self-discipline strategies consistency\nvec: what daily habits and strategies help build stronger self-discipline\nvec: how can you train yourself to stay disciplined and follow through on goals\nhyde: Build self-discipline by starting with small commitments and increasing gradually. Make your bed every morning. Use the two-minute rule: if a task takes less than two minutes, do it now. Remove temptations from your environment and track your streaks to maintain momentum."}
+{"input": "what are the core practices of the bahá'í faith?", "output": "lex: Bahá'í faith core practices worship\nlex: Bahá'í religion prayer fasting principles\nvec: what are the main spiritual practices and rituals observed in the Bahá'í faith\nvec: what daily practices and religious obligations do Bahá'ís follow\nhyde: Core Bahá'í practices include daily obligatory prayer (one of three prayers chosen by the individual), fasting during the Nineteen-Day Fast in March, participation in Nineteen-Day Feasts, and the recitation of \"Alláh-u-Abhá\" 95 times daily. Bahá'ís also observe the prohibition on backbiting and alcohol."}
+{"input": "what is highlining?", "output": "lex: highlining slackline extreme height\nlex: highlining equipment safety rigging\nvec: what is highlining and how does it differ from regular slacklining\nvec: what equipment and safety precautions are required for highlining at extreme heights\nhyde: Highlining is the practice of walking a slackline anchored at significant height, often between cliffs, buildings, or over canyons. Unlike standard slacklining, highliners wear a climbing harness tethered to the line with a leash. Lines are rigged with redundant anchors using static rope or webbing."}
+{"input": "how to travel to bali", "output": "lex: travel Bali Indonesia flights visa\nlex: Bali trip planning itinerary transportation\nvec: how to plan a trip to Bali including flights, visas, and transportation\nvec: what do you need to know before traveling to Bali Indonesia for the first time\nhyde: Fly into Ngurah Rai International Airport (DPS) in southern Bali. Many countries receive a 30-day visa on arrival for $500,000 IDR (~$35). Book a private driver for around $40-50/day to explore the island. Popular areas include Ubud for culture, Seminyak for dining, and Uluwatu for surfing."}
+{"input": "what caused the fall of the roman empire", "output": "lex: fall Roman Empire causes decline\nlex: Roman Empire collapse reasons factors\nvec: what were the main political military and economic causes of the fall of the Roman Empire\nvec: why did the Western Roman Empire collapse in 476 AD\nhyde: The fall of the Western Roman Empire in 476 AD resulted from multiple factors: military overextension, barbarian invasions (Visigoths, Vandals, Ostrogoths), economic decline from debasement of currency, political instability with rapid emperor turnover, and the shift of power to Constantinople."}
+{"input": "what is philosophy of mind", "output": "lex: philosophy of mind consciousness problem\nlex: philosophy of mind mental states dualism\nvec: what does the philosophy of mind study about consciousness and mental states\nvec: what are the main theories in philosophy of mind such as dualism and physicalism\nhyde: Philosophy of mind examines the nature of mental states, consciousness, and their relationship to the physical brain. Central questions include the mind-body problem: how do subjective experiences (qualia) arise from neural processes? Key positions include dualism, physicalism, functionalism, and property dualism."}
+{"input": "how to build a personal brand", "output": "lex: personal brand building online presence\nlex: personal branding strategy social media\nvec: how do you build a strong personal brand for career growth or entrepreneurship\nvec: what steps should you take to develop a recognizable personal brand online\nhyde: Build your personal brand by defining your niche and unique value proposition. Create consistent profiles across LinkedIn, Twitter, and a personal website. Publish content regularly—blog posts, videos, or podcasts—that demonstrates your expertise. Engage authentically with your audience and network at industry events."}
+{"input": "what is the significance of dialogue in philosophy?", "output": "lex: dialogue philosophy Socratic method\nlex: philosophical dialogue significance discourse\nvec: why is dialogue important as a method of philosophical inquiry and reasoning\nvec: how did Socratic dialogue shape Western philosophical tradition\nhyde: Dialogue has been central to philosophy since Plato's Socratic dialogues, where truth emerges through questioning and exchange rather than dogmatic assertion. The dialectical method exposes contradictions in arguments, refines ideas through challenge and response, and models philosophy as collaborative inquiry."}
+{"input": "what does it mean to write a biography?", "output": "lex: biography writing nonfiction life story\nlex: biography research subject narrative\nvec: what is involved in writing a biography of someone's life\nvec: how do biographers research and structure a narrative about a person's life\nhyde: Writing a biography means researching and narrating the story of a real person's life. Biographers conduct interviews, examine letters and documents, and verify facts through multiple sources. The narrative typically follows chronological structure while weaving in themes that defined the subject's character and impact."}
+{"input": "how to develop a writing habit?", "output": "lex: writing habit daily routine discipline\nlex: writing habit consistency productivity\nvec: how do you build and maintain a consistent daily writing habit\nvec: what strategies help writers overcome procrastination and write regularly\nhyde: Set a specific time and place to write every day, even if only for 15-20 minutes. Track your word count or time spent writing. Don't edit while drafting—just get words on the page. Use writing prompts if you're stuck. Many successful authors, including Stephen King, recommend writing at least 1,000 words daily."}
+{"input": "what is green technology", "output": "lex: green technology clean environmental\nlex: green technology sustainable energy efficiency\nvec: what is green technology and what industries does it apply to\nvec: how does green technology help reduce environmental impact and promote sustainability\nhyde: Green technology encompasses innovations that reduce environmental impact, including solar panels, electric vehicles, energy-efficient buildings, biodegradable materials, and water purification systems. These technologies aim to conserve resources, reduce waste, and lower carbon emissions across manufacturing, energy, and transportation sectors."}
+{"input": "how to connect car bluetooth?", "output": "lex: car Bluetooth pairing phone connect\nlex: car Bluetooth setup audio streaming\nvec: how to pair a smartphone to a car's Bluetooth system for calls and music\nvec: step-by-step instructions for connecting a phone to car Bluetooth for the first time\nhyde: To connect via Bluetooth, enable Bluetooth on your phone and car infotainment system. On the car stereo, go to Settings > Bluetooth > Add Device. Select your car's name on your phone's Bluetooth list. Confirm the pairing code on both devices. The phone should automatically reconnect on future drives."}
+{"input": "what are the building blocks of life", "output": "lex: building blocks of life molecules biochemistry\nlex: amino acids nucleic acids proteins cells\nvec: what are the fundamental molecular building blocks that make up all living organisms\nvec: how do amino acids, nucleic acids, and lipids form the basis of life on Earth\nhyde: The building blocks of life are four types of organic molecules: proteins (made from amino acids), nucleic acids (DNA and RNA from nucleotides), carbohydrates (sugars and polysaccharides), and lipids (fats and phospholipids). These molecules self-assemble into cells, the basic unit of all living organisms."}
+{"input": "what is the role of a cinematographer?", "output": "lex: cinematographer role film camera director of photography\nlex: cinematographer lighting shot composition\nvec: what does a cinematographer do on a film set and what creative decisions do they make\nvec: how does the director of photography control lighting, camera, and visual storytelling in film\nhyde: The cinematographer, or director of photography (DP), is responsible for the visual look of a film. They select cameras, lenses, and lighting setups, and work with the director to plan shot composition and camera movement. The DP oversees the camera and electrical departments on set."}
+{"input": "landscape photography", "output": "lex: landscape photography techniques composition\nlex: landscape photography camera lens settings\nlex: landscape photography golden hour\nvec: what camera settings and techniques produce stunning landscape photographs\nvec: how to compose and shoot landscape photography with proper exposure and depth of field\nhyde: For landscape photography, use a wide-angle lens (16-35mm), aperture of f/8-f/11 for maximum sharpness, and a low ISO (100). Shoot during golden hour for warm, directional light. Use a tripod, compose with the rule of thirds, and include a strong foreground element to create depth."}
+{"input": "what are literary movements?", "output": "lex: literary movements periods history\nlex: literary movements Romanticism Modernism Realism\nvec: what are the major literary movements in history and what defines each one\nvec: how do literary movements like Romanticism, Realism, and Modernism differ from each other\nhyde: Literary movements are periods defined by shared styles, themes, and philosophies. Romanticism (1800-1850) emphasized emotion and nature. Realism (1850-1900) depicted ordinary life accurately. Modernism (1900-1945) experimented with form and stream of consciousness. Postmodernism questioned grand narratives through irony and fragmentation."}
+{"input": "what is the capital of france?", "output": "lex: capital France Paris\nlex: Paris capital city France\nvec: what city is the capital of France\nvec: where is the capital of France located and what is it known for\nhyde: Paris is the capital and largest city of France, located on the Seine River in northern France. With a population of over 2 million in the city proper and 12 million in the metropolitan area, it is the country's political, economic, and cultural center."}
+{"input": "golf play", "output": "lex: golf playing tips beginner\nlex: golf swing technique course\nlex: golf rules gameplay etiquette\nvec: how do you play golf and what are the basic rules for beginners\nvec: what techniques and etiquette should new golfers learn before playing on a course\nhyde: A round of golf consists of 18 holes. At each hole, tee off from the tee box, play through the fairway, and putt on the green. The objective is to complete each hole in the fewest strokes. Beginners should start at a driving range, learn basic grip and stance, and play executive (par-3) courses."}
+{"input": "build a treehouse", "output": "lex: treehouse building construction plans\nlex: treehouse DIY wood platform tree\nvec: how to design and build a treehouse safely in a backyard tree\nvec: what materials and tools do you need to build a treehouse for kids\nhyde: Choose a healthy hardwood tree (oak, maple, beech) with a trunk at least 12 inches in diameter. Use treehouse attachment bolts (TABs) rather than nails, which damage the tree. Build the platform at 6-8 feet high using pressure-treated lumber. Frame with 2x6 joists on 16-inch centers and deck with 5/4 boards."}
+{"input": "where to buy classic car parts", "output": "lex: classic car parts buy online supplier\nlex: vintage car parts restoration OEM\nvec: where can you purchase replacement parts for classic and vintage cars\nvec: which online stores and suppliers specialize in classic car restoration parts\nhyde: Find classic car parts at specialty suppliers like Summit Racing, Classic Industries, and Hemmings. Year One stocks OEM-quality parts for GM, Ford, and Mopar vehicles from the 1950s-80s. JEGS and Rock Auto also carry a wide selection. Check eBay Motors and swap meets for rare NOS (new old stock) parts."}
+{"input": "how to set business goals", "output": "lex: business goals setting SMART strategy\nlex: business goal planning objectives targets\nvec: how to set effective business goals using the SMART framework\nvec: what process should entrepreneurs follow to define and track business objectives\nhyde: Set business goals using the SMART framework: Specific (\"increase monthly revenue by 15%\"), Measurable (track with KPIs), Achievable (realistic given resources), Relevant (aligned with company mission), and Time-bound (complete by Q3). Break annual goals into quarterly milestones and review progress monthly."}
+{"input": "what are the characteristics of neolithic societies?", "output": "lex: Neolithic society characteristics agriculture settlement\nlex: Neolithic period farming tools social structure\nvec: what were the key characteristics of Neolithic societies after the agricultural revolution\nvec: how did Neolithic communities organize their social structure, farming, and settlements\nhyde: Neolithic societies (approximately 10,000-3,000 BCE) were characterized by the transition from hunting-gathering to agriculture. People domesticated plants and animals, formed permanent settlements, developed pottery and polished stone tools, and created increasingly complex social hierarchies with specialized labor roles."}
+{"input": "what is the significance of rituals in judaism?", "output": "lex: Judaism rituals significance religious practice\nlex: Jewish rituals Shabbat observance tradition\nvec: what role do rituals play in Jewish religious life and spiritual practice\nvec: why are rituals like Shabbat, kashrut, and prayer important in Judaism\nhyde: Rituals in Judaism (mitzvot) structure daily, weekly, and yearly life around sacred observance. Shabbat, observed from Friday evening to Saturday night, sanctifies time through rest, prayer, and family meals. Rituals connect Jews to their covenant with God, collective memory, and community identity across generations."}
+{"input": "how to increase productivity at work?", "output": "lex: productivity work increase tips\nlex: workplace productivity time management techniques\nvec: what proven strategies help people increase their productivity at work\nvec: how can you manage your time better to get more done during the workday\nhyde: Increase workplace productivity by time-blocking your calendar in 90-minute focus sessions. Tackle your hardest task first (eat the frog). Batch similar tasks like email and meetings. Eliminate distractions by silencing notifications. Use the Pomodoro Technique: 25 minutes of work, 5-minute break, repeat."}
+{"input": "what is panorama photography?", "output": "lex: panorama photography wide angle stitching\nlex: panoramic photo technique camera rotation\nvec: what is panorama photography and how do you capture and stitch panoramic images\nvec: what camera techniques and software are used to create panoramic photographs\nhyde: Panorama photography captures wide scenes by shooting multiple overlapping images and stitching them together. Use a tripod with a panoramic head, shoot in manual mode to keep exposure consistent, and overlap each frame by 30-50%. Stitch in software like Lightroom, PTGui, or Hugin."}
+{"input": "what are the key periods in chinese history", "output": "lex: Chinese history periods dynasties timeline\nlex: China historical periods Qin Han Tang\nvec: what are the major periods and dynasties in Chinese history from ancient to modern times\nvec: how is Chinese history divided into dynastic periods and what defined each era\nhyde: Key periods in Chinese history include: Shang Dynasty (1600-1046 BCE), Zhou Dynasty (1046-256 BCE), Qin Dynasty (221-206 BCE, first unified empire), Han Dynasty (206 BCE-220 CE), Tang Dynasty (618-907, golden age), Song Dynasty (960-1279), Ming Dynasty (1368-1644), Qing Dynasty (1644-1912), and the People's Republic (1949-present)."}
+{"input": "what are the elements of a good story?", "output": "lex: story elements plot character setting\nlex: storytelling elements narrative structure\nvec: what are the essential elements that make a story compelling and well-crafted\nvec: how do plot, character, setting, and conflict work together in a good story\nhyde: A good story requires compelling characters, a clear conflict, a structured plot (beginning, rising action, climax, resolution), a vivid setting, and a consistent point of view. Theme gives the story meaning beyond its events. Strong dialogue reveals character and advances the plot naturally."}
+{"input": "latest news in artificial intelligence research", "output": "lex: artificial intelligence research news 2025 2026\nlex: AI research breakthroughs latest developments\nlex: machine learning AI news recent\nvec: what are the most recent breakthroughs and developments in artificial intelligence research in 2025-2026\nvec: what new AI models and techniques have been published in the latest research\nhyde: In 2025-2026, AI research advanced with larger multimodal models capable of reasoning across text, image, and video. Key developments include improved chain-of-thought reasoning, AI agents that can use tools and write code, and open-weight models matching proprietary performance."}
+{"input": "what are the main beliefs of new age spirituality?", "output": "lex: New Age spirituality beliefs practices\nlex: New Age movement spiritual holistic\nvec: what are the central beliefs and practices of New Age spirituality\nvec: how does the New Age movement define spirituality, consciousness, and healing\nhyde: New Age spirituality encompasses diverse beliefs including holistic healing, the interconnectedness of all life, personal spiritual growth, and the existence of higher consciousness. Practitioners may draw from Eastern religions, astrology, crystal healing, meditation, and the idea that individuals can channel divine energy."}
+{"input": "how to plan a camping trip with kids", "output": "lex: camping trip kids family planning\nlex: family camping children gear checklist\nvec: how to plan and prepare for a family camping trip with young children\nvec: what gear and activities should you bring when camping with kids for the first time\nhyde: Plan a family camping trip by choosing a campground with bathrooms and short hiking trails. Pack extra layers, rain gear, and familiar snacks. Bring activities: nature scavenger hunts, glow sticks, and star charts. Set up camp early to let kids explore. Practice tent setup in the backyard first."}
+{"input": "how do philosophers conceptualize identity", "output": "lex: personal identity philosophy self\nlex: identity philosophy Locke consciousness persistence\nvec: how do philosophers define and explain personal identity and what makes someone the same person over time\nvec: what are the major philosophical theories of identity from Locke to modern philosophy of mind\nhyde: Philosophers debate what constitutes personal identity over time. John Locke argued identity rests on continuity of consciousness and memory. David Hume denied a fixed self, viewing identity as a bundle of perceptions. Derek Parfit argued identity is not what matters—psychological continuity is."}
+{"input": "what is the role of civil society in politics", "output": "lex: civil society political role organizations\nlex: civil society democracy NGOs advocacy\nvec: what role do civil society organizations play in democratic politics and governance\nvec: how does civil society influence government policy and hold political leaders accountable\nhyde: Civil society—NGOs, advocacy groups, unions, and community organizations—serves as a check on government power. These groups mobilize citizens, advocate for policy changes, monitor elections, and provide services the state cannot. A strong civil society is considered essential for healthy democracy and government accountability."}
+{"input": "how to handle inflation impact", "output": "lex: inflation impact personal finance manage\nlex: inflation coping strategies budget investment\nvec: how can individuals protect their finances and manage the impact of high inflation\nvec: what financial strategies help people cope with rising prices and reduced purchasing power\nhyde: To handle inflation, review your budget and cut discretionary spending. Move savings to high-yield accounts or I-bonds that adjust for inflation. Lock in fixed-rate loans before rates rise. Invest in assets that historically outpace inflation: equities, real estate, and TIPS (Treasury Inflation-Protected Securities)."}
+{"input": "how is energy conserved during chemical reactions", "output": "lex: energy conservation chemical reactions thermodynamics\nlex: chemical reaction energy transfer exothermic endothermic\nvec: how does the law of conservation of energy apply to chemical reactions\nvec: how is energy transferred and conserved in exothermic and endothermic chemical reactions\nhyde: In chemical reactions, energy is neither created nor destroyed (first law of thermodynamics). Exothermic reactions release energy—bonds formed in products are stronger than bonds broken in reactants. Endothermic reactions absorb energy—more energy is needed to break reactant bonds than is released forming product bonds."}
+{"input": "how to make sourdough bread", "output": "lex: sourdough bread recipe starter\nlex: sourdough bread baking fermentation dough\nvec: what is the step-by-step process for making sourdough bread from a starter\nvec: how do you feed a sourdough starter and bake a loaf of sourdough bread at home\nhyde: Mix 100g active starter, 375g water, 500g bread flour, and 10g salt. Stretch and fold every 30 minutes for 2 hours, then bulk ferment 4-8 hours until doubled. Shape, place in a banneton, and cold-proof in the fridge overnight. Bake in a Dutch oven at 450°F: 20 min covered, 20 min uncovered."}
+{"input": "what is the philosophy of aesthetics", "output": "lex: aesthetics philosophy beauty art\nlex: philosophy aesthetics theory judgment taste\nvec: what is the philosophy of aesthetics and how does it define beauty and art\nvec: how do philosophers like Kant and Hume approach questions of aesthetic judgment and taste\nhyde: Aesthetics is the branch of philosophy concerned with the nature of beauty, art, and taste. Kant argued that aesthetic judgments are subjective yet claim universal validity—when we call something beautiful, we expect others to agree. Hume held that taste varies but can be refined through experience and education."}
+{"input": "what to pack for a hike?", "output": "lex: hiking packing list gear essentials\nlex: hiking pack checklist day hike\nvec: what essential items should you pack for a day hike in the outdoors\nvec: what gear and supplies do you need to bring on a hiking trip for safety and comfort\nhyde: The ten essentials for hiking: navigation (map/compass/GPS), sun protection, insulation (extra layers), illumination (headlamp), first aid kit, fire starter, repair tools, nutrition (extra food), hydration (extra water), and emergency shelter. Also bring a whistle, trekking poles, and broken-in boots."}
+{"input": "what is the philosophy of existentialism?", "output": "lex: existentialism philosophy Sartre Kierkegaard\nlex: existentialism existence precedes essence freedom\nvec: what is existentialist philosophy and what are its core claims about human freedom and meaning\nvec: how did Sartre, Kierkegaard, and Camus define existentialism and its key ideas\nhyde: Existentialism holds that existence precedes essence—humans are not born with a fixed nature but create themselves through choices. Sartre argued we are \"condemned to be free,\" fully responsible for our actions. Kierkegaard emphasized the anxiety of individual choice, while Camus explored the absurdity of seeking meaning in an indifferent universe."}
+{"input": "battery test", "output": "lex: battery test multimeter voltage\nlex: battery test car 12V load\nlex: battery testing health capacity\nvec: how to test a battery's charge level and health using a multimeter or load tester\nvec: how to check if a car battery or device battery needs replacement\nhyde: Test a 12V car battery with a multimeter set to DC volts. A fully charged battery reads 12.6V or higher. Between 12.0-12.4V indicates partial charge. Below 12.0V means the battery is discharged. For a load test, apply a load equal to half the CCA rating for 15 seconds—voltage should stay above 9.6V."}
+{"input": "what is hdr photography?", "output": "lex: HDR photography high dynamic range\nlex: HDR photo bracketing tone mapping\nvec: what is HDR photography and how does it capture a wider range of light and shadow\nvec: how do you shoot and process HDR photos using exposure bracketing and tone mapping\nhyde: HDR (High Dynamic Range) photography combines multiple exposures of the same scene—typically 3-5 bracketed shots—to capture detail in both highlights and shadows. The images are merged using software like Photomatix or Lightroom, then tone-mapped to produce a single image with a wider dynamic range than a single exposure."}
+{"input": "what is the significance of literary awards?", "output": "lex: literary awards significance publishing\nlex: literary prizes Nobel Pulitzer Booker impact\nvec: why are literary awards significant for authors and the publishing industry\nvec: how do prizes like the Nobel, Pulitzer, and Booker Prize affect book sales and literary reputation\nhyde: Literary awards elevate authors' visibility and boost book sales—Booker Prize winners typically see a 600% increase in sales. Awards canonize works in literary culture, influence academic curricula, and bring attention to underrepresented voices. They also shape publishers' marketing strategies and readers' choices."}
+{"input": "what is cubism?", "output": "lex: Cubism art movement Picasso Braque\nlex: Cubism painting geometric abstraction\nvec: what is Cubism as an art movement and how did it change visual representation in painting\nvec: how did Picasso and Braque develop Cubism and what are its defining visual characteristics\nhyde: Cubism, pioneered by Pablo Picasso and Georges Braque around 1907-1914, broke objects into geometric fragments and depicted multiple viewpoints simultaneously on a flat canvas. Analytic Cubism (1907-1912) deconstructed forms into monochrome facets. Synthetic Cubism (1912-1914) introduced collage, color, and simpler shapes."}
+{"input": "cache hit", "output": "lex: cache hit rate ratio\nlex: CPU cache hit miss latency\nlex: web cache hit response time\nvec: what happens when data is found in cache memory\nvec: how cache hits improve application performance versus cache misses\nhyde: A cache hit occurs when the requested data is found in the cache layer, avoiding a slower lookup to the backing store. Hit rates above 90% typically indicate effective caching."}
+{"input": "current applications of machine learning in research", "output": "lex: machine learning research applications 2025 2026\nlex: ML models scientific research use cases\nlex: deep learning academic research tools\nvec: how is machine learning being applied in scientific research today\nvec: what are the latest ways researchers use ML models in their studies\nhyde: Machine learning is now routinely used in genomics for variant calling, in climate science for weather prediction, and in materials science for discovering novel compounds. Recent breakthroughs include protein structure prediction and automated literature review."}
+{"input": "how to plant a vegetable garden", "output": "lex: vegetable garden planting steps\nlex: backyard vegetable garden soil preparation\nlex: raised bed vegetable garden layout\nvec: what are the steps to start a vegetable garden from scratch\nvec: how to prepare soil and plant vegetables for beginners\nhyde: Choose a site with 6-8 hours of direct sunlight. Amend the soil with compost, till to 12 inches deep, and plant seedlings after the last frost date. Space rows 18-24 inches apart depending on the crop."}
+{"input": "how does existentialism view authenticity", "output": "lex: existentialism authenticity Sartre Heidegger\nlex: authentic existence existentialist philosophy\nvec: what does authenticity mean in existentialist philosophy\nvec: how do existentialist thinkers define living an authentic life\nhyde: For Sartre, authenticity means acknowledging radical freedom and refusing bad faith—the self-deception of pretending our choices are determined by external forces. Heidegger's Eigentlichkeit calls us to own our finitude rather than losing ourselves in das Man."}
+{"input": "what is the great depression", "output": "lex: Great Depression 1929 economic collapse\nlex: Great Depression causes unemployment stock market crash\nvec: what caused the Great Depression and how did it affect the economy\nvec: what were the major events and consequences of the Great Depression in the 1930s\nhyde: The Great Depression began with the stock market crash of October 1929 and lasted until the late 1930s. Unemployment peaked at 25%, thousands of banks failed, and GDP fell by nearly 30%. The New Deal introduced federal relief programs."}
+{"input": "what is the international court of justice", "output": "lex: International Court of Justice ICJ United Nations\nlex: ICJ jurisdiction Hague rulings\nvec: what is the purpose and function of the International Court of Justice\nvec: how does the ICJ at The Hague resolve disputes between countries\nhyde: The International Court of Justice (ICJ) is the principal judicial organ of the United Nations, located in The Hague, Netherlands. It settles legal disputes between states and gives advisory opinions on questions referred by UN organs."}
+{"input": "what is influencer marketing", "output": "lex: influencer marketing social media brand promotion\nlex: influencer campaigns Instagram TikTok sponsorship\nvec: how does influencer marketing work for promoting brands on social media\nvec: what is influencer marketing and why do companies pay content creators\nhyde: Influencer marketing is a strategy where brands partner with social media creators who have engaged followings to promote products. Campaigns may involve sponsored posts, affiliate links, or product reviews. ROI is measured through engagement rates, conversions, and reach."}
+{"input": "how to change a flat tire?", "output": "lex: change flat tire steps jack lug nuts\nlex: flat tire replacement spare wheel\nvec: step-by-step instructions for changing a flat tire on the side of the road\nvec: how to safely jack up a car and replace a flat tire with the spare\nhyde: Loosen the lug nuts before jacking. Place the jack under the frame near the flat tire, raise the vehicle, remove the lug nuts, swap in the spare, hand-tighten the nuts in a star pattern, lower the car, then torque to 80-100 ft-lbs."}
+{"input": "what is the significance of the lotus in buddhism?", "output": "lex: lotus flower Buddhism symbolism\nlex: lotus Buddhist enlightenment purity\nvec: why is the lotus flower an important symbol in Buddhism\nvec: what does the lotus represent in Buddhist art and teachings\nhyde: The lotus grows from muddy water yet blooms immaculately, symbolizing the journey from suffering to enlightenment. In Buddhist iconography, the Buddha is often depicted seated on a lotus throne, representing purity of mind arising from the world of samsara."}
+{"input": "code lint", "output": "lex: code linter static analysis\nlex: linting tools ESLint Pylint code quality\nlex: lint rules syntax errors warnings\nvec: what is code linting and how do linting tools check source code for errors\nvec: how to set up a code linter for catching bugs and enforcing style rules\nhyde: A linter performs static analysis on source code to detect syntax errors, stylistic issues, and potential bugs without executing the program. Popular linters include ESLint for JavaScript, Pylint for Python, and Clippy for Rust."}
+{"input": "what is content marketing", "output": "lex: content marketing strategy blog SEO\nlex: content marketing audience engagement brand\nvec: what is content marketing and how does it attract customers\nvec: how do businesses use content marketing to drive traffic and build trust\nhyde: Content marketing focuses on creating and distributing valuable, relevant content—blog posts, videos, podcasts, whitepapers—to attract and retain a target audience. Rather than directly promoting a product, it builds authority and nurtures leads through the sales funnel."}
+{"input": "what is the meaning of hanukkah", "output": "lex: Hanukkah meaning Jewish festival of lights\nlex: Hanukkah menorah Maccabees temple rededication\nvec: what is the history and significance of Hanukkah in Judaism\nvec: why do Jewish people celebrate Hanukkah and what does it commemorate\nhyde: Hanukkah commemorates the rededication of the Second Temple in Jerusalem after the Maccabean revolt against the Seleucid Empire in 164 BCE. The miracle of the oil—one day's supply lasting eight days—is celebrated by lighting the menorah each night."}
+{"input": "what is existential angst", "output": "lex: existential angst anxiety Kierkegaard\nlex: existential dread absurdity freedom\nvec: what does existential angst mean in philosophy\nvec: how do existentialist philosophers describe the feeling of existential anxiety\nhyde: Existential angst, or Angst, is the deep anxiety that arises from confronting freedom, mortality, and the absence of inherent meaning. Kierkegaard described it as the dizziness of freedom; Heidegger linked it to awareness of one's Being-toward-death."}
+{"input": "how to style open shelves", "output": "lex: open shelf styling tips decor\nlex: kitchen open shelving arrangement display\nvec: how to arrange and decorate open shelves so they look good\nvec: what are tips for styling open shelves in a kitchen or living room\nhyde: Group items in odd numbers and vary heights. Mix functional pieces like dishes with decorative objects like plants or small art. Leave 30% of the shelf empty to avoid clutter. Use a consistent color palette to tie everything together."}
+{"input": "linkedin profile", "output": "lex: LinkedIn profile optimization headline\nlex: LinkedIn profile tips summary photo\nlex: LinkedIn profile writing professional\nvec: how to create an effective LinkedIn profile that attracts recruiters\nvec: what should you include in your LinkedIn profile headline and summary\nhyde: Your LinkedIn headline should go beyond your job title—include keywords and your value proposition. Use the summary section to tell your professional story in first person. Add a professional headshot; profiles with photos get 21x more views."}
+{"input": "what are the benefits of yoga", "output": "lex: yoga benefits health flexibility stress\nlex: yoga physical mental health advantages\nvec: what are the physical and mental health benefits of practicing yoga regularly\nvec: how does yoga improve flexibility, strength, and stress levels\nhyde: Regular yoga practice improves flexibility, builds core strength, and lowers cortisol levels. Studies show it reduces chronic back pain, lowers blood pressure, and decreases symptoms of anxiety and depression. Even 20 minutes daily produces measurable benefits."}
+{"input": "what is virtue ethics", "output": "lex: virtue ethics Aristotle character moral\nlex: virtue ethics eudaimonia moral philosophy\nvec: what is virtue ethics and how does it differ from other moral theories\nvec: how does Aristotle's virtue ethics define moral character and the good life\nhyde: Virtue ethics, rooted in Aristotle's Nicomachean Ethics, holds that morality centers on developing virtuous character traits—courage, temperance, justice, prudence—rather than following rules or calculating consequences. The goal is eudaimonia, or human flourishing."}
+{"input": "how to calculate carbon emissions?", "output": "lex: carbon emissions calculation formula CO2\nlex: carbon footprint calculator methodology\nvec: how do you calculate the carbon emissions from energy use and transportation\nvec: what formulas and data are used to measure carbon dioxide emissions\nhyde: To calculate CO2 emissions, multiply the activity data (e.g., kWh of electricity, liters of fuel) by the appropriate emission factor. For gasoline: 2.31 kg CO2 per liter burned. For grid electricity, use the regional emission factor, typically 0.3-0.9 kg CO2/kWh."}
+{"input": "how to start rock climbing", "output": "lex: rock climbing beginner indoor gym\nlex: rock climbing gear shoes harness belay\nvec: how to get started with rock climbing as a complete beginner\nvec: what equipment and skills do beginners need for indoor rock climbing\nhyde: Start at an indoor climbing gym where you can rent shoes and a harness. Take a belay certification class to learn rope handling. Begin on easy routes graded V0-V1 for bouldering or 5.6-5.8 for top-rope. Focus on footwork over arm strength."}
+{"input": "how to create a moon garden?", "output": "lex: moon garden white flowers night-blooming plants\nlex: moon garden design layout fragrant plants\nvec: how to plan and plant a garden designed to be enjoyed at night\nvec: what plants and flowers work best in a moon garden\nhyde: A moon garden features white and pale-colored flowers, silver foliage, and night-blooming plants that glow under moonlight. Include moonflower (Ipomoea alba), white nicotiana, night-blooming jasmine, dusty miller, and lamb's ear. Add light-colored gravel paths for reflection."}
+{"input": "what is the significance of the bildungsroman?", "output": "lex: bildungsroman coming-of-age novel literary genre\nlex: bildungsroman significance literature examples\nvec: what is a bildungsroman and why is it an important literary genre\nvec: how does the bildungsroman novel trace a character's growth and development\nhyde: The bildungsroman, or coming-of-age novel, follows a protagonist's psychological and moral development from youth to adulthood. Examples include Goethe's Wilhelm Meister, Dickens' Great Expectations, and Joyce's A Portrait of the Artist as a Young Man."}
+{"input": "what is moral behavior", "output": "lex: moral behavior ethics right wrong conduct\nlex: moral behavior definition philosophy psychology\nvec: what defines moral behavior and how do people distinguish right from wrong\nvec: what is moral behavior according to ethics and psychology\nhyde: Moral behavior refers to actions that conform to standards of right conduct within a society or ethical framework. It involves making choices that consider the well-being of others, guided by principles such as fairness, honesty, empathy, and respect for autonomy."}
+{"input": "how to use a rototiller?", "output": "lex: rototiller operation tilling soil garden\nlex: rototiller how to use depth settings\nvec: step-by-step instructions for using a rototiller to prepare garden soil\nvec: how to operate a rototiller safely and effectively\nhyde: Set the tilling depth to 6-8 inches for new beds. Walk slowly and let the tines do the work—don't force it forward. Make overlapping passes in parallel rows. Avoid tilling wet soil, which creates compaction. Clean tines after each use."}
+{"input": "how to build a greenhouse?", "output": "lex: greenhouse build DIY construction plans\nlex: greenhouse frame polycarbonate panels foundation\nvec: how to build a small greenhouse in your backyard step by step\nvec: what materials and design are needed to construct a DIY greenhouse\nhyde: Start with a level foundation of treated lumber or concrete blocks. Build the frame from galvanized steel or cedar. Cover with 8mm twin-wall polycarbonate panels, which insulate better than glass. Include ridge vents for airflow and a door on the south-facing end."}
+{"input": "how to handle sibling rivalry?", "output": "lex: sibling rivalry parenting tips conflict\nlex: sibling fighting jealousy children strategies\nvec: how can parents manage sibling rivalry and reduce fighting between children\nvec: what strategies help siblings get along and resolve conflicts\nhyde: Avoid comparing siblings or taking sides. Acknowledge each child's feelings before mediating. Teach conflict resolution skills: use I-statements, take turns speaking, and brainstorm solutions together. Spend one-on-one time with each child to reduce jealousy."}
+{"input": "how to polish car paint?", "output": "lex: car paint polish compound buffing\nlex: auto paint polishing scratch removal swirl marks\nvec: how to polish car paint to remove scratches and restore shine\nvec: what is the correct technique for machine polishing automotive paint\nhyde: Wash and clay bar the surface first. Apply a small amount of polishing compound to a foam pad on a dual-action polisher. Work in 2x2 foot sections at 1200-1500 RPM with medium pressure. Wipe residue with a microfiber towel, then apply sealant or wax."}
+{"input": "what is intrinsic value", "output": "lex: intrinsic value philosophy ethics\nlex: intrinsic value stock valuation finance\nvec: what does intrinsic value mean in philosophy and in finance\nvec: how is intrinsic value defined as something valuable in itself regardless of consequences\nhyde: In philosophy, intrinsic value is the worth something has in itself, independent of its usefulness. Kant argued that rational beings have intrinsic value as ends in themselves. In finance, intrinsic value refers to the calculated true worth of an asset based on fundamentals."}
+{"input": "how to get rid of weeds naturally", "output": "lex: natural weed killer organic herbicide\nlex: remove weeds without chemicals mulch vinegar\nvec: what are natural methods for killing and preventing weeds in a garden\nvec: how to get rid of weeds without using chemical herbicides\nhyde: Apply a 3-4 inch layer of mulch to suppress weed growth. Pour boiling water directly on weeds in cracks. Spray a mixture of white vinegar, salt, and dish soap on foliage in full sun. Hand-pull weeds after rain when roots come out easily."}
+{"input": "what is the concept of original sin", "output": "lex: original sin Christian theology Adam Eve\nlex: original sin doctrine fall of man\nvec: what is original sin in Christian theology and where does the idea come from\nvec: how does the concept of original sin explain human nature in Christianity\nhyde: Original sin is the Christian doctrine that humanity inherited a sinful nature from Adam and Eve's disobedience in the Garden of Eden. Augustine of Hippo formalized the teaching, arguing that all humans are born in a state of sin, redeemable only through divine grace."}
+{"input": "how to build a successful brand", "output": "lex: brand building strategy identity positioning\nlex: brand identity logo messaging target audience\nvec: what steps are needed to build a strong and recognizable brand\nvec: how do companies create a successful brand identity and positioning\nhyde: Define your brand's mission, values, and target audience. Develop a distinctive visual identity—logo, color palette, typography. Craft a consistent brand voice across all channels. Differentiate with a clear value proposition and deliver on your brand promise consistently."}
+{"input": "what are the teachings of the baha'i faith?", "output": "lex: Baha'i faith teachings principles Baha'u'llah\nlex: Baha'i beliefs unity humanity religion\nvec: what are the core beliefs and teachings of the Baha'i faith\nvec: what did Baha'u'llah teach about unity, equality, and world peace\nhyde: The Baha'i faith, founded by Baha'u'llah in 19th-century Persia, teaches the oneness of God, the oneness of religion, and the oneness of humanity. Core principles include elimination of prejudice, equality of men and women, universal education, and harmony of science and religion."}
+{"input": "how to potty train a toddler?", "output": "lex: potty training toddler tips methods\nlex: toddler toilet training readiness signs\nvec: how to potty train a toddler and what are the signs of readiness\nvec: what is the best approach to potty training a 2-year-old child\nhyde: Watch for readiness signs: staying dry for 2 hours, showing interest in the toilet, and communicating the need to go. Start with a child-sized potty, establish a routine after meals and naps, use positive reinforcement, and expect accidents—avoid punishment."}
+{"input": "how to reduce waste in everyday life?", "output": "lex: reduce waste zero waste lifestyle tips\nlex: waste reduction recycling composting reuse\nvec: what are practical ways to reduce household waste in daily life\nvec: how can individuals cut down on trash and move toward zero waste living\nhyde: Bring reusable bags, bottles, and containers when shopping. Buy in bulk to reduce packaging. Compost food scraps instead of sending them to landfill. Choose products with minimal packaging, repair items before replacing, and donate what you no longer need."}
+{"input": "how international relations affect trade", "output": "lex: international relations trade policy tariffs\nlex: geopolitics trade agreements bilateral multilateral\nvec: how do international political relationships influence global trade and tariffs\nvec: what is the connection between diplomacy and international trade policy\nhyde: Diplomatic relations directly shape trade flows through tariffs, sanctions, and trade agreements. Countries with strong bilateral ties negotiate favorable terms—like the USMCA between the US, Mexico, and Canada—while geopolitical tensions can trigger trade wars and export controls."}
+{"input": "what is business continuity planning", "output": "lex: business continuity planning BCP disaster recovery\nlex: BCP risk assessment contingency plan\nvec: what is a business continuity plan and why do organizations need one\nvec: how do companies create a business continuity plan for disaster recovery\nhyde: Business continuity planning (BCP) ensures an organization can maintain critical functions during and after a disruption. It includes risk assessment, identifying essential operations, establishing recovery time objectives, and defining procedures for communication, IT recovery, and alternate work sites."}
+{"input": "how to have a successful playdate?", "output": "lex: playdate tips children toddler socializing\nlex: kids playdate activities hosting\nvec: how to plan and host a successful playdate for young children\nvec: what tips help make a playdate fun and smooth for kids and parents\nhyde: Keep playdates short—90 minutes is ideal for toddlers. Prepare a few structured activities but allow free play. Put away special toys to avoid conflicts. Have snacks ready, discuss allergies with the other parent beforehand, and supervise without hovering."}
+{"input": "what are the major forms of poetry?", "output": "lex: poetry forms types sonnet haiku epic\nlex: poetic forms verse structures literary\nvec: what are the main types and forms of poetry in literature\nvec: how do different poetry forms like sonnets, haiku, and free verse differ\nhyde: Major poetic forms include the sonnet (14 lines, iambic pentameter), haiku (3 lines, 5-7-5 syllables), epic (long narrative), ballad (storytelling with rhyme), ode (lyrical praise), limerick (humorous five-line form), villanelle (19 lines with refrains), and free verse (no fixed structure)."}
+{"input": "when to plant tulip bulbs?", "output": "lex: tulip bulbs planting time season fall\nlex: tulip bulb planting depth spacing\nvec: what time of year should you plant tulip bulbs for spring blooms\nvec: when is the best season to plant tulips and how deep should the bulbs go\nhyde: Plant tulip bulbs in fall, 6-8 weeks before the ground freezes—typically October to November in most zones. Set bulbs 6-8 inches deep, pointed end up, spaced 4-6 inches apart. They need a cold period of 12-16 weeks to bloom in spring."}
+{"input": "where to buy raised garden beds?", "output": "lex: raised garden beds buy online store\nlex: raised bed garden kits cedar metal\nvec: where can I buy raised garden beds and what materials are best\nvec: what are the best places to purchase raised bed garden kits\nhyde: Raised garden beds are available at Home Depot, Lowe's, and garden centers. Online retailers like Gardener's Supply, Amazon, and Birdies offer metal and cedar kits. Cedar is rot-resistant and long-lasting; galvanized steel beds are durable and modern-looking."}
+{"input": "how to plant a tree properly?", "output": "lex: tree planting technique hole depth root ball\nlex: plant tree correctly mulch watering\nvec: what is the correct way to plant a tree so it grows healthy\nvec: how deep and wide should the hole be when planting a new tree\nhyde: Dig a hole 2-3 times wider than the root ball but only as deep. Set the tree so the root flare sits at ground level. Backfill with native soil, water deeply, and apply 2-4 inches of mulch in a ring, keeping it away from the trunk to prevent rot."}
+{"input": "what is the role of enzymes in digestion", "output": "lex: enzymes digestion amylase protease lipase\nlex: digestive enzymes stomach intestine breakdown\nvec: how do enzymes help break down food during the digestive process\nvec: what role do specific enzymes like amylase and protease play in digestion\nhyde: Digestive enzymes catalyze the breakdown of macronutrients into absorbable units. Amylase in saliva and the pancreas breaks starch into sugars. Pepsin in the stomach cleaves proteins. Lipase from the pancreas breaks fats into fatty acids and glycerol in the small intestine."}
+{"input": "what to wear for rock climbing", "output": "lex: rock climbing clothing gear outfit\nlex: climbing shoes harness chalk bag apparel\nvec: what clothes and gear should you wear for indoor or outdoor rock climbing\nvec: what is the best clothing to wear when rock climbing for comfort and safety\nhyde: Wear stretchy, moisture-wicking pants or shorts that allow full range of motion. Choose a fitted athletic shirt—avoid loose fabric that catches on holds. Climbing shoes should fit snugly. Bring a chalk bag for grip and a harness for roped routes."}
+{"input": "latest uses of bioinformatics in research", "output": "lex: bioinformatics research applications 2025 2026\nlex: bioinformatics genomics proteomics computational biology\nvec: how is bioinformatics being used in current scientific research\nvec: what are the newest bioinformatics tools and applications in genomics and drug discovery\nhyde: Recent bioinformatics advances include single-cell RNA sequencing analysis pipelines, AlphaFold-based protein structure prediction for drug targets, CRISPR off-target analysis algorithms, and large-scale metagenomic assembly for microbiome studies."}
+{"input": "how the scientific community addresses research bias", "output": "lex: research bias scientific community peer review\nlex: scientific bias mitigation replication reproducibility\nvec: how do scientists identify and reduce bias in research studies\nvec: what methods does the scientific community use to address research bias and ensure reproducibility\nhyde: To combat research bias, journals require pre-registration of study protocols, blinded peer review, and reporting of negative results. Replication studies verify findings. Statistical safeguards like p-value corrections and effect size reporting reduce publication bias."}
+{"input": "what is ethical dilemma in real life", "output": "lex: ethical dilemma real life examples\nlex: moral dilemma everyday situations conflict\nvec: what are examples of ethical dilemmas people face in everyday life\nvec: how do real-life ethical dilemmas force people to choose between conflicting values\nhyde: A common ethical dilemma is discovering a coworker falsifying expense reports—report them and risk the relationship, or stay silent and condone dishonesty. Other examples include whistleblowing, end-of-life medical decisions, and allocating scarce resources during emergencies."}
+{"input": "best techniques for street photography", "output": "lex: street photography techniques composition tips\nlex: street photography candid camera settings\nvec: what are the best techniques for capturing compelling street photographs\nvec: how do street photographers take candid shots of people in public spaces\nhyde: Shoot at f/8 for deep depth of field and zone focus at 3 meters for quick candid shots. Use a 28mm or 35mm lens. Anticipate moments—find good light or backgrounds and wait for subjects to enter the frame. Shoot from the hip to stay inconspicuous."}
+{"input": "how to become a researcher", "output": "lex: become researcher academic career path\nlex: research career PhD graduate school publish\nvec: what steps do you need to take to become a professional researcher\nvec: how do you build a career in academic or scientific research\nhyde: Start with an undergraduate degree in your field, seek research assistant positions, and publish early. Apply to graduate programs for a master's or PhD. Build a publication record, attend conferences, and network with established researchers. Postdoctoral positions lead to faculty or industry research roles."}
+{"input": "web socket", "output": "lex: WebSocket protocol real-time connection\nlex: WebSocket API JavaScript server client\nlex: WebSocket vs HTTP persistent connection\nvec: how do WebSockets work for real-time bidirectional communication\nvec: how to implement a WebSocket connection between a client and server\nhyde: WebSocket provides full-duplex communication over a single TCP connection. After an HTTP upgrade handshake, client and server can send messages in both directions without polling. Use `new WebSocket('ws://host/path')` on the client and a library like ws on the server."}
+{"input": "what is lean manufacturing", "output": "lex: lean manufacturing Toyota production system\nlex: lean manufacturing waste reduction kaizen\nvec: what is lean manufacturing and what principles does it follow\nvec: how does lean manufacturing eliminate waste and improve production efficiency\nhyde: Lean manufacturing, derived from the Toyota Production System, aims to minimize waste (muda) while maximizing value. Its five principles: define value from the customer's perspective, map the value stream, create flow, establish pull, and pursue perfection through continuous improvement (kaizen)."}
+{"input": "what are writing prompts?", "output": "lex: writing prompts creative fiction ideas\nlex: writing prompts exercises journal story starters\nvec: what are writing prompts and how do writers use them for inspiration\nvec: how do writing prompts help overcome writer's block and spark creativity\nhyde: Writing prompts are short scenarios, questions, or opening lines designed to spark creative writing. Examples: \"Write about a door that appeared overnight\" or \"Describe your earliest memory from a stranger's perspective.\" They help overcome writer's block and build a daily writing habit."}
+{"input": "how to capture bokeh effect", "output": "lex: bokeh effect photography aperture lens\nlex: bokeh background blur shallow depth of field\nvec: how to achieve a bokeh effect with blurred background in photography\nvec: what camera settings and lenses produce the best bokeh\nhyde: Use a wide aperture (f/1.4 to f/2.8) to create shallow depth of field. A fast prime lens like a 50mm f/1.8 or 85mm f/1.4 produces smooth bokeh. Increase the distance between subject and background, and get close to your subject for maximum blur."}
+{"input": "what is a controlled experiment", "output": "lex: controlled experiment scientific method variables\nlex: control group experimental group independent variable\nvec: what is a controlled experiment and how does it work in science\nvec: how do scientists set up control and experimental groups in a controlled experiment\nhyde: A controlled experiment tests a hypothesis by changing one independent variable while keeping all other conditions constant. The control group receives no treatment, while the experimental group does. Comparing outcomes isolates the effect of the variable being tested."}
+{"input": "what is telemedicine", "output": "lex: telemedicine telehealth virtual doctor visit\nlex: telemedicine remote healthcare video consultation\nvec: what is telemedicine and how does it deliver healthcare remotely\nvec: how do patients use telemedicine for virtual doctor appointments\nhyde: Telemedicine uses video calls, phone consultations, and remote monitoring to deliver healthcare without in-person visits. Patients can consult doctors from home for diagnoses, prescriptions, and follow-ups. It expanded rapidly during COVID-19 and now covers specialties from dermatology to psychiatry."}
+{"input": "what are the teachings of jainism", "output": "lex: Jainism teachings principles ahimsa karma\nlex: Jain philosophy non-violence Mahavira\nvec: what are the core teachings and beliefs of Jainism\nvec: what did Mahavira teach about non-violence and the path to liberation in Jainism\nhyde: Jainism, taught by Mahavira in the 6th century BCE, centers on ahimsa (non-violence), satya (truth), and aparigraha (non-attachment). Jains believe the soul is eternal, bound by karma accumulated through actions. Liberation (moksha) is achieved through right faith, right knowledge, and right conduct."}
+{"input": "what is sustainable living", "output": "lex: sustainable living eco-friendly lifestyle\nlex: sustainable living reduce reuse recycle carbon footprint\nvec: what does sustainable living mean and how can people practice it\nvec: what are the key principles and habits of a sustainable lifestyle\nhyde: Sustainable living means reducing your environmental impact by consuming fewer resources, choosing renewable energy, eating locally, minimizing waste, and favoring durable goods over disposable ones. It applies to housing, transportation, food, clothing, and daily consumption habits."}
+{"input": "xml parse", "output": "lex: XML parser parsing library\nlex: XML DOM SAX parser programming\nlex: XML parse Python JavaScript Java\nvec: how to parse XML documents programmatically in different languages\nvec: what are the common methods for reading and parsing XML files in code\nhyde: To parse XML in Python, use `xml.etree.ElementTree`: `tree = ET.parse('file.xml'); root = tree.getroot()`. For streaming large files, use SAX with `xml.sax`. In JavaScript, use `DOMParser` or libraries like `fast-xml-parser`."}
+{"input": "how does compound interest work", "output": "lex: compound interest formula calculation rate\nlex: compound interest savings investment growth\nvec: how does compound interest grow money over time compared to simple interest\nvec: what is the formula for compound interest and how is it calculated\nhyde: Compound interest is calculated on both the principal and accumulated interest. The formula is A = P(1 + r/n)^(nt), where P is principal, r is annual rate, n is compounding frequency, and t is time in years. Monthly compounding on $10,000 at 5% yields $16,470 after 10 years."}
+{"input": "what is the role of reason in ethics", "output": "lex: reason ethics moral philosophy rationalism\nlex: reason morality Kant rational ethical judgment\nvec: what role does reason play in making moral and ethical decisions\nvec: how do philosophers like Kant argue that reason is the foundation of ethics\nhyde: Kant held that reason alone can determine moral duty through the categorical imperative: act only according to maxims you could universalize. Rationalist ethics contrasts with sentimentalism (Hume), which grounds morality in emotion rather than rational deliberation."}
+{"input": "videography tips", "output": "lex: videography tips filming techniques camera\nlex: video production shooting composition stabilization\nvec: what are practical tips for improving videography and video shooting quality\nvec: how to shoot better video with camera movement, lighting, and composition techniques\nhyde: Stabilize shots with a gimbal or tripod. Follow the rule of thirds for framing. Shoot at 24fps for cinematic feel or 60fps for smooth slow motion. Use three-point lighting. Record clean audio separately with a lavalier or shotgun mic—audio quality matters more than resolution."}
+{"input": "how to choose a daycare?", "output": "lex: daycare choose selection criteria childcare\nlex: daycare center evaluation safety ratio\nvec: what should parents look for when choosing a daycare for their child\nvec: how to evaluate and compare daycare centers for quality and safety\nhyde: Visit multiple centers and observe interactions between staff and children. Check the staff-to-child ratio (1:4 for infants is ideal), licensing status, cleanliness, and safety measures. Ask about daily routines, curriculum, discipline policies, and staff qualifications and turnover."}
+{"input": "how to replace car alternator?", "output": "lex: replace car alternator DIY steps\nlex: alternator replacement belt removal installation\nvec: step-by-step instructions for replacing a car alternator yourself\nvec: how to remove and install a new alternator in a vehicle\nhyde: Disconnect the negative battery terminal. Remove the serpentine belt by releasing the tensioner. Unplug the electrical connectors and unbolt the alternator. Install the new unit, reconnect the wiring, route the belt back on, and reconnect the battery. Test by checking voltage at 13.5-14.5V."}
+{"input": "how to create a youtube channel", "output": "lex: create YouTube channel setup steps\nlex: YouTube channel start grow subscribers content\nvec: how to set up and launch a new YouTube channel from scratch\nvec: what steps do you need to take to create and grow a YouTube channel\nhyde: Sign in to YouTube with a Google account, click Create a Channel, and choose your channel name. Upload a profile picture and banner. Write a channel description with keywords. Plan a content schedule, create your first video, and optimize titles, thumbnails, and tags for search."}
+{"input": "what is dualism in mind-body philosophy", "output": "lex: mind-body dualism Descartes substance\nlex: dualism philosophy of mind mental physical\nvec: what is mind-body dualism and how does Descartes explain the relationship between mind and body\nvec: how does dualism in philosophy argue that mind and body are separate substances\nhyde: Cartesian dualism, proposed by René Descartes, holds that mind and body are two distinct substances: res cogitans (thinking substance) and res extensa (extended substance). The mind is non-physical and conscious; the body is physical and mechanistic. Their interaction remains the central problem."}
+{"input": "what is cliffhanger?", "output": "lex: cliffhanger literary device narrative suspense\nlex: cliffhanger ending story plot tension\nvec: what is a cliffhanger in storytelling and how does it create suspense\nvec: how do writers use cliffhangers to keep readers or viewers engaged\nhyde: A cliffhanger is a narrative device that ends a chapter, episode, or story at a moment of high suspense, leaving the outcome unresolved. It compels the audience to continue reading or watching. The term originates from serialized fiction where characters were literally left hanging from cliffs."}
+{"input": "how to volunteer for civic initiatives", "output": "lex: volunteer civic initiatives community service\nlex: volunteering local government community projects\nvec: how can someone find and volunteer for civic engagement and community initiatives\nvec: what are ways to get involved in local civic volunteer opportunities\nhyde: Check your city's website or community board for volunteer openings on advisory committees, park cleanups, and voter registration drives. Organizations like VolunteerMatch and local nonprofits connect volunteers with civic projects. Attend town hall meetings to learn about current needs."}
+{"input": "how does hinduism view the divine cycle of creation?", "output": "lex: Hinduism creation cycle Brahma Vishnu Shiva\nlex: Hindu cosmology srishti sthiti pralaya\nvec: how does Hinduism explain the cosmic cycle of creation, preservation, and destruction\nvec: what is the Hindu view of the divine cycle involving Brahma, Vishnu, and Shiva\nhyde: In Hindu cosmology, creation is cyclical. Brahma creates the universe, Vishnu preserves it, and Shiva destroys it so it can be reborn. Each cycle spans a kalpa (4.32 billion years). The universe undergoes endless cycles of srishti (creation), sthiti (preservation), and pralaya (dissolution)."}
+{"input": "what is consequentialist ethics", "output": "lex: consequentialism ethics utilitarianism outcomes\nlex: consequentialist moral theory consequences actions\nvec: what is consequentialist ethics and how does it judge the morality of actions\nvec: how does consequentialism differ from deontological ethics in evaluating right and wrong\nhyde: Consequentialism judges actions solely by their outcomes. The most influential form, utilitarianism (Bentham, Mill), holds that the right action maximizes overall happiness or well-being. Unlike deontology, which focuses on duties and rules, consequentialism permits any action if the results are good."}
+{"input": "how to promote environmental awareness?", "output": "lex: environmental awareness promotion education campaigns\nlex: promote environmental sustainability community outreach\nvec: how can individuals and organizations promote environmental awareness in their communities\nvec: what are effective strategies for raising public awareness about environmental issues\nhyde: Organize community cleanups, host documentary screenings, and partner with schools for environmental education programs. Use social media campaigns with clear calls to action. Start a local recycling or composting initiative. Create informational signage at parks and public spaces."}
+{"input": "how to practice self-love", "output": "lex: self-love self-care practices mental health\nlex: self-love habits self-compassion boundaries\nvec: what are practical ways to practice self-love and self-compassion daily\nvec: how to build self-love through healthy habits and positive self-talk\nhyde: Practice self-love by setting boundaries, speaking to yourself with kindness, and prioritizing rest without guilt. Journal about what you appreciate about yourself. Replace self-criticism with curiosity: ask \"what do I need right now?\" instead of \"what's wrong with me?\""}
+{"input": "what is companion planting with vegetables", "output": "lex: companion planting vegetables garden chart\nlex: companion planting tomato basil marigold\nvec: what is companion planting and which vegetables grow well together\nvec: how does companion planting benefit vegetable gardens and deter pests\nhyde: Companion planting pairs vegetables that benefit each other. Basil planted near tomatoes repels aphids and may improve flavor. Marigolds deter nematodes around most vegetables. The Three Sisters—corn, beans, and squash—is a classic trio: corn supports beans, beans fix nitrogen, squash shades soil."}
+{"input": "how to set achievable goals?", "output": "lex: set achievable goals SMART goal setting\nlex: goal setting strategy actionable realistic\nvec: how to set realistic and achievable goals using the SMART framework\nvec: what techniques help people set goals they can actually accomplish\nhyde: Use the SMART framework: Specific (define exactly what you want), Measurable (quantify progress), Achievable (within your capabilities), Relevant (aligned with larger objectives), Time-bound (set a deadline). Break large goals into weekly milestones and track progress visually."}
+{"input": "how do scientists study animal behavior", "output": "lex: animal behavior study ethology methods\nlex: animal behavior research observation field experiments\nvec: what methods do scientists use to study and analyze animal behavior\nvec: how do ethologists observe and research animal behavior in the wild and in labs\nhyde: Ethologists use direct observation, video tracking, and GPS telemetry to study animal behavior in natural habitats. Lab experiments control variables to test hypotheses about cognition and social behavior. Focal sampling follows one individual; scan sampling records group behavior at intervals."}
+{"input": "how to maintain motivation through challenges?", "output": "lex: maintain motivation challenges resilience\nlex: staying motivated difficult times strategies\nvec: how to stay motivated when facing setbacks and difficult challenges\nvec: what strategies help maintain motivation during tough periods in life or work\nhyde: Break the challenge into small wins to maintain a sense of progress. Revisit your original purpose—why did you start? Celebrate incremental achievements. Build accountability through a partner or group. Accept setbacks as data rather than failure, and adjust your approach rather than your goal."}
+{"input": "what is the philosophy of mind", "output": "lex: philosophy of mind consciousness mental states\nlex: philosophy of mind problem qualia dualism physicalism\nvec: what is the philosophy of mind and what questions does it explore\nvec: how does philosophy of mind address consciousness, mental states, and the mind-body problem\nhyde: Philosophy of mind investigates the nature of consciousness, mental states, and their relationship to the physical brain. Central questions include the hard problem of consciousness (why subjective experience exists), whether mental states reduce to brain states, and the nature of intentionality and qualia."}
+{"input": "enum class", "output": "lex: enum class C++ Java strongly typed\nlex: enum class Python enumeration members\nlex: enum class scoped enumeration\nvec: how to define and use enum classes in C++ or Java for type-safe enumerations\nvec: what is the difference between an enum and an enum class in C++\nhyde: In C++11, `enum class` creates a scoped, strongly typed enumeration. Unlike plain enums, values don't implicitly convert to int and must be accessed with the scope operator: `enum class Color { Red, Green, Blue }; Color c = Color::Red;`"}
+{"input": "how to sell art on etsy?", "output": "lex: sell art Etsy shop setup listing\nlex: Etsy art shop pricing shipping prints\nvec: how to set up an Etsy shop to sell original art and prints\nvec: what tips help artists successfully sell artwork on Etsy\nhyde: Create an Etsy seller account and set up your shop with a clear brand name and banner. Photograph art in natural light with a neutral background. Write detailed listings with keywords buyers search for. Price to cover materials, time, Etsy fees (6.5%), and shipping. Offer prints alongside originals."}
+{"input": "what is virtue epistemology", "output": "lex: virtue epistemology intellectual virtues knowledge\nlex: virtue epistemology Sosa Zagzebski epistemic\nvec: what is virtue epistemology and how does it differ from traditional theories of knowledge\nvec: how does virtue epistemology evaluate knowledge based on intellectual character traits\nhyde: Virtue epistemology evaluates beliefs based on the intellectual character of the knower rather than just the properties of the belief. Ernest Sosa's reliabilism treats virtues as reliable cognitive faculties; Linda Zagzebski's responsibilism focuses on traits like open-mindedness, intellectual courage, and thoroughness."}
+{"input": "what is ethical egoism", "output": "lex: ethical egoism moral theory self-interest\nlex: ethical egoism Ayn Rand rational selfishness\nvec: what is ethical egoism and how does it differ from psychological egoism\nvec: how does ethical egoism argue that acting in self-interest is morally right\nhyde: Ethical egoism holds that agents ought to act in their own self-interest. Unlike psychological egoism (a descriptive claim that people always act selfishly), ethical egoism is normative—it prescribes self-interest as the moral standard. Ayn Rand's rational self-interest is a well-known variant."}
+{"input": "tech fix", "output": "lex: tech troubleshooting fix repair computer\nlex: technology fix common problems software hardware\nlex: tech support fix device issue\nvec: how to troubleshoot and fix common technology problems with computers and devices\nvec: what are basic tech fixes for common software and hardware issues\nhyde: Start with a restart—it resolves most transient issues. Clear browser cache for web problems. Check cables and connections for hardware failures. Update drivers and firmware. For persistent crashes, check event logs and run diagnostics. Factory reset as a last resort after backing up data."}
+{"input": "how to evaluate scientific sources", "output": "lex: evaluate scientific sources credibility peer-reviewed\nlex: scientific source evaluation criteria journal\nvec: how to evaluate whether a scientific source or study is credible and reliable\nvec: what criteria should you use to assess the quality of scientific research papers\nhyde: Check if the study is published in a peer-reviewed journal with an impact factor. Examine the sample size, methodology, and statistical analysis. Look for conflicts of interest in funding disclosures. Verify the authors' credentials and institutional affiliations. Check citation count and whether results have been replicated."}
+{"input": "what is taoism", "output": "lex: Taoism Daoism Lao Tzu Tao Te Ching\nlex: Taoism philosophy wu wei yin yang\nvec: what are the core beliefs and principles of Taoism\nvec: what did Lao Tzu teach in the Tao Te Ching about the way and harmony with nature\nhyde: Taoism (Daoism) is a Chinese philosophical and spiritual tradition rooted in the Tao Te Ching by Lao Tzu. The Tao (\"the Way\") is the fundamental, nameless force underlying all things. Core concepts include wu wei (effortless action), yin-yang balance, simplicity, and harmony with nature."}
+{"input": "how neural networks function", "output": "lex: neural network layers neurons weights backpropagation\nlex: neural network deep learning forward pass activation\nvec: how do artificial neural networks process data and learn from training\nvec: what is the architecture and learning mechanism of a neural network\nhyde: A neural network processes input through layers of interconnected neurons. Each neuron computes a weighted sum of its inputs, applies an activation function (ReLU, sigmoid), and passes the result forward. Training uses backpropagation to adjust weights by computing gradients of the loss function."}
+{"input": "how to maintain a bonsai tree?", "output": "lex: bonsai tree care maintenance watering pruning\nlex: bonsai trimming repotting soil fertilizer\nvec: how to properly care for and maintain a bonsai tree at home\nvec: what are the watering, pruning, and soil requirements for bonsai trees\nhyde: Water bonsai when the top half-inch of soil feels dry—never on a schedule. Place in bright indirect light for indoor species or full sun for outdoor varieties. Prune new growth to maintain shape. Repot every 2-3 years in spring using well-draining akadama-based soil. Fertilize biweekly during growing season."}
+{"input": "what role does language play in philosophy", "output": "lex: language philosophy linguistic turn Wittgenstein\nlex: philosophy of language meaning reference semantics\nvec: what role does language play in philosophical inquiry and analysis\nvec: how did Wittgenstein and analytic philosophers view the relationship between language and thought\nhyde: The linguistic turn of the 20th century made language central to philosophy. Wittgenstein argued that philosophical problems arise from misunderstandings of language. Analytic philosophers examine how meaning, reference, and truth conditions work. Ordinary language philosophy holds that everyday usage resolves many metaphysical puzzles."}
+{"input": "how to fight pests organically", "output": "lex: organic pest control garden insects\nlex: organic pesticide neem oil insecticidal soap\nvec: how to control garden pests using organic and natural methods\nvec: what organic pest control methods work for vegetable gardens\nhyde: Spray neem oil or insecticidal soap to kill soft-bodied pests like aphids and whiteflies. Introduce beneficial insects: ladybugs eat aphids, parasitic wasps target caterpillars. Use row covers to physically exclude pests. Apply diatomaceous earth around plant bases for slugs and beetles."}
+{"input": "what is the role of research institutions", "output": "lex: research institutions universities role science\nlex: research institutions funding labs innovation\nvec: what role do research institutions and universities play in advancing science\nvec: how do research institutions contribute to knowledge creation and innovation\nhyde: Research institutions—universities, government labs, and private research organizations—drive scientific progress through funded investigations, peer-reviewed publications, and training of new researchers. They provide infrastructure (labs, equipment, libraries), facilitate collaboration, and translate findings into real-world applications."}
+{"input": "what is narrative ethics", "output": "lex: narrative ethics storytelling moral philosophy\nlex: narrative ethics literature moral reasoning\nvec: what is narrative ethics and how does storytelling relate to moral understanding\nvec: how do narrative ethicists use stories and literature to explore moral questions\nhyde: Narrative ethics holds that moral understanding is shaped by the stories we tell and hear. Rather than abstract principles, it emphasizes particular cases and lived experience. Literature, patient narratives in medicine, and personal testimony illuminate moral complexity that rules-based ethics may miss."}
+{"input": "ai ops", "output": "lex: AIOps artificial intelligence IT operations\nlex: AIOps monitoring anomaly detection automation\nlex: AIOps MLOps machine learning operations\nvec: what is AIOps and how does AI improve IT operations management\nvec: how do AIOps platforms use machine learning for monitoring and incident response\nhyde: AIOps (Artificial Intelligence for IT Operations) applies machine learning to IT operations data—logs, metrics, events—to detect anomalies, predict outages, and automate incident response. Platforms like Datadog, Splunk, and Moogsoft correlate alerts to reduce noise and speed up root cause analysis."}
+{"input": "how to negotiate a business deal", "output": "lex: negotiate business deal tactics strategy\nlex: business negotiation skills contract terms\nvec: what are effective strategies for negotiating a business deal successfully\nvec: how to prepare for and conduct a business negotiation to reach a favorable agreement\nhyde: Prepare by researching the other party's priorities and constraints. Define your BATNA (best alternative to a negotiated agreement) and walk-away point. Open with an ambitious but defensible anchor. Listen more than you talk. Focus on interests, not positions, to find creative win-win solutions."}
+{"input": "how to protest peacefully", "output": "lex: peaceful protest demonstration rights organizing\nlex: nonviolent protest civil disobedience activism\nvec: how to organize and participate in a peaceful protest effectively\nvec: what are the principles and logistics of peaceful demonstration and nonviolent activism\nhyde: Know your rights: peaceful assembly is protected by the First Amendment. Organize with clear goals, designated marshals, and a planned route. Coordinate with local authorities for permits. Bring water, ID, and emergency contacts. Stay nonviolent, document with video, and have legal observers present."}
+{"input": "how to start oil painting?", "output": "lex: oil painting beginner supplies techniques\nlex: oil painting start canvas brushes paints medium\nvec: how to get started with oil painting as a beginner\nvec: what supplies and techniques do beginners need to start oil painting\nhyde: Start with a basic set of oil paints: titanium white, cadmium yellow, cadmium red, ultramarine blue, and burnt umber. Use medium-grade bristle brushes in sizes 4, 8, and 12. Work on pre-primed canvas. Thin early layers with odorless mineral spirits and use linseed oil for later layers (fat over lean)."}
+{"input": "what is the significance of archetypes?", "output": "lex: archetypes Carl Jung collective unconscious\nlex: archetypes significance literature psychology\nvec: what is the significance of archetypes in psychology and literature\nvec: how did Carl Jung define archetypes and why do they appear across cultures\nhyde: Carl Jung described archetypes as universal, inherited patterns in the collective unconscious—the Hero, the Shadow, the Trickster, the Great Mother. They recur across myths, dreams, and stories worldwide because they reflect fundamental human experiences and psychological structures shared by all cultures."}
+{"input": "how to mix colors in oil painting?", "output": "lex: oil painting color mixing palette technique\nlex: mix oil paint colors complementary warm cool\nvec: how to mix oil paint colors to achieve the right hues and values\nvec: what is the proper technique for blending and mixing colors in oil painting\nhyde: Mix on a glass or wood palette using a palette knife for clean blends. Start with the lighter color and add the darker one gradually. To mute a color, mix in its complement: add green to red, purple to yellow. Mix value (light/dark) separately from hue for better control."}
+{"input": "how do different religions define good and evil?", "output": "lex: good evil religion definition theology\nlex: good evil Christianity Islam Buddhism Hinduism\nvec: how do different world religions define and explain the concepts of good and evil\nvec: what are the religious perspectives on good versus evil across Christianity, Islam, Buddhism, and Hinduism\nhyde: Christianity frames evil as separation from God through sin, with goodness as alignment with divine will. Islam teaches that evil arises from disobeying Allah's commands. Buddhism sees evil as rooted in ignorance, greed, and hatred rather than a cosmic force. Hinduism links good and evil to dharma and karma."}
+{"input": "sail boat", "output": "lex: sailboat sailing types rigging\nlex: sailboat buy beginner learn to sail\nlex: sailboat parts hull keel mast\nvec: what are the different types of sailboats and how do they work\nvec: how to get started with sailboat sailing as a beginner\nhyde: Sailboats are propelled by wind acting on sails. Common types include dinghies (small, single-hull), keelboats (weighted keel for stability), catamarans (twin hulls), and sloops (single mast, fore-and-aft rigged). Key parts include the hull, mast, boom, jib, mainsail, rudder, and keel."}
+{"input": "how crispr technology works", "output": "lex: CRISPR Cas9 gene editing mechanism\nlex: CRISPR technology DNA guide RNA\nvec: how does CRISPR-Cas9 gene editing technology work at the molecular level\nvec: what is the mechanism by which CRISPR cuts and edits DNA sequences\nhyde: CRISPR-Cas9 uses a guide RNA (gRNA) complementary to the target DNA sequence. The gRNA directs the Cas9 nuclease to the precise genomic location, where it creates a double-strand break. The cell's repair machinery then either disrupts the gene (NHEJ) or inserts a new sequence (HDR) using a provided template."}
+{"input": "hair cut", "output": "lex: haircut styles men women trends\nlex: haircut salon barbershop near me\nlex: haircut techniques layered fade trim\nvec: what are the popular haircut styles and how to choose the right one\nvec: how to communicate what haircut you want to a stylist or barber\nhyde: Popular haircuts include the bob, pixie cut, and layers for women, and the fade, crew cut, and textured crop for men. Choose based on face shape: round faces suit angular cuts, long faces benefit from volume at the sides. Bring reference photos to your appointment for clear communication."}
+{"input": "how to develop an art portfolio?", "output": "lex: art portfolio development pieces selection\nlex: art portfolio presentation layout artist\nvec: how to build a strong art portfolio for school applications or professional work\nvec: what should an art portfolio include and how should it be organized\nhyde: Select 15-20 of your strongest, most cohesive pieces that demonstrate range and skill. Open and close with your best work. Show process sketches alongside finished pieces. Use consistent, high-quality photography. For digital portfolios, use platforms like Behance or a personal website with clean navigation."}
+{"input": "what is atmospheric science", "output": "lex: atmospheric science meteorology climate weather\nlex: atmospheric science atmosphere composition dynamics\nvec: what is atmospheric science and what topics does it study\nvec: how does atmospheric science explain weather, climate, and the Earth's atmosphere\nhyde: Atmospheric science studies the Earth's atmosphere—its composition, structure, and dynamics. Sub-fields include meteorology (weather forecasting), climatology (long-term patterns), atmospheric chemistry (ozone, pollutants), and atmospheric physics (radiation, cloud formation). It underpins weather prediction and climate change research."}
+{"input": "how to apply for a mortgage", "output": "lex: mortgage application process requirements\nlex: apply mortgage home loan pre-approval credit score\nvec: what are the steps to apply for a home mortgage loan\nvec: how to prepare your finances and documents to apply for a mortgage\nhyde: Check your credit score (aim for 620+, 740+ for best rates). Save for a down payment of 3-20%. Get pre-approved with a lender by submitting W-2s, pay stubs, bank statements, and tax returns. Compare rates from multiple lenders. Once you find a home, submit the full application and await underwriting."}
+{"input": "how to analyze political polls", "output": "lex: political poll analysis methodology\nlex: polling data interpretation margin error\nlex: election survey statistics\nvec: what methods are used to analyze and interpret political polling data\nvec: how to evaluate the accuracy and reliability of election polls\nvec: understanding margin of error and sample size in political surveys\nhyde: To analyze a political poll, start by examining the sample size, methodology, and margin of error. A poll of 1,000 likely voters with a ±3% margin means the true value falls within that range 95% of the time. Compare results across multiple polls using polling averages to reduce noise."}
+{"input": "how does the body maintain homeostasis", "output": "lex: homeostasis regulation human body\nlex: negative feedback loop physiology\nlex: body temperature pH blood glucose regulation\nvec: what mechanisms does the human body use to maintain internal stability\nvec: how do feedback loops help regulate body temperature and blood sugar levels\nhyde: The body maintains homeostasis through negative feedback loops. When blood glucose rises after a meal, the pancreas releases insulin, signaling cells to absorb glucose. When body temperature drops, the hypothalamus triggers shivering and vasoconstriction to conserve heat."}
+{"input": "how to transplant seedlings?", "output": "lex: transplant seedlings garden\nlex: seedling hardening off repotting\nlex: moving seedlings outdoors soil\nvec: what is the correct process for transplanting seedlings from pots into the garden\nvec: when and how should you harden off and transplant young plants outdoors\nhyde: Transplant seedlings after hardening them off for 7-10 days. Dig a hole slightly larger than the root ball, gently remove the seedling from its pot, and place it at the same depth it was growing. Water thoroughly and mulch around the base to retain moisture."}
+{"input": "how to interpret graphs and charts", "output": "lex: reading graphs charts data visualization\nlex: interpret bar line pie chart\nlex: graph axis scale data trends\nvec: how do you read and interpret different types of graphs and charts correctly\nvec: what should you look for when analyzing data presented in visual charts\nhyde: To interpret a graph, first read the title and axis labels to understand what is being measured. Identify the scale and units. For line charts, look at trends over time. For bar charts, compare heights across categories. Always check whether the y-axis starts at zero, as truncated axes can exaggerate differences."}
+{"input": "how to start a sketchbook?", "output": "lex: sketchbook practice beginner drawing\nlex: daily sketching habit art journal\nlex: first sketchbook tips supplies\nvec: how do beginners start and maintain a regular sketchbook practice\nvec: what supplies and techniques should you use when starting your first sketchbook\nhyde: Start your sketchbook by choosing a book with paper weight of at least 80gsm. Begin with simple observational drawings of everyday objects. Draw for 10-15 minutes daily without worrying about perfection. Use pencil, pen, or whatever feels comfortable. Date each page to track your progress."}
+{"input": "what are the main teachings of jainism?", "output": "lex: jainism core teachings principles\nlex: ahimsa anekantavada aparigraha jain\nlex: jain dharma beliefs nonviolence\nvec: what are the central beliefs and philosophical teachings of Jainism\nvec: how do Jain principles like ahimsa and anekantavada guide ethical living\nhyde: Jainism teaches three core principles: ahimsa (nonviolence toward all living beings), anekantavada (many-sidedness of truth), and aparigraha (non-attachment to possessions). The path to liberation involves the Three Jewels: right faith, right knowledge, and right conduct. Jains practice strict vegetarianism and asceticism."}
+{"input": "how to choose curtains for living room", "output": "lex: living room curtain selection fabric\nlex: curtain length style window treatment\nlex: drapes color pattern room decor\nvec: how do you choose the right curtains for a living room based on style and function\nvec: what curtain fabric length and color work best for different living room windows\nhyde: Choose curtains that hang 1-2 inches above the floor for a polished look. For a small living room, use light-colored sheer fabrics to maximize natural light. Mount the curtain rod 4-6 inches above the window frame and extend it 3-8 inches beyond each side to make windows appear larger."}
+{"input": "how to take macro photos", "output": "lex: macro photography technique close-up\nlex: macro lens focus stacking lighting\nlex: close-up photography camera settings\nvec: what camera settings and equipment do you need for macro photography\nvec: how to achieve sharp focus and good lighting in close-up macro shots\nhyde: For macro photography, use a dedicated macro lens (60mm or 100mm) or extension tubes. Set your aperture to f/8-f/16 for sufficient depth of field. Use a tripod and remote shutter to eliminate camera shake. Focus stacking—taking multiple shots at different focus distances—produces sharp images throughout the subject."}
+{"input": "how to write a query letter?", "output": "lex: query letter writing literary agent\nlex: book manuscript submission query format\nlex: query letter hook synopsis comp titles\nvec: how do you write an effective query letter to a literary agent for your novel\nvec: what structure and elements should a query letter include for book submissions\nhyde: A query letter has three paragraphs: the hook (a compelling one-sentence pitch), the mini-synopsis (250 words covering the protagonist, conflict, and stakes), and the bio (your credentials and comp titles). Address the agent by name, mention why you chose them, and keep the entire letter under one page."}
+{"input": "what are plasmids", "output": "lex: plasmid DNA circular extrachromosomal\nlex: plasmid bacteria gene transfer cloning\nlex: plasmid vector molecular biology\nvec: what are plasmids and what role do they play in bacterial genetics\nvec: how are plasmids used as vectors in molecular biology and genetic engineering\nhyde: Plasmids are small, circular, double-stranded DNA molecules found in bacteria that replicate independently of chromosomal DNA. They often carry genes for antibiotic resistance. In genetic engineering, plasmids serve as vectors to insert foreign genes into host cells for cloning and protein expression."}
+{"input": "how do scientists accurately measure time", "output": "lex: atomic clock time measurement precision\nlex: cesium clock seconds SI definition\nlex: timekeeping scientific instruments\nvec: how do atomic clocks and other instruments allow scientists to measure time with extreme precision\nvec: what is the scientific definition of a second and how is it measured\nhyde: The SI second is defined by the cesium-133 atom, which oscillates 9,192,631,770 times per second. Atomic clocks use this transition frequency to achieve accuracy within one second over millions of years. Optical lattice clocks using strontium atoms are even more precise, losing less than one second over the age of the universe."}
+{"input": "how to build a professional network?", "output": "lex: professional networking career connections\nlex: LinkedIn networking events industry contacts\nlex: building professional relationships mentorship\nvec: what are effective strategies for building and maintaining a professional network\nvec: how can attending events and using LinkedIn help grow your career network\nhyde: Build your professional network by attending industry conferences, joining professional associations, and engaging on LinkedIn. Follow up within 48 hours of meeting someone new. Offer value before asking for favors—share articles, make introductions, or provide feedback. Schedule regular coffee chats to maintain relationships."}
+{"input": "what is the significance of sacred symbols?", "output": "lex: sacred symbols religious meaning\nlex: spiritual symbols cross om menorah lotus\nlex: religious iconography symbolism significance\nvec: what role do sacred symbols play in religious and spiritual traditions\nvec: how do symbols like the cross, om, and menorah carry meaning in their respective faiths\nhyde: Sacred symbols serve as tangible expressions of spiritual truths across religions. The Christian cross represents sacrifice and redemption, the Hindu Om embodies the primordial sound of creation, and the Jewish menorah symbolizes divine light. These symbols anchor believers' faith and create shared identity within communities."}
+{"input": "how to succeed in a digital marketing career?", "output": "lex: digital marketing career skills\nlex: SEO social media analytics marketing job\nlex: digital marketing certifications portfolio\nvec: what skills and experience do you need to build a successful digital marketing career\nvec: how to get started in digital marketing and advance to senior roles\nhyde: A digital marketing career requires proficiency in SEO, paid advertising (Google Ads, Meta Ads), content marketing, email marketing, and analytics tools like Google Analytics. Build a portfolio with real campaigns. Earn certifications from Google, HubSpot, or Meta. Entry-level roles include marketing coordinator or social media specialist."}
+{"input": "how to plan a trip to europe?", "output": "lex: Europe trip planning itinerary budget\nlex: European travel visa flights accommodations\nlex: backpacking Europe route booking tips\nvec: how do you plan and budget for a multi-country trip across Europe\nvec: what are the steps for organizing flights, accommodations, and itineraries for European travel\nhyde: Plan your Europe trip 3-6 months ahead. Book flights early for the best fares. Get a Eurail pass if visiting 3+ countries. Budget €50-150/day depending on the country. Book accommodations on Booking.com or Hostelworld. Check visa requirements—US citizens can stay 90 days in the Schengen Area without a visa."}
+{"input": "how machine learning influences businesses", "output": "lex: machine learning business applications\nlex: ML AI enterprise automation prediction\nlex: machine learning revenue customer analytics\nvec: how are businesses using machine learning to improve operations and decision-making\nvec: what impact does machine learning have on business revenue and efficiency\nhyde: Machine learning transforms businesses through demand forecasting, customer churn prediction, fraud detection, and recommendation engines. Retailers use ML to optimize pricing and inventory. Banks deploy ML models for credit scoring. Companies using ML-driven analytics report 5-10% increases in revenue through personalized marketing."}
+{"input": "what are the main characteristics of memoirs?", "output": "lex: memoir characteristics literary genre\nlex: memoir vs autobiography personal narrative\nlex: memoir writing elements structure\nvec: what distinguishes a memoir from other forms of autobiographical writing\nvec: what are the key literary features and structure of a memoir\nhyde: A memoir focuses on a specific theme or period in the author's life, unlike an autobiography which covers an entire life chronologically. Key characteristics include a first-person narrative voice, emotional honesty, reflection on personal growth, vivid sensory details, and a thematic arc that gives the story universal resonance."}
+{"input": "how do sikhs practice their faith", "output": "lex: Sikh faith practices worship\nlex: gurdwara langar five Ks Sikhism\nlex: Sikh prayer Guru Granth Sahib\nvec: what are the daily religious practices and rituals observed by Sikhs\nvec: how do Sikhs worship in the gurdwara and observe the five Ks\nhyde: Sikhs practice their faith through daily prayers (Nitnem), including Japji Sahib at dawn. They worship at the gurdwara, where the Guru Granth Sahib is read aloud. Baptized Sikhs wear the five Ks: kesh (uncut hair), kangha (comb), kara (steel bracelet), kachera (undergarment), and kirpan (ceremonial sword). Langar, the communal kitchen, serves free meals to all visitors."}
+{"input": "what are the foundations of feminist ethics", "output": "lex: feminist ethics care theory foundations\nlex: feminist moral philosophy gender justice\nlex: ethics of care Gilligan Noddings feminist\nvec: what are the core principles and philosophical foundations of feminist ethics\nvec: how does feminist ethics differ from traditional moral philosophy in its approach to care and justice\nhyde: Feminist ethics emerged from Carol Gilligan's critique of Kohlberg's moral development theory, arguing that women's moral reasoning emphasizes care and relationships rather than abstract principles of justice. Nel Noddings developed the ethics of care, centering moral life on attentiveness, responsibility, and responsiveness to the needs of particular others."}
+{"input": "how do antibiotics work", "output": "lex: antibiotics mechanism action bacteria\nlex: antibiotic cell wall protein synthesis inhibition\nlex: bactericidal bacteriostatic penicillin\nvec: how do antibiotics kill or inhibit the growth of bacteria in the human body\nvec: what are the different mechanisms by which antibiotics target bacterial cells\nhyde: Antibiotics work by targeting structures unique to bacteria. Penicillin and cephalosporins inhibit cell wall synthesis, causing bacteria to burst. Tetracyclines block the 30S ribosomal subunit, preventing protein synthesis. Fluoroquinolones inhibit DNA gyrase, stopping bacterial DNA replication. Antibiotics are classified as bactericidal (kill bacteria) or bacteriostatic (stop growth)."}
+{"input": "what is geothermal energy?", "output": "lex: geothermal energy heat earth power\nlex: geothermal power plant electricity generation\nlex: geothermal renewable energy underground\nvec: how does geothermal energy work and how is it used to generate electricity\nvec: what are the advantages and limitations of geothermal energy as a renewable source\nhyde: Geothermal energy harnesses heat from the Earth's interior. Hot water and steam from underground reservoirs drive turbines to generate electricity. Geothermal power plants operate at over 90% capacity factor, far higher than wind or solar. Iceland generates 25% of its electricity from geothermal sources."}
+{"input": "how does a bill become a law", "output": "lex: bill becomes law legislative process\nlex: US Congress legislation committee vote\nlex: bill passage House Senate president sign\nvec: what are the steps a bill goes through in the US Congress to become a law\nvec: how does the legislative process work from bill introduction to presidential signature\nhyde: A bill is introduced in the House or Senate and assigned to a committee. The committee holds hearings, marks up the bill, and votes. If passed, it goes to the full chamber for debate and a vote. Both chambers must pass identical versions. Differences are resolved in a conference committee. The final bill goes to the President, who can sign it into law or veto it."}
+{"input": "what is the difference between ethics and morals", "output": "lex: ethics vs morals difference\nlex: ethics morals philosophy distinction\nlex: moral principles ethical systems comparison\nvec: what is the distinction between ethics and morals in philosophy\nvec: how do personal morals differ from ethical systems and codes of conduct\nhyde: Ethics refers to systematic, philosophical frameworks for determining right and wrong—such as utilitarianism or deontology. Morals are personal beliefs about right and wrong shaped by culture, religion, and upbringing. Ethics are prescriptive rules applied to groups (medical ethics, business ethics), while morals are individual convictions."}
+{"input": "what was the silk road", "output": "lex: Silk Road ancient trade route\nlex: Silk Road China Rome trade network\nlex: Silk Road history commerce cultural exchange\nvec: what was the historical Silk Road and what goods and ideas were traded along it\nvec: how did the Silk Road connect civilizations between China and the Mediterranean\nhyde: The Silk Road was a network of trade routes connecting China to the Mediterranean from the 2nd century BCE to the 15th century CE. Merchants traded silk, spices, gold, and jade. Beyond goods, the Silk Road facilitated the spread of Buddhism, Islam, papermaking, and gunpowder across Eurasia."}
+{"input": "what is the significance of beauty in philosophy", "output": "lex: beauty philosophy aesthetics significance\nlex: aesthetics Kant Plato beauty philosophical\nlex: philosophy of beauty sublime art\nvec: how have philosophers understood and defined the concept of beauty throughout history\nvec: what is the philosophical significance of beauty in aesthetics from Plato to Kant\nhyde: In Plato's Symposium, beauty is a ladder ascending from physical attraction to the Form of Beauty itself. Kant distinguished between the beautiful (harmonious, universal pleasure) and the sublime (overwhelming grandeur). For Hegel, beauty in art reveals truth through sensory form. Contemporary aesthetics debates whether beauty is objective or culturally constructed."}
+{"input": "how to communicate with elected officials", "output": "lex: contact elected officials representatives\nlex: write letter call congressman senator\nlex: constituent advocacy elected official communication\nvec: what are effective ways to communicate your concerns to elected officials\nvec: how to write letters or make phone calls to your congressional representatives\nhyde: The most effective way to reach your elected officials is a phone call to their district office. Identify yourself as a constituent, state the bill number, and clearly state your position in under 60 seconds. Personalized letters are more impactful than form emails. Attend town halls for face-to-face interaction."}
+{"input": "what is phenomenology", "output": "lex: phenomenology philosophy Husserl\nlex: phenomenological method consciousness experience\nlex: phenomenology Heidegger Merleau-Ponty intentionality\nvec: what is phenomenology and how does it study conscious experience\nvec: how did Husserl and Heidegger develop phenomenology as a philosophical method\nhyde: Phenomenology is a philosophical method founded by Edmund Husserl that studies the structures of conscious experience as they appear to the subject. Through \"bracketing\" (epoché), the phenomenologist suspends assumptions about the external world to describe phenomena as they are experienced. Heidegger extended this into an analysis of Being-in-the-world."}
+{"input": "how to enhance concentration", "output": "lex: improve concentration focus techniques\nlex: attention span deep work focus tips\nlex: concentration exercises mindfulness pomodoro\nvec: what techniques and habits can help you improve focus and concentration\nvec: how can mindfulness and time management methods like Pomodoro improve attention\nhyde: Improve concentration by eliminating distractions: silence notifications, use website blockers, and work in a quiet environment. The Pomodoro Technique—25 minutes of focused work followed by a 5-minute break—builds sustained attention. Regular exercise, adequate sleep (7-9 hours), and mindfulness meditation physically strengthen the brain's prefrontal cortex."}
+{"input": "what is the theory of relativity", "output": "lex: theory of relativity Einstein\nlex: special general relativity spacetime gravity\nlex: E=mc2 Einstein relativity physics\nvec: what are Einstein's special and general theories of relativity and what do they explain\nvec: how does the theory of relativity describe the relationship between space time and gravity\nhyde: Einstein's special relativity (1905) states that the speed of light is constant for all observers and that time dilates at high velocities (E=mc²). General relativity (1915) describes gravity not as a force but as the curvature of spacetime caused by mass and energy. Massive objects bend spacetime, and objects follow curved paths."}
+{"input": "what is depth of field?", "output": "lex: depth of field photography aperture\nlex: DOF shallow deep focus bokeh\nlex: aperture f-stop focal length depth field\nvec: what is depth of field in photography and how does aperture affect it\nvec: how do aperture, focal length, and distance control the depth of field in a photo\nhyde: Depth of field (DOF) is the range of distance in a photo that appears acceptably sharp. A wide aperture (f/1.8) produces a shallow DOF with a blurred background (bokeh), ideal for portraits. A narrow aperture (f/16) produces deep DOF where everything is sharp, suited for landscapes. Focal length and subject distance also affect DOF."}
+{"input": "how to write a haiku", "output": "lex: haiku poem writing syllable\nlex: haiku 5-7-5 Japanese poetry\nlex: haiku nature season kigo structure\nvec: what are the rules and structure for writing a traditional haiku poem\nvec: how do you compose a haiku with the 5-7-5 syllable pattern and seasonal reference\nhyde: A haiku is a three-line Japanese poem with a 5-7-5 syllable structure. Traditional haiku includes a kigo (seasonal word) and a kireji (cutting word) that creates a pause or shift. Example: \"An old silent pond / A frog jumps into the pond— / Splash! Silence again.\" Focus on a single moment in nature observed with clarity."}
+{"input": "how to address misinformation in politics", "output": "lex: political misinformation combat fact-checking\nlex: fake news disinformation media literacy\nlex: countering political misinformation strategies\nvec: what strategies can be used to identify and counter political misinformation\nvec: how can media literacy and fact-checking help address false political claims\nhyde: Combat political misinformation by checking claims against nonpartisan fact-checkers like PolitiFact, Snopes, and FactCheck.org. Verify the original source before sharing. Teach media literacy skills: examine the URL, author credentials, and whether other outlets confirm the story. Prebunking—warning people about manipulation techniques before exposure—is more effective than debunking after the fact."}
+{"input": "what is the philosophy of humor?", "output": "lex: philosophy of humor laughter theory\nlex: incongruity superiority relief theory humor\nlex: humor philosophy comedy Bergson\nvec: what are the main philosophical theories that explain why things are funny\nvec: how do incongruity theory, superiority theory, and relief theory explain humor\nhyde: Three major theories explain humor. Superiority theory (Hobbes) says we laugh at others' misfortunes. Relief theory (Freud) says laughter releases nervous energy. Incongruity theory (Kant, Schopenhauer) says humor arises when expectations are violated—we laugh at the gap between what we expect and what occurs."}
+{"input": "how does determinism challenge free will", "output": "lex: determinism free will debate\nlex: causal determinism libertarian compatibilism\nlex: free will philosophy hard determinism\nvec: how does philosophical determinism pose a challenge to the concept of free will\nvec: can free will exist if every event is causally determined by prior events\nhyde: Determinism holds that every event, including human choices, is the inevitable result of prior causes. If our decisions are fully determined by brain states, genetics, and environment, then free will appears illusory. Compatibilists like Hume argue free will means acting on one's desires without external coercion, which is compatible with determinism."}
+{"input": "how to write compelling endings?", "output": "lex: writing compelling story ending\nlex: novel ending techniques resolution climax\nlex: satisfying conclusion fiction writing\nvec: what techniques do authors use to write powerful and satisfying story endings\nvec: how to craft a compelling ending that resolves the plot and resonates emotionally\nhyde: A compelling ending resolves the central conflict while delivering an emotional payoff. Techniques include the circular ending (returning to an opening image with new meaning), the surprise twist (recontextualizing everything), and the resonant final image. Avoid deus ex machina. The ending should feel both surprising and inevitable—earned by what came before."}
+{"input": "how to make scientific presentations engaging", "output": "lex: scientific presentation engaging tips\nlex: science talk slides audience storytelling\nlex: research presentation design delivery\nvec: how can scientists make their research presentations more engaging and accessible\nvec: what techniques improve the delivery and visual design of scientific talks\nhyde: Make scientific presentations engaging by opening with a question or surprising finding rather than an outline slide. Use large visuals and minimal text—no more than 6 words per slide. Tell a story: setup the problem, build tension with the data, and deliver the conclusion as a punchline. Practice to stay under time and make eye contact."}
+{"input": "how to draw with a graphic tablet?", "output": "lex: graphic tablet drawing digital art\nlex: Wacom drawing tablet pen pressure\nlex: digital drawing tablet beginner setup\nvec: how do you set up and start drawing with a graphic tablet for digital art\nvec: what are tips for beginners learning to draw on a Wacom or similar tablet\nhyde: Set up your graphic tablet by installing the driver software and calibrating pen pressure. Start in a drawing program like Clip Studio Paint or Krita. The key challenge is hand-eye coordination—you draw on the tablet but look at the screen. Practice simple lines and circles to build muscle memory. Adjust pressure sensitivity curves to match your drawing style."}
+{"input": "how to build a capsule wardrobe", "output": "lex: capsule wardrobe essentials minimalist\nlex: capsule wardrobe build pieces mix match\nlex: minimalist wardrobe basics clothing\nvec: how do you create a capsule wardrobe with a minimal set of versatile clothing pieces\nvec: what are the essential items and steps to build a functional capsule wardrobe\nhyde: A capsule wardrobe consists of 30-40 versatile pieces that mix and match. Start by choosing a neutral color palette (black, navy, white, beige). Include 2-3 pairs of pants, 5-7 tops, 2 jackets, 2 pairs of shoes, and 1-2 dresses or suits. Remove items you haven't worn in a year. Invest in quality basics over trendy pieces."}
+{"input": "what was the impact of the berlin wall?", "output": "lex: Berlin Wall impact fall 1989\nlex: Berlin Wall Cold War Germany division\nlex: Berlin Wall consequences reunification\nvec: what was the historical impact of the Berlin Wall on Germany and the Cold War\nvec: how did the fall of the Berlin Wall in 1989 change Europe and global politics\nhyde: The Berlin Wall divided East and West Berlin from 1961 to 1989, symbolizing the Iron Curtain between communist and capitalist worlds. Its fall on November 9, 1989, triggered German reunification in 1990 and accelerated the collapse of communist regimes across Eastern Europe, effectively ending the Cold War."}
+{"input": "classic literature", "output": "lex: classic literature novels canon\nlex: classic books literary fiction great works\nlex: classic literature reading list authors\nvec: what are the most important works of classic literature and why are they significant\nvec: which classic novels and authors are considered essential reading in the Western literary canon\nhyde: Classic literature includes works that have stood the test of time for their artistic merit, universal themes, and cultural influence. Essential classics include Homer's Odyssey, Shakespeare's Hamlet, Austen's Pride and Prejudice, Dostoevsky's Crime and Punishment, and Fitzgerald's The Great Gatsby."}
+{"input": "how to make slime at home", "output": "lex: homemade slime recipe DIY\nlex: slime glue borax contact solution\nlex: make slime kids craft\nvec: what ingredients and steps do you need to make slime at home\nvec: how to make homemade slime using glue and borax or contact lens solution\nhyde: Mix 1/2 cup of white PVA glue with 1/2 cup of liquid starch or 1 tablespoon of borax dissolved in 1 cup of water. Stir until the slime pulls away from the bowl. Knead with your hands for 2-3 minutes until smooth. Add food coloring or glitter before mixing for a custom look. Store in an airtight container."}
+{"input": "what is the ethics of climate change", "output": "lex: climate change ethics moral responsibility\nlex: climate ethics justice intergenerational\nlex: environmental ethics carbon emissions moral\nvec: what are the ethical and moral dimensions of climate change and environmental responsibility\nvec: how do philosophers approach questions of climate justice and intergenerational obligation\nhyde: Climate ethics addresses who bears moral responsibility for carbon emissions and their consequences. Key questions include intergenerational justice (obligations to future generations), distributive justice (developing nations suffer most but polluted least), and the tragedy of the commons. Philosophers debate whether current generations owe a carbon debt to those who will inherit a warmer world."}
+{"input": "what are leadership qualities", "output": "lex: leadership qualities traits effective\nlex: leader skills communication vision integrity\nlex: leadership characteristics management\nvec: what personal qualities and traits define an effective leader\nvec: which skills and characteristics are most important for strong leadership\nhyde: Effective leaders demonstrate integrity, clear communication, empathy, and decisiveness. They articulate a compelling vision and inspire others to work toward shared goals. Key qualities include emotional intelligence, accountability, adaptability under pressure, and the ability to delegate while empowering team members to take ownership."}
+{"input": "what is the difference between a credit score and a credit report", "output": "lex: credit score vs credit report difference\nlex: credit report FICO score bureaus\nlex: credit score number credit report history\nvec: what is the difference between a credit score and a credit report\nvec: how does a credit report relate to the credit score number lenders use\nhyde: A credit report is a detailed record of your credit history maintained by bureaus (Equifax, Experian, TransUnion). It lists accounts, payment history, balances, and inquiries. A credit score is a three-digit number (300-850) calculated from your credit report data. FICO scores weigh payment history (35%), amounts owed (30%), length of history (15%), new credit (10%), and credit mix (10%)."}
+{"input": "how to make homemade pizza", "output": "lex: homemade pizza dough recipe\nlex: pizza from scratch oven toppings\nlex: make pizza dough sauce crust\nvec: how do you make pizza from scratch at home with homemade dough and sauce\nvec: what is the best recipe for homemade pizza dough and how do you bake it\nhyde: Mix 3 cups flour, 1 packet yeast, 1 tsp salt, 1 tbsp olive oil, and 1 cup warm water. Knead for 10 minutes and let rise 1 hour. Stretch the dough on a floured surface, spread tomato sauce, add mozzarella and toppings. Bake at 475°F (245°C) on a preheated pizza stone for 10-12 minutes until the crust is golden."}
+{"input": "how to improve workplace productivity", "output": "lex: workplace productivity improvement strategies\nlex: employee productivity time management office\nlex: work efficiency focus deep work\nvec: what strategies and techniques can improve productivity in the workplace\nvec: how can employees and managers increase work output and reduce wasted time\nhyde: Improve workplace productivity by eliminating unnecessary meetings, batching similar tasks together, and protecting blocks of uninterrupted focus time. Use the Eisenhower Matrix to prioritize tasks by urgency and importance. Managers should set clear goals, reduce bureaucratic overhead, and ensure employees have the tools and autonomy they need."}
+{"input": "what is the role of clergy in christianity", "output": "lex: clergy role Christianity priest pastor\nlex: Christian minister ordained church leadership\nlex: priest pastor deacon church clergy duties\nvec: what roles and responsibilities do clergy members serve in Christian churches\nvec: how do priests, pastors, and deacons function within different Christian denominations\nhyde: Christian clergy serve as spiritual leaders, administering sacraments, preaching sermons, and providing pastoral care. In Catholicism, ordained priests celebrate Mass, hear confessions, and perform baptisms. Protestant pastors focus on preaching and teaching Scripture. Deacons serve the community through charity and administrative support. The clergy structure varies widely across denominations."}
+{"input": "how does virtue ethics work", "output": "lex: virtue ethics Aristotle moral character\nlex: virtue ethics eudaimonia character traits\nlex: Aristotelian ethics virtues vices\nvec: how does virtue ethics evaluate moral action based on character rather than rules\nvec: what is Aristotle's approach to virtue ethics and how does it define the good life\nhyde: Virtue ethics, rooted in Aristotle's Nicomachean Ethics, holds that moral action flows from virtuous character rather than following rules (deontology) or maximizing outcomes (consequentialism). Virtues like courage, temperance, and justice are developed through practice. The goal is eudaimonia—human flourishing—achieved by living according to reason and cultivating the mean between excess and deficiency."}
+{"input": "what are the challenges of climate science", "output": "lex: climate science challenges research\nlex: climate modeling uncertainty data gaps\nlex: climate change research limitations predictions\nvec: what are the major scientific challenges in studying and predicting climate change\nvec: why is climate modeling difficult and what uncertainties do climate scientists face\nhyde: Climate science faces challenges including modeling complex feedback loops (clouds, ocean currents, ice sheets), limited historical data from pre-instrumental periods, and the chaotic nature of weather systems. Regional predictions are harder than global ones. Tipping points—thresholds beyond which changes become irreversible—are difficult to predict with current models."}
+{"input": "how to reduce stress naturally", "output": "lex: reduce stress naturally techniques\nlex: stress relief meditation exercise breathing\nlex: natural stress management relaxation\nvec: what natural methods and lifestyle changes can help reduce stress without medication\nvec: how do exercise, meditation, and breathing techniques reduce stress levels\nhyde: Reduce stress naturally by exercising 30 minutes daily—aerobic exercise lowers cortisol and releases endorphins. Practice deep breathing: inhale for 4 counts, hold for 7, exhale for 8. Meditate for 10 minutes each morning. Limit caffeine and alcohol, sleep 7-9 hours, and spend time in nature. Progressive muscle relaxation and journaling also help."}
+{"input": "how to start trail running", "output": "lex: trail running beginner start\nlex: trail running shoes gear technique\nlex: off-road running trails tips\nvec: how do beginners get started with trail running and what gear is needed\nvec: what training tips and safety advice should new trail runners follow\nhyde: Start trail running on well-marked, relatively flat trails. Invest in trail running shoes with lugged soles for traction. Run by effort, not pace—expect to be 1-2 minutes per mile slower than road pace. Walk the uphills, run the flats and downhills. Carry water on runs over 45 minutes. Watch your footing and shorten your stride on technical terrain."}
+{"input": "how to write a literary essay?", "output": "lex: literary essay writing analysis\nlex: literary analysis thesis evidence essay\nlex: English literature essay structure argument\nvec: how do you write a strong literary analysis essay with a clear thesis and evidence\nvec: what is the structure and approach for writing an essay analyzing a work of literature\nhyde: A literary essay argues a specific thesis about a text using evidence from the work itself. Open with a hook and thesis statement. Each body paragraph should present a claim, textual evidence (quotations), and analysis explaining how the evidence supports your argument. Use close reading to examine language, imagery, symbolism, and structure. Conclude by synthesizing your argument."}
+{"input": "sustainable development goals", "output": "lex: sustainable development goals SDGs UN\nlex: SDG 2030 agenda United Nations\nlex: UN sustainability goals poverty climate\nvec: what are the United Nations Sustainable Development Goals and what do they aim to achieve\nvec: how are the 17 SDGs structured and what progress has been made toward the 2030 agenda\nhyde: The 17 Sustainable Development Goals (SDGs) were adopted by the United Nations in 2015 as a universal call to action by 2030. They include: No Poverty (SDG 1), Zero Hunger (SDG 2), Good Health (SDG 3), Quality Education (SDG 4), Gender Equality (SDG 5), Clean Water (SDG 6), and Climate Action (SDG 13), among others."}
+{"input": "how to navigate with gps", "output": "lex: GPS navigation outdoor use\nlex: GPS coordinates waypoint route handheld\nlex: GPS device map navigation hiking\nvec: how do you use a GPS device or app for outdoor navigation and route finding\nvec: how to read GPS coordinates and set waypoints for hiking or travel\nhyde: To navigate with GPS, first mark your starting point as a waypoint. Enter your destination coordinates or select a point on the map. The GPS receiver triangulates your position using signals from at least 4 satellites. Follow the bearing and distance readings to your waypoint. Always carry a paper map and compass as backup in case of battery failure."}
+{"input": "how to conduct a scientific experiment", "output": "lex: scientific experiment method steps\nlex: scientific method hypothesis variables control\nlex: experiment design procedure data collection\nvec: what are the steps involved in designing and conducting a proper scientific experiment\nvec: how do you set up controls, variables, and data collection for a science experiment\nhyde: A scientific experiment follows these steps: 1) Ask a question, 2) Research background, 3) Form a hypothesis, 4) Design the experiment with independent, dependent, and controlled variables, 5) Collect data through repeated trials, 6) Analyze results using statistics, 7) Draw conclusions. Always include a control group and change only one variable at a time."}
+{"input": "digital transformation strategy implementation", "output": "lex: digital transformation strategy enterprise\nlex: digital transformation implementation roadmap\nlex: enterprise digitalization technology adoption\nvec: how do organizations plan and implement a digital transformation strategy\nvec: what are the key phases and challenges of enterprise digital transformation\nhyde: Digital transformation strategy begins with assessing current technology maturity and identifying high-impact processes for digitization. Build a roadmap with quick wins (cloud migration, workflow automation) and long-term goals (data-driven decision making, AI integration). Assign executive sponsorship, train employees, and measure success with KPIs like cycle time reduction and customer satisfaction scores."}
+{"input": "how to improve sleep quality naturally?", "output": "lex: improve sleep quality natural remedies\nlex: sleep hygiene tips better rest\nlex: insomnia natural treatment melatonin\nvec: what natural methods and sleep hygiene habits improve the quality of sleep\nvec: how can you fall asleep faster and sleep more deeply without medication\nhyde: Improve sleep quality by maintaining a consistent schedule—go to bed and wake at the same time daily. Keep your bedroom cool (65-68°F), dark, and quiet. Avoid screens for 1 hour before bed since blue light suppresses melatonin. Limit caffeine after noon. Exercise regularly but not within 3 hours of bedtime. Try magnesium supplements or chamomile tea."}
+{"input": "how to build customer loyalty", "output": "lex: customer loyalty retention strategies\nlex: loyalty program repeat customers brand\nlex: customer retention engagement satisfaction\nvec: what strategies do businesses use to build long-term customer loyalty and retention\nvec: how do loyalty programs and customer experience drive repeat business\nhyde: Build customer loyalty by delivering consistent quality and exceeding expectations. Implement a points-based loyalty program offering meaningful rewards. Personalize communications using purchase history data. Respond to complaints within 24 hours and resolve them generously. Customers who feel valued spend 67% more than new customers. Track Net Promoter Score to measure loyalty over time."}
+{"input": "what is consequentialism", "output": "lex: consequentialism ethics moral theory\nlex: consequentialism utilitarianism outcomes\nlex: consequentialist ethics Mill Bentham\nvec: what is consequentialism and how does it evaluate the morality of actions\nvec: how does consequentialist ethics judge right and wrong based on outcomes and consequences\nhyde: Consequentialism is a moral theory holding that the rightness of an action depends solely on its outcomes. The most well-known form is utilitarianism (Bentham, Mill), which aims to maximize overall happiness or well-being. An action is morally right if it produces the best consequences for the greatest number of people, regardless of the actor's intentions."}
+{"input": "how does philosophy approach artificial intelligence?", "output": "lex: philosophy artificial intelligence AI ethics\nlex: AI philosophy consciousness mind machine\nlex: philosophy of AI Turing test Chinese room\nvec: how do philosophers analyze questions about artificial intelligence and machine consciousness\nvec: what philosophical problems does AI raise about minds, consciousness, and moral status\nhyde: Philosophers approach AI through questions of consciousness (can machines be conscious?), the Chinese Room argument (Searle argued symbol manipulation isn't understanding), the Turing test (behavioral equivalence), and moral status (should sentient AI have rights?). The alignment problem—ensuring AI systems pursue human values—has become a central concern in philosophy of technology."}
+{"input": "how to reduce sugar intake", "output": "lex: reduce sugar intake diet\nlex: cut sugar cravings low sugar eating\nlex: sugar consumption health alternatives\nvec: what practical strategies help reduce daily sugar consumption and manage cravings\nvec: how can you cut back on added sugar in your diet without feeling deprived\nhyde: Reduce sugar intake by reading nutrition labels—sugar hides in sauces, bread, and yogurt under names like dextrose, maltose, and high-fructose corn syrup. Replace sugary drinks with water or sparkling water. Eat whole fruit instead of juice. Gradually reduce sugar in coffee over 2 weeks. Protein and fiber at each meal stabilize blood sugar and reduce cravings."}
+{"input": "building resilience", "output": "lex: building resilience mental toughness\nlex: emotional resilience coping skills adversity\nlex: psychological resilience strategies stress\nvec: how can individuals build emotional and psychological resilience to handle adversity\nvec: what habits and mindset shifts help develop personal resilience and mental toughness\nhyde: Building resilience involves developing a growth mindset, maintaining social connections, and practicing self-care. Reframe setbacks as learning opportunities. Cultivate problem-solving skills rather than ruminating on what went wrong. Regular exercise, adequate sleep, and mindfulness strengthen your capacity to recover from stress. Resilient people accept what they cannot control and focus energy on what they can."}
+{"input": "how to attend a town hall meeting", "output": "lex: town hall meeting attend participate\nlex: local government town hall public forum\nlex: town hall meeting preparation questions\nvec: how do you find and attend a local town hall meeting to participate in government\nvec: what should you prepare before attending a town hall meeting with your representative\nhyde: Find town hall meetings through your representative's website, social media, or local newspaper. Arrive early to get a seat. Prepare a concise question or statement under 60 seconds. Introduce yourself as a constituent and mention your town. Be respectful and specific—reference a bill number or policy. Many representatives also hold virtual town halls you can join online."}
+{"input": "google sheets", "output": "lex: Google Sheets spreadsheet formulas\nlex: Google Sheets tutorial functions tips\nlex: Google Sheets pivot table VLOOKUP\nvec: how to use Google Sheets for data analysis with formulas and functions\nvec: what are the most useful Google Sheets features, formulas, and keyboard shortcuts\nhyde: Google Sheets is a free cloud-based spreadsheet application. Key functions include VLOOKUP for searching data across columns, SUMIF for conditional totals, and QUERY for SQL-like data filtering. Use Ctrl+/ to view keyboard shortcuts. Create pivot tables via Data > Pivot table. Share sheets with collaborators for real-time editing."}
+{"input": "how to manage digital distractions?", "output": "lex: manage digital distractions focus\nlex: phone screen time notification blocking\nlex: digital distraction productivity apps\nvec: how can you reduce digital distractions from phones and social media to stay focused\nvec: what tools and strategies help manage screen time and notification overload\nhyde: Manage digital distractions by turning off non-essential notifications. Use app blockers like Freedom or Cold Turkey during focus periods. Set your phone to Do Not Disturb and place it in another room. Schedule specific times to check email and social media rather than responding in real-time. Use Screen Time (iOS) or Digital Wellbeing (Android) to track and limit usage."}
+{"input": "what are stem cells", "output": "lex: stem cells types function biology\nlex: stem cell embryonic adult pluripotent\nlex: stem cell therapy regenerative medicine\nvec: what are stem cells and what makes them different from regular cells in the body\nvec: how are stem cells used in medical research and regenerative medicine\nhyde: Stem cells are undifferentiated cells that can self-renew and differentiate into specialized cell types. Embryonic stem cells are pluripotent—they can become any cell type. Adult stem cells are multipotent, limited to specific tissues (e.g., hematopoietic stem cells produce blood cells). Induced pluripotent stem cells (iPSCs) are adult cells reprogrammed to an embryonic-like state."}
+{"input": "how does literary geography influence narratives?", "output": "lex: literary geography narrative place setting\nlex: geography literature landscape sense of place\nlex: spatial narrative setting fiction geography\nvec: how does the geography and physical setting of a story influence its narrative and themes\nvec: what role does sense of place and landscape play in shaping literary narratives\nhyde: Literary geography examines how real and imagined places shape narrative meaning. Faulkner's Yoknapatawpha County embodies Southern decay and racial tension. Hardy's Wessex landscapes mirror characters' emotional states. Setting is not just backdrop—it constrains plot, shapes character psychology, and carries symbolic weight. Urban and rural spaces generate distinct narrative possibilities."}
+{"input": "what were the causes of world war ii", "output": "lex: causes World War II WWII origins\nlex: WWII causes Treaty Versailles Hitler aggression\nlex: World War 2 causes appeasement fascism\nvec: what were the main political and economic causes that led to World War II\nvec: how did the Treaty of Versailles, fascism, and appeasement contribute to the outbreak of WWII\nhyde: World War II resulted from multiple causes: the punitive Treaty of Versailles (1919) imposed crippling reparations on Germany, fueling resentment. The Great Depression created economic desperation exploited by fascist movements. Hitler's expansionist aggression—remilitarizing the Rhineland, annexing Austria, and invading Czechoslovakia—met with appeasement from Britain and France until the invasion of Poland in September 1939."}
+{"input": "what is the role of faith in spirituality", "output": "lex: faith role spirituality belief\nlex: spiritual faith trust divine religious\nlex: faith spirituality meaning transcendence\nvec: what role does faith play in spiritual practice and personal transcendence\nvec: how does faith relate to spiritual growth and the search for meaning\nhyde: Faith in spirituality serves as the foundation for trust in a reality beyond the material world. It enables surrender to uncertainty and provides a framework for interpreting suffering and purpose. Unlike dogmatic belief, spiritual faith often involves personal experience—a felt sense of connection to something greater that sustains practice through doubt and difficulty."}
+{"input": "how to contribute to political campaigns", "output": "lex: political campaign contribution donate volunteer\nlex: volunteer political campaign canvassing\nlex: campaign donation fundraising grassroots\nvec: how can individuals contribute to political campaigns through donations or volunteering\nvec: what are the different ways to get involved in a political campaign as a volunteer\nhyde: Contribute to political campaigns by donating through the candidate's official website (individual contributions are limited to $3,300 per election per candidate in federal races). Volunteer to canvass door-to-door, phone bank, or text bank. Attend campaign events, host a house party, or share the candidate's message on social media. Small-dollar donations are increasingly impactful."}
+{"input": "what is the importance of meditation in spirituality?", "output": "lex: meditation spirituality importance practice\nlex: spiritual meditation mindfulness contemplation\nlex: meditation enlightenment inner peace spiritual\nvec: why is meditation considered essential to many spiritual traditions and practices\nvec: how does meditation contribute to spiritual growth and inner transformation\nhyde: Meditation is central to nearly every spiritual tradition. In Buddhism, vipassana meditation cultivates insight into impermanence. Hindu dhyana aims for union with Brahman. Christian contemplative prayer seeks direct experience of God. Across traditions, meditation quiets mental chatter, develops present-moment awareness, and opens practitioners to transcendent experience."}
+{"input": "how to prune fruit trees?", "output": "lex: prune fruit trees technique timing\nlex: fruit tree pruning winter dormant cuts\nlex: apple pear tree pruning branches\nvec: when and how should you prune fruit trees for better growth and fruit production\nvec: what pruning techniques are used for apple, pear, and other fruit trees\nhyde: Prune fruit trees during late winter dormancy (January-March) before buds break. Remove dead, diseased, and crossing branches first. Open the center of the tree to allow sunlight and air circulation. Make cuts at a 45-degree angle just above an outward-facing bud. Remove water sprouts (vertical shoots) and suckers from the base. Never remove more than 25% of the canopy in one season."}
+{"input": "what is conservation biology", "output": "lex: conservation biology biodiversity preservation\nlex: conservation biology endangered species habitat\nlex: wildlife conservation ecology management\nvec: what is conservation biology and what are its main goals and methods\nvec: how do conservation biologists work to protect endangered species and biodiversity\nhyde: Conservation biology is the scientific study of preserving biodiversity and preventing extinction. It combines ecology, genetics, and landscape management to protect threatened species and ecosystems. Key approaches include habitat restoration, establishing wildlife corridors, captive breeding programs, and designating protected areas. The field was formalized in the 1980s by Michael Soulé."}
+{"input": "how do muslims observe hajj?", "output": "lex: Hajj Muslim pilgrimage Mecca rituals\nlex: Hajj rites Kaaba Arafat Mina Islam\nlex: Islamic pilgrimage Hajj steps obligations\nvec: what are the rituals and steps Muslims follow during the Hajj pilgrimage to Mecca\nvec: how do Muslims prepare for and perform the Hajj pilgrimage\nhyde: Hajj occurs annually during Dhul Hijjah, the 12th month of the Islamic calendar. Pilgrims enter a state of ihram (ritual purity) and wear simple white garments. They perform tawaf (circling the Kaaba seven times), sa'i (walking between Safa and Marwah), stand at Arafat in prayer, and stone the pillars at Mina. Hajj concludes with Eid al-Adha, the Festival of Sacrifice."}
+{"input": "digital economy transformation", "output": "lex: digital economy transformation trends\nlex: digital economy e-commerce fintech platform\nlex: economic digitalization technology market 2025\nvec: how is the digital economy transforming traditional industries and business models\nvec: what are the key drivers and trends of digital economic transformation\nhyde: The digital economy encompasses all economic activity enabled by digital technologies. E-commerce, fintech, cloud computing, and platform businesses (Uber, Airbnb) have disrupted traditional industries. By 2025, the digital economy accounts for over 15% of global GDP. Key drivers include mobile internet penetration, AI automation, and the shift to subscription-based and data-driven business models."}
+{"input": "how does philosophy address systemic injustice?", "output": "lex: philosophy systemic injustice structural oppression\nlex: social justice philosophy racial gender inequality\nlex: systemic injustice Rawls critical race theory\nvec: how do philosophers analyze and propose solutions to systemic injustice and structural oppression\nvec: what philosophical frameworks address racial, gender, and economic systemic inequality\nhyde: Philosophers address systemic injustice through multiple frameworks. Rawls's veil of ignorance argues just institutions would be designed without knowing one's social position. Critical race theory examines how legal and social structures perpetuate racial inequality. Iris Marion Young distinguished five faces of oppression: exploitation, marginalization, powerlessness, cultural imperialism, and violence."}
+{"input": "how to analyze a political speech", "output": "lex: political speech analysis rhetoric\nlex: speech analysis persuasion ethos pathos logos\nlex: rhetorical analysis political discourse\nvec: what techniques are used to analyze the rhetoric and persuasive strategies in political speeches\nvec: how do you evaluate a political speech for logical arguments, emotional appeals, and credibility\nhyde: Analyze a political speech by examining its rhetorical appeals: ethos (credibility—does the speaker establish authority?), pathos (emotion—what feelings are evoked?), and logos (logic—are arguments supported by evidence?). Identify rhetorical devices like repetition, anaphora, and metaphor. Consider the audience, context, and what the speaker wants listeners to do."}
+{"input": "how to support clean energy initiatives?", "output": "lex: clean energy support renewable initiatives\nlex: renewable energy advocacy solar wind policy\nlex: clean energy action community support\nvec: how can individuals and communities support clean energy initiatives and policies\nvec: what actions can people take to promote renewable energy adoption in their area\nhyde: Support clean energy by installing solar panels or subscribing to community solar. Switch to a green electricity provider. Contact elected officials to support renewable energy legislation and tax credits. Invest in clean energy funds. Drive electric or hybrid vehicles. Advocate for local building codes that require energy efficiency standards. Join or donate to organizations like the Sierra Club or local clean energy cooperatives."}
+{"input": "how to diagnose car starting problems?", "output": "lex: car starting problems diagnosis troubleshoot\nlex: car won't start battery starter ignition\nlex: engine cranks no start fuel spark\nvec: how do you diagnose why a car won't start and identify the root cause\nvec: what are the common reasons a car fails to start and how to troubleshoot them\nhyde: If the car clicks but won't crank, the battery is likely dead—test with a multimeter (should read 12.6V). If the engine cranks but won't start, check fuel delivery (listen for the fuel pump whine) and spark (pull a plug and check for spark). A no-crank, no-click condition often points to a failed starter motor or corroded battery terminals."}
+{"input": "how to identify personal values and beliefs?", "output": "lex: identify personal values beliefs self-reflection\nlex: core values assessment life priorities\nlex: personal values exercise self-awareness\nvec: how can you identify and clarify your core personal values and beliefs\nvec: what exercises and reflection methods help discover what you truly value in life\nhyde: Identify your core values by reflecting on peak experiences—moments when you felt most fulfilled and authentic. Write down 10-15 values (integrity, creativity, family, freedom) and narrow to your top 5. Ask: what angers you when it's violated? What would you fight for? A values card sort exercise—ranking printed values—can clarify priorities you struggle to articulate."}
+{"input": "what is the significance of the gnostic gospels?", "output": "lex: gnostic gospels significance Nag Hammadi\nlex: gnostic texts Gospel Thomas early Christianity\nlex: gnostic gospels meaning heresy Christian\nvec: what are the gnostic gospels and why are they significant for understanding early Christianity\nvec: how did the Nag Hammadi discovery change our knowledge of gnostic Christian texts\nhyde: The gnostic gospels are early Christian texts discovered at Nag Hammadi, Egypt in 1945. They include the Gospel of Thomas, Gospel of Philip, and Gospel of Truth. These texts reveal diverse beliefs in early Christianity—including the idea that salvation comes through secret knowledge (gnosis) rather than faith alone. They were excluded from the biblical canon as heretical by the 4th century church."}
+{"input": "russia train", "output": "lex: Russia train travel Trans-Siberian railway\nlex: Russian railway routes tickets booking\nlex: Trans-Siberian Express Moscow Vladivostok\nvec: how to travel by train in Russia and what are the major railway routes\nvec: what is the Trans-Siberian Railway and how do you book tickets for Russian trains\nhyde: The Trans-Siberian Railway is the longest railway line in the world, spanning 9,289 km from Moscow to Vladivostok over 6 days. Book tickets through Russian Railways (RZD) at rzd.ru or through agents like RealRussia. Classes include platzkart (open berth), kupe (4-person compartment), and SV (2-person sleeper). Bring your own food for long journeys."}
+{"input": "how do you write an effective book review?", "output": "lex: book review writing effective structure\nlex: write book review summary critique\nlex: book review template opinion analysis\nvec: how do you write a thoughtful and effective book review with summary and analysis\nvec: what structure and elements make a strong book review for publication or school\nhyde: An effective book review opens with the book's title, author, genre, and a one-sentence summary. Discuss the main themes and the author's writing style. Include specific examples and short quotations. Evaluate strengths and weaknesses honestly. Avoid spoilers for fiction. End with a recommendation and who would enjoy the book. Aim for 500-800 words."}
+{"input": "how to practice self-compassion?", "output": "lex: self-compassion practice exercises\nlex: self-compassion Kristin Neff mindfulness\nlex: self-kindness inner critic self-care\nvec: what are practical ways to practice self-compassion and quiet your inner critic\nvec: how does Kristin Neff's framework for self-compassion work in daily life\nhyde: Kristin Neff defines self-compassion as three components: self-kindness (treating yourself as you would a friend), common humanity (recognizing suffering is shared), and mindfulness (acknowledging pain without over-identifying). Practice by placing your hand on your heart when distressed and saying: \"This is a moment of suffering. Suffering is part of life. May I be kind to myself.\""}
+{"input": "what is the significance of pilgrimage in religion?", "output": "lex: pilgrimage religion significance spiritual\nlex: religious pilgrimage Mecca Jerusalem Varanasi\nlex: pilgrimage sacred journey faith tradition\nvec: why is pilgrimage important across different religious traditions\nvec: what spiritual significance does the act of pilgrimage carry in major world religions\nhyde: Pilgrimage holds deep significance across religions. Muslims perform Hajj to Mecca as one of the Five Pillars. Christians journey to Jerusalem, Rome, and Santiago de Compostela. Hindus bathe in the Ganges at Varanasi. The physical journey symbolizes an inner spiritual transformation—leaving ordinary life, enduring hardship, and arriving at a sacred place of renewal and encounter with the divine."}
+{"input": "api doc", "output": "lex: API documentation reference endpoints\nlex: REST API docs developer guide\nlex: API documentation Swagger OpenAPI\nvec: how to read and use API documentation for integrating with a web service\nvec: what tools and formats are used for creating and hosting API documentation\nhyde: API documentation describes available endpoints, request/response formats, authentication methods, and error codes. RESTful APIs typically document each endpoint with its HTTP method (GET, POST, PUT, DELETE), URL path, query parameters, request body schema, and example responses. Tools like Swagger/OpenAPI generate interactive docs where developers can test endpoints directly."}
+{"input": "how to boil an egg perfectly", "output": "lex: boil egg perfectly soft hard\nlex: boiled egg timing minutes technique\nlex: perfect hard soft boiled egg recipe\nvec: how long do you boil an egg for soft-boiled and hard-boiled results\nvec: what is the best technique for boiling eggs so they peel easily and cook perfectly\nhyde: Place eggs in a single layer in a pot and cover with cold water by 1 inch. Bring to a rolling boil, then remove from heat and cover. For soft-boiled: 6-7 minutes. For medium: 9-10 minutes. For hard-boiled: 12-13 minutes. Transfer immediately to an ice bath for 5 minutes. Older eggs (7-10 days) peel more easily than fresh ones."}
+{"input": "how to create a home office space", "output": "lex: home office setup design workspace\nlex: home office desk chair ergonomic\nlex: work from home office organization\nvec: how do you set up a productive and ergonomic home office workspace\nvec: what furniture, lighting, and layout create the best home office environment\nhyde: Set up your home office in a quiet room with natural light. Invest in an ergonomic chair with lumbar support and a desk at elbow height (28-30 inches). Position your monitor at arm's length with the top at eye level. Use a desk lamp with 4000-5000K color temperature. Keep cables organized and add a plant—studies show greenery reduces stress and improves focus."}
+{"input": "what are the basic laws of thermodynamics", "output": "lex: laws of thermodynamics basic physics\nlex: thermodynamics first second third law entropy\nlex: thermodynamic laws energy heat transfer\nvec: what are the four laws of thermodynamics and what does each one describe\nvec: how do the laws of thermodynamics govern energy transfer and entropy\nhyde: The zeroth law establishes thermal equilibrium: if A and B are each in equilibrium with C, they are in equilibrium with each other. The first law states energy cannot be created or destroyed (conservation of energy). The second law says entropy in a closed system always increases—heat flows from hot to cold, never the reverse. The third law states entropy approaches zero as temperature approaches absolute zero."}
+{"input": "how to create a home yoga space", "output": "lex: home yoga space setup room\nlex: yoga room design mat props space\nlex: home yoga studio create practice area\nvec: how do you set up a dedicated yoga practice space in your home\nvec: what equipment and room setup do you need for a home yoga studio\nhyde: Create a home yoga space in an area with at least 6x8 feet of clear floor space. Use a non-slip yoga mat (6mm thickness for comfort). Add blocks, a strap, and a bolster for supported poses. Keep the space clutter-free and at a comfortable temperature (68-72°F). Soft natural light and a small speaker for calming music enhance the atmosphere."}
+{"input": "what is the bible?", "output": "lex: Bible Christian scripture holy book\nlex: Bible Old New Testament books\nlex: Bible history composition canon\nvec: what is the Bible and how is it organized into Old and New Testaments\nvec: how was the Bible composed and compiled over time as a sacred text\nhyde: The Bible is the sacred scripture of Christianity, consisting of the Old Testament (39 books in Protestant tradition, 46 in Catholic) and the New Testament (27 books). The Old Testament includes the Torah, historical books, poetry, and prophets, written primarily in Hebrew. The New Testament contains the Gospels, Acts, Epistles, and Revelation, written in Greek during the 1st century CE."}
+{"input": "how does virtue ethics differ from other ethical theories", "output": "lex: virtue ethics vs deontology consequentialism\nlex: virtue ethics comparison ethical theories\nlex: Aristotle virtue ethics Kant Mill contrast\nvec: how does virtue ethics differ from deontological and consequentialist moral theories\nvec: what makes virtue ethics unique compared to rule-based and outcome-based ethical frameworks\nhyde: Virtue ethics (Aristotle) asks \"What kind of person should I be?\" rather than \"What should I do?\" Deontology (Kant) focuses on following moral rules regardless of outcomes. Consequentialism (Mill) judges actions by their results. Virtue ethics emphasizes developing moral character through habit and practical wisdom, while the others prescribe universal principles or calculations."}
+{"input": "how genetic research impacts medicine", "output": "lex: genetic research medicine impact\nlex: genomics personalized medicine gene therapy\nlex: genetic testing pharmacogenomics CRISPR\nvec: how has genetic research transformed medical treatments and diagnosis\nvec: what advances in genomics and gene therapy are changing the future of medicine\nhyde: Genetic research has revolutionized medicine through pharmacogenomics (tailoring drug dosages to genetic profiles), gene therapy (correcting defective genes, as in the FDA-approved Luxturna for inherited blindness), and CRISPR gene editing (potential cures for sickle cell disease). Genetic testing identifies cancer risk (BRCA1/2 mutations) enabling early screening and prevention."}
+{"input": "how to fix car scratches?", "output": "lex: fix car scratches paint repair\nlex: car scratch removal polish compound\nlex: auto paint scratch repair DIY\nvec: how do you repair and remove scratches from a car's paint finish at home\nvec: what products and techniques fix different types of car paint scratches\nhyde: Car scratches fall into three categories: clear coat scratches (light, fingernail doesn't catch), base coat scratches (deeper, white visible), and primer/metal scratches (deepest). For clear coat scratches, use rubbing compound followed by polish. For deeper scratches, apply touch-up paint matching your car's color code (found on the door jamb sticker), then clear coat and wet sand with 2000-grit."}
+{"input": "how digital currencies work", "output": "lex: digital currency cryptocurrency blockchain\nlex: Bitcoin cryptocurrency how it works\nlex: digital currency blockchain mining wallet\nvec: how do digital currencies like Bitcoin use blockchain technology to process transactions\nvec: what is the technical process behind cryptocurrency transactions and mining\nhyde: Digital currencies operate on blockchain technology—a decentralized ledger distributed across thousands of computers. When you send Bitcoin, the transaction is broadcast to the network. Miners validate transactions by solving cryptographic puzzles (proof of work), adding them to a block. Each block links to the previous one, creating an immutable chain. Wallets store private keys that prove ownership."}
+{"input": "what is existentialism", "output": "lex: existentialism philosophy Sartre Kierkegaard\nlex: existentialism existence precedes essence freedom\nlex: existentialist philosophy meaning absurd\nvec: what is existentialism and what are its core philosophical claims about human existence\nvec: how did Sartre, Kierkegaard, and Camus develop existentialist philosophy\nhyde: Existentialism holds that existence precedes essence—humans are not born with a fixed nature but create meaning through choices and actions. Kierkegaard emphasized individual faith and anxiety. Sartre declared we are \"condemned to be free\"—radical freedom brings radical responsibility. Camus confronted the absurd: life has no inherent meaning, yet we must live as if it does."}
+{"input": "what are the key concepts in marxist philosophy", "output": "lex: Marxist philosophy key concepts\nlex: Marx dialectical materialism class struggle surplus\nlex: Marxism alienation historical materialism ideology\nvec: what are the central ideas and concepts in Karl Marx's philosophical framework\nvec: how do dialectical materialism, class struggle, and alienation function in Marxist thought\nhyde: Key concepts in Marxist philosophy include historical materialism (material conditions drive historical change), dialectical materialism (contradictions between productive forces and relations of production), class struggle (bourgeoisie vs. proletariat), alienation (workers separated from their labor's product), surplus value (profit extracted from unpaid labor), and ideology (ruling class ideas that justify the status quo)."}
+{"input": "how to find emotional support", "output": "lex: emotional support resources help\nlex: finding emotional support therapy counseling\nlex: mental health support groups crisis helpline\nvec: where can someone find emotional support during difficult times or mental health challenges\nvec: what resources are available for people seeking emotional support and counseling\nhyde: Find emotional support through multiple channels: talk to a trusted friend or family member. Contact a therapist through Psychology Today's directory or your insurance provider. Call the 988 Suicide and Crisis Lifeline (dial 988) for immediate help. Join support groups through NAMI or local community centers. Online therapy platforms like BetterHelp and Talkspace offer accessible counseling."}
+{"input": "relationship goals", "output": "lex: relationship goals healthy couple\nlex: relationship goals communication trust partnership\nlex: healthy relationship habits couples\nvec: what are realistic and healthy relationship goals for couples to work toward\nvec: how do couples build a strong relationship through communication and shared goals\nhyde: Healthy relationship goals include open and honest communication, maintaining individual identities while building shared experiences, resolving conflicts respectfully without contempt or stonewalling, expressing appreciation daily, supporting each other's personal growth, maintaining physical intimacy, and aligning on major life decisions like finances, children, and career priorities."}
+{"input": "what is the role of media in politics", "output": "lex: media role politics influence\nlex: political media coverage news bias\nlex: media politics democracy journalism fourth estate\nvec: what role does the media play in shaping political discourse and public opinion\nvec: how does news coverage and media bias influence political outcomes and democracy\nhyde: The media serves as the \"fourth estate\" in democracy—informing citizens, holding officials accountable, and setting the public agenda. Media framing shapes which issues voters prioritize. Agenda-setting theory shows that what the media covers becomes what the public considers important. The rise of partisan media and social media algorithms has increased polarization by creating ideological echo chambers."}
+{"input": "what is stream of consciousness", "output": "lex: stream of consciousness literary technique\nlex: stream of consciousness narrative style\nvec: what does stream of consciousness mean as a writing technique in literature\nvec: how does stream of consciousness narration work in novels and fiction\nhyde: Stream of consciousness is a narrative technique that presents a character's continuous flow of thoughts, feelings, and sensory impressions as they occur. Pioneered by writers like Virginia Woolf and James Joyce, it mimics the unstructured way the human mind processes experience."}
+{"input": "where to find budget travel tips", "output": "lex: budget travel tips cheap flights accommodations\nlex: affordable travel planning money saving\nvec: where can I find reliable tips for traveling on a tight budget\nvec: what are the best resources for planning cheap vacations and budget trips\nhyde: To travel on a budget, book flights midweek, use fare comparison tools like Google Flights or Skyscanner, stay in hostels or use house-sitting platforms, and eat at local markets instead of tourist restaurants."}
+{"input": "what is fallibilism", "output": "lex: fallibilism epistemology philosophy\nlex: fallibilism knowledge certainty\nvec: what does fallibilism mean in philosophy and epistemology\nvec: how does fallibilism challenge the idea that knowledge requires absolute certainty\nhyde: Fallibilism is the philosophical doctrine that no belief or claim can ever be conclusively justified or proven beyond all doubt. Associated with Charles Sanders Peirce and Karl Popper, it holds that all human knowledge is provisional and subject to revision."}
+{"input": "auth flow", "output": "lex: authentication flow OAuth JWT\nlex: authorization code flow token exchange\nlex: auth login session management\nvec: how does an authentication and authorization flow work in web applications\nvec: what are the steps in an OAuth 2.0 authorization code flow\nhyde: The OAuth 2.0 authorization code flow begins when the client redirects the user to the authorization server. After login, the server returns an authorization code, which the client exchanges for an access token and refresh token via the token endpoint."}
+{"input": "where to find datasets for scientific research", "output": "lex: scientific research datasets open data repositories\nlex: public datasets academic research download\nvec: where can researchers find free datasets for scientific studies\nvec: what are the best open data repositories for academic and scientific research\nhyde: Public research datasets are available from repositories such as Kaggle, the UCI Machine Learning Repository, NASA's Open Data Portal, NOAA Climate Data, and institutional data archives like Harvard Dataverse and Zenodo."}
+{"input": "ui build", "output": "lex: UI build frontend framework components\nlex: user interface build tooling bundler\nlex: UI component library development\nvec: how to build a user interface for a web or mobile application\nvec: what tools and frameworks are used to build modern frontend UIs\nhyde: To build a responsive UI, start by choosing a component framework such as React, Vue, or Svelte. Use a build tool like Vite or Webpack to bundle assets, and style with CSS modules or Tailwind CSS for rapid layout development."}
+{"input": "how to conserve water at home?", "output": "lex: water conservation home tips\nlex: reduce household water usage\nvec: what are practical ways to conserve water at home and reduce water bills\nvec: how can I use less water in my house for everyday tasks\nhyde: Fix leaky faucets promptly—a single drip can waste over 3,000 gallons per year. Install low-flow showerheads and dual-flush toilets, run dishwashers and washing machines only with full loads, and water your garden early in the morning to minimize evaporation."}
+{"input": "how to obtain information on state legislation", "output": "lex: state legislation tracking bill search\nlex: state law lookup legislative database\nvec: how can I find and track state legislation and bills currently being considered\nvec: what websites or tools let you look up state laws and legislative history\nhyde: To track state legislation, visit your state legislature's official website, which provides bill text, status, and voting records. Tools like LegiScan and the National Conference of State Legislatures (NCSL) aggregate bills across all 50 states."}
+{"input": "what shoes for hiking?", "output": "lex: hiking shoes boots trail footwear\nlex: best hiking boots waterproof ankle support\nvec: what type of shoes or boots should I wear for hiking on trails\nvec: how to choose the right hiking footwear for different terrain and conditions\nhyde: For day hikes on well-maintained trails, lightweight hiking shoes with good tread provide enough support. For rocky or wet terrain, mid-cut waterproof boots with ankle support and Vibram soles offer better protection and stability."}
+{"input": "what is the role of empathy in moral decision-making", "output": "lex: empathy moral decision-making ethics\nlex: empathy role ethical judgment\nvec: how does empathy influence the way people make moral and ethical decisions\nvec: what role does feeling empathy play in moral reasoning and ethical behavior\nhyde: Empathy allows individuals to imagine the experiences of others, which directly influences moral judgment. Studies show that people who score higher on empathy scales are more likely to make prosocial decisions, though critics like Paul Bloom argue empathy can also bias moral reasoning."}
+{"input": "how to improve self-worth?", "output": "lex: improve self-worth self-esteem building\nlex: boost self-confidence self-value exercises\nvec: what are effective strategies to improve your sense of self-worth and self-esteem\nvec: how can someone build stronger self-worth through daily habits and mindset shifts\nhyde: To improve self-worth, start by identifying and challenging negative self-talk. Practice self-compassion, set small achievable goals, keep a journal of accomplishments, and surround yourself with supportive people. Cognitive behavioral techniques can help reframe core beliefs about your value."}
+{"input": "what is cryptography", "output": "lex: cryptography encryption decryption\nlex: cryptographic algorithms symmetric asymmetric\nvec: what is cryptography and how does it protect data through encryption\nvec: how do cryptographic systems work to secure communications and information\nhyde: Cryptography is the science of encoding and decoding information to prevent unauthorized access. It uses algorithms like AES (symmetric) and RSA (asymmetric) to encrypt plaintext into ciphertext. Only parties with the correct key can decrypt the message back to its original form."}
+{"input": "how to photograph reflections", "output": "lex: photography reflections water glass mirror\nlex: reflection photography techniques composition\nvec: what techniques help capture sharp and creative reflection photographs\nvec: how to photograph reflections in water, mirrors, and glass surfaces\nhyde: To photograph reflections, use a polarizing filter to control glare and increase clarity. Shoot at a low angle to maximize the reflected image in water. For mirror or glass reflections, focus manually on the reflected subject rather than the surface itself."}
+{"input": "how do black holes form", "output": "lex: black hole formation stellar collapse\nlex: black holes neutron star supernova\nvec: how do black holes form from dying stars and gravitational collapse\nvec: what is the process by which a massive star becomes a black hole\nhyde: Black holes form when a massive star—typically more than 20 solar masses—exhausts its nuclear fuel and can no longer support itself against gravitational collapse. The core implodes past the neutron star stage, compressing into a singularity surrounded by an event horizon."}
+{"input": "how to conduct literature review in research", "output": "lex: literature review research methodology\nlex: academic literature review systematic search\nvec: how do you conduct a thorough literature review for an academic research paper\nvec: what are the steps to search, organize, and synthesize sources in a literature review\nhyde: Begin by defining your research question, then search databases like PubMed, Google Scholar, and Web of Science using targeted keywords. Screen abstracts for relevance, organize selected papers by theme, and synthesize findings to identify gaps in existing knowledge."}
+{"input": "how do scientists use models", "output": "lex: scientific models simulation prediction\nlex: scientific modeling research methodology\nvec: how do scientists use models to understand and predict natural phenomena\nvec: what types of models do scientists build to test hypotheses and simulate systems\nhyde: Scientists use mathematical, computational, and physical models to represent complex systems. Climate models simulate atmospheric interactions, molecular models predict protein folding, and epidemiological models forecast disease spread. Models are validated against observed data and refined iteratively."}
+{"input": "how to stage a home for sale", "output": "lex: home staging tips selling house\nlex: stage house real estate curb appeal\nvec: how do you stage a home to make it more appealing to potential buyers\nvec: what are the key steps to prepare and stage a house before listing it for sale\nhyde: Declutter every room, remove personal photos, and use neutral paint colors. Arrange furniture to maximize space and natural light. Add fresh flowers, clean all surfaces, and improve curb appeal with trimmed landscaping and a freshly painted front door."}
+{"input": "rim fix", "output": "lex: rim repair bent wheel fix\nlex: alloy rim curb damage repair\nlex: car wheel rim straightening\nvec: how to fix a bent or damaged car wheel rim\nvec: can a curb-damaged alloy rim be repaired and how much does it cost\nhyde: Minor curb rash on alloy rims can be sanded, filled with body filler, and repainted at home. Bent rims require professional straightening on a hydraulic press. If the rim has cracks, replacement is safer than repair."}
+{"input": "what is speculative fiction?", "output": "lex: speculative fiction genre definition\nlex: speculative fiction sci-fi fantasy dystopia\nvec: what is speculative fiction and what genres does it encompass\nvec: how is speculative fiction different from science fiction and fantasy\nhyde: Speculative fiction is an umbrella genre that includes science fiction, fantasy, horror, dystopian, and alternate history literature. It explores \"what if\" scenarios by altering known reality—imagining different technologies, social structures, or natural laws."}
+{"input": "what are algorithms in computer science", "output": "lex: algorithms computer science data structures\nlex: algorithm sorting searching complexity\nvec: what are algorithms in computer science and why are they fundamental\nvec: how do computer science algorithms solve problems through step-by-step procedures\nhyde: An algorithm is a finite sequence of well-defined instructions for solving a class of problems or performing a computation. Common examples include sorting algorithms (quicksort, mergesort), search algorithms (binary search), and graph algorithms (Dijkstra's shortest path)."}
+{"input": "how to calculate car loan payments?", "output": "lex: car loan payment calculator formula\nlex: auto loan monthly payment interest rate\nvec: how do you calculate monthly car loan payments based on principal, interest rate, and term\nvec: what formula is used to determine monthly auto loan payments\nhyde: The monthly car loan payment is calculated using the formula: M = P × [r(1+r)^n] / [(1+r)^n − 1], where P is the principal, r is the monthly interest rate (annual rate divided by 12), and n is the total number of monthly payments."}
+{"input": "how to recycle electronics?", "output": "lex: electronics recycling e-waste disposal\nlex: recycle old computers phones e-waste\nvec: how and where can I recycle old electronics like phones, computers, and TVs\nvec: what is the proper way to dispose of electronic waste responsibly\nhyde: Many retailers like Best Buy and Staples offer free electronics drop-off recycling. Check Earth911.org for local e-waste facilities. Before recycling, wipe personal data from devices. Never throw electronics in regular trash—they contain lead, mercury, and other hazardous materials."}
+{"input": "what is the significance of the anti-hero?", "output": "lex: anti-hero literary significance character\nlex: anti-hero fiction protagonist flawed\nvec: what is the literary significance of the anti-hero as a character type in fiction\nvec: why are anti-heroes important in storytelling and what do they represent\nhyde: The anti-hero challenges traditional notions of heroism by embodying flawed, morally ambiguous traits. Characters like Raskolnikov, Walter White, and Deadpool resonate because they reflect the complexity of human nature, blurring the line between virtue and vice."}
+{"input": "what is the significance of ramadan", "output": "lex: Ramadan significance Islam fasting\nlex: Ramadan holy month Muslim observance\nvec: what is the spiritual and cultural significance of Ramadan in Islam\nvec: why do Muslims observe Ramadan and what does the month represent\nhyde: Ramadan is the ninth month of the Islamic lunar calendar, during which Muslims fast from dawn to sunset. It commemorates the first revelation of the Quran to Prophet Muhammad. The fast cultivates self-discipline, empathy for the hungry, and spiritual closeness to God."}
+{"input": "where to find landscaping stones?", "output": "lex: landscaping stones buy garden rocks\nlex: landscape stone supply yard near me\nvec: where can I buy landscaping stones and decorative rocks for my yard\nvec: what are the best places to find affordable landscaping stones and pavers\nhyde: Landscaping stones can be purchased from home improvement stores like Home Depot and Lowe's, local stone yards, and quarries. For bulk orders, landscape supply companies deliver directly. River rock, flagstone, and pea gravel are popular choices for garden paths and borders."}
+{"input": "where to watch latest movies online", "output": "lex: watch movies online streaming platforms 2026\nlex: latest movies streaming services new releases\nvec: where can I watch the latest movies online through streaming services in 2026\nvec: which streaming platforms have the newest movie releases available to watch\nhyde: New theatrical releases typically arrive on streaming platforms 45-90 days after their cinema debut. Netflix, Amazon Prime Video, Disney+, Apple TV+, and Max each acquire exclusive titles. Check JustWatch.com to see which service currently streams a specific movie."}
+{"input": "what is contemporary art?", "output": "lex: contemporary art definition movement\nlex: contemporary art 21st century modern\nvec: what defines contemporary art and how is it different from modern art\nvec: what are the key characteristics and themes of contemporary art\nhyde: Contemporary art refers to art produced from the late 20th century to the present day. Unlike modern art (roughly 1860s–1970s), contemporary art encompasses a wide range of media—installation, video, digital, and performance—and often engages with identity, globalization, and technology."}
+{"input": "what is the significance of easter", "output": "lex: Easter significance Christianity resurrection\nlex: Easter religious meaning Christian holiday\nvec: what is the religious and cultural significance of Easter in Christianity\nvec: why is Easter considered the most important Christian holiday\nhyde: Easter celebrates the resurrection of Jesus Christ on the third day after his crucifixion, as described in the New Testament Gospels. It is the most important feast in Christianity, marking the fulfillment of prophecy and the foundation of Christian faith in life after death."}
+{"input": "how to install peel and stick wallpaper", "output": "lex: peel and stick wallpaper installation\nlex: self-adhesive wallpaper apply walls\nvec: what are the steps to properly install peel and stick wallpaper on a wall\nvec: how do you apply self-adhesive wallpaper without bubbles or wrinkles\nhyde: Clean the wall surface and let it dry completely. Start at the top, peeling back a few inches of backing at a time. Use a smoothing tool to press the wallpaper flat, working from the center outward to remove air bubbles. Trim excess at the ceiling and baseboard with a sharp blade."}
+{"input": "how do behavioral scientists study behavior", "output": "lex: behavioral science research methods\nlex: behavioral psychology experiments observation\nvec: what methods do behavioral scientists use to study and measure human behavior\nvec: how do behavioral researchers design experiments and observational studies\nhyde: Behavioral scientists study behavior through controlled experiments, field observations, surveys, and neuroimaging. Randomized controlled trials isolate variables, while observational studies capture behavior in natural settings. Eye-tracking and fMRI provide physiological data on decision-making processes."}
+{"input": "soccer training drills", "output": "lex: soccer training drills exercises\nlex: football practice drills passing shooting\nvec: what are effective soccer training drills for improving skills and fitness\nvec: which soccer drills help players improve dribbling, passing, and shooting\nhyde: Set up a cone dribbling course with 10 cones spaced 2 meters apart. Players weave through using inside and outside touches at speed. For passing accuracy, pair players 15 meters apart and practice one-touch passes, alternating feet. Finish sessions with 1v1 attacking drills near the box."}
+{"input": "how to invest in the stock market", "output": "lex: stock market investing beginner guide\nlex: invest stocks brokerage portfolio\nvec: how do beginners start investing in the stock market and building a portfolio\nvec: what are the basic steps to open a brokerage account and buy stocks\nhyde: To start investing, open a brokerage account with a platform like Fidelity, Schwab, or Vanguard. Begin with low-cost index funds that track the S&P 500 for broad diversification. Invest regularly through dollar-cost averaging and avoid trying to time the market."}
+{"input": "what is the role of prophets in christianity?", "output": "lex: prophets Christianity role Bible\nlex: Christian prophets Old Testament New Testament\nvec: what role do prophets play in Christian theology and scripture\nvec: how are prophets understood in Christianity compared to other Abrahamic religions\nhyde: In Christianity, prophets are individuals called by God to deliver divine messages and foretell events. Old Testament prophets like Isaiah and Jeremiah predicted the coming of the Messiah. In the New Testament, Jesus is seen as the ultimate fulfillment of prophetic tradition."}
+{"input": "what is a no-dig garden?", "output": "lex: no-dig garden method sheet mulching\nlex: no-dig gardening lasagna layering technique\nvec: what is a no-dig garden and how do you build one without tilling the soil\nvec: how does the no-dig gardening method work to improve soil health\nhyde: A no-dig garden is built by layering organic materials—cardboard, compost, straw, and leaf mold—directly on top of existing ground. This preserves soil structure, encourages worm activity, suppresses weeds, and builds fertile topsoil without the labor of digging or tilling."}
+{"input": "how to raise startup capital", "output": "lex: raise startup capital funding sources\nlex: startup fundraising seed investors venture capital\nvec: what are the main ways to raise capital for a new startup company\nvec: how do founders raise seed funding and early-stage investment for a startup\nhyde: Startup capital can come from bootstrapping, friends and family, angel investors, venture capital firms, crowdfunding platforms like Kickstarter, or government grants. Prepare a pitch deck with your business model, market size, traction metrics, and financial projections before approaching investors."}
+{"input": "how to save money effectively", "output": "lex: save money tips budgeting strategies\nlex: effective saving habits personal finance\nvec: what are effective strategies and habits for saving money consistently\nvec: how can I create a budget and save more money each month\nhyde: Follow the 50/30/20 rule: allocate 50% of income to needs, 30% to wants, and 20% to savings. Automate transfers to a high-yield savings account on payday. Track spending with an app, cancel unused subscriptions, and build a 3-6 month emergency fund before investing."}
+{"input": "what is the problem of evil", "output": "lex: problem of evil philosophy theodicy\nlex: problem of evil God suffering\nvec: what is the philosophical problem of evil and how does it challenge belief in God\nvec: how do philosophers and theologians respond to the problem of evil and suffering\nhyde: The problem of evil asks: if an omnipotent, omniscient, and benevolent God exists, why does suffering occur? Epicurus first formulated this dilemma. Theodicies like the free will defense and soul-making theodicy attempt to reconcile God's existence with the reality of evil."}
+{"input": "how to register to vote online", "output": "lex: register to vote online voter registration\nlex: online voter registration state website\nvec: how can I register to vote online in my state\nvec: what do I need to register to vote through an online voter registration system\nhyde: Most U.S. states offer online voter registration at vote.org or through the secretary of state's website. You'll need your state-issued ID number or last four digits of your Social Security number, your date of birth, and current residential address."}
+{"input": "what are the principles of evolution", "output": "lex: principles of evolution natural selection\nlex: evolution theory variation inheritance selection\nvec: what are the core principles of biological evolution by natural selection\nvec: how do variation, inheritance, and selection drive the process of evolution\nhyde: Evolution operates through four key principles: variation (individuals differ genetically), inheritance (traits pass from parents to offspring), selection (individuals better adapted to their environment survive and reproduce more), and time (changes accumulate across generations, leading to speciation)."}
+{"input": "explain the ten commandments", "output": "lex: Ten Commandments Bible Exodus Deuteronomy\nlex: Ten Commandments meaning list\nvec: what are the Ten Commandments and what does each one mean\nvec: how are the Ten Commandments explained in the Bible and interpreted by different faiths\nhyde: The Ten Commandments, given to Moses on Mount Sinai, include: (1) You shall have no other gods before me, (2) You shall not make idols, (3) You shall not take the Lord's name in vain, (4) Remember the Sabbath, (5) Honor your father and mother, (6) You shall not murder."}
+{"input": "how to pose people for portraits", "output": "lex: portrait posing techniques photography\nlex: portrait photography poses guide\nvec: what are effective ways to pose people for flattering portrait photographs\nvec: how do professional photographers direct subjects into natural-looking portrait poses\nhyde: Have your subject shift their weight to one foot and angle their body 45 degrees from the camera. Turn the chin slightly down and toward the light. For hands, give them something to hold or rest them naturally. Ask them to breathe out before the shot to relax their expression."}
+{"input": "css grid", "output": "lex: CSS grid layout template columns rows\nlex: CSS grid container gap alignment\nlex: CSS grid-template-areas responsive\nvec: how to create page layouts using CSS grid with rows and columns\nvec: what are the key CSS grid properties for building responsive layouts\nhyde: .container { display: grid; grid-template-columns: repeat(3, 1fr); gap: 1rem; } .item-wide { grid-column: span 2; } CSS Grid allows two-dimensional layout control with explicit row and column definitions, making it ideal for full-page layouts."}
+{"input": "how to go plastic-free in the kitchen?", "output": "lex: plastic-free kitchen alternatives\nlex: reduce plastic kitchen reusable containers\nvec: how can I eliminate single-use plastics from my kitchen\nvec: what are the best plastic-free alternatives for food storage and kitchen items\nhyde: Replace plastic wrap with beeswax wraps or silicone lids. Store food in glass jars or stainless steel containers. Use bar dish soap instead of bottled liquid soap. Buy in bulk using cloth bags, and choose wooden or bamboo utensils over plastic ones."}
+{"input": "what are the teachings of confucius?", "output": "lex: Confucius teachings Confucianism philosophy\nlex: Confucian ethics filial piety ren li\nvec: what are the main teachings and ethical principles of Confucius\nvec: how did Confucius define virtue, proper conduct, and social harmony\nhyde: Confucius emphasized ren (benevolence), li (ritual propriety), xiao (filial piety), and junzi (the ideal of a morally cultivated person). He taught that social harmony comes from fulfilling one's role in relationships—ruler to subject, parent to child, husband to wife, elder to younger, and friend to friend."}
+{"input": "what is performance art?", "output": "lex: performance art definition live medium\nlex: performance art artists examples history\nvec: what is performance art and how does it differ from traditional visual art\nvec: what are the defining characteristics and famous examples of performance art\nhyde: Performance art is a live, time-based art form in which the artist's body and actions are the medium. Emerging in the 1960s and 70s, artists like Marina Abramović, Yoko Ono, and Joseph Beuys blurred boundaries between art and life, often engaging audiences directly."}
+{"input": "how do vaccines work", "output": "lex: vaccines immune system antibodies mechanism\nlex: how vaccines work immunization\nvec: how do vaccines train the immune system to fight diseases\nvec: what is the biological mechanism by which vaccines provide immunity\nhyde: Vaccines introduce a weakened, inactivated, or fragment form of a pathogen (or its mRNA blueprint) into the body. The immune system recognizes it as foreign, produces antibodies, and creates memory cells. If exposed to the real pathogen later, the immune system responds rapidly."}
+{"input": "ai-driven marketing", "output": "lex: AI-driven marketing automation personalization\nlex: artificial intelligence marketing campaigns analytics\nvec: how is artificial intelligence being used to drive marketing strategies and campaigns\nvec: what AI tools and techniques improve marketing personalization and customer targeting\nhyde: AI-driven marketing uses machine learning to segment audiences, predict customer behavior, and personalize content at scale. Tools like predictive analytics, chatbots, and recommendation engines increase conversion rates. A/B testing is automated, and ad spend is optimized in real time by algorithms."}
+{"input": "how to pursue a career in scientific research", "output": "lex: scientific research career path academia\nlex: career scientist PhD research position\nvec: what steps should I take to pursue a career in scientific research\nvec: what education and experience are needed to become a professional researcher in science\nhyde: A career in scientific research typically starts with a bachelor's degree in a STEM field, followed by a PhD program where you specialize in a research area. After completing your doctorate, postdoctoral positions provide additional training before applying for faculty or industry research roles."}
+{"input": "what is cryptocurrency trading?", "output": "lex: cryptocurrency trading buy sell exchange\nlex: crypto trading Bitcoin Ethereum strategies\nvec: what is cryptocurrency trading and how do people buy and sell digital currencies\nvec: how does cryptocurrency trading work on exchanges like Coinbase and Binance\nhyde: Cryptocurrency trading involves buying and selling digital assets like Bitcoin and Ethereum on exchanges. Traders use market orders, limit orders, and stop-losses. Strategies range from long-term holding (HODLing) to day trading based on technical analysis of price charts and volume indicators."}
+{"input": "what is calculus used for", "output": "lex: calculus applications real world uses\nlex: calculus derivatives integrals physics engineering\nvec: what are the real-world applications of calculus in science and engineering\nvec: how is calculus used in physics, economics, and other fields\nhyde: Calculus is used to model rates of change and accumulation. In physics, derivatives describe velocity and acceleration; integrals calculate areas and volumes. Engineers use calculus to design structures, economists model marginal cost and revenue, and biologists model population growth with differential equations."}
+{"input": "how does moral philosophy address human rights", "output": "lex: moral philosophy human rights ethics\nlex: philosophical foundations human rights natural rights\nvec: how does moral philosophy provide a foundation for human rights\nvec: what ethical theories support the concept of universal human rights\nhyde: Moral philosophy grounds human rights through several frameworks: natural law theory holds rights are inherent to human nature, Kantian ethics argues every person deserves dignity as a rational agent, and utilitarianism supports rights as instruments that maximize overall well-being."}
+{"input": "how to choose a writing genre?", "output": "lex: choose writing genre fiction nonfiction\nlex: writing genre selection author style\nvec: how should a writer choose the best genre for their writing style and interests\nvec: what factors help an author decide which literary genre to write in\nhyde: Consider what you love to read—your favorite genre as a reader often translates well. Experiment by writing short pieces in different genres: fantasy, mystery, literary fiction, memoir. Pay attention to which genre energizes you and where your voice feels most natural."}
+{"input": "how to write a standout personal statement", "output": "lex: personal statement writing tips college application\nlex: standout personal statement essay graduate school\nvec: how do you write a compelling personal statement for college or graduate school admissions\nvec: what makes a personal statement stand out to admissions committees\nhyde: Open with a vivid, specific anecdote—not a generic quote. Show rather than tell by describing experiences that shaped your goals. Connect your past to your intended field of study. Be authentic; admissions officers read thousands of essays and recognize genuine voice immediately."}
+{"input": "how to improve sleep quality", "output": "lex: improve sleep quality tips habits\nlex: better sleep hygiene insomnia remedies\nvec: what are proven ways to improve sleep quality and fall asleep faster\nvec: how can I develop better sleep habits to get more restful sleep\nhyde: Maintain a consistent sleep schedule, even on weekends. Keep your bedroom cool (65-68°F), dark, and quiet. Avoid screens for 30 minutes before bed. Limit caffeine after noon. Regular exercise improves sleep, but finish workouts at least 3 hours before bedtime."}
+{"input": "how to stay updated on global affairs", "output": "lex: global affairs news sources current events\nlex: world news reliable sources daily updates\nvec: what are the best ways to stay informed about global affairs and world news\nvec: which news sources and tools help you keep up with international current events\nhyde: Follow reputable outlets like Reuters, AP News, BBC World, and The Economist for balanced global coverage. Use RSS readers or news aggregator apps like Feedly. Subscribe to daily briefing newsletters such as Morning Brew or The Daily from the New York Times."}
+{"input": "what are the characteristics of renaissance architecture?", "output": "lex: Renaissance architecture characteristics features\nlex: Renaissance architecture columns dome symmetry\nvec: what are the defining characteristics of Renaissance architecture in Europe\nvec: how did Renaissance architects use symmetry, columns, and domes in their buildings\nhyde: Renaissance architecture, flourishing in 15th-16th century Italy, revived classical Greek and Roman forms. Key features include symmetrical facades, round arches, columns with Corinthian capitals, hemispherical domes (as in Brunelleschi's Florence Cathedral), and harmonious proportions based on geometry."}
+{"input": "what are color modes in photography?", "output": "lex: color modes photography RGB CMYK sRGB\nlex: photography color space Adobe RGB ProPhoto\nvec: what are the different color modes and color spaces used in digital photography\nvec: how do RGB, sRGB, Adobe RGB, and CMYK color modes affect photo editing and printing\nhyde: Digital photographs use RGB color mode for screens, with sRGB as the standard web color space and Adobe RGB offering a wider gamut for print work. CMYK is used for commercial printing. ProPhoto RGB captures the widest range but requires careful color management to avoid banding."}
+{"input": "how to create a zen garden?", "output": "lex: zen garden create Japanese rock garden\nlex: zen garden design sand gravel stones\nvec: how do you design and create a traditional Japanese zen rock garden\nvec: what materials and layout principles are used in building a zen garden\nhyde: A zen garden (karesansui) uses raked white gravel or sand to represent water, with carefully placed rocks symbolizing mountains or islands. Rake parallel lines for calm or concentric circles around rocks. Keep the design minimal—moss, a few stones, and clean gravel on a flat rectangular area."}
+{"input": "mountain peak", "output": "lex: mountain peak climbing summit elevation\nlex: highest mountain peaks world list\nlex: mountain peak hiking trails\nvec: what are the highest mountain peaks in the world and their elevations\nvec: how to plan a hike or climb to a mountain peak summit\nhyde: Mount Everest stands at 8,849 meters (29,032 ft), the highest peak on Earth. K2 at 8,611 m and Kangchenjunga at 8,586 m follow. For trekkers, peaks like Mont Blanc (4,808 m) and Mount Kilimanjaro (5,895 m) are accessible without technical climbing experience."}
+{"input": "how to follow campaign finance laws", "output": "lex: campaign finance laws compliance regulations\nlex: campaign finance rules FEC political donations\nvec: how do political candidates and organizations comply with campaign finance laws\nvec: what are the key campaign finance regulations and reporting requirements in the U.S.\nhyde: Campaign finance laws require candidates to register with the FEC, disclose all contributions and expenditures, and adhere to contribution limits. Individual donors can give up to $3,300 per candidate per election. PACs and Super PACs have separate rules. File quarterly reports electronically."}
+{"input": "how to advocate for education reform", "output": "lex: education reform advocacy strategies\nlex: advocate education policy change\nvec: how can individuals effectively advocate for education reform in their community\nvec: what strategies work for pushing education policy changes at the local and state level\nhyde: Start by attending school board meetings and building relationships with elected officials. Join or form coalitions with parent groups, teachers' unions, and nonprofits. Write op-eds, organize town halls, and use data on student outcomes to make evidence-based arguments for specific policy changes."}
+{"input": "how do philosophical arguments work", "output": "lex: philosophical arguments logic premises conclusion\nlex: philosophical reasoning deductive inductive\nvec: how are philosophical arguments structured with premises and conclusions\nvec: what makes a philosophical argument valid or sound in logic\nhyde: A philosophical argument consists of premises (claims assumed to be true) and a conclusion that follows from them. In a deductive argument, if the premises are true and the form is valid, the conclusion must be true. An argument is sound when it is both valid and its premises are actually true."}
+{"input": "fix roof", "output": "lex: roof repair fix leak shingles\nlex: roof damage repair DIY contractor\nlex: fix roof leak flashing\nvec: how to repair a damaged or leaking roof at home\nvec: when should you DIY a roof fix versus hiring a professional roofer\nhyde: For minor roof leaks, locate the source from the attic during rain. Replace cracked or missing shingles by lifting surrounding shingles, removing nails, and sliding in a new one. Apply roofing cement under flashing for small gaps. For structural damage or large areas, hire a licensed roofer."}
+{"input": "how to implement csr initiatives", "output": "lex: CSR initiatives corporate social responsibility implementation\nlex: corporate social responsibility programs strategy\nvec: how do companies implement corporate social responsibility initiatives effectively\nvec: what steps should a business take to launch a CSR program\nhyde: Start by conducting a materiality assessment to identify social and environmental issues relevant to your business and stakeholders. Set measurable goals aligned with the UN Sustainable Development Goals. Allocate budget, assign a dedicated CSR team, and report progress annually using GRI standards."}
+{"input": "how to meditate for beginners", "output": "lex: meditation beginners guide mindfulness\nlex: beginner meditation techniques breathing\nvec: how do beginners start a daily meditation practice from scratch\nvec: what are simple meditation techniques for people who have never meditated before\nhyde: Sit comfortably with your back straight. Close your eyes and focus on your breath—notice each inhale and exhale. When thoughts arise, gently return attention to your breathing without judgment. Start with 5 minutes daily and gradually increase. Consistency matters more than duration."}
+{"input": "how to boost immune system naturally", "output": "lex: boost immune system natural remedies\nlex: strengthen immune system diet exercise sleep\nvec: what natural methods help strengthen the immune system\nvec: which foods, supplements, and lifestyle habits boost immune function naturally\nhyde: Eat a diet rich in fruits, vegetables, and lean protein to supply vitamins C, D, and zinc. Exercise moderately for 30 minutes most days. Sleep 7-9 hours per night. Manage stress through meditation or yoga. Fermented foods like yogurt and kimchi support gut health, which is linked to immune function."}
+{"input": "how to bake a cake from scratch", "output": "lex: bake cake from scratch recipe\nlex: homemade cake recipe flour butter eggs\nvec: how do you bake a basic cake from scratch without a box mix\nvec: what is a simple recipe for baking a homemade vanilla or chocolate cake\nhyde: Preheat oven to 350°F (175°C). Mix 2 cups flour, 1.5 cups sugar, 3 eggs, 1 cup butter, 1 cup milk, 2 tsp baking powder, 1 tsp vanilla. Pour into greased 9-inch pans and bake 30-35 minutes until a toothpick comes out clean. Cool before frosting."}
+{"input": "what are the main festivals in hinduism", "output": "lex: Hindu festivals Diwali Holi Navratri\nlex: Hinduism religious festivals celebrations\nvec: what are the major festivals celebrated in Hinduism and their significance\nvec: which Hindu festivals are the most widely observed and what do they celebrate\nhyde: Diwali, the festival of lights, celebrates the triumph of light over darkness and honors Lakshmi. Holi marks the arrival of spring with colored powders. Navratri is a nine-night festival honoring the goddess Durga. Ganesh Chaturthi celebrates the birth of Lord Ganesha with elaborate processions."}
+{"input": "how to replace car air filter?", "output": "lex: replace car air filter engine cabin\nlex: car air filter replacement DIY steps\nvec: how do you replace the engine air filter in a car yourself\nvec: what are the steps to change a car's air filter at home without a mechanic\nhyde: Open the hood and locate the air filter housing—usually a black plastic box near the engine. Unclip the latches, remove the old filter, and note its orientation. Insert the new filter with the rubber rim facing up, close the housing, and secure the clips. Replace every 12,000-15,000 miles."}
+{"input": "digital transformation strategies", "output": "lex: digital transformation strategy enterprise\nlex: digital transformation cloud automation AI\nvec: what strategies do organizations use to drive successful digital transformation\nvec: how do enterprises plan and execute a digital transformation initiative\nhyde: A digital transformation strategy begins with assessing current processes and identifying bottlenecks. Prioritize quick wins like automating manual workflows. Migrate infrastructure to cloud platforms, adopt data analytics for decision-making, and invest in employee training. Measure ROI with KPIs tied to business outcomes."}
+{"input": "how to argument for climate action", "output": "lex: argue climate action policy advocacy\nlex: climate change argument evidence persuasion\nvec: how can you make a compelling argument for urgent climate action\nvec: what evidence and reasoning support the case for strong climate change policies\nhyde: The scientific consensus is clear: global temperatures have risen 1.1°C since pre-industrial levels, causing more extreme weather, rising seas, and ecosystem collapse. Economic analyses show that the cost of inaction—estimated at $23 trillion by 2050—far exceeds the investment needed for a clean energy transition."}
+{"input": "how does human activity affect climate change", "output": "lex: human activity climate change greenhouse gas emissions\nlex: anthropogenic climate change fossil fuels deforestation\nvec: how do human activities like burning fossil fuels contribute to climate change\nvec: what is the scientific evidence linking human activity to global warming\nhyde: Human activities—primarily burning fossil fuels for energy, deforestation, and industrial agriculture—release greenhouse gases like CO2 and methane into the atmosphere. Since 1850, atmospheric CO2 has risen from 280 to over 420 ppm, trapping heat and raising global average temperatures by 1.1°C."}
+{"input": "how to create a wildlife-friendly garden?", "output": "lex: wildlife-friendly garden habitat plants\nlex: garden attract birds bees butterflies\nvec: how can I design a garden that attracts and supports local wildlife\nvec: what plants and features make a garden friendly to birds, bees, and butterflies\nhyde: Plant native flowering species to attract pollinators—coneflower, milkweed, and lavender support bees and butterflies. Add a shallow water dish, leave leaf litter for insects, install nest boxes for birds, and avoid pesticides. A log pile provides habitat for beetles, frogs, and hedgehogs."}
+{"input": "how to prepare for a long hike", "output": "lex: long hike preparation gear checklist\nlex: hiking preparation training nutrition hydration\nvec: how should I prepare physically and logistically for a long day hike or multi-day trek\nvec: what gear, training, and planning is needed before a long hiking trip\nhyde: Train by walking with a loaded pack for progressively longer distances over 4-6 weeks. Pack the ten essentials: navigation, sun protection, insulation, illumination, first aid, fire, tools, nutrition, hydration, and shelter. Check the weather forecast and file a trip plan with someone you trust."}
+{"input": "how to use photoshop for digital painting?", "output": "lex: Photoshop digital painting brushes techniques\nlex: digital painting Photoshop tutorial layers\nvec: how do you use Adobe Photoshop for digital painting and illustration\nvec: what Photoshop tools, brushes, and techniques are essential for digital painting\nhyde: In Photoshop, start a digital painting by creating a new canvas at 300 DPI. Use the Brush tool (B) with pressure sensitivity enabled on a graphics tablet. Block in shapes on separate layers, then refine details. Use layer blend modes like Multiply for shadows and Screen for highlights."}
+{"input": "what changed in kubernetes latest version", "output": "lex: Kubernetes latest version changes release notes 2025 2026\nlex: Kubernetes new features changelog update\nvec: what are the notable changes and new features in the latest Kubernetes release\nvec: what major features were added or deprecated in the most recent Kubernetes version in 2025 or 2026\nhyde: Kubernetes v1.32 introduced improvements to sidecar containers (now GA), enhanced pod scheduling with dynamic resource allocation, graduated the Gateway API to stable, and deprecated legacy in-tree cloud provider integrations in favor of external cloud controller managers."}
+{"input": "what is e-commerce?", "output": "lex: e-commerce electronic commerce online shopping\nlex: e-commerce platform business model\nvec: what is e-commerce and how do online businesses sell products and services\nvec: how does electronic commerce work from storefront to payment processing\nhyde: E-commerce (electronic commerce) is the buying and selling of goods or services over the internet. Business models include B2C (Amazon, Shopify stores), B2B (Alibaba), C2C (eBay, Etsy), and D2C (brands selling directly). Transactions are processed through payment gateways like Stripe or PayPal."}
+{"input": "what is meant by 'the good life' in philosophy", "output": "lex: the good life philosophy eudaimonia ethics\nlex: philosophical good life Aristotle virtue happiness\nvec: what does the concept of the good life mean in philosophy and ethics\nvec: how did Aristotle and other philosophers define what it means to live a good life\nhyde: In Aristotelian ethics, the good life (eudaimonia) is achieved through the practice of virtue and the exercise of reason over a complete lifetime. It is not mere pleasure but a state of flourishing—living in accordance with one's highest capacities within a community."}
+{"input": "how to obtain information on federal legislation", "output": "lex: federal legislation tracking Congress bills\nlex: federal law lookup Congress.gov bill status\nvec: how can I find information about federal legislation and bills in the U.S. Congress\nvec: what resources are available to track federal bills and laws through the legislative process\nhyde: Congress.gov is the official source for federal legislation. Search by bill number, keyword, or sponsor. Each bill page shows full text, status, cosponsors, committee actions, and vote records. GovTrack.us and ProPublica's Congress API provide additional analysis and tracking tools."}
+{"input": "what are the elements of classical music?", "output": "lex: classical music elements melody harmony rhythm\nlex: classical music composition structure form\nvec: what are the fundamental elements and structures of classical music\nvec: how do melody, harmony, rhythm, and form work together in classical music compositions\nhyde: Classical music is built on melody (a sequence of notes forming a theme), harmony (chords supporting the melody), rhythm (the timing and pattern of notes), dynamics (volume changes), and form (the structure, such as sonata, rondo, or theme and variations)."}
+{"input": "what are celtic traditions and customs", "output": "lex: Celtic traditions customs festivals Ireland Scotland\nlex: Celtic culture Samhain Beltane druids\nvec: what are the traditional customs and cultural practices of the Celtic peoples\nvec: which Celtic traditions like Samhain and Beltane are still observed today\nhyde: Celtic traditions include seasonal festivals marking the agricultural calendar: Samhain (Oct 31) honored the dead and the start of winter, Imbolc (Feb 1) marked spring's return, Beltane (May 1) celebrated fertility with bonfires, and Lughnasadh (Aug 1) was the harvest festival. Many survive in Irish and Scottish culture today."}
+{"input": "hash code", "output": "lex: hash code function programming\nlex: hashCode Java hash table implementation\nlex: cryptographic hash function SHA MD5\nvec: what is a hash code and how are hash functions used in programming\nvec: how does the hashCode method work in Java for hash tables and collections\nhyde: A hash code is an integer value computed from an object's data, used to quickly locate it in a hash table. In Java, every object has a hashCode() method. For HashMap, objects with equal hashCodes go to the same bucket, and equals() resolves collisions. Override both hashCode() and equals() together."}
+{"input": "what is artificial intelligence", "output": "lex: artificial intelligence AI machine learning\nlex: artificial intelligence definition applications\nvec: what is artificial intelligence and how does it work at a fundamental level\nvec: what are the main types and applications of artificial intelligence technology\nhyde: Artificial intelligence (AI) is the simulation of human intelligence by computer systems. It encompasses machine learning (learning from data), natural language processing (understanding language), and computer vision (interpreting images). AI systems are trained on large datasets to recognize patterns and make predictions."}
+{"input": "what is interfaith dialogue?", "output": "lex: interfaith dialogue religious traditions\nlex: interfaith dialogue ecumenism interreligious\nvec: what is interfaith dialogue and why is it important for religious communities\nvec: how do different religious groups engage in interfaith dialogue to promote understanding\nhyde: Interfaith dialogue is the cooperative interaction between people of different religious traditions, aimed at mutual understanding rather than conversion. Organizations like the Parliament of the World's Religions bring together leaders from Christianity, Islam, Judaism, Hinduism, Buddhism, and others to discuss shared values and address social issues."}
+{"input": "what is darwin's theory of evolution", "output": "lex: Darwin theory evolution natural selection\nlex: Darwin Origin of Species evolution\nvec: what is Charles Darwin's theory of evolution by natural selection\nvec: how did Darwin explain the origin of species through natural selection and adaptation\nhyde: In On the Origin of Species (1859), Charles Darwin proposed that species evolve over generations through natural selection. Organisms with traits better suited to their environment survive and reproduce more, passing those advantageous traits to offspring. Over time, this leads to new species."}
+{"input": "what is permaculture gardening?", "output": "lex: permaculture gardening design principles\nlex: permaculture garden sustainable agriculture\nvec: what is permaculture gardening and how does it apply ecological design principles\nvec: how do you design a permaculture garden that mimics natural ecosystems\nhyde: Permaculture gardening applies ecological design principles to create self-sustaining food systems. It uses zones radiating from the home, guilds of companion plants, water harvesting with swales, and polyculture instead of monoculture. The goal is a garden that produces food with minimal external inputs."}
+{"input": "how to practice gratitude", "output": "lex: gratitude practice daily journal techniques\nlex: practicing gratitude mental health benefits\nvec: what are effective ways to practice gratitude in everyday life\nvec: how does a daily gratitude practice improve mental health and well-being\nhyde: Keep a gratitude journal and write three specific things you're grateful for each night—not vague statements, but concrete moments. Write a gratitude letter to someone who impacted you. During meals, pause to appreciate the food. Research shows consistent gratitude practice reduces anxiety and improves sleep."}
+{"input": "what are digital credentials?", "output": "lex: digital credentials badges certificates verification\nlex: digital credentials blockchain verifiable\nvec: what are digital credentials and how are they used to verify qualifications\nvec: how do digital badges and verifiable credentials work for education and employment\nhyde: Digital credentials are electronic records that verify a person's qualifications, skills, or achievements. They include digital badges, certificates, and micro-credentials issued by platforms like Credly or Accredible. Verifiable credentials use cryptographic signatures so employers can instantly confirm authenticity without contacting the issuer."}
+{"input": "how does culture influence ethics", "output": "lex: culture ethics moral values influence\nlex: cultural relativism ethics cross-cultural morality\nvec: how does culture shape people's ethical beliefs and moral values\nvec: what is the relationship between cultural norms and ethical decision-making\nhyde: Culture shapes ethics by defining what a society considers right or wrong. Collectivist cultures may prioritize group harmony and duty to family, while individualist cultures emphasize personal autonomy and rights. Cultural relativism argues that moral standards are culturally defined, while universalists hold that some ethical principles transcend culture."}
+{"input": "what is stream of consciousness?", "output": "lex: stream of consciousness writing technique\nlex: stream of consciousness Joyce Woolf literature\nvec: what is the stream of consciousness technique in literature and who pioneered it\nvec: how do authors use stream of consciousness to portray inner thoughts in fiction\nhyde: Stream of consciousness is a literary method that captures the continuous flow of a character's thoughts, memories, and perceptions without conventional structure. James Joyce's Ulysses and Virginia Woolf's Mrs Dalloway are landmark examples, using free-flowing prose, associative leaps, and minimal punctuation."}
+{"input": "how do body systems work together", "output": "lex: body systems interaction physiology\nlex: human body organ systems coordination\nvec: how do the different organ systems in the human body work together to maintain health\nvec: what are examples of body systems interacting with each other in human physiology\nhyde: The circulatory system delivers oxygen absorbed by the respiratory system to muscles controlled by the nervous system. The digestive system breaks down nutrients that the circulatory system distributes. The endocrine system releases hormones that regulate metabolism, growth, and the immune response."}
+{"input": "what are the principles of sustainable development", "output": "lex: sustainable development principles environmental social economic\nlex: sustainable development goals UN SDGs\nvec: what are the core principles of sustainable development and why do they matter\nvec: how do the three pillars of sustainable development balance environmental, social, and economic needs\nhyde: Sustainable development meets present needs without compromising future generations' ability to meet theirs (Brundtland Report, 1987). Its three pillars are environmental protection, social equity, and economic viability. The UN's 17 Sustainable Development Goals (SDGs) provide a framework for global action through 2030."}
+{"input": "how to evaluate startup ideas", "output": "lex: evaluate startup ideas validation framework\nlex: startup idea assessment market viability\nvec: how do entrepreneurs evaluate whether a startup idea is worth pursuing\nvec: what frameworks and criteria help assess the viability of a new startup idea\nhyde: Evaluate a startup idea on four dimensions: problem severity (is this a hair-on-fire problem?), market size (TAM > $1B?), competitive landscape (what's the unfair advantage?), and founder-market fit (do you have unique insight?). Validate by talking to 50+ potential customers before writing any code."}
+{"input": "how to write a business plan", "output": "lex: business plan writing template sections\nlex: business plan executive summary financial projections\nvec: how do you write a comprehensive business plan for a new company\nvec: what sections and information should be included in a startup business plan\nhyde: A business plan includes: executive summary, company description, market analysis, organization structure, product/service line, marketing strategy, funding request, and financial projections. Start with a clear problem statement and your unique solution. Include 3-year revenue forecasts with assumptions clearly stated."}
+{"input": "what are greenhouse gases?", "output": "lex: greenhouse gases CO2 methane atmosphere\nlex: greenhouse gas effect global warming climate\nvec: what are greenhouse gases and how do they contribute to global warming\nvec: which gases trap heat in Earth's atmosphere and cause the greenhouse effect\nhyde: Greenhouse gases—including carbon dioxide (CO2), methane (CH4), nitrous oxide (N2O), and fluorinated gases—trap infrared radiation in the atmosphere, warming the planet. CO2 is the most abundant from fossil fuel combustion. Methane, though shorter-lived, is 80 times more potent over 20 years."}
+{"input": "how do religions interpret the concept of sacredness?", "output": "lex: sacredness religion sacred concept interpretation\nlex: sacred space rituals holy religious traditions\nvec: how do different world religions define and interpret the concept of sacredness\nvec: what does sacredness mean across Christianity, Islam, Hinduism, Buddhism, and indigenous traditions\nhyde: In Christianity, sacredness is conferred by God's presence—churches, sacraments, and scripture are holy. In Hinduism, sacred rivers like the Ganges and temples house divine energy. Indigenous traditions see sacredness in natural features—mountains, groves, and animals. Islam treats the Quran and Mecca as inviolably sacred."}
+{"input": "when to introduce solid foods to a baby?", "output": "lex: introduce solid foods baby age months\nlex: baby first foods solids weaning schedule\nvec: at what age should you start introducing solid foods to a baby\nvec: what are the signs a baby is ready for solid foods and what foods to start with\nhyde: Most pediatricians recommend introducing solid foods around 6 months of age. Signs of readiness include sitting up with support, showing interest in food, and loss of the tongue-thrust reflex. Start with single-ingredient purees like sweet potato, avocado, or iron-fortified cereal, one new food every 3-5 days."}
+{"input": "renaissance literature", "output": "lex: Renaissance literature authors works\nlex: Renaissance literary period Shakespeare Petrarch humanism\nvec: what are the major works and characteristics of Renaissance literature\nvec: how did Renaissance humanism influence literature in Europe during the 14th-17th centuries\nhyde: Renaissance literature (14th-17th century) was shaped by humanism's emphasis on individual experience and classical learning. Key figures include Petrarch (sonnets), Boccaccio (Decameron), Shakespeare (plays and sonnets), Cervantes (Don Quixote), and Machiavelli (The Prince). Vernacular languages replaced Latin as the literary standard."}
+{"input": "how digital twins transform industries", "output": "lex: digital twins industry transformation simulation\nlex: digital twin technology manufacturing IoT\nvec: how are digital twins being used to transform industries like manufacturing and healthcare\nvec: what is digital twin technology and how does it improve operational efficiency in industry\nhyde: A digital twin is a virtual replica of a physical asset, process, or system, updated in real time with IoT sensor data. In manufacturing, digital twins simulate production lines to predict failures. In healthcare, patient-specific organ models guide surgical planning. Energy companies use them to optimize wind turbine performance."}
+{"input": "resilience training programs", "output": "lex: resilience training programs mental toughness\nlex: resilience building workplace employee training\nvec: what are resilience training programs and how do they build mental toughness\nvec: how do organizations implement resilience training for employees and teams\nhyde: Resilience training programs teach participants to manage stress, adapt to adversity, and recover from setbacks. Common frameworks include cognitive behavioral techniques, mindfulness practices, and strengths-based coaching. The U.S. Army's Master Resilience Training and Penn Resilience Program are widely studied evidence-based models."}
+{"input": "how to jump-start a car?", "output": "lex: jump-start car battery jumper cables\nlex: jump start dead car battery steps\nvec: what is the correct procedure to jump-start a car with a dead battery?\nvec: how do you connect jumper cables between two cars to restart a dead battery?\nhyde: To jump-start a car, connect the red clamp to the dead battery positive terminal, then to the donor battery positive. Connect black to donor negative, then to unpainted metal on the dead car. Start the donor car, wait 2 minutes, then start the dead car."}
+{"input": "google maps", "output": "lex: google maps directions navigation\nlex: google maps route planner\nlex: google maps API embed\nvec: how to use Google Maps for turn-by-turn driving directions\nvec: what features does Google Maps offer for route planning and navigation?\nhyde: Open Google Maps on your phone or browser, type your destination in the search bar, and tap \"Directions.\" Choose driving, transit, walking, or cycling. The app will show estimated travel time and alternative routes."}
+{"input": "sail smooth", "output": "lex: smooth sailing techniques\nlex: sailboat trim wind conditions\nlex: reduce boat heeling pitching\nvec: how do you achieve smooth sailing on a sailboat in varying wind conditions?\nvec: what techniques help reduce choppy motion and maintain a comfortable ride while sailing?\nhyde: To sail smoothly, keep the boat balanced by adjusting the mainsheet and jib trim. Ease the sails slightly in gusts to reduce heeling, and steer at an angle that minimizes pitching through waves."}
+{"input": "how to create a value proposition", "output": "lex: value proposition canvas template\nlex: unique value proposition statement\nlex: customer value proposition examples\nvec: how do you write a compelling value proposition for a product or service?\nvec: what framework helps define a unique value proposition that resonates with target customers?\nhyde: A strong value proposition clearly states what your product does, who it's for, and why it's better than alternatives. Use this formula: We help [target customer] achieve [desired outcome] by [unique approach], unlike [competitors] who [limitation]."}
+{"input": "where to buy used cars online", "output": "lex: buy used cars online marketplace\nlex: certified pre-owned cars website\nlex: online used car dealers Carvana AutoTrader\nvec: what are the best websites for buying used cars online with delivery?\nvec: which online platforms sell certified pre-owned vehicles with warranties?\nhyde: Popular online used car marketplaces include Carvana, CarMax, AutoTrader, and Cars.com. Carvana offers home delivery and a 7-day return policy. CarMax provides no-haggle pricing and certified inspections on all vehicles."}
+{"input": "what are the main practices in zoroastrianism?", "output": "lex: zoroastrianism practices rituals worship\nlex: zoroastrian fire temple prayer\nlex: zoroastrian navjote purity rituals\nvec: what are the core religious practices and rituals observed in Zoroastrianism?\nvec: how do Zoroastrians worship and what daily rituals do they follow?\nhyde: Zoroastrians pray five times daily (the five Gahs) facing a source of light. The sacred fire is maintained in fire temples as a symbol of Ahura Mazda's truth. Key rituals include the Navjote initiation ceremony, wearing the sudreh and kusti, and maintaining ritual purity."}
+{"input": "how to increase daily physical activity", "output": "lex: increase daily physical activity steps\nlex: exercise habits sedentary lifestyle\nlex: walking more daily movement tips\nvec: what are practical ways to add more physical activity to a sedentary daily routine?\nvec: how can someone gradually increase their daily step count and movement throughout the day?\nhyde: Take the stairs instead of the elevator, park farther from entrances, and set a timer to stand and walk every 30 minutes. Aim for 10,000 steps daily by adding short walks after meals. Even 5-minute movement breaks reduce the health risks of prolonged sitting."}
+{"input": "how does bioethics address cloning", "output": "lex: bioethics cloning human reproductive therapeutic\nlex: ethical issues cloning debate\nlex: cloning moral arguments bioethics\nvec: what ethical arguments do bioethicists raise for and against human cloning?\nvec: how does the field of bioethics evaluate therapeutic versus reproductive cloning?\nhyde: Bioethicists distinguish between reproductive cloning, which aims to create a new human being, and therapeutic cloning, which produces embryonic stem cells for medical research. Most bioethicists oppose reproductive cloning due to safety risks, concerns about human dignity, and the commodification of life."}
+{"input": "what is genetic engineering", "output": "lex: genetic engineering DNA modification\nlex: gene editing CRISPR recombinant DNA\nlex: genetically modified organisms GMO\nvec: what is genetic engineering and how does it work to modify an organism's DNA?\nvec: what are the main techniques used in genetic engineering such as CRISPR and recombinant DNA?\nhyde: Genetic engineering is the direct manipulation of an organism's DNA using biotechnology. Scientists can insert, delete, or modify genes to alter traits. Key techniques include recombinant DNA technology, which combines DNA from different sources, and CRISPR-Cas9, which allows precise editing at specific locations in the genome."}
+{"input": "how to test drive a car?", "output": "lex: test drive car checklist\nlex: car test drive tips what to check\nlex: dealership test drive questions\nvec: what should you look for and evaluate during a car test drive?\nvec: how do you properly test drive a vehicle before buying it?\nhyde: During a test drive, check acceleration, braking response, and steering feel. Drive on highways, local roads, and over bumps. Listen for unusual noises. Test the infotainment system, climate control, and visibility from all mirrors. Make sure the seats are comfortable and adjust to your driving position."}
+{"input": "how do philosophers approach death", "output": "lex: philosophy of death mortality\nlex: existentialism death Heidegger Epicurus\nlex: philosophical views afterlife mortality\nvec: how have major philosophers throughout history approached the concept of death and mortality?\nvec: what do existentialist and ancient philosophers say about the meaning of death?\nhyde: Epicurus argued that death is nothing to fear because when death exists, we do not. Heidegger saw death as central to authentic existence, calling it \"Being-toward-death.\" The Stoics taught that meditating on mortality (memento mori) leads to a more purposeful life."}
+{"input": "what is the capital of japan", "output": "lex: capital Japan Tokyo\nlex: Tokyo capital city Japan\nvec: what city is the capital of Japan?\nvec: when did Tokyo become the capital of Japan?\nhyde: Tokyo is the capital city of Japan. It became the capital in 1868 when Emperor Meiji moved the imperial seat from Kyoto. Tokyo, located on the eastern coast of Honshu, is the most populous metropolitan area in the world with over 37 million residents."}
+{"input": "what is the significance of the afterlife in different faiths?", "output": "lex: afterlife beliefs religions Christianity Islam Buddhism\nlex: heaven hell reincarnation afterlife\nlex: religious views life after death\nvec: how do different world religions view the afterlife and what happens after death?\nvec: what role does belief in the afterlife play in Christianity, Islam, Hinduism, and Buddhism?\nhyde: In Christianity, the afterlife involves heaven or hell based on faith and deeds. Islam teaches judgment day followed by paradise (Jannah) or hellfire. Hinduism and Buddhism believe in reincarnation, where the soul is reborn based on karma until achieving moksha or nirvana."}
+{"input": "what is 3d printing and how does it work", "output": "lex: 3D printing additive manufacturing process\nlex: FDM SLA 3D printer filament resin\nlex: 3D printing layer by layer CAD model\nvec: how does 3D printing work to create objects layer by layer from a digital model?\nvec: what are the main types of 3D printing technologies such as FDM and SLA?\nhyde: 3D printing, or additive manufacturing, builds objects layer by layer from a digital CAD file. The most common method, FDM (Fused Deposition Modeling), melts plastic filament and extrudes it through a nozzle. SLA (Stereolithography) uses a UV laser to cure liquid resin into solid layers."}
+{"input": "how do i contact my congressperson", "output": "lex: contact congressperson phone email address\nlex: find elected representative congress\nlex: write letter senator representative\nvec: how can I find and contact my U.S. congressional representative or senator?\nvec: what is the best way to reach out to my congressperson about an issue?\nhyde: Visit house.gov and enter your zip code to find your U.S. Representative. For senators, go to senate.gov. You can call their D.C. or district office, send an email through their website contact form, or mail a letter. Calling the Capitol switchboard at (202) 224-3121 connects you to any member's office."}
+{"input": "what is stream of consciousness writing?", "output": "lex: stream of consciousness writing technique\nlex: stream of consciousness literature Joyce Woolf\nlex: interior monologue narrative style\nvec: what is stream of consciousness as a literary writing technique?\nvec: how did authors like James Joyce and Virginia Woolf use stream of consciousness in their novels?\nhyde: Stream of consciousness is a narrative technique that presents a character's continuous flow of thoughts, feelings, and associations without conventional structure. James Joyce's \"Ulysses\" and Virginia Woolf's \"Mrs Dalloway\" are landmark examples, using long unpunctuated passages to mimic the way the mind actually works."}
+{"input": "how to use a ring light", "output": "lex: ring light setup photography video\nlex: ring light placement distance camera\nlex: ring light selfie video lighting\nvec: how do you set up and position a ring light for video recording or photography?\nvec: what are the best settings and distance for using a ring light for selfies and video calls?\nhyde: Place the ring light directly in front of your face at eye level, with the camera positioned in the center of the ring. Keep the light 12-24 inches from your face for an even, shadow-free glow. Adjust brightness to avoid overexposure. The circular catchlights in the eyes are a signature look."}
+{"input": "how to engage in civic duties", "output": "lex: civic duties voting jury duty community\nlex: civic engagement participation democracy\nlex: citizen responsibilities voting volunteering\nvec: what are the main civic duties citizens should participate in beyond voting?\nvec: how can someone actively engage in civic responsibilities in their local community?\nhyde: Civic duties include voting in elections, serving on a jury when called, staying informed about local issues, attending town hall meetings, volunteering for community organizations, and contacting elected officials about policy concerns. Voting in local elections has the most direct impact on your daily life."}
+{"input": "spain life", "output": "lex: living in Spain expat lifestyle\nlex: Spain cost of living culture daily life\nlex: move to Spain quality of life\nvec: what is daily life like for someone living in Spain as an expat or resident?\nvec: what is the cost of living and quality of life in Spain compared to other European countries?\nhyde: Life in Spain revolves around a later schedule than most of Europe. Lunch is the main meal, typically eaten between 2-3 PM, and dinner is served after 9 PM. The cost of living is lower than in northern Europe, with affordable housing outside Madrid and Barcelona. The climate, healthcare system, and social culture attract many expats."}
+{"input": "ai-driven analytics", "output": "lex: AI-driven analytics machine learning data\nlex: artificial intelligence business analytics platform\nlex: AI predictive analytics tools\nvec: how are AI and machine learning used to power data analytics and business intelligence?\nvec: what AI-driven analytics platforms help businesses make data-driven predictions?\nhyde: AI-driven analytics uses machine learning algorithms to automatically detect patterns, anomalies, and trends in large datasets. Unlike traditional BI tools, AI analytics can generate predictive forecasts, perform natural language queries, and surface insights without manual configuration."}
+{"input": "where to buy vintage home accessories", "output": "lex: vintage home accessories shop online\nlex: retro home decor antique store\nlex: vintage furniture accessories Etsy eBay\nvec: where can I buy vintage and antique home decor accessories online?\nvec: what are the best stores and websites for finding retro and vintage home furnishings?\nhyde: Shop vintage home accessories on Etsy, Chairish, and 1stDibs for curated antique finds. Local estate sales and flea markets often have unique pieces at lower prices. Ruby Lane specializes in antiques, while eBay offers a wide selection of retro decor from various eras."}
+{"input": "how to join a political party", "output": "lex: join political party registration\nlex: register Democrat Republican party membership\nlex: political party membership sign up\nvec: how do you officially join or register with a political party in the United States?\nvec: what is the process for becoming a member of a political party?\nhyde: To join a political party in the U.S., register with your state's election office by selecting a party affiliation on your voter registration form. You can register online, by mail, or at your local DMV. Some states allow you to change party affiliation at any time, while others have deadlines before primary elections."}
+{"input": "how to quit smoking?", "output": "lex: quit smoking methods nicotine\nlex: stop smoking cessation plan\nlex: nicotine replacement therapy patches gum\nvec: what are the most effective methods and strategies to quit smoking permanently?\nvec: how do nicotine replacement therapies and medications help people stop smoking?\nhyde: The most effective approach combines nicotine replacement therapy (patches, gum, or lozenges) with behavioral support. Prescription medications like varenicline (Chantix) and bupropion can double quit rates. Set a quit date, identify triggers, and call 1-800-QUIT-NOW for free coaching."}
+{"input": "what is phenomenological existentialism", "output": "lex: phenomenological existentialism Heidegger Sartre\nlex: phenomenology existentialism lived experience\nlex: existential phenomenology philosophy\nvec: what is phenomenological existentialism and how does it differ from other branches of existentialism?\nvec: how did Heidegger and Sartre combine phenomenology with existentialist philosophy?\nhyde: Phenomenological existentialism applies Husserl's phenomenological method to existential questions about human existence. Heidegger's \"Being and Time\" analyzes Dasein (being-there) through the structures of lived experience. Sartre extended this in \"Being and Nothingness,\" arguing that consciousness is always directed toward objects and that existence precedes essence."}
+{"input": "how to install car seat covers?", "output": "lex: install car seat covers DIY\nlex: car seat cover fitting instructions\nlex: universal seat covers installation steps\nvec: what is the step-by-step process for installing car seat covers?\nvec: how do you fit universal car seat covers on front and rear seats?\nhyde: Pull the seat cover over the top of the headrest and stretch it down over the backrest. Tuck the excess fabric into the gap between the seat and backrest. Hook the elastic straps underneath the seat and clip them together. For bucket seats, align the cover's seams with the seat contours before securing."}
+{"input": "what is the scientific process for drug development", "output": "lex: drug development process phases clinical trials\nlex: pharmaceutical drug approval FDA pipeline\nlex: preclinical clinical trial Phase 1 2 3\nvec: what are the stages of the scientific process for developing and approving a new pharmaceutical drug?\nvec: how does a drug go from laboratory discovery through clinical trials to FDA approval?\nhyde: Drug development follows a pipeline: discovery and preclinical testing (3-6 years), Phase I trials testing safety in small groups, Phase II trials evaluating efficacy, Phase III large-scale trials confirming effectiveness, and FDA review. The entire process typically takes 10-15 years and costs over $1 billion."}
+{"input": "what is climate change", "output": "lex: climate change global warming greenhouse gases\nlex: climate change causes effects CO2\nlex: global temperature rise fossil fuels\nvec: what is climate change and what are its primary causes and effects on the planet?\nvec: how do greenhouse gas emissions from fossil fuels contribute to global climate change?\nhyde: Climate change refers to long-term shifts in global temperatures and weather patterns. Since the Industrial Revolution, burning fossil fuels has released CO2 and other greenhouse gases that trap heat in the atmosphere, raising the average global temperature by about 1.1°C. This causes rising sea levels, extreme weather, and ecosystem disruption."}
+{"input": "how to sell a car privately?", "output": "lex: sell car privately steps title transfer\nlex: private car sale listing price\nlex: sell used car by owner paperwork\nvec: what are the steps to sell a car privately without a dealer?\nvec: what paperwork and documentation do you need to sell a car to a private buyer?\nhyde: To sell a car privately, first determine a fair price using Kelley Blue Book or Edmunds. Gather the title, maintenance records, and smog certificate. List the car on Craigslist, Facebook Marketplace, or AutoTrader. When meeting buyers, accept cashier's checks or cash. Sign the title over and file a release of liability with your DMV."}
+{"input": "how to analyze a political candidate's stance", "output": "lex: analyze political candidate stance positions\nlex: candidate policy positions voting record\nlex: compare political candidates issues\nvec: how do you research and analyze a political candidate's policy positions and voting record?\nvec: what tools and resources help voters compare political candidates on key issues?\nhyde: Review the candidate's official website for stated policy positions. Check their voting record on congress.gov or VoteSmart.org. Compare their stances on key issues using tools like ISideWith or BallotReady. Look for consistency between their statements and votes, and check campaign finance records on OpenSecrets."}
+{"input": "what is lean startup methodology", "output": "lex: lean startup methodology MVP\nlex: lean startup build measure learn\nlex: Eric Ries lean startup principles\nvec: what is the lean startup methodology and how does the build-measure-learn cycle work?\nvec: how does the lean startup approach use minimum viable products to validate business ideas?\nhyde: The lean startup methodology, developed by Eric Ries, emphasizes rapid iteration through the Build-Measure-Learn feedback loop. Start by building a Minimum Viable Product (MVP), measure how customers respond using actionable metrics, and learn whether to pivot or persevere. The goal is to reduce waste by validating assumptions before investing heavily."}
+{"input": "what is the renaissance", "output": "lex: Renaissance period history art culture\nlex: Renaissance 14th 15th 16th century Italy Europe\nlex: Renaissance art Leonardo Michelangelo humanism\nvec: what was the Renaissance period and what were its major cultural and artistic achievements?\nvec: how did the Renaissance transform European art, science, and intellectual thought?\nhyde: The Renaissance was a cultural movement spanning roughly the 14th to 17th centuries, originating in Florence, Italy. It marked a revival of classical Greek and Roman learning, emphasizing humanism, individualism, and secular inquiry. Major figures include Leonardo da Vinci, Michelangelo, and Galileo."}
+{"input": "faith respect", "output": "lex: interfaith respect tolerance\nlex: respecting different faiths religions\nlex: religious tolerance diversity beliefs\nvec: how can people show respect for different religious faiths and beliefs?\nvec: what does interfaith respect and dialogue look like in diverse communities?\nhyde: Respecting others' faith means listening without judgment, learning about different religious traditions, and recognizing that spiritual beliefs are deeply personal. Interfaith dialogue builds mutual understanding by focusing on shared values like compassion, justice, and community while honoring theological differences."}
+{"input": "where to find heirloom seed suppliers?", "output": "lex: heirloom seed suppliers catalog\nlex: buy heirloom seeds online non-GMO\nlex: heirloom vegetable seed company\nvec: where can I buy heirloom and non-GMO seeds from reputable suppliers?\nvec: what are the best heirloom seed companies that sell open-pollinated vegetable seeds?\nhyde: Top heirloom seed suppliers include Baker Creek Heirloom Seeds, Seed Savers Exchange, and Johnny's Selected Seeds. Baker Creek offers over 1,800 open-pollinated varieties with free shipping. Seed Savers Exchange is a nonprofit dedicated to preserving rare heirloom varieties through their seed bank and catalog."}
+{"input": "how do christians celebrate easter", "output": "lex: Christian Easter celebration traditions\nlex: Easter Sunday church service resurrection\nlex: Holy Week Good Friday Easter customs\nvec: how do Christians celebrate Easter and what are the main traditions of Holy Week?\nvec: what religious services and customs do Christians observe during the Easter season?\nhyde: Christians celebrate Easter as the resurrection of Jesus Christ on the third day after his crucifixion. Holy Week begins with Palm Sunday, followed by Maundy Thursday communion, Good Friday services, and Easter Sunday worship. Many churches hold sunrise services, and traditions include Easter egg hunts, lilies, and special meals."}
+{"input": "what are exchange-traded funds (etfs)", "output": "lex: exchange-traded funds ETFs investing\nlex: ETF index fund stock market\nlex: ETF vs mutual fund comparison\nvec: what are exchange-traded funds (ETFs) and how do they work as an investment?\nvec: how do ETFs differ from mutual funds and what are their advantages for investors?\nhyde: An exchange-traded fund (ETF) is a basket of securities that trades on a stock exchange like a single stock. ETFs typically track an index like the S&P 500 and offer diversification at a low expense ratio. Unlike mutual funds, ETFs can be bought and sold throughout the trading day at market price."}
+{"input": "how to enhance creativity?", "output": "lex: enhance creativity techniques exercises\nlex: boost creative thinking brainstorming\nlex: creativity habits daily practice\nvec: what are proven techniques and exercises to enhance creative thinking?\nvec: how can someone develop daily habits that boost creativity and generate new ideas?\nhyde: To enhance creativity, practice divergent thinking by generating many ideas without judgment. Keep a daily journal, expose yourself to new experiences, and set aside unstructured time for daydreaming. Research shows that walking, adequate sleep, and constraints can all stimulate creative problem-solving."}
+{"input": "what are the key features of taoist philosophy?", "output": "lex: Taoist philosophy Taoism key concepts\nlex: Tao Te Ching wu wei Taoism\nlex: Taoism yin yang natural harmony\nvec: what are the central concepts and key features of Taoist philosophy?\nvec: how does Taoism emphasize living in harmony with the Tao and the concept of wu wei?\nhyde: Taoism centers on the Tao (the Way), an ineffable force that underlies all existence. Key concepts include wu wei (non-action or effortless action), living in harmony with nature, and the balance of yin and yang. The Tao Te Ching by Laozi and the Zhuangzi are the foundational texts."}
+{"input": "how to effectively visualize scientific data", "output": "lex: scientific data visualization charts graphs\nlex: data visualization tools matplotlib Python\nlex: scientific figure plotting techniques\nvec: what are effective techniques for visualizing scientific data in charts and graphs?\nvec: which tools and software are best for creating publication-quality scientific data visualizations?\nhyde: Choose chart types that match your data: scatter plots for correlations, bar charts for comparisons, line plots for time series, and heatmaps for matrices. Use matplotlib or ggplot2 for publication figures. Minimize chart junk, label axes clearly, and use colorblind-friendly palettes like viridis."}
+{"input": "where to watch live nba games?", "output": "lex: watch live NBA games streaming\nlex: NBA League Pass live stream TV\nlex: NBA games broadcast ESPN TNT\nvec: where can I watch live NBA basketball games online or on TV?\nvec: what streaming services and TV channels broadcast live NBA games in 2025-2026?\nhyde: Live NBA games air on ESPN, TNT, and ABC during the regular season. NBA League Pass streams all out-of-market games. Streaming options include Sling TV, YouTube TV, and Hulu + Live TV for cable-free access. The NBA app offers free highlights and select live games on mobile."}
+{"input": "what was the impact of the industrial revolution on society?", "output": "lex: Industrial Revolution impact society economy\nlex: Industrial Revolution social changes urbanization\nlex: Industrial Revolution labor factories 18th 19th century\nvec: how did the Industrial Revolution transform society, economy, and daily life?\nvec: what were the major social and economic impacts of the Industrial Revolution on workers and cities?\nhyde: The Industrial Revolution (1760-1840) shifted economies from agrarian to industrial, triggering mass urbanization as workers moved to factory cities. It created a new working class, child labor, and pollution, but also raised living standards over time, enabled mass production, and spurred technological innovation in transportation and communication."}
+{"input": "wisdom gain", "output": "lex: gaining wisdom life experience\nlex: wisdom philosophy personal growth\nlex: how to become wiser decision making\nvec: how does a person gain wisdom through life experience and reflection?\nvec: what do philosophers and psychologists say about how wisdom is acquired?\nhyde: Wisdom is gained through a combination of diverse life experience, reflective thinking, and learning from mistakes. Psychologist Paul Baltes identified wisdom as expert knowledge about the fundamental pragmatics of life, including understanding uncertainty, managing emotions, and balancing competing interests."}
+{"input": "what is the role of local government", "output": "lex: local government role responsibilities\nlex: city county municipal government services\nlex: local government functions zoning schools police\nvec: what are the main roles and responsibilities of local government in a community?\nvec: how does local city and county government provide public services and manage community affairs?\nhyde: Local governments provide essential services including public schools, police and fire departments, road maintenance, water and sewer systems, zoning and land use planning, parks, and public transit. City councils and county boards set local taxes, pass ordinances, and approve budgets that directly affect residents' daily lives."}
+{"input": "what is metaphysical ethics", "output": "lex: metaphysical ethics philosophy morality\nlex: metaphysics ethics moral realism\nlex: metaethics ontology moral facts\nvec: what is metaphysical ethics and how does it relate to the nature of moral reality?\nvec: how does metaphysics inform ethical theory and questions about whether moral facts exist?\nhyde: Metaphysical ethics, closely related to metaethics, examines the ontological status of moral values. It asks whether moral facts exist independently of human minds (moral realism) or are human constructions (anti-realism). This branch investigates the metaphysical foundations that underlie ethical claims, such as whether \"goodness\" is a real property in the world."}
+{"input": "what is empiricism", "output": "lex: empiricism philosophy knowledge experience\nlex: empiricism Locke Hume sensory evidence\nlex: empiricism vs rationalism epistemology\nvec: what is empiricism in philosophy and how does it claim knowledge is acquired through experience?\nvec: how did philosophers like John Locke and David Hume develop the theory of empiricism?\nhyde: Empiricism is the philosophical theory that all knowledge is derived from sensory experience rather than innate ideas. John Locke argued the mind starts as a \"tabula rasa\" (blank slate), and David Hume extended this by arguing that even causal relationships are known only through observation and habit, not reason alone."}
+{"input": "what is epistemology", "output": "lex: epistemology philosophy knowledge\nlex: epistemology theory of knowledge justified belief\nlex: epistemology truth belief justification\nvec: what is epistemology and what questions does it address about knowledge and belief?\nvec: how does epistemology study the nature, sources, and limits of human knowledge?\nhyde: Epistemology is the branch of philosophy concerned with the nature, scope, and limits of knowledge. It examines questions like: What is knowledge? How is it different from mere belief? What counts as justification? The classic definition from Plato is that knowledge is justified true belief, though this was challenged by Gettier in 1963."}
+{"input": "what is the significance of community in spirituality?", "output": "lex: community spirituality religious fellowship\nlex: spiritual community congregation sangha\nlex: communal worship spiritual practice\nvec: why is community considered important in spiritual and religious practice?\nvec: how does belonging to a spiritual community enhance personal faith and practice?\nhyde: Spiritual communities provide shared worship, accountability, and mutual support that deepen individual faith. In Christianity, the church body gathers for fellowship; in Buddhism, the sangha is one of the Three Jewels; in Judaism, a minyan of ten is required for communal prayer. Communal practice reinforces commitment and provides belonging."}
+{"input": "what is the difference between memoir and autobiography?", "output": "lex: memoir vs autobiography difference\nlex: memoir autobiography literary genre\nlex: memoir personal narrative autobiography life story\nvec: what is the difference between a memoir and an autobiography as literary genres?\nvec: how does a memoir's scope and focus differ from a full autobiography?\nhyde: An autobiography covers the author's entire life chronologically, from birth to the present. A memoir focuses on a specific theme, period, or set of experiences from the author's life, emphasizing emotional truth and reflection. Memoirs are often more literary and thematic, while autobiographies are more comprehensive and factual."}
+{"input": "what is the significance of allegory?", "output": "lex: allegory literary device significance\nlex: allegory examples literature symbolism\nlex: allegorical writing Pilgrim's Progress Animal Farm\nvec: what is an allegory in literature and why is it a significant literary device?\nvec: how do authors use allegory to convey deeper moral or political meanings through symbolic narratives?\nhyde: An allegory is a narrative in which characters, events, and settings symbolically represent abstract ideas or moral concepts. Orwell's \"Animal Farm\" allegorizes the Russian Revolution; Bunyan's \"Pilgrim's Progress\" represents the Christian spiritual journey. Allegory allows writers to critique society, explore complex ideas, and engage readers on multiple levels."}
+{"input": "portrait photography tips", "output": "lex: portrait photography tips lighting posing\nlex: portrait photo camera settings lens\nlex: headshot portrait natural light composition\nvec: what are the best tips for taking professional-quality portrait photographs?\nvec: how should you set up lighting, posing, and camera settings for portrait photography?\nhyde: Use an 85mm or 50mm lens at f/1.8-f/2.8 to create a pleasing background blur. Position your subject near a window for soft natural light, or use a reflector to fill shadows. Focus on the nearest eye, shoot at eye level, and direct your subject to angle their body 45 degrees to the camera."}
+{"input": "how to build passive income", "output": "lex: build passive income streams\nlex: passive income ideas investments dividends\nlex: earn passive income rental property online\nvec: what are the most reliable ways to build passive income streams?\nvec: how can someone start generating passive income through investments, rental property, or online businesses?\nhyde: Common passive income sources include dividend stocks yielding 3-5% annually, rental properties generating monthly cash flow, index fund investments, creating digital products or online courses, and building affiliate marketing websites. Start by investing in a low-cost S&P 500 index fund and reinvesting dividends."}
+{"input": "how to choose the right camera", "output": "lex: choose camera DSLR mirrorless beginner\nlex: camera buying guide sensor megapixels\nlex: best camera photography type budget\nvec: how do you choose the right camera for your photography needs and budget?\nvec: what factors should you consider when deciding between DSLR and mirrorless cameras?\nhyde: Decide what you'll shoot most: landscapes, portraits, video, or street photography. Mirrorless cameras are lighter with faster autofocus, while DSLRs offer longer battery life and more lens options. Key specs to compare: sensor size (full-frame vs APS-C), megapixels, autofocus points, and video capabilities. Budget $500-1000 for a capable starter body."}
+{"input": "what is the significance of the great barrier reef?", "output": "lex: Great Barrier Reef significance ecosystem\nlex: Great Barrier Reef coral biodiversity Australia\nlex: Great Barrier Reef marine life conservation\nvec: why is the Great Barrier Reef ecologically significant and important to protect?\nvec: what makes the Great Barrier Reef the world's largest coral reef system and why is it under threat?\nhyde: The Great Barrier Reef, stretching over 2,300 km along Australia's northeast coast, is the world's largest coral reef system and is visible from space. It supports over 1,500 fish species, 400 coral species, and countless marine organisms. It's a UNESCO World Heritage Site threatened by coral bleaching from rising ocean temperatures."}
+{"input": "how to celebrate holi festival", "output": "lex: Holi festival celebration traditions India\nlex: Holi festival of colors powder\nlex: how to celebrate Holi customs food\nvec: how is the Holi festival celebrated and what are its main traditions and customs?\nvec: what are the traditional ways to celebrate Holi with colors, food, and bonfires?\nhyde: Holi is celebrated over two days: Holika Dahan (bonfire night) and Rangwali Holi (color day). On the morning of Holi, people gather outdoors to throw colored powders (gulal) and spray colored water at each other. Traditional foods include gujiya (sweet dumplings), thandai (spiced milk drink), and puran poli."}
+{"input": "how to negotiate a salary?", "output": "lex: negotiate salary offer tips\nlex: salary negotiation techniques counter offer\nlex: job offer salary negotiation script\nvec: what are effective strategies for negotiating a higher salary during a job offer?\nvec: how do you prepare for and conduct a successful salary negotiation?\nhyde: Research the market rate for your role on Glassdoor, Levels.fyi, or Payscale before negotiating. When you receive an offer, express enthusiasm, then say \"I was hoping for something closer to [target].\" Always negotiate based on market data and your value, not personal needs. Aim 10-20% above the initial offer."}
+{"input": "what is sacred geometry?", "output": "lex: sacred geometry patterns symbols\nlex: sacred geometry golden ratio Fibonacci\nlex: sacred geometry Flower of Life Metatron\nvec: what is sacred geometry and what mathematical patterns are considered sacred?\nvec: how do sacred geometry concepts like the golden ratio and Flower of Life appear in nature and architecture?\nhyde: Sacred geometry assigns symbolic and spiritual meaning to geometric shapes and proportions found in nature. Key patterns include the Flower of Life (overlapping circles), Metatron's Cube, the golden ratio (1.618), and the Fibonacci spiral. These patterns appear in sunflower seeds, nautilus shells, and ancient temple architecture."}
+{"input": "what is political corruption", "output": "lex: political corruption bribery abuse of power\nlex: government corruption examples types\nlex: political corruption embezzlement nepotism\nvec: what is political corruption and what forms does it take in government?\nvec: how does political corruption such as bribery and embezzlement undermine democratic governance?\nhyde: Political corruption is the abuse of public office for private gain. Forms include bribery (accepting payments for favorable decisions), embezzlement of public funds, nepotism (appointing relatives to positions), patronage, and vote-buying. Transparency International's Corruption Perceptions Index ranks countries by perceived levels of public sector corruption."}
+{"input": "what are the rituals of islam", "output": "lex: Islam rituals Five Pillars worship\nlex: Islamic prayer salat fasting Ramadan\nlex: Muslim rituals hajj pilgrimage zakat\nvec: what are the main rituals and religious practices in Islam?\nvec: how do Muslims observe the Five Pillars of Islam including prayer, fasting, and pilgrimage?\nhyde: The Five Pillars of Islam form the core rituals: Shahada (declaration of faith), Salat (five daily prayers facing Mecca), Zakat (annual charitable giving of 2.5% of wealth), Sawm (fasting during Ramadan from dawn to sunset), and Hajj (pilgrimage to Mecca at least once in a lifetime)."}
+{"input": "neural networks", "output": "lex: neural networks deep learning artificial\nlex: neural network architecture layers neurons\nlex: convolutional recurrent neural network CNN RNN\nvec: how do artificial neural networks work and what are the different types of architectures?\nvec: what are the basic components of a neural network including layers, weights, and activation functions?\nhyde: A neural network consists of layers of interconnected nodes (neurons). Input data passes through hidden layers where each connection has a weight. Each neuron applies an activation function (like ReLU or sigmoid) to the weighted sum of its inputs. During training, backpropagation adjusts weights to minimize the loss function."}
+{"input": "what is the trolley problem", "output": "lex: trolley problem ethics thought experiment\nlex: trolley problem utilitarianism moral dilemma\nlex: trolley problem Philippa Foot\nvec: what is the trolley problem and why is it important in ethical philosophy?\nvec: how does the trolley problem illustrate the conflict between utilitarian and deontological ethics?\nhyde: The trolley problem, introduced by Philippa Foot in 1967, asks: a runaway trolley will kill five people unless you pull a lever to divert it onto a track where it will kill one person. Do you pull the lever? Utilitarians say yes (saving more lives), while deontologists argue that actively causing someone's death is morally different from allowing deaths to occur."}
+{"input": "digital transformation in businesses", "output": "lex: digital transformation business strategy\nlex: digital transformation enterprise technology cloud\nlex: business digitization automation workflows\nvec: how are businesses implementing digital transformation to modernize their operations and strategy?\nvec: what technologies drive digital transformation in enterprises, including cloud computing and automation?\nhyde: Digital transformation involves integrating digital technology into all areas of a business, changing how it operates and delivers value. Key components include migrating to cloud infrastructure, automating manual processes, adopting data analytics for decision-making, and building digital customer experiences. McKinsey reports that 70% of transformation efforts fall short of their goals."}
+{"input": "how to protect business data", "output": "lex: protect business data security cybersecurity\nlex: data protection encryption backup strategy\nlex: business data security firewall access control\nvec: what are the most important steps to protect sensitive business data from breaches and loss?\nvec: how should a business implement data protection measures including encryption, backups, and access controls?\nhyde: Protect business data with layered security: encrypt data at rest and in transit using AES-256, implement role-based access controls, enable multi-factor authentication for all accounts, maintain automated offsite backups with the 3-2-1 rule, and train employees on phishing awareness. Conduct regular security audits and penetration testing."}
+{"input": "what is cellular respiration", "output": "lex: cellular respiration ATP glucose\nlex: cellular respiration glycolysis Krebs cycle\nlex: aerobic respiration mitochondria electron transport\nvec: what is cellular respiration and how do cells convert glucose into ATP energy?\nvec: what are the three stages of cellular respiration: glycolysis, the Krebs cycle, and the electron transport chain?\nhyde: Cellular respiration is the metabolic process by which cells break down glucose (C6H12O6) to produce ATP. It occurs in three stages: glycolysis (in the cytoplasm, producing 2 ATP), the Krebs cycle (in the mitochondrial matrix, producing 2 ATP), and the electron transport chain (on the inner mitochondrial membrane, producing 34 ATP)."}
+{"input": "how technology impacts scientific research", "output": "lex: technology impact scientific research tools\nlex: technology advances science instruments computing\nlex: AI machine learning scientific discovery\nvec: how has modern technology transformed the way scientific research is conducted?\nvec: what role do computing, AI, and advanced instruments play in accelerating scientific discovery?\nhyde: Technology has transformed scientific research through high-throughput sequencing (enabling genomics), electron microscopy (revealing molecular structures), supercomputers (running complex simulations), and machine learning (identifying patterns in massive datasets). AI tools like AlphaFold have predicted protein structures that took decades to solve experimentally."}
+{"input": "how wearable technology is evolving", "output": "lex: wearable technology evolution smartwatch fitness\nlex: wearable tech health monitoring sensors 2025 2026\nlex: wearable devices Apple Watch Garmin health tracking\nvec: how is wearable technology evolving in terms of health monitoring and smart features?\nvec: what are the latest advances in wearable devices for fitness tracking and medical diagnostics?\nhyde: Wearable technology has evolved from basic step counters to sophisticated health monitors. Modern smartwatches track heart rate, blood oxygen, ECG, sleep stages, and skin temperature. Emerging features include continuous glucose monitoring, blood pressure sensing, and AI-powered health alerts that can detect atrial fibrillation and sleep apnea."}
+{"input": "what is the significance of compassion in ethics?", "output": "lex: compassion ethics moral philosophy\nlex: compassion morality empathy ethical theory\nlex: ethics of care compassion Schopenhauer\nvec: why is compassion considered a central virtue in ethical philosophy?\nvec: how do ethical theories incorporate compassion as a foundation for moral behavior?\nhyde: Schopenhauer argued that compassion (Mitleid) is the foundation of all morality, as it allows us to recognize the suffering of others as our own. The ethics of care, developed by Carol Gilligan and Nel Noddings, places compassionate relationships at the center of moral reasoning, contrasting with abstract rule-based approaches like Kantianism."}
+{"input": "what is the principle of double effect", "output": "lex: principle of double effect ethics\nlex: double effect doctrine Aquinas moral philosophy\nlex: double effect intended foreseen consequences\nvec: what is the principle of double effect and how does it apply in moral philosophy?\nvec: how does the doctrine of double effect distinguish between intended and foreseen consequences of an action?\nhyde: The principle of double effect, originating from Thomas Aquinas, holds that an action with both good and bad effects is morally permissible if: (1) the action itself is not wrong, (2) the bad effect is not intended, (3) the bad effect is not the means to the good effect, and (4) the good effect outweighs the bad. It's commonly applied in medical ethics and just war theory."}
+{"input": "what are the latest trends in interior design", "output": "lex: interior design trends 2025 2026\nlex: interior design trends colors materials\nlex: home decor trends furniture styles\nvec: what are the newest interior design trends for homes in 2025 and 2026?\nvec: which colors, materials, and furniture styles are trending in interior design right now?\nhyde: Top interior design trends for 2025-2026 include warm earth tones replacing cool grays, curved furniture and organic shapes, bold textured walls, sustainable and natural materials like rattan and stone, statement lighting, and maximalist layering. Warm woods, bouclé fabrics, and vintage-inspired pieces continue to dominate living spaces."}
+{"input": "how to research candidates before voting", "output": "lex: research candidates before voting election\nlex: voter guide candidate positions issues\nlex: candidate research voting record platform\nvec: how can voters research political candidates and their positions before an election?\nvec: what resources help voters compare candidates' platforms and voting records before casting a ballot?\nhyde: Before voting, check nonpartisan voter guides from Vote411.org (League of Women Voters) or BallotReady. Review candidates' official websites for policy positions, and check voting records on VoteSmart.org. Read local newspaper endorsements, watch candidate debates, and verify claims on fact-checking sites like PolitiFact."}
+{"input": "how did the roman empire impact culture?", "output": "lex: Roman Empire cultural impact legacy\nlex: Roman Empire influence law language architecture\nlex: Rome culture art Latin Western civilization\nvec: how did the Roman Empire shape Western culture, law, and language?\nvec: what lasting cultural impacts did the Roman Empire have on architecture, government, and society?\nhyde: The Roman Empire's cultural legacy includes Latin (the root of Romance languages), Roman law (the basis of civil law systems worldwide), architectural innovations like arches, aqueducts, and concrete, republican government concepts, road networks, and the spread of Christianity. Roman art, literature, and engineering influenced Western civilization for centuries."}
+{"input": "explain monotheism", "output": "lex: monotheism one God religion\nlex: monotheism Christianity Islam Judaism\nlex: monotheism definition history theology\nvec: what is monotheism and which major world religions practice the belief in one God?\nvec: how did monotheism develop historically and what distinguishes it from polytheism?\nhyde: Monotheism is the belief in a single, all-powerful God. The three major monotheistic religions are Judaism, Christianity, and Islam, all tracing their roots to Abraham. Judaism was among the earliest monotheistic faiths, emerging around 2000 BCE. Monotheism contrasts with polytheism (many gods) and differs from henotheism (one chief god among many)."}
+{"input": "how to replace windshield wipers?", "output": "lex: replace windshield wipers installation\nlex: change wiper blades car DIY\nlex: windshield wiper replacement size\nvec: how do you replace windshield wiper blades on a car step by step?\nvec: what size windshield wipers does my car need and how do I install them?\nhyde: Lift the wiper arm away from the windshield. Press the small tab where the blade meets the arm and slide the old blade off the hook. Slide the new blade onto the J-hook until it clicks into place. Lower the arm back gently. Check your owner's manual or an auto parts store's fit guide for the correct blade size."}
+{"input": "what are tectonic plates", "output": "lex: tectonic plates Earth crust geology\nlex: plate tectonics continental drift boundaries\nlex: tectonic plates earthquake volcano subduction\nvec: what are tectonic plates and how does plate tectonics explain earthquakes and volcanic activity?\nvec: how do tectonic plates move and interact at convergent, divergent, and transform boundaries?\nhyde: Tectonic plates are massive slabs of Earth's lithosphere that float on the semi-fluid asthenosphere. There are 15 major plates that move 1-10 cm per year. At convergent boundaries, plates collide causing mountains and subduction zones; at divergent boundaries, plates separate creating mid-ocean ridges; at transform boundaries, plates slide past each other causing earthquakes."}
+{"input": "airbnb bookings", "output": "lex: Airbnb bookings reservations how to\nlex: Airbnb book rental property listing\nlex: Airbnb booking tips cancellation policy\nvec: how do you book a rental property on Airbnb and what should you know before reserving?\nvec: what are the Airbnb booking policies including cancellation, fees, and payment?\nhyde: To book on Airbnb, search by destination and dates, filter by price, type, and amenities, and review photos and guest reviews. Request to book or use Instant Book listings for immediate confirmation. Airbnb charges a service fee of 14-16%. Check the cancellation policy (Flexible, Moderate, or Strict) before confirming."}
+{"input": "how do you develop a writing voice?", "output": "lex: develop writing voice style\nlex: writing voice tone author style\nlex: find unique writing voice techniques\nvec: how does a writer develop their own unique writing voice and style?\nvec: what exercises and practices help writers find and strengthen their authentic voice?\nhyde: Developing a writing voice requires reading widely, writing consistently, and paying attention to what feels natural. Write the way you think and speak. Experiment with sentence length, word choice, and rhythm. Read your work aloud to hear your voice. Imitate writers you admire, then gradually let your own patterns emerge through regular practice."}
+{"input": "what is devotion in religious context", "output": "lex: devotion religion religious worship\nlex: devotion faith prayer bhakti piety\nlex: religious devotion spiritual practice\nvec: what does devotion mean in a religious context and how is it practiced across faiths?\nvec: how do different religions express devotion through prayer, worship, and spiritual discipline?\nhyde: Religious devotion refers to profound love, loyalty, and dedication to God or a divine reality, expressed through prayer, worship, and spiritual practice. In Hinduism, bhakti (devotion) is a path to liberation through loving surrender to a deity. In Christianity, devotion involves daily prayer, scripture reading, and sacramental participation."}
+{"input": "what is skepticism in philosophy", "output": "lex: skepticism philosophy epistemology doubt\nlex: philosophical skepticism Pyrrhonism Descartes\nlex: skepticism knowledge certainty questioning\nvec: what is philosophical skepticism and how does it question the possibility of knowledge?\nvec: how did Pyrrhonian skepticism and Cartesian doubt influence Western philosophical thought?\nhyde: Philosophical skepticism questions whether certain knowledge is possible. Pyrrhonian skepticism (from Pyrrho of Elis) suspends judgment on all claims, arguing that for every argument there is an equally strong counterargument. Descartes used methodological doubt—doubting everything that could be doubted—to arrive at \"cogito ergo sum\" as an indubitable foundation."}
+{"input": "fix teeth", "output": "lex: fix teeth dental repair options\nlex: broken chipped teeth treatment dentist\nlex: dental restoration crowns veneers bonding\nvec: what are the options for fixing damaged, chipped, or broken teeth?\nvec: how do dentists repair teeth using crowns, veneers, bonding, and other dental treatments?\nhyde: Common dental repairs include bonding (composite resin applied to chipped teeth, $100-400), porcelain veneers (thin shells covering the front surface, $500-2500 per tooth), crowns (caps covering the entire tooth, $800-1500), and dental implants for missing teeth ($3000-5000). Treatment depends on the extent of damage."}
+{"input": "what are social media photography tips?", "output": "lex: social media photography tips Instagram\nlex: phone photography social media lighting composition\nlex: Instagram photo tips editing filters\nvec: what are the best photography tips for creating engaging social media content?\nvec: how do you take better photos for Instagram and other social media platforms using a phone?\nhyde: Shoot during golden hour (the hour after sunrise or before sunset) for warm, flattering light. Use the rule of thirds grid on your phone camera. Keep backgrounds clean and uncluttered. Edit consistently using the same preset or filter for a cohesive feed. Shoot in natural light whenever possible and avoid using flash."}
+{"input": "what is gerrymandering", "output": "lex: gerrymandering redistricting electoral districts\nlex: gerrymandering political manipulation voting\nlex: gerrymandering packing cracking congressional\nvec: what is gerrymandering and how does it manipulate electoral district boundaries?\nvec: how does gerrymandering use techniques like packing and cracking to influence election outcomes?\nhyde: Gerrymandering is the manipulation of electoral district boundaries to favor a particular political party. Two main techniques are \"packing\" (concentrating opposition voters into a few districts) and \"cracking\" (spreading them across many districts to dilute their vote). The term dates to 1812 when Governor Elbridge Gerry approved a district shaped like a salamander."}
+{"input": "how do the arts contribute to moral understanding?", "output": "lex: arts moral understanding ethics\nlex: art literature ethics empathy\nlex: arts moral education philosophical perspective\nvec: how do the arts such as literature, film, and visual art contribute to moral understanding?\nvec: in what ways do artistic works cultivate empathy and ethical awareness in audiences?\nhyde: Literature, theater, and film place audiences in the shoes of characters facing moral dilemmas, cultivating empathy and ethical reflection. Martha Nussbaum argues that novels develop moral imagination by exposing readers to lives unlike their own. Art invites us to confront injustice, question assumptions, and feel the weight of ethical choices."}
+{"input": "what are the main beliefs of jainism?", "output": "lex: Jainism beliefs principles religion\nlex: Jainism ahimsa non-violence karma\nlex: Jain philosophy anekantavada moksha\nvec: what are the core beliefs and principles of Jainism as a religion?\nvec: how does Jainism emphasize non-violence (ahimsa) and what are its main philosophical tenets?\nhyde: Jainism's core beliefs include ahimsa (non-violence toward all living beings), anekantavada (many-sidedness of truth), and aparigraha (non-attachment). Jains believe the soul (jiva) accumulates karma through actions and must purify itself through ethical living, asceticism, and meditation to achieve moksha (liberation from the cycle of rebirth)."}
+{"input": "how do philosophers define happiness", "output": "lex: philosophers define happiness philosophy\nlex: happiness eudaimonia Aristotle hedonism\nlex: philosophical theories happiness well-being\nvec: how have major philosophers throughout history defined happiness and well-being?\nvec: what is the difference between Aristotle's eudaimonia and hedonistic views of happiness?\nhyde: Aristotle defined happiness (eudaimonia) as flourishing through virtuous activity over a complete life, not mere pleasure. Epicurus identified happiness with ataraxia (tranquility) and the absence of pain. Utilitarians like Mill equated happiness with pleasure but distinguished higher (intellectual) from lower (bodily) pleasures. Modern positive psychology studies happiness as subjective well-being."}
+{"input": "how to train a dog to sit", "output": "lex: train dog sit command\nlex: dog training sit positive reinforcement\nlex: teach puppy sit treat method\nvec: what is the step-by-step method for training a dog to sit on command?\nvec: how do you use positive reinforcement to teach a dog or puppy the sit command?\nhyde: Hold a treat close to your dog's nose, then slowly move your hand up so the dog's head follows the treat and their bottom lowers. The moment they sit, say \"sit,\" give the treat, and praise them. Repeat 5-10 times per session, 2-3 sessions daily. Within a week, most dogs learn to sit on verbal command alone."}
+{"input": "how to choose a family-friendly restaurant?", "output": "lex: family-friendly restaurant kids menu\nlex: choose restaurant families children\nlex: kid-friendly dining options reviews\nvec: how do you find and choose a family-friendly restaurant suitable for dining with children?\nvec: what features make a restaurant good for families with young kids?\nhyde: Look for restaurants with a dedicated kids' menu, high chairs, and a casual atmosphere that tolerates noise. Check Google or Yelp reviews filtered for \"family-friendly.\" Booth seating, crayons or activity sheets, and an early dinner option are good signs. Fast-casual restaurants often work well since kids don't have to wait long for food."}
+{"input": "what is historical context in literature?", "output": "lex: historical context literature analysis\nlex: historical context literary criticism period\nlex: literature historical background social conditions\nvec: what does historical context mean when analyzing and interpreting a work of literature?\nvec: how does understanding the historical period and social conditions help interpret literary texts?\nhyde: Historical context in literature refers to the social, political, economic, and cultural conditions during the time a work was written. Understanding that \"1984\" was written in 1948 during the rise of totalitarian states deepens its meaning. Historical context helps readers interpret themes, character motivations, and the author's intent within their time period."}
+{"input": "where to buy mid-century modern furniture", "output": "lex: buy mid-century modern furniture store\nlex: mid-century modern furniture online vintage\nlex: MCM furniture West Elm Design Within Reach\nvec: where can I buy authentic or reproduction mid-century modern furniture?\nvec: what are the best stores and websites for purchasing mid-century modern style furniture?\nhyde: Shop mid-century modern furniture at West Elm, Design Within Reach (DWR), and Article for contemporary reproductions. For vintage originals, check Chairish, 1stDibs, and local estate sales. IKEA offers affordable MCM-inspired pieces. Facebook Marketplace and Craigslist often have authentic Eames, Knoll, and Herman Miller pieces at lower prices."}
+{"input": "how to transition kids to new schools?", "output": "lex: transition kids new school tips\nlex: children changing schools adjustment\nlex: help child new school anxiety transfer\nvec: how can parents help their children transition smoothly to a new school?\nvec: what strategies help kids adjust emotionally and socially when changing schools?\nhyde: Visit the new school together before the first day so the building feels familiar. Meet the teacher and tour the classroom. Maintain routines at home for stability. Encourage your child to talk about their feelings and validate their anxiety. Arrange playdates with new classmates early on, and stay in contact with teachers during the first few weeks."}
+{"input": "what is graphic design?", "output": "lex: graphic design visual communication\nlex: graphic design typography layout color\nlex: graphic design tools Adobe Figma\nvec: what is graphic design and what skills and tools does a graphic designer use?\nvec: how does graphic design combine typography, color, and layout to communicate visually?\nhyde: Graphic design is the craft of creating visual content to communicate messages. Designers use typography, color theory, layout, and imagery to create logos, websites, posters, packaging, and more. Key tools include Adobe Photoshop, Illustrator, InDesign, and Figma. The field spans print design, web/UI design, branding, and motion graphics."}
+{"input": "what is the latest iphone model", "output": "lex: latest iPhone model 2025 2026\nlex: newest iPhone Apple release\nlex: iPhone 17 features specs\nvec: what is the latest iPhone model released by Apple and what are its key features?\nvec: what are the specs and improvements in the newest iPhone compared to previous models?\nhyde: The iPhone 16 series launched in September 2024 with the A18 chip, a dedicated Camera Control button, and Apple Intelligence features. The iPhone 16 Pro and Pro Max feature a 48MP main camera, titanium design, and improved battery life. The iPhone 17 lineup is expected in September 2025."}
+{"input": "where to find open access research papers", "output": "lex: open access research papers free\nlex: open access journals articles database\nlex: free academic papers PubMed arXiv\nvec: where can I find free open access research papers and academic articles?\nvec: what databases and websites provide open access to peer-reviewed scientific papers?\nhyde: Access free research papers through PubMed Central (biomedical), arXiv (physics, math, CS), SSRN (social sciences), and DOAJ (Directory of Open Access Journals). Google Scholar often links to free PDF versions. Unpaywall is a browser extension that finds legal free versions of paywalled papers. Many universities also maintain institutional repositories."}
+{"input": "how to improve interpersonal skills", "output": "lex: improve interpersonal skills communication\nlex: interpersonal skills active listening empathy\nlex: people skills social interaction workplace\nvec: what are effective ways to improve interpersonal and communication skills?\nvec: how can someone develop better listening, empathy, and social skills in personal and professional settings?\nhyde: Improve interpersonal skills by practicing active listening: maintain eye contact, avoid interrupting, and paraphrase what you heard. Ask open-ended questions to show genuine interest. Develop empathy by considering others' perspectives before responding. Practice assertive communication—express your needs clearly while respecting others. Seek feedback on how you come across."}
+{"input": "math model", "output": "lex: mathematical model equations simulation\nlex: math modeling real-world applications\nlex: mathematical model differential equations optimization\nvec: what is a mathematical model and how is it used to represent real-world systems?\nvec: how do mathematicians build models using equations to simulate and predict outcomes?\nhyde: A mathematical model uses equations and formulas to represent the behavior of a real-world system. For example, the SIR model uses differential equations to predict disease spread: dS/dt = -βSI, dI/dt = βSI - γI, dR/dt = γI. Models are validated by comparing predictions to observed data and refined iteratively."}
+{"input": "what is digital transformation", "output": "lex: digital transformation definition strategy\nlex: digital transformation technology business process\nlex: digital transformation cloud automation data-driven\nvec: what is digital transformation and how does it change how organizations operate?\nvec: what are the key components and stages of digital transformation in a business?\nhyde: Digital transformation is the process of using digital technologies to fundamentally change how an organization operates and delivers value. It goes beyond digitizing existing processes—it involves rethinking business models, customer experiences, and operational workflows using cloud computing, AI, data analytics, and automation."}
+{"input": "how to improve project outcomes", "output": "lex: improve project outcomes management\nlex: project success factors planning execution\nlex: project management methodology agile results\nvec: what strategies and practices improve project outcomes and increase the chance of success?\nvec: how can project managers improve delivery, stakeholder satisfaction, and results?\nhyde: Improve project outcomes by defining clear objectives and success criteria upfront, engaging stakeholders early and often, breaking work into short iterations with regular checkpoints, and managing risks proactively. Use retrospectives to learn from each phase. Projects with clear scope, executive sponsorship, and empowered teams are 2-3x more likely to succeed."}
+{"input": "what is the relationship between ethics and happiness?", "output": "lex: ethics happiness philosophy relationship\nlex: virtue ethics happiness eudaimonia Aristotle\nlex: morality well-being ethical living\nvec: what is the philosophical relationship between living ethically and being happy?\nvec: how does Aristotle argue that virtue and ethics are connected to happiness and human flourishing?\nhyde: Aristotle argued that happiness (eudaimonia) is achieved through virtuous living—not pleasure alone, but the active exercise of reason and moral virtue over a lifetime. The Stoics similarly held that virtue is sufficient for happiness. Utilitarianism inverts this: moral actions are those that maximize total happiness. The question of whether being moral makes you happy remains debated."}
+{"input": "how does philosophy explore the nature of truth?", "output": "lex: philosophy truth nature theories\nlex: correspondence coherence pragmatic theory truth\nlex: truth philosophy epistemology logic\nvec: how do philosophical theories explain the nature of truth and what makes a statement true?\nvec: what are the main theories of truth in philosophy such as correspondence, coherence, and pragmatic theories?\nhyde: Philosophy examines truth through several theories. The correspondence theory holds that truth is agreement between a proposition and reality. The coherence theory says a statement is true if it fits consistently within a system of beliefs. The pragmatic theory (James, Dewey) defines truth as what works in practice. Deflationary theories argue that \"true\" adds nothing beyond the assertion itself."}
+{"input": "rain drop", "output": "lex: raindrop formation size shape\nlex: raindrop water cycle precipitation\nlex: rain droplet physics terminal velocity\nvec: how do raindrops form and what determines their size and shape as they fall?\nvec: what is the science behind raindrop formation in the water cycle and precipitation?\nhyde: Raindrops form when water vapor condenses around tiny particles (condensation nuclei) in clouds. As droplets collide and merge, they grow heavy enough to fall. Contrary to the teardrop image, falling raindrops are actually shaped like hamburger buns—flattened on the bottom by air resistance. Average raindrops are 1-2mm in diameter and fall at about 20 mph."}
+{"input": "what is magical realism?", "output": "lex: magical realism literary genre\nlex: magical realism Garcia Marquez literature\nlex: magical realism Latin American fiction examples\nvec: what is magical realism as a literary genre and what are its defining characteristics?\nvec: how do authors like Gabriel Garcia Marquez blend the magical and mundane in magical realism?\nhyde: Magical realism is a literary genre in which supernatural elements appear in an otherwise realistic setting, treated as ordinary by the characters. Gabriel Garcia Marquez's \"One Hundred Years of Solitude\" is the quintessential example, where events like a character ascending to heaven while hanging laundry are narrated matter-of-factly alongside everyday life in Macondo."}
+{"input": "how to write a film review", "output": "lex: write film review movie critique\nlex: film review structure format examples\nlex: movie review writing tips analysis\nvec: how do you write a well-structured and engaging film review?\nvec: what elements should be included in a film review such as plot summary, analysis, and rating?\nhyde: Start with a hook—a striking observation about the film. Provide a brief, spoiler-free plot summary (2-3 sentences). Evaluate the directing, acting, cinematography, screenplay, and score. Support your opinion with specific scenes or examples. Address who would enjoy the film and rate it on your chosen scale. Keep the review between 400-800 words."}
+{"input": "what is the current inflation rate", "output": "lex: current inflation rate CPI 2025 2026\nlex: inflation rate United States economy\nlex: consumer price index inflation percentage\nvec: what is the current U.S. inflation rate and how is it measured by the CPI?\nvec: what is the latest consumer price index data showing the annual inflation rate?\nhyde: The U.S. Bureau of Labor Statistics measures inflation through the Consumer Price Index (CPI), which tracks the average change in prices paid by consumers for goods and services. The annual inflation rate is calculated by comparing the current CPI to the same month one year prior. Check bls.gov/cpi for the latest monthly release."}
+{"input": "what is the function of dialogue?", "output": "lex: dialogue function purpose communication\nlex: dialogue conversation role\nvec: what purpose does dialogue serve in communication and storytelling\nvec: how does dialogue function in literature and everyday interaction\nhyde: Dialogue serves multiple functions: it conveys information between characters, reveals personality and motivation, advances the plot, and creates tension. In everyday communication, dialogue enables mutual understanding and negotiation of meaning."}
+{"input": "what is the importance of peer review", "output": "lex: peer review importance scientific publishing\nlex: peer review process academic research\nvec: why is peer review important in academic and scientific publishing\nvec: how does the peer review process ensure quality in research papers\nhyde: Peer review is the cornerstone of scientific publishing. Before a paper is accepted, independent experts evaluate the methodology, data analysis, and conclusions. This process catches errors, prevents fraudulent claims, and maintains the credibility of published research."}
+{"input": "what is the impact of the printing press", "output": "lex: printing press impact history Gutenberg\nlex: printing press effects literacy knowledge\nvec: how did the invention of the printing press change society and the spread of knowledge\nvec: what were the historical consequences of Gutenberg's printing press\nhyde: Gutenberg's printing press, invented around 1440, revolutionized the production of books. By making texts affordable and widely available, it increased literacy rates, enabled the Protestant Reformation, and accelerated the Scientific Revolution across Europe."}
+{"input": "what is open science", "output": "lex: open science definition principles\nlex: open access open data research transparency\nvec: what does open science mean and what are its core principles\nvec: how does open science promote transparency and accessibility in research\nhyde: Open science is a movement to make scientific research, data, and dissemination accessible to all. It encompasses open access publishing, open data sharing, open-source software, and transparent methodologies, aiming to accelerate discovery through collaboration."}
+{"input": "swim class", "output": "lex: swimming classes lessons beginner\nlex: swim class schedule enrollment\nvec: where can I find swimming classes for beginners or children\nvec: what should I expect from a swimming lesson and how to enroll\nhyde: Our swim classes are available for all ages and skill levels. Beginner classes focus on water safety, floating, and basic strokes. Intermediate classes cover freestyle, backstroke, and treading water. Sessions run 30-45 minutes with certified instructors."}
+{"input": "what is the bhagavad gita", "output": "lex: Bhagavad Gita Hindu scripture meaning\nlex: Bhagavad Gita Krishna Arjuna teachings\nvec: what is the Bhagavad Gita and what are its central teachings\nvec: what role does the Bhagavad Gita play in Hindu philosophy and practice\nhyde: The Bhagavad Gita is a 700-verse Hindu scripture that forms part of the Mahabharata epic. It is a dialogue between Prince Arjuna and the god Krishna, addressing duty (dharma), devotion (bhakti), knowledge (jnana), and selfless action (karma yoga)."}
+{"input": "how does plant photosynthesis work", "output": "lex: photosynthesis process plants chlorophyll\nlex: light reactions Calvin cycle carbon dioxide\nvec: how do plants convert sunlight into energy through photosynthesis\nvec: what are the steps of photosynthesis in plant cells\nhyde: Photosynthesis occurs in chloroplasts. In the light reactions, chlorophyll absorbs sunlight to split water molecules, producing ATP and NADPH. In the Calvin cycle, these molecules drive the fixation of CO2 into glucose, releasing oxygen as a byproduct."}
+{"input": "what is a black hole", "output": "lex: black hole definition physics space\nlex: black hole event horizon singularity\nvec: what is a black hole and how does it form in space\nvec: how do black holes work according to general relativity\nhyde: A black hole is a region in space where gravity is so intense that nothing, not even light, can escape. It forms when a massive star collapses at the end of its life. The boundary is called the event horizon, beyond which lies the singularity."}
+{"input": "how ecosystems function", "output": "lex: ecosystem function energy flow nutrient cycling\nlex: ecosystems trophic levels food web\nvec: how do ecosystems function through energy flow and nutrient cycling\nvec: what are the key processes that keep ecosystems balanced and healthy\nhyde: Ecosystems function through interconnected processes: producers capture solar energy via photosynthesis, consumers transfer energy through food webs, and decomposers recycle nutrients back into the soil. Water, carbon, and nitrogen cycle continuously through biotic and abiotic components."}
+{"input": "how to increase home resale value", "output": "lex: increase home resale value renovations\nlex: home improvement ROI property value\nvec: what home improvements increase resale value the most\nvec: how can I boost my home's market price before selling\nhyde: Kitchen and bathroom remodels offer the highest ROI, typically recovering 60-80% of costs. Other high-value improvements include replacing the front door, adding a deck, and upgrading to energy-efficient windows. Fresh paint and curb appeal landscaping are low-cost, high-impact upgrades."}
+{"input": "how to design an effective scientific study", "output": "lex: scientific study design methodology\nlex: research design controls variables sample size\nvec: how do you design a rigorous and effective scientific study\nvec: what steps are involved in planning a well-controlled research experiment\nhyde: An effective study begins with a clear hypothesis and defined variables. Choose an appropriate design (randomized controlled trial, cohort, etc.), calculate the required sample size for statistical power, establish controls, and pre-register your protocol to reduce bias."}
+{"input": "how to set up a campfire", "output": "lex: campfire setup build fire outdoors\nlex: campfire fire pit kindling tinder logs\nvec: how do you properly build and start a campfire outdoors\nvec: what materials and steps are needed to set up a safe campfire\nhyde: To build a campfire, clear a fire ring down to bare soil. Place a tinder bundle of dry leaves or paper in the center. Stack small kindling sticks in a teepee shape around it. Light the tinder and gradually add larger logs as the fire grows. Keep water nearby to extinguish."}
+{"input": "where to learn digital marketing", "output": "lex: digital marketing courses online training\nlex: learn digital marketing SEO social media\nvec: where can I take courses to learn digital marketing skills\nvec: what are the best online platforms for learning SEO, social media, and digital advertising\nhyde: Google Digital Garage offers a free Fundamentals of Digital Marketing course with certification. HubSpot Academy covers inbound marketing and content strategy. Coursera and Udemy feature paid courses on SEO, PPC, email marketing, and social media advertising."}
+{"input": "how to remove car dents?", "output": "lex: car dent removal DIY repair\nlex: paintless dent repair PDR technique\nvec: how can I remove dents from my car at home without repainting\nvec: what are the methods for fixing small dents on a car body\nhyde: For small dents, try the boiling water method on plastic bumpers or use a suction cup dent puller. Paintless dent repair (PDR) uses metal rods to push dents out from behind the panel. For deeper dents, apply body filler, sand smooth, and repaint."}
+{"input": "what is a moral code", "output": "lex: moral code definition ethics principles\nlex: moral code rules behavior right wrong\nvec: what is a moral code and how does it guide human behavior\nvec: how do societies and individuals develop a set of moral principles\nhyde: A moral code is a set of principles or rules that define right and wrong conduct. It may be derived from religious teachings, cultural traditions, philosophical reasoning, or personal reflection. Examples include the Ten Commandments, Kantian ethics, and utilitarianism."}
+{"input": "what is cloud computing", "output": "lex: cloud computing definition services\nlex: cloud computing IaaS PaaS SaaS\nvec: what is cloud computing and how do cloud services work\nvec: what are the different types of cloud computing services like IaaS, PaaS, and SaaS\nhyde: Cloud computing delivers computing resources—servers, storage, databases, networking, and software—over the internet on a pay-as-you-go basis. The three main service models are Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and Software as a Service (SaaS)."}
+{"input": "how to practice meditation", "output": "lex: meditation practice techniques beginners\nlex: mindfulness meditation breathing focus\nvec: how do I start a daily meditation practice as a beginner\nvec: what are simple meditation techniques for reducing stress and improving focus\nhyde: Start with 5-10 minutes daily. Sit comfortably, close your eyes, and focus on your breath. When thoughts arise, notice them without judgment and gently return attention to breathing. Guided meditation apps like Headspace or Insight Timer can help beginners build consistency."}
+{"input": "what is xeriscaping?", "output": "lex: xeriscaping drought-tolerant landscaping water conservation\nlex: xeriscape garden design dry climate plants\nvec: what is xeriscaping and how does it reduce water usage in landscaping\nvec: how do you design a xeriscape garden with drought-resistant plants\nhyde: Xeriscaping is a landscaping approach that minimizes water use by selecting drought-tolerant native plants, improving soil with compost, using efficient drip irrigation, applying mulch to retain moisture, and reducing lawn area. It originated in arid regions of the western United States."}
+{"input": "what are the main beliefs of buddhism", "output": "lex: Buddhism beliefs Four Noble Truths Eightfold Path\nlex: Buddhist teachings karma dharma nirvana\nvec: what are the core beliefs and teachings of Buddhism\nvec: what do Buddhists believe about suffering, enlightenment, and the path to nirvana\nhyde: Buddhism is founded on the Four Noble Truths: life involves suffering (dukkha), suffering arises from craving (tanha), suffering can end (nirodha), and the path to its end is the Noble Eightfold Path. Key concepts include karma, rebirth, impermanence (anicca), and non-self (anatta)."}
+{"input": "how to reduce carbon footprint?", "output": "lex: reduce carbon footprint emissions tips\nlex: lower carbon footprint energy transportation diet\nvec: what are effective ways to reduce my personal carbon footprint\nvec: how can individuals lower their greenhouse gas emissions in daily life\nhyde: The biggest personal reductions come from driving less or switching to an EV, flying less frequently, eating less red meat, improving home insulation, and switching to renewable energy. A plant-rich diet can cut food-related emissions by up to 50%."}
+{"input": "how to save for a child's education?", "output": "lex: save child education fund college\nlex: 529 plan education savings account\nvec: how should I save money for my child's college education\nvec: what are the best investment accounts for saving for a child's education\nhyde: A 529 plan is one of the most tax-advantaged ways to save for education. Contributions grow tax-free, and withdrawals for qualified expenses (tuition, books, room and board) are also tax-free. Many states offer additional tax deductions for contributions."}
+{"input": "what is the best way to learn python programming?", "output": "lex: learn Python programming beginner tutorial\nlex: Python programming course exercises projects\nvec: what is the most effective way to learn Python programming from scratch\nvec: which Python courses and resources are best for beginners learning to code\nhyde: Start with an interactive tutorial like Python.org's official tutorial or Codecademy's Python course. Practice daily on sites like LeetCode or HackerRank. Build small projects—a calculator, web scraper, or to-do app—to solidify concepts. Read \"Automate the Boring Stuff with Python\" for practical applications."}
+{"input": "how to grow roses from cuttings?", "output": "lex: grow roses cuttings propagation\nlex: rose cutting rooting hormone planting\nvec: how do you propagate roses from stem cuttings at home\nvec: what is the step-by-step process for rooting rose cuttings\nhyde: Take a 6-8 inch cutting from a healthy rose stem just below a leaf node. Remove lower leaves, dip the cut end in rooting hormone, and insert into moist potting mix. Cover with a plastic bag to maintain humidity. Roots typically form in 4-8 weeks. Transplant once established."}
+{"input": "sustainable architecture", "output": "lex: sustainable architecture green building design\nlex: sustainable building materials energy efficient\nvec: what is sustainable architecture and what design principles does it follow\nvec: how do architects design energy-efficient and environmentally friendly buildings\nhyde: Sustainable architecture minimizes environmental impact through passive solar design, natural ventilation, high-performance insulation, and renewable energy integration. Materials like cross-laminated timber, recycled steel, and low-VOC finishes reduce embodied carbon."}
+{"input": "what is the concept of moral luck", "output": "lex: moral luck philosophy concept\nlex: moral luck Thomas Nagel Bernard Williams\nvec: what is the philosophical concept of moral luck and why is it controversial\nvec: how does moral luck challenge our ideas about responsibility and blame\nhyde: Moral luck, introduced by Thomas Nagel and Bernard Williams in 1976, refers to situations where moral judgment depends on factors beyond a person's control. A drunk driver who arrives home safely is judged differently from one who kills a pedestrian, despite identical recklessness."}
+{"input": "task wait", "output": "lex: async task wait await\nlex: task wait timeout concurrency\nvec: how to wait for an asynchronous task to complete in programming\nvec: how to use await or task wait for concurrent operations\nhyde: Use `await task` in async/await patterns to wait for completion. In C#, `Task.Wait()` blocks synchronously while `await` yields control. In Python, `await asyncio.gather(*tasks)` waits for multiple coroutines. Use timeouts to prevent indefinite blocking."}
+{"input": "latest findings in climate science", "output": "lex: climate science research findings 2025 2026\nlex: climate change latest studies temperature emissions\nvec: what are the most recent scientific findings about climate change in 2025-2026\nvec: what do the latest climate science studies reveal about global warming trends\nhyde: Recent studies in 2025 confirm that global average temperatures have exceeded 1.5°C above pre-industrial levels. Ocean heat content reached record highs, and Arctic sea ice extent continued its decline. New research links accelerated ice sheet loss in Greenland and Antarctica to rising sea levels."}
+{"input": "how to lose weight fast?", "output": "lex: lose weight fast safe methods\nlex: weight loss diet exercise calorie deficit\nvec: what are safe and effective methods to lose weight quickly\nvec: how can I create a calorie deficit to lose weight without harming my health\nhyde: Safe weight loss is 1-2 pounds per week through a calorie deficit of 500-1000 calories daily. Combine a protein-rich diet with strength training and cardio. Avoid crash diets—they cause muscle loss and metabolic slowdown. Drink water, sleep 7-9 hours, and track food intake for accountability."}
+{"input": "ukraine", "output": "lex: Ukraine country history conflict\nlex: Ukraine war geopolitics Kyiv\nvec: what is the current situation in Ukraine and the ongoing conflict\nvec: what is the history and geopolitical context of Ukraine\nhyde: Ukraine is a country in Eastern Europe with a population of approximately 44 million. Since February 2022, it has been engaged in a full-scale war following Russia's invasion. Kyiv is the capital. Ukraine has deep historical ties to both European and post-Soviet geopolitics."}
+{"input": "http client", "output": "lex: HTTP client library request\nlex: HTTP client fetch API REST\nvec: how to make HTTP requests using an HTTP client library\nvec: which HTTP client libraries are available for making API calls in different languages\nhyde: An HTTP client sends requests to web servers and processes responses. In JavaScript, use `fetch()` or `axios`. In Python, use `requests` or `httpx`. In Go, use `net/http`. Typical methods include GET, POST, PUT, DELETE. Set headers, handle timeouts, and parse JSON responses."}
+{"input": "how to vlog with a smartphone", "output": "lex: vlog smartphone video recording tips\nlex: smartphone vlogging equipment setup\nvec: how do I start vlogging using only my smartphone\nvec: what equipment and techniques make smartphone vlogs look professional\nhyde: To vlog with a smartphone, use the rear camera for higher quality. Invest in a small tripod or gimbal for stability, a clip-on microphone for clear audio, and a ring light for indoor filming. Shoot in 1080p or 4K, frame at eye level, and edit with apps like CapCut or InShot."}
+{"input": "what are the elements of short stories?", "output": "lex: short story elements plot character setting\nlex: short story structure literary elements\nvec: what are the key literary elements that make up a short story\nvec: how are plot, character, setting, and theme used in short story writing\nhyde: The essential elements of a short story are plot (the sequence of events), character (the people involved), setting (time and place), conflict (the central struggle), theme (the underlying message), and point of view (the narrative perspective). Short stories typically focus on a single incident."}
+{"input": "how to fix car key fob?", "output": "lex: car key fob fix repair battery replacement\nlex: key fob not working reprogram\nvec: how do I fix a car key fob that stopped working\nvec: how to replace the battery or reprogram a car key fob\nhyde: If your key fob stops working, replace the battery first—open the case with a flat screwdriver and swap in a new CR2032 or CR2025 coin cell. If it still fails, reprogram it: consult your owner's manual for the key-turn sequence or visit a dealer for re-pairing."}
+{"input": "how to grow orchids indoors?", "output": "lex: grow orchids indoors care guide\nlex: orchid indoor growing light water humidity\nvec: how do you care for orchids when growing them indoors\nvec: what light, water, and humidity conditions do indoor orchids need\nhyde: Phalaenopsis orchids thrive indoors with bright indirect light, such as an east-facing window. Water once a week by soaking the roots, then draining completely. Maintain 50-70% humidity with a pebble tray. Fertilize biweekly with diluted orchid fertilizer. Repot every 1-2 years in bark medium."}
+{"input": "how to prepare a scientific presentation", "output": "lex: scientific presentation preparation slides\nlex: research talk conference presentation tips\nvec: how do you prepare and deliver an effective scientific presentation\nvec: what are tips for creating clear slides for a research conference talk\nhyde: Structure your talk as: introduction with context, methods, key results, and conclusions. Use one main idea per slide. Minimize text—use figures and graphs. Practice timing (typically 12 minutes for a 15-minute slot). Anticipate questions about methodology and limitations."}
+{"input": "ai", "output": "lex: artificial intelligence AI machine learning\nlex: AI deep learning neural networks LLM\nvec: what is artificial intelligence and how does modern AI technology work\nvec: what are the main branches and applications of artificial intelligence\nhyde: Artificial intelligence (AI) refers to computer systems that perform tasks typically requiring human intelligence, such as recognizing speech, making decisions, and translating languages. Modern AI relies on machine learning, particularly deep neural networks and large language models (LLMs)."}
+{"input": "how to write a research proposal", "output": "lex: research proposal writing guide\nlex: research proposal structure sections\nvec: how do you write a strong research proposal for a grant or thesis\nvec: what sections and elements should a research proposal include\nhyde: A research proposal typically includes: title, abstract, introduction with background and significance, literature review, research questions or hypotheses, methodology, timeline, budget, and references. Clearly state the gap your research will fill and justify the chosen methods."}
+{"input": "how to stop negative self-talk?", "output": "lex: stop negative self-talk techniques\nlex: negative self-talk cognitive behavioral therapy\nvec: how can I stop negative self-talk and replace it with positive thinking\nvec: what psychological techniques help overcome critical inner dialogue\nhyde: Cognitive behavioral therapy (CBT) teaches you to identify and challenge negative automatic thoughts. When you catch yourself thinking \"I always fail,\" reframe it: \"I struggled this time, but I've succeeded before.\" Keep a thought journal, practice self-compassion, and label thoughts as thoughts, not facts."}
+{"input": "how scientific collaboration advances research", "output": "lex: scientific collaboration research advancement\nlex: interdisciplinary research teamwork co-authorship\nvec: how does collaboration between scientists accelerate research progress\nvec: why is interdisciplinary teamwork important in advancing scientific discovery\nhyde: Multi-institutional collaboration allows researchers to share equipment, data, and expertise across disciplines. The Human Genome Project involved 20 institutions across six countries. Studies show that co-authored papers receive more citations and have higher reproducibility than single-author work."}
+{"input": "how to measure business performance", "output": "lex: business performance metrics KPIs\nlex: measure business performance revenue profit\nvec: what key performance indicators are used to measure business success\nvec: how do companies track and evaluate their business performance\nhyde: Key business performance metrics include revenue growth rate, net profit margin, customer acquisition cost (CAC), customer lifetime value (CLV), employee productivity, and return on investment (ROI). Use dashboards and quarterly reviews to track KPIs against targets."}
+{"input": "how to volunteer for a political campaign", "output": "lex: volunteer political campaign election\nlex: campaign volunteering canvassing phone banking\nvec: how can I sign up to volunteer for a political campaign\nvec: what kinds of volunteer work are available on political campaigns\nhyde: To volunteer, visit the candidate's website and fill out the volunteer form. Common roles include canvassing door-to-door, phone banking, text banking, organizing events, and driving voters to polls on election day. Most campaigns welcome volunteers of all experience levels."}
+{"input": "how to bake a chocolate cake?", "output": "lex: chocolate cake recipe bake from scratch\nlex: baking chocolate cake ingredients instructions\nvec: how do I bake a moist chocolate cake from scratch at home\nvec: what is a simple recipe for homemade chocolate cake\nhyde: Preheat oven to 350°F. Mix 2 cups flour, 2 cups sugar, 3/4 cup cocoa powder, 2 tsp baking soda, and 1 tsp salt. Add 2 eggs, 1 cup buttermilk, 1 cup hot coffee, and 1/2 cup oil. Pour into greased pans and bake 30-35 minutes. Frost with chocolate ganache."}
+{"input": "how do mystics approach spirituality?", "output": "lex: mystics spirituality mystical experience\nlex: mysticism spiritual practice contemplation\nvec: how do mystics across traditions approach spiritual experience and union with the divine\nvec: what practices and beliefs characterize mystical approaches to spirituality\nhyde: Mystics seek direct, personal experience of the divine through contemplation, prayer, and meditation. Christian mystics like Meister Eckhart pursued union with God; Sufi mystics practice dhikr (remembrance of God); and Hindu mystics use yoga and devotion to experience Brahman."}
+{"input": "how cultural festivals affect community bonding", "output": "lex: cultural festivals community bonding social cohesion\nlex: festivals community identity traditions\nvec: how do cultural festivals strengthen community bonds and social cohesion\nvec: what role do cultural celebrations play in bringing communities together\nhyde: Cultural festivals create shared experiences that reinforce collective identity. Studies show communities with regular festivals report higher levels of social trust and neighborly interaction. Events like Diwali, Carnival, and Lunar New Year bring together diverse groups through food, music, and ritual."}
+{"input": "how to follow election results", "output": "lex: follow election results live tracking\nlex: election night results coverage 2026\nvec: how can I follow live election results on election night\nvec: what websites and apps provide real-time election result tracking\nhyde: Follow live election results on the Associated Press (AP) election page, which aggregates official county-level results. Major outlets like CNN, NYT, and BBC offer interactive maps. Sign up for push notifications from news apps. Official state election websites post certified results."}
+{"input": "how to sell a car to a dealership?", "output": "lex: sell car dealership trade-in value\nlex: selling car dealer offer negotiation\nvec: how do I sell my used car to a dealership and get a fair price\nvec: what steps should I follow when trading in or selling a car to a dealer\nhyde: Get your car's market value from Kelley Blue Book or Edmunds before visiting a dealer. Clean the car, gather maintenance records, and bring the title. Get quotes from multiple dealers. The dealer will inspect the car, run a vehicle history report, and make an offer based on condition and mileage."}
+{"input": "what is a conductor in physics", "output": "lex: conductor physics electrical conductivity\nlex: electrical conductor materials electrons\nvec: what is an electrical conductor and how does it work in physics\nvec: what makes certain materials good conductors of electricity\nhyde: An electrical conductor is a material that allows electric current to flow freely through it. Metals like copper, silver, and aluminum are excellent conductors because they have free electrons in their outer shells that move easily when a voltage is applied. Conductivity depends on temperature and material structure."}
+{"input": "what is the significance of civil disobedience?", "output": "lex: civil disobedience significance history\nlex: civil disobedience Thoreau MLK Gandhi nonviolent protest\nvec: why is civil disobedience significant in political and social movements\nvec: how have acts of civil disobedience changed laws and society throughout history\nhyde: Civil disobedience—the deliberate, nonviolent refusal to obey unjust laws—has driven major social change. Thoreau coined the term in 1849; Gandhi used it to help end British rule in India; and Martin Luther King Jr. employed it during the American civil rights movement to challenge segregation."}
+{"input": "how to understand research articles", "output": "lex: understand research articles reading papers\nlex: read scientific journal article structure\nvec: how do I read and understand scientific research articles effectively\nvec: what strategy helps beginners comprehend academic journal papers\nhyde: Start by reading the abstract for the main findings. Then read the introduction for context and the conclusion for takeaways. Next, examine figures and tables. Finally, read methods and results in detail. Look up unfamiliar terms. Read the paper multiple times—comprehension improves with each pass."}
+{"input": "how to start a 401(k)", "output": "lex: 401k start retirement plan employer\nlex: 401k enrollment contribution match\nvec: how do I set up and start contributing to a 401(k) retirement plan\nvec: what are the steps to enroll in my employer's 401(k) plan\nhyde: Enroll through your employer's HR or benefits portal. Choose a contribution percentage—aim for at least enough to get the full employer match (typically 3-6% of salary). Select investment funds based on your retirement timeline. For 2026, the contribution limit is $23,500 ($31,000 if over 50)."}
+{"input": "how to organize a grassroots campaign", "output": "lex: grassroots campaign organizing strategy\nlex: grassroots organizing community mobilization\nvec: how do you organize a grassroots political or community campaign from scratch\nvec: what are the key steps in building a grassroots movement for a cause\nhyde: Start by defining your goal and identifying your base—who cares about this issue? Build a leadership team, create a volunteer database, and develop talking points. Use door-to-door canvassing, community meetings, social media, and petitions to grow support. Track commitments and follow up consistently."}
+{"input": "what are the fundamental teachings of sikhism?", "output": "lex: Sikhism fundamental teachings beliefs\nlex: Sikh Guru Nanak five articles of faith\nvec: what are the core beliefs and teachings of Sikhism\nvec: what did Guru Nanak and the Sikh Gurus teach about God and living\nhyde: Sikhism, founded by Guru Nanak in the 15th century Punjab, teaches belief in one God (Ik Onkar), equality of all people, honest living (kirat karni), sharing with others (vand chakko), and remembrance of God (naam japna). The Guru Granth Sahib is the eternal Guru and holy scripture."}
+{"input": "what are aboriginal dreamtime stories", "output": "lex: Aboriginal Dreamtime stories Australian Indigenous\nlex: Dreamtime creation mythology Aboriginal culture\nvec: what are Aboriginal Australian Dreamtime stories and what do they represent\nvec: how do Dreamtime stories explain creation and law in Aboriginal culture\nhyde: Dreamtime (or Dreaming) stories are the foundational narratives of Aboriginal Australian peoples. They describe how ancestral beings shaped the land, created animals and plants, and established laws and customs. These stories are passed down through oral tradition, song, dance, and art, and remain central to Indigenous identity."}
+{"input": "how do philosophers approach the meaning of life", "output": "lex: meaning of life philosophy existentialism\nlex: philosophers purpose existence meaning\nvec: how have different philosophers addressed the question of life's meaning\nvec: what do existentialist and other philosophical traditions say about the purpose of life\nhyde: Existentialists like Sartre argued life has no inherent meaning—we must create it through our choices. Aristotle proposed eudaimonia (flourishing) as life's purpose. Camus explored the absurd, suggesting we must find meaning despite an indifferent universe. Eastern philosophy often points to liberation from suffering."}
+{"input": "how to make compost at home?", "output": "lex: compost home DIY composting bin\nlex: composting kitchen scraps yard waste\nvec: how do I start composting food scraps and yard waste at home\nvec: what is the step-by-step process for making compost in a backyard bin\nhyde: Layer brown materials (dried leaves, cardboard) and green materials (kitchen scraps, grass clippings) in a 3:1 ratio. Keep the pile moist like a wrung-out sponge. Turn it every 1-2 weeks with a pitchfork. Avoid meat, dairy, and oils. Finished compost is dark, crumbly, and earthy-smelling in 2-6 months."}
+{"input": "how to reduce food waste?", "output": "lex: reduce food waste tips prevention\nlex: food waste reduction meal planning storage\nvec: how can I reduce food waste at home through planning and storage\nvec: what strategies help households throw away less food\nhyde: Plan meals weekly and shop with a list to avoid overbuying. Store produce properly—leafy greens in airtight containers, herbs in water. Use FIFO (first in, first out) in your fridge. Freeze leftovers and overripe fruit. Compost scraps you can't eat. The average household wastes 30% of purchased food."}
+{"input": "how to learn about native american culture", "output": "lex: Native American culture history learn\nlex: Indigenous peoples traditions tribal nations\nvec: how can I respectfully learn about Native American culture and history\nvec: what are good resources for understanding Indigenous peoples' traditions and heritage\nhyde: Visit the National Museum of the American Indian (Smithsonian) or local tribal cultural centers. Read works by Native authors like Joy Harjo, Tommy Orange, and Robin Wall Kimmerer. Attend powwows and cultural events when open to the public. Learn which tribal nations are indigenous to your area."}
+{"input": "how to participate in a town hall meeting", "output": "lex: town hall meeting participate attend\nlex: town hall public meeting local government\nvec: how do I attend and participate in a local town hall meeting\nvec: what should I know before speaking at a town hall meeting\nhyde: Check your local government website or social media for upcoming town hall schedules. Arrive early and sign up to speak if required. Prepare a concise statement (usually 2-3 minutes). Stay respectful and on-topic. Bring supporting data or personal stories to strengthen your point."}
+{"input": "how to choose a photo backdrop", "output": "lex: photo backdrop choose background photography\nlex: photography backdrop portrait studio\nvec: how do I choose the right backdrop for portrait or studio photography\nvec: what factors should I consider when selecting a photo backdrop\nhyde: Choose a backdrop that complements your subject without competing for attention. Solid colors (white, gray, black) are versatile for portraits. Muslin provides a painterly texture. For outdoor shoots, look for uncluttered backgrounds with good depth. Consider the color of your subject's clothing to avoid clashing."}
+{"input": "what is the nature of god in christianity", "output": "lex: nature of God Christianity Trinity\nlex: Christian God attributes Father Son Holy Spirit\nvec: how does Christianity describe the nature and attributes of God\nvec: what is the doctrine of the Trinity in Christian theology\nhyde: Christianity teaches that God is one being existing as three persons: the Father, the Son (Jesus Christ), and the Holy Spirit. This is the doctrine of the Trinity. God is described as omniscient, omnipotent, omnipresent, eternal, and perfectly good. God is both transcendent and personally involved in creation."}
+{"input": "how to scale a business", "output": "lex: scale business growth strategies\nlex: business scaling operations revenue expansion\nvec: how do you scale a business effectively while managing growth challenges\nvec: what strategies help companies expand operations and increase revenue\nhyde: Scaling requires repeatable processes, automation, and a strong team. Standardize operations with SOPs, invest in technology to reduce manual work, and hire ahead of demand. Monitor unit economics—ensure customer acquisition cost stays below lifetime value. Secure funding for growth through revenue, debt, or equity."}
+{"input": "what is yoga and its benefits", "output": "lex: yoga benefits health practice\nlex: yoga physical mental health flexibility stress\nvec: what is yoga and what physical and mental health benefits does it provide\nvec: how does regular yoga practice improve flexibility, strength, and well-being\nhyde: Yoga is an ancient practice combining physical postures (asanas), breathing techniques (pranayama), and meditation. Regular practice improves flexibility, builds strength, reduces stress and anxiety, lowers blood pressure, and enhances sleep quality. Styles range from gentle Hatha to vigorous Vinyasa and Ashtanga."}
+{"input": "how to get rid of self-limiting beliefs?", "output": "lex: self-limiting beliefs overcome remove\nlex: limiting beliefs mindset change techniques\nvec: how can I identify and overcome self-limiting beliefs that hold me back\nvec: what techniques help replace self-limiting beliefs with empowering ones\nhyde: Identify limiting beliefs by noticing recurring thoughts like \"I'm not smart enough\" or \"I don't deserve success.\" Challenge each belief: what evidence supports it? What evidence contradicts it? Replace it with a realistic affirmation. Take small actions that disprove the belief to build new neural pathways."}
+{"input": "how are seasons determined by geography", "output": "lex: seasons geography Earth axial tilt\nlex: seasons latitude hemisphere climate\nvec: how does geography and Earth's axial tilt determine the seasons\nvec: why do different parts of the world experience different seasons at the same time\nhyde: Seasons result from Earth's 23.5° axial tilt. As Earth orbits the Sun, the Northern and Southern Hemispheres alternately tilt toward or away from the Sun, varying the angle and duration of sunlight. Near the equator, seasons are minimal; at higher latitudes, seasonal variation is extreme."}
+{"input": "how to create a scalable business model", "output": "lex: scalable business model design\nlex: business model scalability revenue growth\nvec: how do you design a business model that scales efficiently with growth\nvec: what makes a business model scalable and what are common scalable model types\nhyde: A scalable business model increases revenue without proportional increases in costs. SaaS, marketplace, and platform models are inherently scalable. Key elements: low marginal cost per customer, automation of delivery, network effects, and recurring revenue. Test with a minimum viable product before scaling."}
+{"input": "can pets help reduce kids' anxiety?", "output": "lex: pets children anxiety reduction\nlex: pet therapy kids stress mental health\nvec: can having pets help reduce anxiety and stress in children\nvec: what research shows about the effect of pets on children's mental health\nhyde: Studies show that children with pets exhibit lower cortisol levels and reduced anxiety. A 2015 study in Preventing Chronic Disease found that children living with dogs had significantly lower rates of childhood anxiety. Petting an animal for 10 minutes reduces cortisol and increases oxytocin levels."}
+{"input": "date parse", "output": "lex: date parse string format\nlex: date parsing datetime library\nvec: how to parse date strings into date objects in programming\nvec: which libraries handle date parsing and formatting in JavaScript or Python\nhyde: In JavaScript, use `new Date('2025-01-15')` or `Date.parse()` for ISO strings. For complex formats, use `date-fns` parse function or `dayjs('12/25/2025', 'MM/DD/YYYY')`. In Python, use `datetime.strptime('2025-01-15', '%Y-%m-%d')` or the `dateutil.parser.parse()` function for flexible parsing."}
+{"input": "how do christians observe lent?", "output": "lex: Christians observe Lent fasting prayer\nlex: Lent Christian observance Ash Wednesday Easter\nvec: how do Christians observe the season of Lent before Easter\nvec: what are the traditional Lenten practices of fasting, prayer, and almsgiving\nhyde: Lent is a 40-day period before Easter beginning on Ash Wednesday. Christians observe it through fasting (abstaining from certain foods or luxuries), increased prayer, and almsgiving (charitable giving). Many give up a habit or take on a spiritual discipline. Catholic tradition requires abstaining from meat on Fridays."}
+{"input": "what are literary short stories?", "output": "lex: literary short stories fiction genre\nlex: short story literary fiction writers\nvec: what defines literary short stories as distinct from other fiction genres\nvec: what are the characteristics of literary short fiction and who are notable writers in the genre\nhyde: Literary short stories prioritize character development, thematic depth, and prose style over plot-driven entertainment. They often explore the human condition through interior conflict and ambiguity. Notable practitioners include Anton Chekhov, Alice Munro, Raymond Carver, and Jorge Luis Borges."}
+{"input": "thailand", "output": "lex: Thailand country travel Southeast Asia\nlex: Thailand Bangkok culture tourism\nvec: what should I know about Thailand as a travel destination or country\nvec: what are the key facts about Thailand's culture, geography, and tourist attractions\nhyde: Thailand is a Southeast Asian country known for tropical beaches, ornate temples, and rich cuisine. Bangkok is the capital. Popular destinations include Chiang Mai, Phuket, and the islands of Koh Samui and Phi Phi. Thai food staples include pad thai, green curry, and tom yum soup."}
+{"input": "how to do a flip on a trampoline", "output": "lex: trampoline flip backflip technique\nlex: trampoline flip tutorial safety\nvec: how do I safely learn to do a backflip on a trampoline\nvec: what is the proper technique for doing flips on a trampoline\nhyde: Start by mastering high, controlled bounces. Practice tucking your knees to your chest mid-air. For a backflip, bounce high, throw your arms back, tuck tightly, and spot your landing. Always practice on a trampoline with safety nets and a spotter. Progress from seat drops to back drops before attempting flips."}
+{"input": "how to efficiently use time at work?", "output": "lex: time management work productivity\nlex: efficient time work techniques scheduling\nvec: how can I manage my time more efficiently at work to increase productivity\nvec: what time management techniques help get more done during the workday\nhyde: Use time-blocking to schedule focused work in 90-minute intervals. Prioritize with the Eisenhower Matrix: do urgent-important tasks first, schedule important-not-urgent ones, delegate urgent-not-important tasks, and eliminate the rest. Batch similar tasks, limit meetings, and turn off notifications during deep work."}
+{"input": "what is venture capital funding", "output": "lex: venture capital funding investment startups\nlex: VC funding rounds Series A seed\nvec: what is venture capital and how does VC funding work for startups\nvec: what are the different stages of venture capital funding from seed to Series C\nhyde: Venture capital is equity financing provided to high-growth startups in exchange for ownership stakes. Funding stages include pre-seed, seed ($500K-$2M), Series A ($2-15M), Series B ($15-50M), and later rounds. VCs evaluate the team, market size, traction, and scalability before investing."}
+{"input": "app build", "output": "lex: app build compile deploy\nlex: mobile app build process configuration\nvec: how to build and compile a mobile or web application for deployment\nvec: what are the steps in the app build process and common build tools\nhyde: For mobile apps, use `xcodebuild` (iOS) or `./gradlew assembleRelease` (Android). For web apps, run `npm run build` or `vite build` to bundle and optimize assets. Configure environment variables, set the build target, and use CI/CD pipelines (GitHub Actions, CircleCI) for automated builds."}
+{"input": "how to build strong relationships?", "output": "lex: build strong relationships communication trust\nlex: healthy relationships skills connection\nvec: how do you build and maintain strong personal relationships\nvec: what habits and communication skills help strengthen relationships\nhyde: Strong relationships are built on trust, open communication, and mutual respect. Practice active listening—give full attention without planning your response. Express appreciation regularly. Handle conflicts by addressing issues directly without blame. Invest quality time and show up consistently during both good and hard times."}
+{"input": "when to start prenatal classes?", "output": "lex: prenatal classes start when pregnancy\nlex: childbirth education classes timing\nvec: when during pregnancy should I start taking prenatal classes\nvec: what is the recommended timing for beginning childbirth education classes\nhyde: Most experts recommend starting prenatal classes during the second trimester, around weeks 20-24, and completing them by week 36. Early classes cover nutrition, exercise, and fetal development. Later classes focus on labor stages, breathing techniques, pain management options, breastfeeding, and newborn care."}
+{"input": "how to choose kitchen cabinet hardware", "output": "lex: kitchen cabinet hardware handles knobs\nlex: cabinet hardware style finish selection\nvec: how do I choose the right handles and knobs for kitchen cabinets\nvec: what styles and finishes of kitchen cabinet hardware work with different designs\nhyde: Match hardware to your kitchen style: brushed nickel or stainless for modern kitchens, oil-rubbed bronze for traditional, brass for transitional. Use pulls (3-4 inches) on drawers and knobs on doors. Test ergonomics before buying in bulk. Standard mounting holes are 3 or 3.75 inches apart."}
+{"input": "what is the significance of the torah?", "output": "lex: Torah significance Judaism sacred text\nlex: Torah five books Moses Jewish law\nvec: what is the Torah and why is it significant in Judaism\nvec: what role does the Torah play in Jewish religious life and law\nhyde: The Torah comprises the five books of Moses (Genesis, Exodus, Leviticus, Numbers, Deuteronomy) and is the most sacred text in Judaism. It contains the 613 commandments (mitzvot), the creation narrative, and the covenant between God and the Israelites. It is read publicly in synagogue every week."}
+{"input": "test mock", "output": "lex: test mock unit testing\nlex: mock object stub spy testing\nvec: how to use mocks and stubs in unit testing\nvec: what are mock objects and how do they help isolate components in tests\nhyde: Mocks replace real dependencies with controlled objects during testing. In Python, use `unittest.mock.patch()` to replace a function. In JavaScript, use `jest.fn()` or `jest.spyOn()`. Mocks verify that methods were called with expected arguments. Stubs return fixed values; spies track calls without replacing behavior."}
+{"input": "how does culture influence identity?", "output": "lex: culture influence identity formation\nlex: cultural identity socialization values\nvec: how does culture shape a person's sense of identity\nvec: in what ways do cultural values and traditions influence who we become\nhyde: Culture shapes identity through language, traditions, values, and social norms internalized from childhood. Family, community, religion, and media all transmit cultural frameworks. Identity is constructed through negotiation between personal experiences and cultural expectations, creating a sense of belonging and self-understanding."}
+{"input": "how to be a good listener", "output": "lex: good listener active listening skills\nlex: listening skills empathy communication\nvec: how can I become a better and more active listener in conversations\nvec: what techniques improve listening skills and show empathy\nhyde: Active listening means giving full attention: maintain eye contact, put away distractions, and don't interrupt. Reflect back what you heard (\"It sounds like you're saying...\"). Ask open-ended questions to show interest. Avoid jumping to advice—sometimes people just need to feel heard. Validate their emotions."}
+{"input": "how to improve public speaking skills", "output": "lex: public speaking skills improve presentation\nlex: public speaking confidence practice tips\nvec: how can I improve my public speaking and overcome stage fright\nvec: what techniques help deliver confident and engaging presentations\nhyde: Join Toastmasters for regular practice in a supportive environment. Record yourself speaking and review for filler words and pacing. Structure talks with a clear opening hook, three key points, and a memorable close. Practice in front of friends. Manage nerves through deep breathing and visualization beforehand."}
+{"input": "log debug", "output": "lex: log debug logging level\nlex: debug logging output configuration\nvec: how to configure debug-level logging in an application\nvec: how to use log debug statements for troubleshooting code\nhyde: Set the log level to DEBUG to capture detailed diagnostic output. In Python: `logging.basicConfig(level=logging.DEBUG)`. In Node.js with winston: `logger.level = 'debug'`. In Java with SLF4J: configure logback.xml with `<root level=\"DEBUG\">`. Use debug logs for variable values, flow tracing, and conditional paths."}
+{"input": "what is the large hadron collider", "output": "lex: Large Hadron Collider LHC CERN\nlex: LHC particle accelerator Higgs boson\nvec: what is the Large Hadron Collider and what has it discovered\nvec: how does the LHC at CERN work to study particle physics\nhyde: The Large Hadron Collider (LHC) at CERN near Geneva is the world's largest and most powerful particle accelerator. It accelerates protons to near light speed in a 27-kilometer ring and collides them to study fundamental particles. In 2012, it confirmed the existence of the Higgs boson."}
+{"input": "what is the significance of worship practices?", "output": "lex: worship practices significance religion\nlex: worship rituals prayer spiritual meaning\nvec: what is the significance of worship practices across different religions\nvec: why do religious communities engage in rituals, prayer, and worship\nhyde: Worship practices—prayer, ritual, song, and meditation—serve to connect individuals with the divine, reinforce communal identity, and express gratitude and devotion. In Christianity, worship centers on liturgy and sacraments; in Islam, the five daily prayers (salat); in Hinduism, puja and temple ceremonies."}
+{"input": "what are fair trade products?", "output": "lex: fair trade products certification\nlex: fair trade coffee chocolate ethical\nvec: what are fair trade products and how does fair trade certification work\nvec: what does the fair trade label mean for farmers and consumers\nhyde: Fair trade products are goods certified to meet standards ensuring producers in developing countries receive fair prices, safe working conditions, and sustainable practices. Common fair trade products include coffee, chocolate, tea, bananas, and cotton. Look for the Fairtrade International or Fair Trade USA label."}
+{"input": "what is the significance of community in ethics", "output": "lex: community ethics significance moral philosophy\nlex: communitarian ethics social responsibility\nvec: what role does community play in ethical theory and moral life\nvec: how does communitarian philosophy view the relationship between community and ethics\nhyde: Communitarian ethics argues that moral reasoning is rooted in community values and shared traditions, not just individual rights. Philosophers like Alasdair MacIntyre and Charles Taylor emphasize that virtues and moral identity are shaped by the communities in which we participate."}
+{"input": "what are index funds", "output": "lex: index funds investing passive\nlex: index fund S&P 500 ETF low cost\nvec: what are index funds and why are they popular for investing\nvec: how do index funds work and what are their advantages over actively managed funds\nhyde: An index fund is a type of mutual fund or ETF that tracks a market index like the S&P 500. It holds all (or a representative sample of) the stocks in that index. Index funds offer broad diversification, low expense ratios (typically 0.03-0.20%), and historically outperform most actively managed funds."}
+{"input": "what is hinduism", "output": "lex: Hinduism religion beliefs practices\nlex: Hindu dharma gods Vedas karma reincarnation\nvec: what is Hinduism and what are its main beliefs and practices\nvec: what do Hindus believe about God, karma, and the cycle of rebirth\nhyde: Hinduism is one of the world's oldest religions, originating in the Indian subcontinent. It encompasses diverse beliefs but key concepts include dharma (duty), karma (action and consequence), samsara (cycle of rebirth), and moksha (liberation). Sacred texts include the Vedas, Upanishads, and Bhagavad Gita."}
+{"input": "what is sufism?", "output": "lex: Sufism Islamic mysticism spiritual\nlex: Sufi practices dhikr whirling dervishes\nvec: what is Sufism and how does it relate to Islam\nvec: what are the spiritual practices and beliefs of Sufi mystics\nhyde: Sufism is the mystical dimension of Islam, emphasizing the inward search for God and the purification of the soul. Sufis practice dhikr (repetitive remembrance of God), meditation, and poetry to achieve closeness to the divine. Rumi and Al-Ghazali are among the most famous Sufi masters."}
+{"input": "how to outline a novel", "output": "lex: outline novel plot structure\nlex: novel outline writing planning chapters\nvec: how do I create an outline for writing a novel\nvec: what methods do authors use to plan and structure a novel before writing\nhyde: Start with a one-sentence premise, then expand to a paragraph summary. Use the three-act structure: setup, confrontation, resolution. Create character profiles with goals and arcs. Write a chapter-by-chapter outline with scene goals. Methods include the Snowflake Method, Save the Cat beat sheet, or index cards on a corkboard."}
+{"input": "what is the role of the who in pandemics", "output": "lex: WHO World Health Organization pandemic role\nlex: WHO pandemic response disease outbreak\nvec: what role does the World Health Organization play during pandemics\nvec: how does the WHO coordinate international responses to disease outbreaks\nhyde: The World Health Organization (WHO) coordinates international pandemic response by issuing health guidelines, declaring Public Health Emergencies of International Concern (PHEIC), distributing vaccines through COVAX, providing technical assistance to countries, and monitoring disease surveillance data from member states."}
+{"input": "how are glaciers formed", "output": "lex: glacier formation process ice\nlex: glaciers formed snow compaction accumulation\nvec: how do glaciers form from accumulated snow and ice over time\nvec: what is the process of glacier formation and movement\nhyde: Glaciers form when annual snowfall exceeds snowmelt over many years. The accumulated snow compresses into firn (granular ice) and eventually into dense glacial ice. When the ice mass becomes thick enough, gravity causes it to flow slowly downhill. This process takes decades to centuries."}
+{"input": "how to ensure research reproducibility", "output": "lex: research reproducibility replication methods\nlex: reproducible research data sharing protocols\nvec: how do researchers ensure their studies are reproducible by others\nvec: what practices improve the reproducibility and replication of scientific research\nhyde: Ensure reproducibility by pre-registering your study, sharing raw data and analysis code in public repositories (e.g., GitHub, Zenodo), documenting every methodological step, using version control, and providing computational environments (Docker containers). Report all results, including null findings."}
+{"input": "how do different religions view angels?", "output": "lex: angels religions Christianity Islam Judaism\nlex: angels religious beliefs spiritual beings\nvec: how do different religions like Christianity, Islam, and Judaism view angels\nvec: what roles do angels play across major world religions\nhyde: In Christianity, angels are messengers of God (e.g., Gabriel, Michael) who serve as protectors and intermediaries. Islam teaches that angels (mala'ika) are created from light and include Jibril (Gabriel) who delivered the Quran. Judaism describes angels as divine agents carrying out God's will in the Hebrew Bible."}
+{"input": "how does the social contract theory explain governance", "output": "lex: social contract theory governance political philosophy\nlex: social contract Hobbes Locke Rousseau\nvec: how does social contract theory explain the legitimacy of government\nvec: what did Hobbes, Locke, and Rousseau argue about the social contract and governance\nhyde: Social contract theory holds that governments derive legitimacy from the consent of the governed. Hobbes argued people surrender freedoms to a sovereign for security. Locke emphasized natural rights to life, liberty, and property, with government protecting them. Rousseau proposed the general will as the basis for collective governance."}
+{"input": "how to use trekking poles", "output": "lex: trekking poles hiking technique\nlex: trekking poles adjustment grip walking\nvec: how do you properly use trekking poles while hiking\nvec: what is the correct technique for adjusting and using trekking poles on trails\nhyde: Adjust pole length so your elbow is at 90° on flat ground. Shorten poles for uphill, lengthen for downhill. Plant the pole opposite your stepping foot. Use wrist straps for support—push down through the strap, not the grip. On steep descents, poles reduce knee impact by up to 25%."}
+{"input": "how does blockchain technology work", "output": "lex: blockchain technology distributed ledger\nlex: blockchain cryptography decentralized consensus\nvec: how does blockchain technology work at a technical level\nvec: what are the key components of blockchain like blocks, hashing, and consensus mechanisms\nhyde: A blockchain is a distributed ledger where transactions are grouped into blocks. Each block contains a cryptographic hash of the previous block, creating an immutable chain. Nodes validate transactions through consensus mechanisms like Proof of Work or Proof of Stake. No central authority controls the network."}
+{"input": "how to plant a wildflower meadow?", "output": "lex: wildflower meadow planting seeds\nlex: plant wildflower meadow soil preparation native\nvec: how do I plant and establish a wildflower meadow in my yard\nvec: what steps are needed to create a wildflower meadow from seed\nhyde: Clear existing vegetation by mowing low and raking away debris. Loosen the top inch of soil. Mix wildflower seeds with sand for even distribution and scatter in fall or early spring. Press seeds into soil but don't cover them—most need light to germinate. Water gently until established. Avoid fertilizer, which favors grasses."}
+{"input": "how to engage in civil political discussions", "output": "lex: civil political discussion respectful debate\nlex: political conversation etiquette disagreement\nvec: how can I have respectful and productive political discussions with people who disagree\nvec: what strategies help keep political conversations civil and constructive\nhyde: Start by listening to understand, not to rebut. Ask questions like \"What experiences led you to that view?\" Avoid personal attacks and generalizations. Find common ground before addressing differences. Use \"I\" statements instead of \"you always\" accusations. Accept that changing minds takes time and repeated respectful engagement."}
+{"input": "where to watch super bowl 2024", "output": "lex: super bowl 2024 streaming channel\nlex: super bowl LVIII broadcast network\nlex: watch super bowl 2024 live\nvec: what channel or streaming service is broadcasting Super Bowl 2024\nvec: where can I watch the 2024 Super Bowl LVIII game live online\nhyde: Super Bowl LVIII airs on CBS on February 11, 2024. You can stream it live on Paramount+ or through the CBS Sports app. Kickoff is at 6:30 PM ET from Allegiant Stadium in Las Vegas."}
+{"input": "what is the mind-body problem", "output": "lex: mind-body problem philosophy\nlex: dualism consciousness physicalism\nlex: mental states physical brain\nvec: what is the philosophical mind-body problem and why is it difficult to solve\nvec: how do philosophers explain the relationship between consciousness and the physical brain\nhyde: The mind-body problem asks how mental states like thoughts, feelings, and consciousness relate to physical states of the brain. Descartes proposed substance dualism, arguing mind and body are fundamentally different substances."}
+{"input": "how to report scientific findings", "output": "lex: scientific findings report writing\nlex: research results publication format\nlex: academic paper methodology results\nvec: how should scientists structure and report their research findings in a paper\nvec: what is the standard format for reporting results in a scientific publication\nhyde: When reporting scientific findings, organize your paper into Introduction, Methods, Results, and Discussion (IMRaD). Present results with tables and figures, include statistical analyses, and state findings objectively before interpreting them."}
+{"input": "code test", "output": "lex: software unit testing framework\nlex: code testing automated tests\nlex: test-driven development TDD\nvec: how to write and run automated tests for software code\nvec: what are the common approaches to testing code including unit tests and integration tests\nhyde: Unit tests verify individual functions in isolation. Use a testing framework like Jest, pytest, or JUnit to write assertions that check expected outputs against actual results. Run tests with `npm test` or `pytest`."}
+{"input": "what is human rights", "output": "lex: human rights definition universal declaration\nlex: fundamental human rights UDHR\nlex: civil political economic social rights\nvec: what are human rights and what does the Universal Declaration of Human Rights guarantee\nvec: what fundamental freedoms and protections are considered universal human rights\nhyde: Human rights are inherent rights belonging to every person regardless of nationality, sex, ethnicity, or religion. The Universal Declaration of Human Rights (1948) established 30 articles covering civil, political, economic, social, and cultural rights."}
+{"input": "what is the function of dna", "output": "lex: DNA function genetic information\nlex: deoxyribonucleic acid protein synthesis\nlex: DNA replication transcription translation\nvec: what role does DNA play in storing and transmitting genetic information in cells\nvec: how does DNA encode instructions for building proteins in living organisms\nhyde: DNA stores the genetic instructions needed for the development and functioning of all living organisms. It encodes genes as sequences of nucleotide bases (A, T, G, C) that are transcribed into RNA and translated into proteins."}
+{"input": "how to advocate for a cause", "output": "lex: cause advocacy strategies campaigning\nlex: grassroots advocacy organizing\nlex: political advocacy lobbying petition\nvec: what are effective ways to advocate and campaign for a social or political cause\nvec: how can individuals organize and mobilize support for a cause they care about\nhyde: Start by clearly defining your cause and goals. Build a coalition of supporters, create a compelling message, and use multiple channels: social media, petitions, letters to legislators, public events, and media outreach to amplify your message."}
+{"input": "how to grow blueberries at home?", "output": "lex: grow blueberries home garden\nlex: blueberry bush planting acidic soil\nlex: container blueberry growing care\nvec: how do I plant and care for blueberry bushes in my home garden\nvec: what soil pH and conditions do blueberries need to grow well at home\nhyde: Blueberries thrive in acidic soil with a pH of 4.5-5.5. Plant in full sun with well-drained soil amended with peat moss. Space bushes 4-6 feet apart and mulch with pine needles. Water regularly and prune dead wood in late winter."}
+{"input": "what causes market volatility", "output": "lex: stock market volatility causes\nlex: financial market fluctuations economic factors\nlex: market volatility interest rates inflation\nvec: what economic and geopolitical factors cause stock market volatility\nvec: why do financial markets experience sudden price swings and instability\nhyde: Market volatility is driven by economic data releases, interest rate changes, geopolitical events, earnings surprises, and investor sentiment. High uncertainty about inflation, central bank policy, or political instability increases price fluctuations across asset classes."}
+{"input": "what is the importance of spiritual leadership?", "output": "lex: spiritual leadership organizations values\nlex: spiritual leadership workplace meaning purpose\nvec: how does spiritual leadership influence organizations and their members\nvec: what role does spiritual leadership play in providing meaning and purpose at work\nhyde: Spiritual leadership theory proposes that leaders who foster a sense of calling, meaning, and membership create more engaged and productive organizations. It emphasizes vision, altruistic love, and hope as core values that transcend traditional management."}
+{"input": "what is the paris agreement", "output": "lex: Paris Agreement climate change 2015\nlex: Paris climate accord greenhouse gas emissions\nlex: Paris Agreement temperature goals\nvec: what is the Paris Agreement and what are its goals for addressing climate change\nvec: what commitments did countries make under the 2015 Paris climate accord\nhyde: The Paris Agreement is a legally binding international treaty on climate change adopted in 2015. Its goal is to limit global warming to well below 2°C, preferably 1.5°C, above pre-industrial levels. Countries submit nationally determined contributions (NDCs) outlining emission reduction targets."}
+{"input": "how to enhance customer engagement", "output": "lex: customer engagement strategies retention\nlex: increase customer interaction loyalty\nlex: customer engagement marketing personalization\nvec: what strategies can businesses use to improve customer engagement and loyalty\nvec: how can companies create more meaningful interactions with their customers\nhyde: Personalize communications using customer data and segmentation. Implement loyalty programs, respond promptly on social media, send targeted email campaigns, and gather feedback through surveys. Omnichannel engagement ensures consistent experience across touchpoints."}
+{"input": "how to encourage children to read?", "output": "lex: encourage children reading habits\nlex: kids reading motivation tips\nlex: children literacy books engagement\nvec: what strategies help encourage children to develop a love of reading\nvec: how can parents motivate reluctant children to read more books\nhyde: Read aloud to children daily from an early age. Let them choose their own books based on interests. Create a cozy reading nook, visit the library regularly, and set a family reading time. Avoid using reading as punishment; make it enjoyable."}
+{"input": "what is base jumping?", "output": "lex: base jumping extreme sport parachute\nlex: BASE jump fixed object skydiving\nlex: base jumping wingsuit cliff\nvec: what is BASE jumping and how does it differ from skydiving\nvec: what does BASE stand for and what are the risks of base jumping\nhyde: BASE jumping involves parachuting from fixed objects: Buildings, Antennas, Spans (bridges), and Earth (cliffs). Unlike skydiving from aircraft, BASE jumps occur at much lower altitudes, giving jumpers only seconds to deploy their parachute."}
+{"input": "how to clean car engine bay?", "output": "lex: clean car engine bay degreaser\nlex: engine bay detailing wash\nlex: engine compartment cleaning steps\nvec: what is the safest way to clean and degrease a car engine bay\nvec: step by step process to clean under the hood of a car\nhyde: Cover sensitive electrical components with plastic bags. Apply engine degreaser to the entire bay, let it sit 5-10 minutes, then agitate with a brush. Rinse with low-pressure water, avoiding direct spray on the alternator, fuse box, and air intake."}
+{"input": "how to manage sibling rivalry?", "output": "lex: sibling rivalry management parenting\nlex: brothers sisters fighting conflict\nlex: sibling jealousy fairness strategies\nvec: how can parents effectively manage fighting and rivalry between siblings\nvec: what are proven strategies to reduce sibling conflict and jealousy\nhyde: Avoid comparing siblings to each other. Give each child individual attention and acknowledge their unique strengths. Teach conflict resolution skills rather than always intervening. Set clear family rules about respectful behavior and let children solve minor disputes themselves."}
+{"input": "how to build a raised garden bed?", "output": "lex: build raised garden bed DIY\nlex: raised bed construction lumber soil\nlex: raised garden bed plans dimensions\nvec: how do I build a raised garden bed from wood step by step\nvec: what materials and dimensions work best for a DIY raised garden bed\nhyde: Cut four boards of untreated cedar or redwood to size: two at 4 feet and two at 8 feet for a standard 4x8 bed. Screw corners together with deck screws. Place on level ground, line the bottom with cardboard, and fill with a mix of topsoil, compost, and peat moss."}
+{"input": "what is the g7", "output": "lex: G7 group of seven nations\nlex: G7 summit member countries\nlex: G7 economic political alliance\nvec: what is the G7 and which countries are members of this international group\nvec: what role does the Group of Seven play in global economic and political governance\nhyde: The G7 (Group of Seven) is an intergovernmental forum of seven major advanced economies: Canada, France, Germany, Italy, Japan, the United Kingdom, and the United States. The EU also participates. Members meet annually to discuss global economic policy, security, and trade."}
+{"input": "what is the role of choice in ethics?", "output": "lex: choice ethics moral philosophy\nlex: free will moral responsibility\nlex: ethical decision-making autonomy\nvec: what role does personal choice play in moral philosophy and ethical responsibility\nvec: how do ethicists view free will and autonomous choice in determining moral accountability\nhyde: Choice is central to ethics because moral responsibility presupposes the ability to choose freely. Aristotle argued that virtuous action requires deliberate choice (prohairesis). Without genuine alternatives, praise and blame lose their foundation."}
+{"input": "home fix", "output": "lex: home repair DIY fix\nlex: house maintenance common repairs\nlex: home improvement handyman tasks\nvec: how to do common home repairs and fixes yourself\nvec: what are typical household problems and how to fix them without a professional\nhyde: Common DIY home repairs include fixing leaky faucets, patching drywall holes, unclogging drains, replacing light switches, re-caulking bathrooms, and fixing squeaky doors. Most require only basic tools: screwdriver, pliers, wrench, and putty knife."}
+{"input": "what should i wear hiking?", "output": "lex: hiking clothing layers gear\nlex: hiking outfit shoes weather\nlex: what to wear hiking trail\nvec: what is the best clothing to wear for a day hike in different weather conditions\nvec: how should I layer my clothes for hiking to stay comfortable\nhyde: Dress in moisture-wicking layers: a synthetic or merino wool base layer, an insulating mid layer like fleece, and a waterproof shell. Wear sturdy hiking boots or trail shoes with wool socks. Avoid cotton, which retains moisture and causes chafing."}
+{"input": "what are the main tenets of jainism?", "output": "lex: Jainism main tenets principles\nlex: Jain beliefs ahimsa non-violence\nlex: Jainism five vows anekantavada\nvec: what are the core beliefs and principles of the Jain religion\nvec: what are the five main vows and philosophical tenets of Jainism\nhyde: Jainism centers on three jewels: right faith, right knowledge, and right conduct. Its five vows are ahimsa (non-violence), satya (truth), asteya (non-stealing), brahmacharya (chastity), and aparigraha (non-attachment). Jains believe in karma and the soul's liberation through self-discipline."}
+{"input": "what is universal healthcare", "output": "lex: universal healthcare single payer system\nlex: universal health coverage public insurance\nlex: universal healthcare countries policy\nvec: what is universal healthcare and how do different countries implement it\nvec: how does a universal healthcare system provide coverage to all citizens\nhyde: Universal healthcare ensures all residents have access to medical services without financial hardship. Models vary: single-payer systems (Canada), national health services (UK's NHS), and mandatory insurance systems (Germany). Funding comes through taxes or mandatory premiums."}
+{"input": "where to buy rare plant seeds?", "output": "lex: buy rare plant seeds online\nlex: rare exotic seed suppliers shop\nlex: unusual heirloom seeds catalog\nvec: where can I purchase rare and exotic plant seeds online\nvec: what are reputable suppliers for hard-to-find and unusual plant seeds\nhyde: Specialty seed suppliers for rare plants include Baker Creek Heirloom Seeds, Chiltern Seeds, Plant World Seeds, and Rare Seeds. Online marketplaces like Etsy also have independent growers selling unusual varieties. Check import regulations for international orders."}
+{"input": "how to kayak for the first time", "output": "lex: beginner kayaking first time tips\nlex: kayak basics paddling technique\nlex: learn kayaking beginner guide\nvec: what should a beginner know before going kayaking for the first time\nvec: how do I paddle and balance a kayak as a first-time kayaker\nhyde: For your first kayak outing, choose calm, flat water like a lake or slow river. Adjust the foot pegs so your knees are slightly bent. Hold the paddle with hands shoulder-width apart, knuckles aligned with the blade edge. Use torso rotation, not just arms, for each stroke."}
+{"input": "what are the major teachings in rumi's poetry?", "output": "lex: Rumi poetry teachings themes\nlex: Rumi Sufi mysticism divine love\nlex: Rumi Masnavi spiritual wisdom\nvec: what are the central spiritual and philosophical themes in Rumi's poems\nvec: what does Rumi teach about love, the soul, and union with the divine\nhyde: Rumi's poetry centers on divine love as the path to spiritual union with God. His Masnavi explores themes of longing, surrender, and the dissolution of the ego. He uses metaphors of wine, the beloved, and the reed flute to express the soul's yearning for its source."}
+{"input": "what is the purpose of a pilgrimage", "output": "lex: pilgrimage purpose religious spiritual\nlex: pilgrimage meaning journey sacred site\nvec: what is the spiritual purpose of making a pilgrimage to a sacred site\nvec: why do people of different religions undertake pilgrimages\nhyde: A pilgrimage is a sacred journey to a holy site undertaken for spiritual renewal, penance, or devotion. In Islam, Hajj to Mecca is obligatory. Christians walk the Camino de Santiago. Hindus visit Varanasi. The journey itself is seen as transformative, not just the destination."}
+{"input": "craigslist ads", "output": "lex: Craigslist ads posting classified\nlex: Craigslist listings buy sell\nlex: Craigslist marketplace local ads\nvec: how to post and browse classified ads on Craigslist\nvec: how does Craigslist work for buying, selling, and listing items locally\nhyde: To post a Craigslist ad, go to craigslist.org, select your city, and click \"create a posting.\" Choose a category (for sale, housing, jobs, services), write a clear title and description, add photos, and set your price. Most postings are free for individuals."}
+{"input": "what is a primary election", "output": "lex: primary election definition process\nlex: primary election presidential nomination\nlex: open closed primary voting\nvec: what is a primary election and how does it determine party nominees\nvec: how do primary elections work in the United States political system\nhyde: A primary election is a vote held by a political party to choose its candidates for the general election. In a closed primary, only registered party members can vote. In an open primary, any registered voter may participate regardless of party affiliation."}
+{"input": "what was the role of the catholic church in the middle ages?", "output": "lex: Catholic Church Middle Ages role\nlex: medieval church political power papacy\nlex: Catholic Church feudalism education medieval\nvec: what political, social, and cultural role did the Catholic Church play during the Middle Ages\nvec: how did the Catholic Church influence governance, education, and daily life in medieval Europe\nhyde: The Catholic Church was the dominant institution in medieval Europe. It controlled vast lands, collected tithes, and wielded political power through the papacy. The Church ran schools and universities, preserved classical texts in monasteries, and regulated moral life through canon law and sacraments."}
+{"input": "what to pack in a hospital bag for labor?", "output": "lex: hospital bag labor delivery packing list\nlex: what to bring hospital birth bag\nlex: labor bag essentials mother baby\nvec: what items should I pack in my hospital bag before going into labor\nvec: what is a complete packing checklist for the hospital for giving birth\nhyde: Hospital bag essentials for labor: ID and insurance card, birth plan, comfortable robe or gown, slippers, toiletries, phone charger, going-home outfit for you and baby, car seat, nursing bra, newborn diapers, snacks, and a pillow from home."}
+{"input": "how international trade agreements affect local economies", "output": "lex: international trade agreements local economy impact\nlex: trade deal tariff local jobs wages\nlex: free trade agreement economic effects\nvec: how do international trade agreements impact jobs and economies at the local level\nvec: what are the positive and negative effects of free trade agreements on local industries\nhyde: Trade agreements lower tariffs and open markets, which can reduce consumer prices and expand exports. However, local industries that cannot compete with cheaper imports may shrink, leading to job losses in manufacturing regions. The net effect depends on the economy's structure and adjustment policies."}
+{"input": "what is the ring of fire", "output": "lex: Ring of Fire Pacific Ocean volcanoes\nlex: Pacific Ring of Fire earthquakes tectonic\nlex: ring of fire map plate boundaries\nvec: what is the Pacific Ring of Fire and why does it have so many earthquakes and volcanoes\nvec: which tectonic plates form the Ring of Fire around the Pacific Ocean\nhyde: The Ring of Fire is a 40,000 km horseshoe-shaped zone around the Pacific Ocean where about 75% of the world's volcanoes and 90% of earthquakes occur. It follows boundaries of tectonic plates including the Pacific, Nazca, and Philippine Sea plates."}
+{"input": "how does relativism differ from absolutism", "output": "lex: moral relativism absolutism difference\nlex: ethical relativism vs moral absolutism\nlex: relativism absolutism philosophy comparison\nvec: what is the philosophical difference between moral relativism and moral absolutism\nvec: how do relativists and absolutists disagree about the nature of moral truth\nhyde: Moral absolutism holds that certain actions are universally right or wrong regardless of context or culture. Moral relativism argues that moral judgments are not universal but depend on cultural, social, or personal frameworks. Absolutists point to human rights; relativists emphasize cultural diversity."}
+{"input": "how to harvest rainwater for gardening?", "output": "lex: rainwater harvesting garden setup\nlex: rain barrel collection irrigation\nlex: harvest rainwater system DIY\nvec: how can I set up a rainwater collection system to water my garden\nvec: what equipment do I need to harvest rainwater for garden irrigation\nhyde: Install a rain barrel or cistern under a downspout to collect roof runoff. Use a first-flush diverter to discard initial dirty water. A screen keeps debris and mosquitoes out. Connect a spigot or hose at the bottom for gravity-fed garden irrigation. A 1,000 sq ft roof yields ~600 gallons per inch of rain."}
+{"input": "what is the significance of the sacred tree in various faiths?", "output": "lex: sacred tree symbolism religion\nlex: tree of life world tree spiritual traditions\nlex: sacred trees Buddhism Hinduism Christianity Norse\nvec: what role do sacred trees play in the religious symbolism of different faiths\nvec: how are trees like the Bodhi tree and Yggdrasil significant in world religions\nhyde: Sacred trees appear across religions: the Bodhi tree where Buddha attained enlightenment, the Tree of Life in Genesis, Yggdrasil in Norse mythology connecting the nine worlds, and the banyan in Hinduism symbolizing eternal life. Trees represent growth, connection between earth and heaven, and renewal."}
+{"input": "code dep", "output": "lex: code dependency management\nlex: software dependency package manager\nlex: dependency resolution version conflicts\nvec: how to manage code dependencies and packages in a software project\nvec: what tools help resolve and manage dependencies in programming\nhyde: Dependency management tools track and install external libraries your code relies on. Package managers like npm (JavaScript), pip (Python), and cargo (Rust) resolve version conflicts, maintain lock files, and ensure reproducible builds across environments."}
+{"input": "what is the concept of rebirth in buddhism?", "output": "lex: rebirth Buddhism reincarnation concept\nlex: Buddhist rebirth samsara karma cycle\nlex: rebirth reincarnation Buddhism difference\nvec: how does Buddhism explain the concept of rebirth and the cycle of samsara\nvec: what is the difference between rebirth in Buddhism and reincarnation in Hinduism\nhyde: In Buddhism, rebirth is not the transmigration of a fixed soul but the continuation of a stream of consciousness shaped by karma. Beings cycle through samsara—the realms of existence—until achieving nirvana. Unlike Hindu reincarnation, Buddhism denies a permanent self (anatta) that transfers between lives."}
+{"input": "cultural iconography", "output": "lex: cultural iconography symbols art\nlex: iconographic symbols meaning culture\nlex: visual symbolism iconography history\nvec: what is cultural iconography and how are visual symbols used to convey meaning across cultures\nvec: how do art historians study and interpret iconographic symbols in different cultural traditions\nhyde: Cultural iconography studies the identification and interpretation of visual symbols in art and media. Icons like the Christian cross, Buddhist lotus, or American bald eagle carry layered meanings shaped by history, religion, and politics. Erwin Panofsky formalized iconographic analysis in three levels."}
+{"input": "current trends in ai research", "output": "lex: AI research trends 2025 2026\nlex: artificial intelligence latest developments\nlex: machine learning LLM multimodal research\nvec: what are the most important current trends and breakthroughs in AI research in 2025-2026\nvec: what directions is artificial intelligence research heading in areas like large language models and multimodal AI\nhyde: Key AI research trends in 2025-2026 include scaling reasoning models, multimodal foundation models combining text, image, and video, AI agents that use tools autonomously, efficient fine-tuning methods like LoRA, and alignment research on safety and interpretability."}
+{"input": "how artificial intelligence is used in healthcare", "output": "lex: AI healthcare applications medical\nlex: artificial intelligence diagnosis treatment\nlex: machine learning medical imaging drug discovery\nvec: how is artificial intelligence being applied in healthcare for diagnosis and treatment\nvec: what are the main uses of AI and machine learning in the medical field\nhyde: AI in healthcare is used for medical image analysis (detecting tumors in radiology scans), drug discovery (predicting molecular interactions), clinical decision support, electronic health record analysis, robotic surgery assistance, and predicting patient outcomes in intensive care."}
+{"input": "what is gothic literature?", "output": "lex: gothic literature definition genre\nlex: gothic fiction horror romance 18th century\nlex: gothic novel characteristics examples\nvec: what defines gothic literature as a genre and what are its key characteristics\nvec: what are the origins and major works of gothic fiction\nhyde: Gothic literature is a genre that combines horror, romance, and mystery, originating with Horace Walpole's The Castle of Otranto (1764). Characteristics include gloomy settings (castles, ruins), supernatural elements, heightened emotion, and themes of decay, madness, and the sublime."}
+{"input": "how to foster inclusivity in interactions?", "output": "lex: foster inclusivity interactions communication\nlex: inclusive language behavior workplace\nlex: diversity inclusion interpersonal skills\nvec: how can I be more inclusive in my daily interactions with diverse people\nvec: what communication strategies foster inclusivity and make everyone feel welcome\nhyde: Use people's correct names and pronouns. Practice active listening without interrupting. Avoid assumptions based on appearance. Invite quieter voices into conversations. Be aware of cultural differences in communication styles. Acknowledge and address microaggressions when they occur."}
+{"input": "how to prune hydrangeas?", "output": "lex: prune hydrangeas when how\nlex: hydrangea pruning guide timing\nlex: cut back hydrangea old new wood\nvec: when and how should I prune different types of hydrangeas\nvec: what is the correct pruning technique for hydrangeas that bloom on old versus new wood\nhyde: Pruning depends on the hydrangea type. Bigleaf (H. macrophylla) and oakleaf hydrangeas bloom on old wood—prune just after flowering in summer. Panicle (H. paniculata) and smooth (H. arborescens) bloom on new wood—prune in late winter. Remove dead stems to the base and cut back to a pair of healthy buds."}
+{"input": "how do philosophers address moral ambiguity", "output": "lex: moral ambiguity philosophy ethics\nlex: ethical dilemma moral uncertainty philosophers\nlex: moral gray area philosophical perspectives\nvec: how do different philosophical traditions deal with situations of moral ambiguity\nvec: what do philosophers say about making ethical decisions when right and wrong are unclear\nhyde: Philosophers address moral ambiguity through competing frameworks. Utilitarians weigh outcomes, deontologists look to duties and rules, and virtue ethicists ask what a person of good character would do. Moral particularists argue each situation is unique and cannot be reduced to universal principles."}
+{"input": "what is a bildungsroman", "output": "lex: bildungsroman definition coming-of-age novel\nlex: bildungsroman literary genre examples\nlex: bildungsroman character development growth\nvec: what is a bildungsroman and what are the defining features of this literary genre\nvec: what are famous examples of bildungsroman or coming-of-age novels in literature\nhyde: A bildungsroman is a novel that follows the psychological and moral growth of a protagonist from youth to adulthood. The genre originated in German literature with Goethe's Wilhelm Meister's Apprenticeship. Classic examples include Jane Eyre, David Copperfield, and The Catcher in the Rye."}
+{"input": "thai cooking classes online", "output": "lex: Thai cooking class online course\nlex: learn Thai cuisine virtual cooking\nlex: Thai food cooking lesson video\nvec: where can I take online Thai cooking classes to learn authentic Thai cuisine\nvec: what are the best virtual courses for learning to cook Thai food at home\nhyde: Online Thai cooking classes teach dishes like pad thai, green curry, tom yum soup, and mango sticky rice. Platforms include Udemy, Skillshare, and dedicated sites like Hot Thai Kitchen. Live Zoom classes with Thai chefs offer real-time guidance on techniques and ingredient sourcing."}
+{"input": "how automation affects employment", "output": "lex: automation employment impact jobs\nlex: automation job displacement workforce\nlex: robots AI replacing workers labor market\nvec: how does increasing automation and robotics affect employment and job availability\nvec: what impact does workplace automation have on different types of jobs and wages\nhyde: Automation displaces routine manual and cognitive tasks but creates new roles in technology maintenance, programming, and oversight. Studies estimate 14% of jobs are highly automatable. Workers in manufacturing, data entry, and transportation face the highest displacement risk, while creative and interpersonal roles are less affected."}
+{"input": "what is a moral compass", "output": "lex: moral compass definition ethics\nlex: moral compass inner sense right wrong\nlex: personal values moral guidance\nvec: what does it mean to have a moral compass and how does it guide ethical behavior\nvec: how do people develop an internal sense of right and wrong known as a moral compass\nhyde: A moral compass is a person's internal sense of right and wrong that guides their decisions and behavior. It is shaped by upbringing, culture, religious beliefs, education, and personal experience. It acts as an ethical guide when facing difficult choices without clear external rules."}
+{"input": "how to set financial goals", "output": "lex: set financial goals planning budget\nlex: financial goal setting SMART savings\nlex: personal finance goals short long term\nvec: how do I set effective short-term and long-term financial goals\nvec: what is a step-by-step process for creating and achieving personal financial goals\nhyde: Set SMART financial goals: Specific (save $10,000), Measurable (track monthly), Achievable (based on income), Relevant (emergency fund), Time-bound (within 12 months). Categorize into short-term (under 1 year), medium-term (1-5 years), and long-term (5+ years) goals. Automate savings to stay on track."}
+{"input": "how to improve car gas mileage?", "output": "lex: improve car gas mileage fuel economy\nlex: better fuel efficiency driving tips\nlex: increase MPG car maintenance\nvec: what are the best ways to improve a car's gas mileage and fuel efficiency\nvec: what driving habits and car maintenance steps help reduce fuel consumption\nhyde: Keep tires inflated to the recommended PSI—underinflation increases rolling resistance. Drive at steady speeds using cruise control, avoid rapid acceleration, and reduce idling. Remove excess weight and roof racks. Replace air filters and spark plugs on schedule. Properly inflated tires alone can improve MPG by 3%."}
+{"input": "how to embrace change positively?", "output": "lex: embrace change positive mindset\nlex: adapting change personal growth resilience\nlex: coping with change acceptance\nvec: how can I learn to embrace change in life with a positive attitude\nvec: what psychological strategies help people adapt to change instead of resisting it\nhyde: Reframe change as an opportunity for growth rather than a threat. Practice mindfulness to stay present instead of worrying about the unknown. Set small, manageable goals during transitions. Build a support network and reflect on past changes you navigated successfully to build confidence."}
+{"input": "how to develop patience?", "output": "lex: develop patience self-control techniques\nlex: building patience mindfulness practice\nlex: patience skills emotional regulation\nvec: what techniques can help a person develop more patience in daily life\nvec: how do you train yourself to be more patient and less reactive\nhyde: Practice the pause: when you feel impatient, take three deep breaths before responding. Mindfulness meditation trains present-moment awareness and reduces reactivity. Reframe waiting as an opportunity. Set realistic expectations and practice delaying gratification with small exercises."}
+{"input": "how to design surveys for scientific research", "output": "lex: design survey scientific research methodology\nlex: research questionnaire design validity\nlex: survey instrument Likert scale sampling\nvec: how should researchers design valid and reliable surveys for scientific studies\nvec: what are the principles of good questionnaire design in scientific research\nhyde: Design surveys by first defining clear research questions. Use validated scales where available. Write neutral, unambiguous items avoiding leading questions. Include a mix of Likert-scale and open-ended questions. Pilot test with a small sample, assess reliability (Cronbach's alpha), and use random sampling for generalizability."}
+{"input": "how to get rid of garden pests naturally?", "output": "lex: natural garden pest control organic\nlex: garden pests organic remedies\nlex: beneficial insects companion planting pest\nvec: what are natural and organic methods to get rid of garden pests without chemicals\nvec: how can I control insects and pests in my garden using companion planting and beneficial insects\nhyde: Introduce beneficial insects like ladybugs and lacewings to eat aphids. Plant marigolds and basil as companion plants to repel pests. Spray diluted neem oil or insecticidal soap on affected leaves. Use diatomaceous earth around plant bases. Hand-pick slugs and caterpillars in the evening."}
+{"input": "how to build a green roof", "output": "lex: green roof construction installation\nlex: build living roof layers materials\nlex: green roof waterproof membrane substrate plants\nvec: how do you build a green roof on a residential or commercial building\nvec: what are the structural layers and materials needed for a green roof installation\nhyde: A green roof consists of layers: waterproof membrane, root barrier, drainage layer (gravel or drainage mat), filter fabric, lightweight growing substrate (4-6 inches for extensive, 6-24 for intensive), and drought-tolerant plants like sedums. The roof must support 15-30 lbs/sqft when saturated."}
+{"input": "what are the sacred texts of judaism", "output": "lex: sacred texts Judaism Torah Talmud\nlex: Jewish scripture Hebrew Bible Tanakh\nlex: Judaism holy books Mishnah\nvec: what are the main sacred texts and scriptures in the Jewish religious tradition\nvec: what is the Torah and what other texts are considered holy in Judaism\nhyde: The primary sacred text of Judaism is the Torah (Five Books of Moses), part of the Tanakh (Hebrew Bible), which also includes Nevi'im (Prophets) and Ketuvim (Writings). The Talmud, comprising the Mishnah and Gemara, contains rabbinic commentary and Jewish law (halakha)."}
+{"input": "how technology has impacted communication", "output": "lex: technology impact communication changes\nlex: digital communication evolution internet social media\nlex: technology transformed how people communicate\nvec: how has technology changed the way people communicate over the last few decades\nvec: what are the major effects of digital technology and the internet on human communication\nhyde: Technology has transformed communication from letters and landlines to instant messaging, video calls, and social media. Email replaced postal mail for business. Smartphones made communication continuous. Social media platforms enabled global, public conversations but also raised concerns about misinformation and reduced face-to-face interaction."}
+{"input": "what are the voting rights", "output": "lex: voting rights law history\nlex: Voting Rights Act suffrage amendments\nlex: voter rights eligibility protection\nvec: what are voting rights in the United States and how have they evolved over time\nvec: what laws protect citizens' right to vote and prevent voter discrimination\nhyde: Voting rights in the US expanded through constitutional amendments: the 15th (race, 1870), 19th (women, 1920), and 26th (age 18, 1971). The Voting Rights Act of 1965 prohibited racial discrimination in voting, including literacy tests and poll taxes, and required federal oversight of elections in certain jurisdictions."}
+{"input": "wedding photography package", "output": "lex: wedding photography package pricing\nlex: wedding photographer booking services\nlex: wedding photo package hours albums\nvec: what is typically included in a wedding photography package and how much does it cost\nvec: how to choose the right wedding photographer and package for your budget\nhyde: Our wedding photography packages start at $2,500 for 6 hours of coverage with one photographer, 300+ edited digital images, and an online gallery. Premium packages include a second shooter, engagement session, 10x10 album, and 8-10 hours of coverage for $4,500."}
+{"input": "how to address political division in communities", "output": "lex: political division community healing\nlex: political polarization bridging divides dialogue\nlex: community political disagreement civil discourse\nvec: how can communities address political divisions and find common ground\nvec: what strategies help reduce political polarization and promote civil dialogue at the local level\nhyde: Host structured community dialogues where participants follow ground rules: listen without interrupting, speak from personal experience, and seek understanding over agreement. Focus on shared local issues—schools, infrastructure, safety—rather than national partisan topics. Train facilitators in conflict mediation techniques."}
+{"input": "how to clean car headlights?", "output": "lex: clean car headlights restore foggy\nlex: headlight restoration oxidation yellowing\nlex: headlight lens cleaning toothpaste sanding\nvec: how do I clean and restore foggy or yellowed car headlights\nvec: what is the best method for removing oxidation from plastic headlight lenses\nhyde: Sand the headlight lens with wet sandpaper, starting at 800 grit and progressing to 2000 and 3000 grit. Polish with a rubbing compound or plastic polish. Apply a UV-resistant clear coat to prevent future yellowing. Toothpaste works as a mild abrasive for light haze."}
+{"input": "what defines gothic literature", "output": "lex: gothic literature characteristics define\nlex: gothic fiction genre elements tropes\nlex: gothic novel dark romantic supernatural\nvec: what are the defining features and conventions of gothic literature as a literary genre\nvec: what themes, settings, and narrative techniques characterize gothic fiction\nhyde: Gothic literature is defined by dark, atmospheric settings (ruined castles, monasteries), supernatural or uncanny events, psychological terror, and themes of isolation, decay, and transgression. Protagonists often face hidden secrets and tyrannical figures. Key works include Frankenstein, Dracula, and The Turn of the Screw."}
+{"input": "what is the importance of cultural heritage in photography?", "output": "lex: cultural heritage photography documentation\nlex: photography preserving culture traditions\nlex: cultural heritage visual documentation ethnographic\nvec: why is photography important for preserving and documenting cultural heritage\nvec: how has photography been used to record and protect cultural traditions and historical sites\nhyde: Photography plays a vital role in documenting cultural heritage—recording endangered architectural sites, traditional crafts, ceremonies, and oral traditions before they disappear. Organizations like UNESCO use photographic archives to catalog World Heritage Sites and support restoration efforts."}
+{"input": "what is logical positivism", "output": "lex: logical positivism Vienna Circle philosophy\nlex: logical positivism verification principle\nlex: logical empiricism analytic philosophy\nvec: what is logical positivism and what did the Vienna Circle philosophers argue\nvec: how does the verification principle define meaningful statements in logical positivism\nhyde: Logical positivism, developed by the Vienna Circle in the 1920s-30s, holds that only statements verifiable through empirical observation or logical proof are meaningful. Metaphysical, ethical, and aesthetic claims are considered cognitively meaningless. Key figures include Carnap, Schlick, and Ayer."}
+{"input": "how to create a self-improvement plan?", "output": "lex: self-improvement plan personal development\nlex: personal growth plan goals habits\nlex: self-improvement roadmap steps\nvec: how do I create an effective self-improvement plan with clear goals and actionable steps\nvec: what steps should I follow to build a personal development plan that I can stick to\nhyde: Start by assessing your current strengths and weaknesses across life areas: health, career, relationships, finances, and personal growth. Set 2-3 SMART goals per area. Break each goal into weekly habits and milestones. Track progress in a journal and review monthly. Adjust the plan based on what's working."}
+{"input": "how robotics is transforming industries", "output": "lex: robotics industry transformation manufacturing\nlex: industrial robots automation sectors\nlex: robotics applications logistics healthcare agriculture\nvec: how is robotics transforming industries like manufacturing, healthcare, and logistics\nvec: what impact are advanced robots and automation having on different industrial sectors\nhyde: Robotics is transforming manufacturing with collaborative robots (cobots) that work alongside humans on assembly lines. In logistics, warehouse robots from companies like Amazon Robotics sort and move packages. Surgical robots like da Vinci enable minimally invasive procedures. Agricultural robots handle harvesting and weeding autonomously."}
+{"input": "famous photographers", "output": "lex: famous photographers history notable\nlex: iconic photographers Ansel Adams Cartier-Bresson\nlex: renowned photographers influential works\nvec: who are the most famous and influential photographers in history\nvec: which photographers are known for iconic images that shaped the art of photography\nhyde: Ansel Adams is known for dramatic black-and-white landscapes of the American West. Henri Cartier-Bresson pioneered street photography and the decisive moment. Dorothea Lange documented the Great Depression. Annie Leibovitz is renowned for celebrity portraiture. Sebastião Salgado captures powerful social documentary images."}
+{"input": "how does climate change affect global politics", "output": "lex: climate change global politics geopolitics\nlex: climate change international relations policy\nlex: climate politics diplomacy conflict resources\nvec: how does climate change influence international relations and global political dynamics\nvec: what are the geopolitical consequences of climate change including resource conflicts and migration\nhyde: Climate change reshapes global politics through resource competition (water, arable land), climate-driven migration, and diplomatic tensions over emissions targets. Arctic ice melt opens new shipping routes and territorial disputes. Island nations face existential threats, driving climate justice advocacy at the UN."}
+{"input": "how to organize a scientific conference", "output": "lex: organize scientific conference planning\nlex: academic conference logistics program committee\nlex: scientific meeting venue call for papers\nvec: what are the steps to organizing a successful scientific conference\nvec: how do you plan an academic conference including call for papers, venue, and scheduling\nhyde: Start 12-18 months ahead. Form a program committee, select a venue, set dates, and issue a call for papers. Use a submission system like EasyChair. Arrange keynote speakers, peer review, and session scheduling. Handle registration, catering, AV equipment, and proceedings publication."}
+{"input": "how to fix a leaking faucet", "output": "lex: fix leaking faucet repair dripping\nlex: faucet leak washer cartridge replacement\nlex: kitchen bathroom faucet drip fix\nvec: how do I fix a dripping faucet in my kitchen or bathroom\nvec: what are the steps to repair a leaking faucet by replacing the washer or cartridge\nhyde: Turn off the water supply valves under the sink. Remove the faucet handle by unscrewing the decorative cap and handle screw. Pull out the cartridge or stem and inspect the rubber washer or O-ring. Replace worn parts, reassemble, and turn the water back on. Most leaks are caused by a degraded washer."}
+{"input": "how social media influences behavior", "output": "lex: social media influence behavior psychology\nlex: social media impact mental health habits\nlex: social media behavioral effects users\nvec: how does social media use influence people's behavior, opinions, and mental health\nvec: what psychological effects does regular social media use have on user behavior\nhyde: Social media influences behavior through social comparison, echo chambers, and dopamine-driven feedback loops. Users curate idealized self-presentations, leading to anxiety and low self-esteem in viewers. Algorithmic content feeds reinforce existing beliefs and can radicalize opinions through filter bubbles."}
+{"input": "how does intertextuality work?", "output": "lex: intertextuality literary theory texts\nlex: intertextuality allusion reference literature\nlex: Kristeva Barthes intertextuality meaning\nvec: how does intertextuality work as a concept in literary theory and criticism\nvec: what does intertextuality mean and how do texts reference and build on other texts\nhyde: Intertextuality, coined by Julia Kristeva, describes how every text is shaped by and references other texts. Meaning is not contained in a single work but emerges from its relationships with prior texts through allusion, quotation, parody, and genre conventions. Roland Barthes argued the reader constructs meaning from these textual connections."}
+{"input": "how does stoicism inspire inner peace", "output": "lex: Stoicism inner peace philosophy\nlex: Stoic philosophy tranquility Marcus Aurelius Epictetus\nlex: Stoic practices equanimity calm\nvec: how do Stoic philosophical principles help achieve inner peace and tranquility\nvec: what Stoic practices and teachings from Marcus Aurelius and Epictetus promote emotional calm\nhyde: Stoicism teaches inner peace through the dichotomy of control: focus only on what you can influence (your thoughts and actions) and accept what you cannot (external events). Marcus Aurelius wrote in Meditations that disturbance comes not from things themselves but from our judgments about them."}
+{"input": "how to install a car stereo?", "output": "lex: install car stereo aftermarket head unit\nlex: car stereo replacement wiring harness\nlex: car radio installation dash kit\nvec: how do I install an aftermarket car stereo and connect the wiring\nvec: what tools and adapters do I need to replace a factory car radio with a new head unit\nhyde: Disconnect the battery. Remove the factory stereo using DIN removal tools or dash panel screws. Connect the aftermarket wiring harness adapter to the car's plug—match wire colors (red=accessory, yellow=battery, black=ground). Mount the new head unit in a dash kit, slide it in, and reconnect the battery."}
+{"input": "art class", "output": "lex: art class painting drawing course\nlex: art classes beginners local online\nlex: learn art lessons studio workshop\nvec: where can I find art classes for beginners to learn painting or drawing\nvec: what types of art classes are available online and in person for adults\nhyde: Beginner art classes cover fundamentals like drawing, color theory, and composition. Options include community college courses, local studio workshops, and online platforms like Skillshare and Domestika. Classes range from watercolor and acrylic painting to charcoal drawing and digital illustration."}
+{"input": "what is the concept of ahimsa", "output": "lex: ahimsa non-violence concept Hinduism Jainism Buddhism\nlex: ahimsa meaning Indian philosophy\nlex: ahimsa Gandhi non-harm\nvec: what is the concept of ahimsa and how is non-violence practiced in Indian religions\nvec: how did Gandhi apply the principle of ahimsa in his philosophy and political movement\nhyde: Ahimsa means non-violence or non-harm and is a central principle in Hinduism, Jainism, and Buddhism. In Jainism, ahimsa extends to all living beings, including insects. Gandhi adopted ahimsa as the foundation of his political resistance, using nonviolent civil disobedience against British colonial rule."}
+{"input": "what was the byzantine empire", "output": "lex: Byzantine Empire history Eastern Roman\nlex: Byzantine Empire Constantinople medieval\nlex: Byzantine Empire culture government fall 1453\nvec: what was the Byzantine Empire and how did it continue from the Roman Empire\nvec: what were the major achievements and eventual fall of the Byzantine Empire\nhyde: The Byzantine Empire was the continuation of the Eastern Roman Empire, centered on Constantinople (modern Istanbul). It lasted from 330 CE to 1453 CE when it fell to the Ottoman Turks. It preserved Greek and Roman culture, developed Eastern Orthodox Christianity, and Justinian's legal code influenced European law."}
+{"input": "how to run for public office", "output": "lex: run for public office campaign steps\nlex: running for election candidate requirements\nlex: political campaign filing candidacy\nvec: what are the steps to running for public office in the United States\nvec: how do I start a political campaign and file as a candidate for local or state office\nhyde: To run for public office, first research eligibility requirements (age, residency, citizenship) for your target seat. File candidacy paperwork with the local election office by the deadline. Build a campaign team, set a budget, raise funds, and collect any required petition signatures. Develop a platform and begin voter outreach."}
+{"input": "how to contact local government officials", "output": "lex: contact local government officials representatives\nlex: reach city council county officials email phone\nlex: local elected officials contact information\nvec: how can I find contact information for and reach out to my local government representatives\nvec: what is the best way to contact city council members or county officials about local issues\nhyde: Find your local officials through your city or county website's \"elected officials\" page or use usa.gov's elected officials lookup tool. Contact methods include email, phone calls to their office, attending public town hall meetings, and submitting comments during city council sessions."}
+{"input": "what is the metaphysics of morality", "output": "lex: metaphysics of morality moral philosophy\nlex: metaethics moral realism anti-realism\nlex: metaphysical foundations ethics moral facts\nvec: what is the metaphysics of morality and how does it address the nature of moral facts\nvec: how do metaethicists debate whether moral truths exist objectively or are constructed\nhyde: The metaphysics of morality examines whether moral facts exist independently of human minds (moral realism) or are constructed by societies and individuals (anti-realism). Moral realists argue that \"murder is wrong\" is objectively true. Constructivists and expressivists argue moral claims express attitudes or social agreements, not metaphysical truths."}
+{"input": "latest research on climate change", "output": "lex: latest climate change research 2025 2026\nlex: recent climate science findings studies\nlex: climate change new research global warming\nvec: what are the latest scientific findings and research on climate change in 2025-2026\nvec: what do recent climate studies say about global warming trends and projections\nhyde: Recent research in 2025 shows global temperatures exceeded 1.5°C above pre-industrial levels for a full calendar year. Studies in Nature Climate Change report accelerating ice sheet loss in Greenland and West Antarctica. New modeling suggests tipping points for the Amazon rainforest may be closer than previously estimated."}
+{"input": "where to find eco-friendly furniture", "output": "lex: eco-friendly furniture sustainable shop\nlex: sustainable furniture store green materials\nlex: eco furniture reclaimed wood organic\nvec: where can I buy eco-friendly and sustainably made furniture\nvec: what brands and stores sell furniture made from sustainable or recycled materials\nhyde: Eco-friendly furniture brands include West Elm (FSC-certified wood), Medley (organic fabrics, solid wood), and Sabai (recycled and recyclable materials). Thrift stores and Habitat for Humanity ReStores sell secondhand furniture. Look for FSC certification, non-toxic finishes, and reclaimed or recycled materials."}
+{"input": "how to stay informed about politics", "output": "lex: stay informed politics news sources\nlex: follow political news reliable media\nlex: political awareness current events tracking\nvec: how can I stay well-informed about politics and current political events\nvec: what are reliable sources and strategies for keeping up with political news\nhyde: Read multiple news sources across the political spectrum: AP News and Reuters for wire reporting, then compare coverage from different outlets. Subscribe to newsletters like The Morning (NYT) or Axios AM. Follow legislative trackers like Congress.gov. Attend local government meetings and candidate forums."}
+{"input": "what is the tao te ching", "output": "lex: Tao Te Ching Laozi Taoism text\nlex: Tao Te Ching Daodejing philosophy\nlex: Tao Te Ching teachings Dao virtue\nvec: what is the Tao Te Ching and what does it teach about the Dao and living wisely\nvec: who wrote the Tao Te Ching and what are its main philosophical ideas\nhyde: The Tao Te Ching, attributed to Laozi (6th century BCE), is the foundational text of Taoism. Its 81 short chapters describe the Dao (the Way)—an ineffable cosmic principle—and De (virtue/power). It advocates wu wei (effortless action), simplicity, humility, and living in harmony with nature."}
+{"input": "what is the ethics of ai", "output": "lex: AI ethics artificial intelligence ethical issues\nlex: ethics of AI bias fairness accountability\nlex: AI ethics alignment safety\nvec: what are the major ethical issues and concerns surrounding artificial intelligence\nvec: how do ethicists address bias, fairness, transparency, and safety in AI systems\nhyde: AI ethics addresses bias in training data that leads to discriminatory outputs, lack of transparency in black-box models, accountability when AI causes harm, privacy concerns from mass data collection, and the alignment problem of ensuring AI systems act according to human values. Frameworks include fairness, accountability, and transparency (FAccT)."}
+{"input": "what is the difference between realism and idealism", "output": "lex: realism idealism philosophy difference\nlex: realism vs idealism metaphysics epistemology\nlex: philosophical realism idealism comparison\nvec: what is the philosophical difference between realism and idealism in metaphysics\nvec: how do realists and idealists disagree about the nature of reality and perception\nhyde: Realism holds that an external world exists independently of our minds and perceptions. Idealism argues that reality is fundamentally mental or mind-dependent. Plato's Forms represent a kind of realism about abstract objects, while Berkeley argued that to exist is to be perceived (esse est percipi)."}
+{"input": "how to prevent garden soil erosion?", "output": "lex: prevent garden soil erosion methods\nlex: soil erosion control garden mulch ground cover\nlex: garden erosion prevention retaining wall\nvec: how can I prevent soil erosion in my garden or yard\nvec: what methods and ground covers help stop soil from washing away in a garden\nhyde: Prevent soil erosion by mulching garden beds with 2-3 inches of wood chips or straw. Plant ground covers like creeping thyme or clover on slopes. Install retaining walls or terraces on steep grades. Use rain gardens to absorb runoff. Avoid leaving soil bare between seasons—plant cover crops like rye or clover."}
+{"input": "how to write a scientific research paper", "output": "lex: write scientific research paper structure\nlex: scientific paper writing IMRaD format\nlex: academic research paper methodology results discussion\nvec: how do you write a scientific research paper following the standard academic format\nvec: what is the structure and process for writing a research paper for journal publication\nhyde: A scientific research paper follows the IMRaD structure: Introduction (background, hypothesis, objectives), Methods (detailed procedures for reproducibility), Results (data presented with figures and tables), and Discussion (interpretation, limitations, implications). Include an abstract, references in the journal's required citation style, and acknowledgments."}
+{"input": "how to diversify investment portfolio", "output": "lex: diversify investment portfolio strategy\nlex: portfolio diversification asset allocation\nlex: investment diversification stocks bonds ETFs\nvec: how should I diversify my investment portfolio across different asset classes\nvec: what is a good strategy for spreading risk through portfolio diversification\nhyde: Diversify across asset classes: stocks, bonds, real estate, and commodities. Within stocks, spread across sectors (tech, healthcare, energy) and geographies (US, international, emerging markets). Use index funds or ETFs for broad exposure. A common allocation is 60% stocks, 30% bonds, 10% alternatives, adjusted by age and risk tolerance."}
+{"input": "how to use social media for business", "output": "lex: social media business marketing strategy\nlex: social media marketing business growth\nlex: business social media content engagement\nvec: how can small businesses effectively use social media platforms for marketing and growth\nvec: what strategies work best for using social media to promote a business and attract customers\nhyde: Choose platforms where your target audience is active: Instagram for visual products, LinkedIn for B2B, TikTok for younger demographics. Post consistently, mix promotional content with value-added posts (tips, behind-the-scenes). Use analytics to track engagement. Run targeted ads with clear CTAs and A/B test creative assets."}
+{"input": "what is zero waste?", "output": "lex: zero waste lifestyle definition\nlex: zero waste reduce reuse recycle\nlex: zero waste living tips practices\nvec: what is the zero waste movement and how do people reduce waste in daily life\nvec: what does zero waste mean and what are practical ways to minimize household waste\nhyde: Zero waste is a philosophy and lifestyle aiming to send nothing to landfills by reducing consumption, reusing items, recycling, and composting. Practical steps include using reusable bags, bottles, and containers, buying in bulk, composting food scraps, and choosing products with minimal or recyclable packaging."}
+{"input": "what is the role of civil society in governance", "output": "lex: civil society governance role function\nlex: civil society organizations NGOs democratic governance\nlex: civil society accountability transparency\nvec: what role does civil society play in democratic governance and government accountability\nvec: how do non-governmental organizations and civic groups contribute to governance\nhyde: Civil society organizations—NGOs, advocacy groups, media, and community organizations—serve as intermediaries between citizens and government. They monitor government transparency, advocate for policy changes, provide public services, and mobilize civic participation. A strong civil society holds government accountable and strengthens democracy."}
+{"input": "what is the meaning of diwali", "output": "lex: Diwali meaning festival of lights\nlex: Diwali Hindu celebration significance\nlex: Diwali traditions Lakshmi Rama\nvec: what is Diwali and what does the festival of lights celebrate in Hindu tradition\nvec: what is the religious and cultural significance of the Diwali festival\nhyde: Diwali, the festival of lights, is celebrated by Hindus, Jains, and Sikhs over five days in autumn. It symbolizes the victory of light over darkness and good over evil. Hindus celebrate Lord Rama's return to Ayodhya and honor Lakshmi, goddess of prosperity. Traditions include lighting diyas, fireworks, rangoli art, and sharing sweets."}
+{"input": "what is a political debate", "output": "lex: political debate definition election\nlex: political debate format candidates issues\nlex: political debate presidential election\nvec: what is a political debate and how do candidates discuss issues in structured debates\nvec: how are political debates organized and what role do they play in elections\nhyde: A political debate is a structured event where candidates for elected office discuss policy positions and respond to questions from moderators and sometimes the audience. Debates follow agreed-upon formats with time limits for responses and rebuttals. They allow voters to compare candidates' positions on key issues directly."}
+{"input": "macro photography", "output": "lex: macro photography techniques close-up\nlex: macro photography lens equipment\nlex: macro photography insects flowers detail\nvec: what is macro photography and what equipment and techniques does it require\nvec: how do I take high-quality macro photographs of small subjects like insects and flowers\nhyde: Macro photography captures subjects at 1:1 magnification or greater, revealing details invisible to the naked eye. Use a dedicated macro lens (100mm is popular) or extension tubes. Shoot at f/8-f/16 for sufficient depth of field. Use a tripod and focus stacking to get the entire subject sharp."}
+{"input": "what was the enlightenment", "output": "lex: Enlightenment 18th century intellectual movement\nlex: Age of Enlightenment reason philosophy\nlex: Enlightenment thinkers Voltaire Locke Kant\nvec: what was the Enlightenment and how did it change Western philosophy and politics\nvec: who were the key Enlightenment thinkers and what ideas did they promote\nhyde: The Enlightenment was an 18th-century intellectual movement emphasizing reason, science, individual liberty, and skepticism of authority. Key thinkers include John Locke (natural rights), Voltaire (free speech), Montesquieu (separation of powers), and Kant (\"dare to know\"). It directly influenced the American and French Revolutions."}
+{"input": "how do philosophers interpret free will", "output": "lex: free will philosophy determinism\nlex: philosophers free will debate libertarian compatibilist\nlex: free will hard determinism compatibilism\nvec: how do different philosophers interpret the problem of free will and determinism\nvec: what are the main philosophical positions on whether humans have free will\nhyde: Three main positions dominate: hard determinism (all events are causally determined, free will is an illusion), libertarianism (genuine free will exists and is incompatible with determinism), and compatibilism (free will and determinism can coexist—you act freely when acting on your own desires without external coercion). Hume and Frankfurt defend compatibilism."}
+{"input": "how to stay engaged in local politics", "output": "lex: engaged local politics civic participation\nlex: local politics involvement community\nlex: civic engagement local government attend meetings\nvec: how can I stay actively engaged and involved in local politics and government\nvec: what are practical ways to participate in local political decision-making\nhyde: Attend city council and school board meetings, which are open to the public. Subscribe to your local government's agenda notifications. Join neighborhood associations or civic groups. Vote in every local election—municipal and school board elections often have low turnout, amplifying each vote's impact."}
+{"input": "how to paint abstract landscapes?", "output": "lex: paint abstract landscape technique\nlex: abstract landscape painting acrylic oil\nlex: abstract landscape art color composition\nvec: how do I paint abstract landscape art using acrylic or oil paints\nvec: what techniques and approaches do artists use when painting abstract landscapes\nhyde: Start with a loose underpainting to block in the horizon and major shapes. Use a palette knife or large brush for expressive marks. Simplify landscape elements—hills, sky, water—into geometric shapes and bold color fields. Layer transparent glazes over opaque areas. Let the painting suggest the landscape rather than depict it literally."}
+{"input": "how to decorate a small apartment", "output": "lex: small apartment decorating ideas\nlex: tiny apartment interior design\nlex: space-saving furniture small rooms\nvec: what are the best ways to decorate and furnish a small apartment to maximize space?\nvec: interior design tips for making a compact apartment look bigger and more stylish\nhyde: Use mirrors and light colors to make a small apartment feel larger. Choose multi-functional furniture like a storage ottoman or a fold-down desk. Vertical shelving frees up floor space while adding display areas."}
+{"input": "what is an allegory", "output": "lex: allegory literary device definition\nlex: allegory examples literature\nvec: what does allegory mean as a literary device and how is it used in storytelling?\nvec: how do authors use allegory to convey hidden meanings through characters and events?\nhyde: An allegory is a narrative in which characters, events, and settings represent abstract ideas or moral qualities. For example, George Orwell's Animal Farm is an allegory for the Russian Revolution, with farm animals standing in for political figures."}
+{"input": "what is wildlife photography?", "output": "lex: wildlife photography techniques\nlex: wildlife photography camera gear\nlex: photographing animals in nature\nvec: what is wildlife photography and what skills and equipment does it require?\nvec: how do photographers capture images of wild animals in their natural habitats?\nhyde: Wildlife photography involves capturing images of animals in their natural environments. Photographers typically use long telephoto lenses (300mm-600mm) and fast shutter speeds to freeze motion. Patience and knowledge of animal behavior are essential for getting close without disturbing subjects."}
+{"input": "what is chaos theory", "output": "lex: chaos theory mathematics\nlex: butterfly effect deterministic systems\nlex: nonlinear dynamics sensitive dependence\nvec: what is chaos theory and how does it explain unpredictable behavior in deterministic systems?\nvec: how does the butterfly effect relate to chaos theory in mathematics and physics?\nhyde: Chaos theory studies deterministic systems that are highly sensitive to initial conditions. A tiny change in starting values can produce vastly different outcomes over time — the so-called butterfly effect. The Lorenz attractor, discovered in 1963, was one of the first examples of chaotic behavior in weather modeling."}
+{"input": "what is the role of ethics in scientific research", "output": "lex: research ethics scientific integrity\nlex: ethical guidelines human subjects research\nlex: scientific misconduct fraud prevention\nvec: why are ethical standards important in conducting scientific research?\nvec: how do ethics committees and institutional review boards regulate scientific experiments?\nhyde: Ethics in scientific research ensures the integrity of findings and the protection of human and animal subjects. Researchers must obtain informed consent, avoid fabrication or falsification of data, and disclose conflicts of interest. Institutional Review Boards (IRBs) review proposed studies before they begin."}
+{"input": "how to shoot video in low light", "output": "lex: low light video settings camera\nlex: filming dark environments ISO aperture\nlex: low light videography tips\nvec: what camera settings and techniques produce the best video quality in low light conditions?\nvec: how do filmmakers shoot usable footage in dark or dimly lit environments?\nhyde: For low light video, open your aperture to f/1.4–f/2.8 and lower your shutter speed to 1/50 for 24fps footage. Raise ISO gradually — modern cameras handle ISO 3200–6400 with acceptable noise. Use a fast prime lens and add practical lights in the scene when possible."}
+{"input": "what is compositional balance?", "output": "lex: compositional balance art design\nlex: symmetrical asymmetrical balance visual\nlex: balance principles composition photography\nvec: what does compositional balance mean in art, photography, and graphic design?\nvec: how do artists achieve visual balance through symmetrical and asymmetrical arrangements?\nhyde: Compositional balance refers to the distribution of visual weight within an image or artwork. Symmetrical balance places equal elements on both sides of a central axis, while asymmetrical balance uses contrasting elements — such as a large shape offset by a smaller, brighter one — to create dynamic equilibrium."}
+{"input": "what is the impact of lobbyists on legislation", "output": "lex: lobbyists influence legislation policy\nlex: lobbying congress lawmaking\nlex: corporate lobbying political spending\nvec: how do lobbyists influence the legislative process and shape laws passed by government?\nvec: what impact does corporate and special interest lobbying have on policy outcomes?\nhyde: Lobbyists meet with lawmakers, draft model legislation, and organize campaign contributions to influence policy outcomes. In the U.S., spending on lobbying exceeded $4 billion annually. Critics argue this gives wealthy interests disproportionate power, while proponents say lobbyists provide expertise legislators need."}
+{"input": "how to navigate with a compass", "output": "lex: compass navigation orienteering\nlex: magnetic compass bearing map reading\nlex: compass declination true north\nvec: how do you use a magnetic compass and topographic map to navigate outdoors?\nvec: what are the steps for taking a bearing with a compass and following it in the field?\nhyde: Hold the compass flat and rotate the bezel until the orienting arrow aligns with the magnetic needle pointing north. Place the compass on your map, align the edge with your start and destination, and rotate the bezel to match the map's grid lines. Adjust for magnetic declination, then follow the bearing."}
+{"input": "what is genetic drift", "output": "lex: genetic drift population genetics\nlex: bottleneck effect founder effect allele frequency\nvec: what is genetic drift and how does it cause random changes in allele frequencies in small populations?\nvec: how do the bottleneck effect and founder effect relate to genetic drift in evolution?\nhyde: Genetic drift is a mechanism of evolution where allele frequencies change randomly from one generation to the next due to chance sampling. Its effects are strongest in small populations. The bottleneck effect occurs when a population is drastically reduced, and the founder effect occurs when a small group colonizes a new area."}
+{"input": "what is the significance of the alhambra?", "output": "lex: Alhambra palace Granada Spain\nlex: Alhambra Islamic architecture Nasrid\nlex: Alhambra historical significance\nvec: why is the Alhambra in Granada, Spain considered a masterpiece of Islamic architecture?\nvec: what is the cultural and historical significance of the Alhambra palace?\nhyde: The Alhambra is a palace and fortress complex in Granada, Spain, built primarily by the Nasrid dynasty in the 13th and 14th centuries. Its intricate stucco work, muqarnas ceilings, and geometric tile patterns represent the pinnacle of Moorish art in Europe. The Court of the Lions features 124 marble columns surrounding a central fountain."}
+{"input": "how the human brain functions", "output": "lex: human brain function neuroscience\nlex: brain regions neurons synapses\nlex: cerebral cortex brain anatomy\nvec: how does the human brain process information through neurons and different brain regions?\nvec: what are the major parts of the brain and their roles in cognition, memory, and movement?\nhyde: The human brain contains approximately 86 billion neurons that communicate via electrical and chemical signals across synapses. The cerebral cortex handles higher-order functions like reasoning and language. The hippocampus is critical for forming new memories, while the cerebellum coordinates movement and balance."}
+{"input": "how is love viewed in different religions?", "output": "lex: love religion Christianity Islam Buddhism\nlex: divine love spiritual traditions\nlex: religious teachings about love\nvec: how do different world religions like Christianity, Islam, Hinduism, and Buddhism define and teach about love?\nvec: what role does love play in the spiritual teachings of major religions?\nhyde: In Christianity, love (agape) is the highest virtue — \"God is love\" (1 John 4:8). Islam teaches that Allah is Al-Wadud, the Loving, and compassion toward others is a core duty. In Buddhism, metta (loving-kindness) is cultivated through meditation. Hinduism describes divine love (bhakti) as devotion to God."}
+{"input": "what is literary symbolism?", "output": "lex: literary symbolism examples\nlex: symbolism in literature meaning\nlex: symbolic imagery fiction poetry\nvec: what is symbolism as a literary device and how do authors use symbols to convey deeper meaning?\nvec: how do readers identify and interpret symbols in novels, poems, and short stories?\nhyde: Literary symbolism is the use of objects, characters, or events to represent abstract ideas beyond their literal meaning. In The Great Gatsby, the green light symbolizes Gatsby's unattainable dream. The conch shell in Lord of the Flies represents order and democratic authority."}
+{"input": "what is the relationship between ethics and law?", "output": "lex: ethics versus law differences\nlex: morality legality relationship\nlex: ethical standards legal requirements\nvec: how do ethics and law relate to each other, and where do they diverge?\nvec: can something be legal but unethical, or illegal but morally justified?\nhyde: Ethics and law overlap but are distinct. Laws are formal rules enforced by the state, while ethics are moral principles guiding individual conduct. Something can be legal yet unethical — such as exploitative pricing — or illegal yet ethically defensible, as in acts of civil disobedience against unjust laws."}
+{"input": "json load", "output": "lex: JSON parse load file\nlex: JSON.parse read file\nlex: json load Python JavaScript\nvec: how do you load and parse a JSON file in Python or JavaScript?\nvec: what functions are used to read JSON data from a file or string?\nhyde: In Python, use json.load(f) to read from a file object and json.loads(s) to parse a string. In JavaScript, use JSON.parse(str) to convert a JSON string into an object, or fetch a file and call response.json() to parse the result."}
+{"input": "how to remove oil stains from clothes", "output": "lex: remove oil stains clothing\nlex: grease stain removal fabric\nlex: oil stain laundry treatment\nvec: what is the best method for removing oil and grease stains from clothing fabric?\nvec: how do you get cooking oil or motor oil stains out of clothes at home?\nhyde: Apply dish soap or liquid detergent directly to the oil stain and gently rub it in. Let it sit for 10-15 minutes, then wash in the hottest water safe for the fabric. For stubborn stains, sprinkle baking soda or cornstarch on the spot to absorb excess oil before treating."}
+{"input": "where to buy greenhouse supplies?", "output": "lex: greenhouse supplies store online\nlex: buy greenhouse panels heaters shelving\nlex: greenhouse gardening equipment\nvec: where can I purchase greenhouse supplies like panels, heaters, ventilation, and shelving?\nvec: what are the best online and local stores for buying greenhouse building materials and accessories?\nhyde: Greenhouse supplies are available at garden centers like Home Depot and Lowe's, as well as specialty retailers like Greenhouse Megastore and Bootstrap Farmer. Online, Amazon carries polycarbonate panels, shade cloth, heating mats, and ventilation fans. For commercial-grade supplies, contact manufacturers like Rimol Greenhouses directly."}
+{"input": "how to support climbing roses?", "output": "lex: climbing roses trellis support\nlex: train climbing roses wall fence\nlex: rose arbor lattice structure\nvec: what structures and techniques are used to support and train climbing roses?\nvec: how do you attach and guide climbing roses along a trellis, wall, or arbor?\nhyde: Install a sturdy trellis, arbor, or wire system at least 3 inches from the wall to allow air circulation. Tie canes horizontally with soft plant ties to encourage lateral growth and more blooms. Prune in late winter, removing dead wood and shortening side shoots to 2-3 buds."}
+{"input": "how to manage debt", "output": "lex: debt management repayment plan\nlex: pay off debt strategies snowball avalanche\nlex: credit card debt consolidation\nvec: what are the most effective strategies for managing and paying off personal debt?\nvec: how does the debt snowball versus debt avalanche method work for debt repayment?\nhyde: List all debts with their balances, interest rates, and minimum payments. With the avalanche method, pay extra toward the highest-interest debt first to save the most money. With the snowball method, pay off the smallest balance first for psychological momentum. Consider consolidation loans if you qualify for a lower rate."}
+{"input": "sailing adventures", "output": "lex: sailing adventure trips voyages\nlex: sailing vacation destinations cruises\nlex: ocean sailing expedition\nvec: what are some popular sailing adventure destinations and voyages around the world?\nvec: how do people plan and prepare for multi-day sailing trips and ocean crossings?\nhyde: Popular sailing adventures include island-hopping in the Greek Cyclades, crossing the Atlantic via the trade winds from the Canary Islands to the Caribbean, and navigating the fjords of Norway. Charter companies offer bareboat and crewed options for all experience levels, from weekend coastal cruises to month-long blue water passages."}
+{"input": "paint flow", "output": "lex: paint flow viscosity consistency\nlex: acrylic paint flow medium pouring\nlex: paint flow rate spray gun\nvec: how do you control paint flow and viscosity for acrylic pouring or spray application?\nvec: what is a flow medium and how does it affect paint consistency?\nhyde: Paint flow refers to how freely paint moves and levels on a surface. For acrylic pouring, mix paint with a flow medium like Floetrol at a 2:1 ratio to achieve a honey-like consistency. For spray guns, thin paint to the manufacturer's recommended viscosity using a flow cup to measure."}
+{"input": "how to create a budget plan", "output": "lex: budget plan personal monthly\nlex: create budget spreadsheet expenses income\nlex: 50/30/20 budgeting rule\nvec: how do you create a personal monthly budget plan to track income and expenses?\nvec: what steps are involved in building a budget and sticking to it?\nhyde: Start by listing your monthly after-tax income. Track all expenses for one month, categorizing them as needs, wants, and savings. Apply the 50/30/20 rule: 50% to necessities, 30% to discretionary spending, and 20% to savings and debt repayment. Use a spreadsheet or app like YNAB to monitor progress."}
+{"input": "how to apply for research funding", "output": "lex: research funding application grant\nlex: apply grant NIH NSF proposal\nlex: research grant writing tips\nvec: what is the process for applying for academic or scientific research funding grants?\nvec: how do researchers write successful grant proposals for agencies like NIH and NSF?\nhyde: Identify funding agencies that match your research area — NIH for biomedical, NSF for science and engineering, NEH for humanities. Read the request for proposals (RFP) carefully. Write a clear specific aims page, include preliminary data, and describe your methodology in detail. Submit through the agency's online portal before the deadline."}
+{"input": "how to improve credit score", "output": "lex: improve credit score FICO\nlex: raise credit score fast tips\nlex: credit score factors payment history\nvec: what are the most effective ways to raise your credit score quickly?\nvec: which factors affect your FICO credit score the most and how can you improve them?\nhyde: Pay all bills on time — payment history accounts for 35% of your FICO score. Keep credit utilization below 30% of your total credit limit. Avoid opening too many new accounts at once. Check your credit report for errors and dispute inaccuracies. Keeping old accounts open increases your average account age."}
+{"input": "what is literary criticism?", "output": "lex: literary criticism theory analysis\nlex: literary criticism schools formalism structuralism\nlex: literary analysis methods approaches\nvec: what is literary criticism and what are its major schools of thought?\nvec: how do literary critics analyze and interpret works of literature using different theoretical frameworks?\nhyde: Literary criticism is the study, evaluation, and interpretation of literature. Major approaches include formalism (focusing on the text itself), structuralism (analyzing underlying structures), feminist criticism (examining gender representation), and post-colonialism (exploring power dynamics). Each lens offers a different way to interpret a work's meaning."}
+{"input": "how do ethical theories apply to social issues", "output": "lex: ethical theories social issues applied ethics\nlex: utilitarianism deontology social justice\nlex: ethics poverty inequality healthcare\nvec: how are ethical theories like utilitarianism and deontology applied to real-world social issues?\nvec: what ethical frameworks do philosophers use to analyze problems like poverty, inequality, and healthcare?\nhyde: Utilitarian ethics evaluates social policies by their overall consequences — a policy is just if it maximizes well-being for the greatest number. Deontological ethics focuses on rights and duties regardless of outcome. Applying these frameworks to issues like healthcare access reveals tensions between collective welfare and individual rights."}
+{"input": "where to buy affordable art prints", "output": "lex: buy affordable art prints online\nlex: cheap art prints posters wall decor\nlex: art print shops Etsy Society6\nvec: where can I buy affordable and high-quality art prints for home decoration?\nvec: what are the best online stores for purchasing inexpensive art prints and posters?\nhyde: Affordable art prints are available on Society6, Redbubble, and Etsy, where independent artists sell prints starting at $15–$30. IKEA offers framed prints under $20. For museum-quality reproductions, check Artsy or Saatchi Art's prints section. King & McGaw specializes in licensed fine art reproductions at mid-range prices."}
+{"input": "how do you critique a literary work?", "output": "lex: critique literary work analysis\nlex: literary critique essay writing\nlex: evaluate novel poem fiction\nvec: what steps do you follow to write a literary critique of a novel or poem?\nvec: how do you analyze and evaluate the strengths and weaknesses of a literary work?\nhyde: To critique a literary work, start by reading it closely and noting your initial reactions. Identify the theme, narrative structure, character development, and use of literary devices. Evaluate how effectively the author conveys their message. Support your assessment with specific textual evidence and quotations from the work."}
+{"input": "what are the principles of democracy", "output": "lex: principles democracy government\nlex: democratic principles rule of law elections\nlex: democracy separation of powers rights\nvec: what are the fundamental principles that define a democratic system of government?\nvec: how do free elections, rule of law, and separation of powers form the foundation of democracy?\nhyde: The core principles of democracy include popular sovereignty (power derives from the people), free and fair elections, rule of law, separation of powers among branches of government, protection of individual rights and civil liberties, and majority rule with minority rights. An independent judiciary ensures laws are applied equally."}
+{"input": "how to grow tomatoes at home?", "output": "lex: grow tomatoes home garden\nlex: tomato plant care watering sunlight\nlex: container tomatoes growing tips\nvec: how do you grow tomato plants at home in a garden bed or container?\nvec: what soil, sunlight, and watering conditions do tomato plants need to produce fruit?\nhyde: Plant tomato seedlings after the last frost in a spot receiving 6-8 hours of direct sunlight. Use well-draining soil amended with compost. Water deeply at the base 1-2 inches per week. Stake or cage plants for support. Feed with a balanced fertilizer every two weeks once fruit begins to set."}
+{"input": "how to fix a loud exhaust?", "output": "lex: fix loud exhaust car muffler\nlex: exhaust leak repair pipe\nlex: muffler replacement noisy exhaust\nvec: how do you diagnose and fix a loud or rattling car exhaust system?\nvec: what causes a car exhaust to become loud and how do you repair or replace the muffler?\nhyde: A loud exhaust is usually caused by a hole in the muffler, a cracked exhaust pipe, or a failed gasket at the manifold. For small holes, apply exhaust repair tape or paste as a temporary fix. For larger damage, replace the affected section. A rusted-through muffler should be replaced entirely — bolt-on universal mufflers cost $30–$80."}
+{"input": "what is kinetic art?", "output": "lex: kinetic art sculpture movement\nlex: kinetic art artists Calder Tinguely\nlex: moving art installation mechanical\nvec: what is kinetic art and how do artists create sculptures and installations that move?\nvec: who are the most famous kinetic artists and what are their notable works?\nhyde: Kinetic art is a genre of art that incorporates real or apparent movement. Alexander Calder pioneered the mobile — hanging sculptures that move with air currents. Jean Tinguely built complex mechanical assemblages that rattled and spun. Modern kinetic artists use motors, wind, and magnets to create motion."}
+{"input": "async web", "output": "lex: async web framework server\nlex: asynchronous HTTP request JavaScript Python\nlex: async await web API\nvec: how do asynchronous programming patterns work in web development and API requests?\nvec: what are the best async web frameworks for building non-blocking HTTP servers?\nhyde: Asynchronous web programming allows a server to handle multiple requests concurrently without blocking. In Python, frameworks like FastAPI and aiohttp use async/await syntax with an event loop. In JavaScript, Express with async handlers or Fastify process requests non-blockingly. This improves throughput for I/O-bound workloads."}
+{"input": "what is the philosophy of nonviolence", "output": "lex: philosophy nonviolence ahimsa pacifism\nlex: nonviolence Gandhi King civil disobedience\nvec: what is the philosophical basis for nonviolence as practiced by Gandhi and Martin Luther King Jr.?\nvec: how does the concept of ahimsa relate to the broader philosophy of nonviolent resistance?\nhyde: Nonviolence (ahimsa) as a philosophy holds that physical force is never justified as a means of conflict resolution. Mahatma Gandhi developed satyagraha — truth-force — as a method of nonviolent resistance against British colonial rule. Martin Luther King Jr. adapted these principles to the American civil rights movement."}
+{"input": "what are the main sects of islam?", "output": "lex: sects of Islam Sunni Shia Sufi\nlex: Islamic denominations branches\nlex: Sunni Shia differences beliefs\nvec: what are the major sects and branches within Islam and how do they differ?\nvec: what caused the split between Sunni and Shia Muslims and what are their key theological differences?\nhyde: The two main sects of Islam are Sunni (approximately 85-90% of Muslims) and Shia (10-15%). The split originated from a disagreement over succession after Prophet Muhammad's death in 632 CE. Sunnis accepted Abu Bakr as caliph, while Shia believed leadership belonged to Ali, Muhammad's cousin and son-in-law. Sufism is a mystical tradition found within both branches."}
+{"input": "how to use charcoal for drawing?", "output": "lex: charcoal drawing techniques\nlex: vine compressed charcoal sketching\nlex: charcoal shading blending paper\nvec: what are the techniques for drawing and shading with charcoal on paper?\nvec: what types of charcoal are used for drawing and how do they differ in effect?\nhyde: Vine charcoal is soft and ideal for light sketching and easy erasing. Compressed charcoal is denser, producing darker, richer marks. Hold the charcoal on its side for broad strokes and use the tip for fine lines. Blend with a tortillon or chamois cloth. Fix finished drawings with spray fixative to prevent smudging."}
+{"input": "what is mindfulness", "output": "lex: mindfulness meditation practice\nlex: mindfulness definition awareness present moment\nlex: mindfulness stress reduction MBSR\nvec: what is mindfulness and how is it practiced as a form of meditation?\nvec: what are the psychological and health benefits of practicing mindfulness regularly?\nhyde: Mindfulness is the practice of paying attention to the present moment without judgment. It involves observing thoughts, feelings, and sensations as they arise and letting them pass. Jon Kabat-Zinn developed Mindfulness-Based Stress Reduction (MBSR), an eight-week program shown to reduce anxiety, depression, and chronic pain."}
+{"input": "latest updates on the ukraine conflict", "output": "lex: Ukraine conflict war 2025 2026 updates\nlex: Ukraine Russia war latest news\nlex: Ukraine ceasefire negotiations frontline\nvec: what are the most recent developments in the Russia-Ukraine war as of 2025-2026?\nvec: what is the current status of the Ukraine conflict including ceasefire talks and territorial changes?\nhyde: As fighting continues along the eastern front, diplomatic efforts have intensified with multiple rounds of negotiations. Ukraine's forces have focused on defensive operations in the Donetsk region while maintaining pressure on supply lines. International support continues with new aid packages and sanctions enforcement."}
+{"input": "git push", "output": "lex: git push remote origin\nlex: git push branch upstream\nlex: git push force rejected\nvec: how do you push commits to a remote repository using git push?\nvec: what do you do when git push is rejected and how do you set upstream tracking branches?\nhyde: Use `git push origin main` to push your local main branch to the remote. For a new branch, use `git push -u origin feature-branch` to set the upstream tracking reference. If the push is rejected because the remote has new commits, run `git pull --rebase` first, then push again."}
+{"input": "what is hedonism", "output": "lex: hedonism philosophy pleasure\nlex: hedonism Epicurus ethical theory\nlex: hedonistic ethics pleasure pain\nvec: what is hedonism as a philosophical doctrine about pleasure and the good life?\nvec: how did Epicurus define hedonism and how does it differ from popular conceptions of pleasure-seeking?\nhyde: Hedonism is the philosophical view that pleasure is the highest good and the proper aim of human life. Epicurus distinguished between kinetic pleasures (active enjoyment) and katastematic pleasures (the absence of pain). He argued that simple pleasures, friendship, and tranquility produce the most lasting happiness — not excess or indulgence."}
+{"input": "what is a mathematical model", "output": "lex: mathematical model definition\nlex: mathematical modeling equations simulation\nlex: applied mathematics modeling real world\nvec: what is a mathematical model and how is it used to represent real-world systems?\nvec: how do scientists and engineers build mathematical models to simulate and predict phenomena?\nhyde: A mathematical model uses equations and variables to represent a real-world system. For example, the SIR model uses differential equations to predict infectious disease spread: dS/dt = -βSI, dI/dt = βSI - γI, dR/dt = γI. Models are validated by comparing predictions against observed data and refined iteratively."}
+{"input": "how to grow an herb garden", "output": "lex: grow herb garden home indoor outdoor\nlex: herb garden planting basil cilantro thyme\nlex: container herb garden windowsill\nvec: how do you start and maintain an herb garden at home, indoors or outdoors?\nvec: which herbs grow best together and what soil and light conditions do they need?\nhyde: Start with easy herbs like basil, parsley, mint, rosemary, and thyme. Plant in well-draining soil with 6+ hours of sunlight. Herbs in containers need pots with drainage holes and regular watering when the top inch of soil is dry. Harvest regularly by pinching stems above leaf nodes to encourage bushy growth."}
+{"input": "how to evaluate a scientific claim", "output": "lex: evaluate scientific claim evidence\nlex: critical thinking scientific evidence peer review\nlex: assess scientific study credibility\nvec: how do you critically evaluate whether a scientific claim is supported by credible evidence?\nvec: what criteria should you use to judge the reliability of a scientific study or finding?\nhyde: Check if the claim is published in a peer-reviewed journal. Look at the sample size, methodology, and whether results have been replicated independently. Consider whether the source has conflicts of interest. Distinguish between correlation and causation. Evaluate the statistical significance and effect size reported in the study."}
+{"input": "what is virtue signaling?", "output": "lex: virtue signaling definition examples\nlex: virtue signaling social media politics\nvec: what does virtue signaling mean and how is the term used in political and social discourse?\nvec: how do people use virtue signaling to publicly express moral values without substantive action?\nhyde: Virtue signaling refers to the public expression of moral values or opinions primarily intended to demonstrate one's good character rather than to effect change. The term is often used critically to describe performative displays on social media — such as posting a hashtag or changing a profile picture — without taking meaningful action on the issue."}
+{"input": "what is impact investing?", "output": "lex: impact investing ESG social return\nlex: impact investing funds sustainable\nlex: socially responsible investing SRI\nvec: what is impact investing and how does it generate both financial returns and social or environmental benefit?\nvec: how does impact investing differ from traditional investing and ESG strategies?\nhyde: Impact investing directs capital toward companies and projects that generate measurable social or environmental benefits alongside financial returns. Unlike ESG screening, which excludes harmful sectors, impact investing actively targets positive outcomes — such as affordable housing, renewable energy, or microfinance. The Global Impact Investing Network (GIIN) estimates the market at over $1 trillion."}
+{"input": "stellar cartography", "output": "lex: stellar cartography star mapping\nlex: star chart celestial mapping catalog\nlex: astronomical survey stellar positions\nvec: what is stellar cartography and how do astronomers map the positions and movements of stars?\nvec: what tools and surveys are used to create detailed maps of stars in the galaxy?\nhyde: Stellar cartography is the science of mapping the positions, distances, and motions of stars. The ESA's Gaia mission has cataloged over 1.8 billion stars with precise positions and parallax measurements. Stellar maps use right ascension and declination coordinates, with distances measured in parsecs from trigonometric parallax."}
+{"input": "what are hedge funds?", "output": "lex: hedge funds investment strategy\nlex: hedge fund accredited investors returns\nlex: hedge fund management fee structure\nvec: what are hedge funds and how do they differ from mutual funds and other investment vehicles?\nvec: what strategies do hedge funds use to generate returns and manage risk?\nhyde: A hedge fund is a pooled investment fund that employs diverse strategies — including long/short equity, arbitrage, and derivatives trading — to generate returns for accredited investors. Unlike mutual funds, hedge funds face fewer regulatory restrictions and typically charge a 2% management fee plus 20% of profits (the \"2 and 20\" model)."}
+{"input": "github repository", "output": "lex: GitHub repository create manage\nlex: GitHub repo clone push pull\nlex: git repository hosting GitHub\nvec: how do you create and manage a repository on GitHub for version control?\nvec: what are the basic operations for working with a GitHub repository including cloning, pushing, and pull requests?\nhyde: To create a GitHub repository, click \"New repository\" on github.com, name it, and choose public or private visibility. Clone it locally with `git clone https://github.com/user/repo.git`. Add files, commit changes, and push with `git push origin main`. Collaborate through pull requests and code reviews."}
+{"input": "how to enhance positive social impact?", "output": "lex: enhance social impact community\nlex: positive social impact strategies nonprofit\nlex: social change community engagement\nvec: what are effective strategies for individuals and organizations to create positive social impact?\nvec: how can nonprofits and businesses measure and increase their social impact in communities?\nhyde: To enhance social impact, define clear measurable goals aligned with community needs. Use a theory of change to map how activities lead to outcomes. Partner with local organizations for culturally informed approaches. Measure results with both quantitative metrics (people served, outcomes achieved) and qualitative feedback from beneficiaries."}
+{"input": "how to negotiate rent prices", "output": "lex: negotiate rent price landlord\nlex: rent negotiation apartment lease\nlex: lower rent strategies tenant\nvec: how do you negotiate a lower rent price with your landlord when signing or renewing a lease?\nvec: what tactics and arguments can tenants use to get a better deal on apartment rent?\nhyde: Research comparable rents in your area on Zillow or Apartments.com before negotiating. Highlight your strengths as a tenant: stable income, good credit, long tenure, or willingness to sign a longer lease. Negotiate during off-peak months (November-February) when demand is lower. Offer to prepay several months or handle minor maintenance in exchange for a reduction."}
+{"input": "how to propagate succulents from leaves", "output": "lex: propagate succulents leaves cuttings\nlex: succulent leaf propagation rooting\nlex: grow succulents from leaf\nvec: how do you propagate new succulent plants from individual leaf cuttings?\nvec: what is the step-by-step process for rooting succulent leaves to grow new plants?\nhyde: Gently twist a healthy leaf from the stem, ensuring a clean break with the base intact. Let it callous over for 2-3 days in indirect light. Place on top of well-draining cactus soil and mist every few days. Roots and a tiny rosette will appear in 2-4 weeks. Avoid direct sunlight until established."}
+{"input": "what is the role of non-governmental organizations", "output": "lex: NGO non-governmental organization role\nlex: NGOs humanitarian aid development\nlex: nonprofit organizations international advocacy\nvec: what roles do non-governmental organizations (NGOs) play in humanitarian aid, development, and advocacy?\nvec: how do NGOs influence government policy and deliver services in developing countries?\nhyde: Non-governmental organizations (NGOs) operate independently from government to address social, environmental, and humanitarian issues. They deliver aid in crisis zones, advocate for policy changes, monitor human rights, and provide services like healthcare and education. Major NGOs include Médecins Sans Frontières, Amnesty International, and the Red Cross."}
+{"input": "what is pentecost in christian faith", "output": "lex: Pentecost Christian Holy Spirit\nlex: Pentecost Acts apostles church\nlex: Pentecost feast day Christianity\nvec: what is the meaning and significance of Pentecost in the Christian faith?\nvec: what happened on the day of Pentecost according to the Book of Acts in the Bible?\nhyde: Pentecost commemorates the descent of the Holy Spirit upon the apostles fifty days after Easter, as described in Acts 2. The apostles began speaking in tongues and Peter preached to a crowd, leading to about 3,000 conversions. It is often called the birthday of the Christian Church and is celebrated as a major feast day."}
+{"input": "how to pay off student loans faster", "output": "lex: pay off student loans faster\nlex: student loan repayment strategies\nlex: student loan refinance extra payments\nvec: what are the most effective strategies for paying off student loans ahead of schedule?\nvec: how can refinancing or making extra payments help you pay off student loans faster?\nhyde: Make payments above the minimum and specify that extra goes toward the principal. Refinance at a lower interest rate if your credit has improved. Use the avalanche method to target the highest-rate loan first. Set up biweekly payments instead of monthly to make one extra payment per year. Allocate windfalls like tax refunds directly to loans."}
+{"input": "what are the characteristics of gothic literature?", "output": "lex: gothic literature characteristics elements\nlex: gothic fiction dark romantic horror\nlex: gothic novel atmosphere supernatural\nvec: what are the defining characteristics and common elements of gothic literature?\nvec: how do gothic novels use setting, atmosphere, and the supernatural to create suspense and dread?\nhyde: Gothic literature features dark, brooding settings like castles, ruins, and isolated mansions. Common elements include supernatural events, madness, secrets, and heightened emotion. The atmosphere is oppressive and foreboding. Key works include Horace Walpole's The Castle of Otranto, Mary Shelley's Frankenstein, and Bram Stoker's Dracula."}
+{"input": "how to register a political party", "output": "lex: register political party requirements\nlex: form new political party ballot access\nlex: political party registration petition signatures\nvec: what is the legal process for registering a new political party in the United States?\nvec: what requirements must be met to officially form and register a political party for elections?\nhyde: Requirements to register a political party vary by state. Generally, you must file organizational documents with the secretary of state, collect a minimum number of petition signatures (often 1-5% of registered voters), adopt a party platform and bylaws, and hold a founding convention. Some states also require fielding candidates in a certain number of races."}
+{"input": "leather reclining lounge chairs", "output": "lex: leather reclining lounge chair\nlex: leather recliner chair buy\nlex: reclining lounge chair living room\nvec: what are the best leather reclining lounge chairs for comfort and durability?\nvec: where can I buy a high-quality leather recliner chair for my living room?\nhyde: The La-Z-Boy Kirkwood leather recliner features top-grain leather upholstery, a power reclining mechanism, and lumbar support. At $1,200, it's a mid-range option with a 10-year warranty. For premium choices, the Ekornes Stressless recliner offers ergonomic design with adjustable headrest and glide function starting at $2,500."}
+{"input": "how to write a scientific research proposal", "output": "lex: write scientific research proposal\nlex: research proposal template structure\nlex: grant proposal methodology aims\nvec: how do you write a compelling scientific research proposal with clear aims and methodology?\nvec: what sections and structure should a scientific research proposal include?\nhyde: A scientific research proposal typically includes: title, abstract, specific aims, background and significance, preliminary data, research design and methods, timeline, budget and justification, and references. The specific aims page is the most critical — state the problem, your hypothesis, and 2-3 measurable objectives clearly in one page."}
+{"input": "how to open a savings account", "output": "lex: open savings account bank\nlex: savings account requirements documents\nlex: high yield savings account online\nvec: what is the process for opening a savings account at a bank or online institution?\nvec: what documents and minimum deposit do you need to open a savings account?\nhyde: To open a savings account, choose a bank or credit union and compare interest rates (high-yield online accounts often offer 4-5% APY). You'll need a government-issued ID, Social Security number, and an initial deposit (often $25-$100). Apply online or in person. Link a checking account for easy transfers and set up automatic deposits."}
+{"input": "what is the role of e-commerce in modern business", "output": "lex: e-commerce business online retail\nlex: e-commerce sales growth digital\nlex: online shopping platform business model\nvec: how has e-commerce transformed the way businesses sell products and reach customers?\nvec: what role does e-commerce play in business strategy including direct-to-consumer and marketplace models?\nhyde: E-commerce enables businesses to sell products globally without physical storefronts. Companies use platforms like Shopify, Amazon Marketplace, and WooCommerce to reach customers online. In 2024, global e-commerce sales exceeded $6 trillion. Direct-to-consumer (DTC) brands cut out middlemen, while marketplaces aggregate sellers for one-stop shopping."}
+{"input": "tree climb", "output": "lex: tree climbing techniques equipment\nlex: recreational tree climbing arborist\nlex: tree climbing harness rope\nvec: what techniques and equipment are used for recreational or professional tree climbing?\nvec: how do arborists safely climb trees using ropes, harnesses, and climbing spurs?\nhyde: Recreational tree climbing uses a doubled-rope technique (DRT) with a throw line to set the rope over a branch. Climbers wear a saddle harness and ascend using mechanical ascenders or friction hitches like the Blake's hitch. Arborists use single-rope technique (SRT) for efficiency and may use climbing spurs for removals only."}
+{"input": "how to upgrade car headlights?", "output": "lex: upgrade car headlights LED HID\nlex: replace headlight bulbs brighter\nlex: headlight upgrade installation\nvec: how do you upgrade your car's headlights to brighter LED or HID bulbs?\nvec: what are the steps for replacing stock halogen headlights with aftermarket LED headlights?\nhyde: To upgrade from halogen to LED headlights, find your bulb size in the owner's manual (e.g., H11, 9005). Purchase a quality LED kit from brands like Hikari or Fahren. Remove the old bulb by twisting the retaining ring, insert the LED bulb, and connect the driver/ballast. Aim the headlights after installation to avoid blinding oncoming traffic."}
+{"input": "what are the themes of to kill a mockingbird?", "output": "lex: To Kill a Mockingbird themes\nlex: To Kill a Mockingbird racial injustice innocence\nlex: Harper Lee themes moral courage\nvec: what are the major themes explored in Harper Lee's To Kill a Mockingbird?\nvec: how does To Kill a Mockingbird address racial injustice, moral courage, and the loss of innocence?\nhyde: The central themes of To Kill a Mockingbird include racial injustice in the American South, as shown through Tom Robinson's trial. Moral courage is embodied by Atticus Finch, who defends Robinson despite social pressure. The loss of innocence is traced through Scout's growing awareness of prejudice and cruelty in Maycomb, Alabama."}
+{"input": "how to install a car roof rack?", "output": "lex: install car roof rack\nlex: roof rack mounting crossbars\nlex: car roof rack installation guide\nvec: how do you install a roof rack on a car with or without factory roof rails?\nvec: what are the steps for mounting crossbars and a roof rack system on a vehicle?\nhyde: For cars with factory side rails, slide the crossbar feet onto the rails and tighten the clamps at your desired spacing. For bare roofs, use a fit kit with clips that hook into the door frame. Torque the mounting hardware to the manufacturer's specification (usually 6-8 Nm). Test by pushing firmly on the bars to confirm they don't shift."}
+{"input": "why is deforestation a concern?", "output": "lex: deforestation environmental impact\nlex: deforestation climate change biodiversity loss\nlex: tropical rainforest destruction causes\nvec: why is deforestation considered a serious environmental problem and what are its consequences?\nvec: how does deforestation contribute to climate change, biodiversity loss, and soil erosion?\nhyde: Deforestation removes trees that absorb CO2, releasing stored carbon and accelerating climate change. Tropical forests hold over 50% of Earth's species — clearing them drives mass extinction. Deforested land loses topsoil to erosion, reducing agricultural productivity. The Amazon alone lost 10,000 square kilometers of forest in a single year."}
+{"input": "how do philosophers explore the nature of reality", "output": "lex: philosophy nature of reality metaphysics\nlex: metaphysics ontology existence\nlex: philosophical realism idealism\nvec: how have philosophers historically explored and debated the nature of reality and existence?\nvec: what are the main metaphysical positions on whether reality is fundamentally material, mental, or something else?\nhyde: Metaphysics, the branch of philosophy concerned with the nature of reality, asks questions like: What exists? Is the physical world all there is? Plato argued that true reality consists of abstract Forms. Descartes proposed mind-body dualism. Materialists hold that only physical matter exists, while idealists like Berkeley argued that reality is fundamentally mental."}
+{"input": "how to build a writing routine", "output": "lex: writing routine daily habit\nlex: build writing practice discipline\nlex: writing schedule productivity\nvec: how do you establish a consistent daily writing routine and maintain discipline?\nvec: what strategies do professional writers use to build and sustain a writing habit?\nhyde: Set a specific time each day for writing — morning works best for many writers because willpower is highest. Start with a modest goal of 300-500 words and increase gradually. Write in the same place to create environmental cues. Track your word count daily. Don't edit while drafting — the first draft's only job is to exist."}
+{"input": "what are public sentiments on immigration", "output": "lex: public opinion immigration polls\nlex: immigration attitudes survey sentiment\nlex: immigration policy public views 2025 2026\nvec: what do recent polls and surveys reveal about public sentiment on immigration policy?\nvec: how do public attitudes toward immigration vary by country, political affiliation, and demographics?\nhyde: A 2025 Gallup poll found that 28% of Americans wanted immigration increased, 36% wanted it decreased, and 33% wanted it kept at current levels. Views split sharply along party lines: 55% of Democrats favored more immigration versus 11% of Republicans. In Europe, surveys showed rising concern about integration alongside recognition of labor market needs."}
+{"input": "how do people practice meditation in buddhism", "output": "lex: Buddhist meditation practice techniques\nlex: Vipassana Zen meditation Buddhism\nlex: mindfulness meditation Buddhist traditions\nvec: what are the main forms of meditation practiced in Buddhism and how are they performed?\nvec: how do Vipassana, Zen, and Tibetan Buddhist meditation techniques differ from each other?\nhyde: Buddhist meditation includes two main types: samatha (calm abiding) and vipassana (insight). In Vipassana, practitioners observe bodily sensations and mental events with equanimity. Zen meditation (zazen) involves sitting with awareness of breath, often facing a wall. Tibetan Buddhism adds visualization practices and mantra recitation. All traditions emphasize mindful awareness."}
+{"input": "how to edit in lightroom", "output": "lex: edit photos Adobe Lightroom\nlex: Lightroom editing tutorial sliders\nlex: Lightroom develop module adjustments\nvec: how do you edit and enhance photos using Adobe Lightroom's develop module?\nvec: what are the essential Lightroom editing steps for exposure, color, and tone adjustments?\nhyde: In Lightroom's Develop module, start with the Basic panel: adjust Exposure for overall brightness, then Highlights and Shadows to recover detail. Set White Balance using the eyedropper or Temperature/Tint sliders. Increase Clarity for midtone contrast and Vibrance for subtle color boost. Use the HSL panel to fine-tune individual colors."}
+{"input": "how does the philosophy of education explore learning", "output": "lex: philosophy of education learning theory\nlex: educational philosophy Dewey Montessori\nlex: epistemology education pedagogy\nvec: how do educational philosophers like Dewey and Montessori theorize about the nature of learning?\nvec: what are the major philosophical approaches to education and how do they shape teaching methods?\nhyde: John Dewey's pragmatism views learning as experiential — students learn by doing and reflecting. Montessori emphasizes self-directed activity and hands-on learning in prepared environments. Constructivism holds that learners build knowledge actively rather than passively receiving it. Each philosophy leads to different classroom structures and teaching practices."}
+{"input": "how to make a family budget?", "output": "lex: family budget plan household\nlex: family budget spreadsheet expenses\nlex: household budgeting categories\nvec: how do you create a family budget that accounts for all household income and expenses?\nvec: what categories and tools should you use when building a family budget?\nhyde: List all family income sources including salaries, freelance work, and benefits. Categorize expenses into fixed (mortgage, insurance, utilities), variable (groceries, gas, clothing), and discretionary (dining out, entertainment). Allocate funds using the envelope method or a budgeting app like Mint or YNAB. Review spending together monthly."}
+{"input": "what is the significance of the ten commandments", "output": "lex: Ten Commandments significance Bible\nlex: Ten Commandments Moses Judaism Christianity\nlex: Decalogue moral law religious\nvec: what is the religious and historical significance of the Ten Commandments in Judaism and Christianity?\nvec: how have the Ten Commandments influenced Western law, ethics, and moral codes?\nhyde: The Ten Commandments (Decalogue) were given by God to Moses on Mount Sinai, as recorded in Exodus 20 and Deuteronomy 5. They form the foundational moral code of Judaism and Christianity, covering duties to God (no other gods, no idols, keep the Sabbath) and duties to others (honor parents, do not murder, steal, or lie)."}
+{"input": "what is creative non-fiction?", "output": "lex: creative non-fiction genre writing\nlex: creative nonfiction memoir essay narrative\nlex: literary nonfiction storytelling\nvec: what is creative non-fiction and how does it differ from traditional journalism or academic writing?\nvec: what techniques do creative non-fiction writers use to tell true stories in a literary way?\nhyde: Creative non-fiction uses literary techniques — narrative arc, scene-setting, dialogue, and vivid description — to tell true stories. Subgenres include memoir, personal essay, literary journalism, and nature writing. Unlike standard reporting, the writer's voice and perspective are central. Examples include Truman Capote's In Cold Blood and Joan Didion's essays."}
+{"input": "air filter", "output": "lex: air filter replacement HVAC\nlex: car engine air filter\nlex: home air purifier HEPA filter\nvec: how often should you replace an air filter in your car engine or home HVAC system?\nvec: what types of air filters are available for home air purifiers and what do HEPA ratings mean?\nhyde: Replace your car's engine air filter every 15,000-30,000 miles depending on driving conditions. Home HVAC filters should be changed every 1-3 months. HEPA filters capture 99.97% of particles 0.3 microns or larger. MERV ratings from 1-16 indicate filtration efficiency — MERV 13+ is recommended for allergy sufferers."}
+{"input": "what is the periodic table", "output": "lex: periodic table elements chemistry\nlex: periodic table groups periods atomic number\nlex: Mendeleev periodic table organization\nvec: what is the periodic table and how are chemical elements organized within it?\nvec: how did Mendeleev create the periodic table and what patterns does it reveal about element properties?\nhyde: The periodic table organizes all known chemical elements by increasing atomic number into rows (periods) and columns (groups). Elements in the same group share similar chemical properties because they have the same number of valence electrons. Dmitri Mendeleev published the first widely recognized periodic table in 1869, predicting undiscovered elements."}
+{"input": "how to use green screen", "output": "lex: green screen chroma key setup\nlex: green screen video editing background\nlex: green screen lighting technique\nvec: how do you set up and use a green screen for video production and chroma key compositing?\nvec: what lighting and camera settings are needed for clean green screen footage?\nhyde: Set up an evenly lit green screen with no wrinkles or shadows. Place the subject at least 6 feet in front of the screen to avoid green spill. Use two softbox lights at 45-degree angles on the screen and separate lights for the subject. In post-production, apply chroma key in software like DaVinci Resolve or After Effects to replace the green background."}
+{"input": "what are the latest fashion trends 2023?", "output": "lex: fashion trends 2023 2024 2025\nlex: latest fashion trends clothing style\nlex: 2023 fashion runway trends\nvec: what were the top fashion trends in 2023 and how have they evolved into 2024-2025?\nvec: what clothing styles, colors, and silhouettes defined fashion trends in recent years?\nhyde: Key fashion trends in 2023 included quiet luxury with understated neutral tones and premium fabrics, oversized blazers and tailored wide-leg trousers, sheer fabrics, ballet flats, and the revival of denim-on-denim. Barbiecore pink carried over from 2022, while earth tones and burgundy gained momentum heading into 2024."}
+{"input": "how to conduct field research", "output": "lex: field research methods data collection\nlex: conduct field study observation interview\nlex: ethnographic fieldwork techniques\nvec: how do researchers plan and conduct field research including observation and interviews?\nvec: what are the methods and ethical considerations involved in conducting ethnographic field research?\nhyde: Field research involves collecting data in natural settings through observation, interviews, and surveys. Begin with a clear research question and ethical approval. Use participant observation to immerse yourself in the environment. Take detailed field notes immediately after each session. Triangulate data from multiple sources to strengthen validity."}
+{"input": "digital currencies", "output": "lex: digital currency cryptocurrency Bitcoin\nlex: digital currency CBDC blockchain\nlex: cryptocurrency exchange trading\nvec: what are digital currencies including cryptocurrencies and central bank digital currencies (CBDCs)?\nvec: how do digital currencies like Bitcoin and Ethereum work using blockchain technology?\nhyde: Digital currencies exist only in electronic form and include cryptocurrencies like Bitcoin and Ethereum, which use decentralized blockchain networks, and central bank digital currencies (CBDCs) issued by governments. Bitcoin uses proof-of-work consensus while Ethereum moved to proof-of-stake. Over 130 countries are exploring or piloting CBDCs as of 2025."}
+{"input": "tree grow", "output": "lex: tree growth rate species\nlex: grow trees planting care\nlex: tree growth stages seedling mature\nvec: how fast do different tree species grow and what conditions promote healthy tree growth?\nvec: what are the stages of tree growth from seedling to mature tree and how do you care for young trees?\nhyde: Tree growth rates vary widely by species. Fast-growing trees like hybrid poplar and willow can add 3-5 feet per year, while oaks grow 1-2 feet annually. For healthy growth, plant in appropriate soil with adequate drainage, water deeply during the first two years, mulch around the base (not touching the trunk), and prune to establish strong structure."}
+{"input": "sail set", "output": "lex: sail set trim sailing\nlex: setting sails rigging sailboat\nlex: sail trim wind angle\nvec: how do you properly set and trim sails on a sailboat for different wind conditions?\nvec: what is the correct technique for setting a mainsail and jib when sailing upwind or downwind?\nhyde: To set the mainsail, head into the wind and raise the halyard while feeding the luff into the mast track. Tension the outhaul and cunningham based on wind strength. When sailing upwind, trim the mainsheet until the telltales flow evenly. Ease the sheet when reaching or running. Adjust the jib sheet so the luff telltales break evenly."}
+{"input": "how to apply the scientific method", "output": "lex: scientific method steps process\nlex: apply scientific method experiment hypothesis\nlex: scientific method observation data analysis\nvec: what are the steps of the scientific method and how do you apply them to an experiment?\nvec: how do scientists use the scientific method to test hypotheses and draw conclusions?\nhyde: The scientific method follows these steps: (1) Observe a phenomenon, (2) Ask a question, (3) Form a testable hypothesis, (4) Design and conduct an experiment with controlled variables, (5) Collect and analyze data, (6) Draw conclusions — does the evidence support or refute the hypothesis? (7) Communicate results and invite replication."}
+{"input": "what is the role of the holy spirit in christianity?", "output": "lex: Holy Spirit Christianity role\nlex: Holy Spirit Trinity Christian theology\nlex: Holy Spirit gifts fruits Bible\nvec: what role does the Holy Spirit play in Christian theology and the life of believers?\nvec: how is the Holy Spirit understood within the doctrine of the Trinity in Christianity?\nhyde: In Christian theology, the Holy Spirit is the third person of the Trinity — coequal with the Father and the Son. The Spirit convicts of sin, regenerates believers at conversion, indwells Christians as a guide and comforter, and empowers them with spiritual gifts (1 Corinthians 12). At Pentecost, the Spirit descended on the apostles, enabling them to preach."}
+{"input": "code review", "output": "lex: code review pull request\nlex: code review checklist guidelines\nlex: peer code review feedback\nvec: what are the best practices for conducting an effective code review on a pull request?\nvec: what should reviewers look for during a code review including bugs, readability, and architecture?\nhyde: During a code review, check for correctness, readability, and maintainability. Look for edge cases, error handling, and potential security issues. Verify that naming conventions are clear and tests cover the new code. Provide constructive feedback with specific suggestions rather than vague criticism. Approve only when the code is production-ready."}
+{"input": "how to manage personal finances", "output": "lex: personal finance management\nlex: manage money budgeting saving investing\nlex: personal financial planning\nvec: what are the key steps for managing your personal finances including budgeting, saving, and investing?\nvec: how should you organize your personal finances to build wealth and avoid debt?\nhyde: Start with a budget tracking all income and expenses. Build an emergency fund covering 3-6 months of expenses. Pay off high-interest debt aggressively. Contribute enough to your 401(k) to get the employer match, then fund a Roth IRA. Automate savings and investments. Review your financial plan quarterly and adjust as income or goals change."}
+{"input": "how to understand legislative documents", "output": "lex: read legislative documents bills statutes\nlex: understand legislation legal language\nlex: interpreting bills acts laws\nvec: how do you read and interpret legislative documents such as bills, statutes, and regulations?\nvec: what techniques help non-lawyers understand the language and structure of legislative texts?\nhyde: Legislative documents follow a standard structure: the title, enacting clause, definitions section, substantive provisions, and effective date. Start with the definitions section — legal terms often have specific meanings different from everyday use. Read the \"findings\" or \"purpose\" section for context. Track cross-references to other statutes. Legislative summaries from CRS or CBO can provide plain-language explanations."}
+{"input": "how to participate in public policy discussions", "output": "lex: participate public policy discussion civic\nlex: public policy engagement town hall\nlex: citizen participation policy advocacy\nvec: how can citizens effectively participate in public policy discussions and influence government decisions?\nvec: what are the ways individuals can engage in public policy debates at the local, state, and federal level?\nhyde: Attend town hall meetings and public comment sessions held by local and state government bodies. Submit written comments during rulemaking periods — federal agencies post proposed rules on regulations.gov. Contact your elected representatives by phone or email. Join advocacy organizations that align with your policy priorities and participate in their campaigns."}
+{"input": "what is the role of philosophy in religion?", "output": "lex: philosophy of religion theology\nlex: philosophical arguments God existence\nlex: religion philosophy relationship faith reason\nvec: what role does philosophy play in examining and understanding religious beliefs and concepts?\nvec: how do philosophers analyze religious claims about God, the soul, and the meaning of existence?\nhyde: Philosophy of religion examines fundamental questions that religions address: Does God exist? What is the nature of the soul? How can evil exist if God is omnipotent? Philosophers evaluate arguments for God's existence (cosmological, teleological, ontological) and critique them. The field also explores the relationship between faith and reason, asking whether religious belief can be rationally justified."}
+{"input": "what is outdoor survival training?", "output": "lex: outdoor survival training wilderness\nlex: survival skills shelter fire water\nlex: wilderness survival course\nvec: what does outdoor survival training involve and what skills does it teach?\nvec: how do wilderness survival courses teach people to find shelter, water, fire, and food in the wild?\nhyde: Outdoor survival training teaches skills needed to stay alive in wilderness emergencies. Core topics include building emergency shelters from natural materials, finding and purifying water, starting fire without matches using a ferro rod or bow drill, signaling for rescue, and basic navigation without GPS. Courses range from weekend workshops to multi-week immersive programs."}
+{"input": "what is the history of the jazz age", "output": "lex: Jazz Age history 1920s\nlex: Jazz Age Harlem Renaissance Roaring Twenties\nlex: jazz music history Louis Armstrong\nvec: what was the Jazz Age and how did jazz music shape American culture in the 1920s?\nvec: how did the Jazz Age connect to the Harlem Renaissance and the social changes of the Roaring Twenties?\nhyde: The Jazz Age, spanning roughly 1920-1929, was a cultural movement defined by the rise of jazz music, loosened social mores, and economic prosperity. Jazz originated in New Orleans and spread to Chicago and New York. The Harlem Renaissance saw Black artists, musicians, and writers flourish. Louis Armstrong, Duke Ellington, and Bessie Smith became icons. The era ended with the stock market crash of 1929."}
+{"input": "how to analyze government budgets", "output": "lex: analyze government budget fiscal\nlex: government budget analysis revenue expenditure\nlex: federal state budget breakdown\nvec: how do you read and analyze a government budget to understand spending priorities and fiscal health?\nvec: what tools and frameworks are used to evaluate government budget allocations and deficits?\nhyde: To analyze a government budget, start with the summary tables showing total revenue, total expenditure, and the deficit or surplus. Compare allocations across categories: defense, healthcare, education, infrastructure. Track year-over-year changes to identify spending trends. Examine revenue sources (income tax, sales tax, borrowing) and assess whether projected growth assumptions are realistic."}
+{"input": "how to learn python programming?", "output": "lex: learn Python programming beginner\nlex: Python tutorial course exercises\nlex: Python programming fundamentals syntax\nvec: what is the best way for a beginner to learn Python programming from scratch?\nvec: what resources, courses, and projects should someone use to learn Python programming?\nhyde: Start with Python's official tutorial at docs.python.org. Learn the basics: variables, data types, loops, conditionals, and functions. Practice on sites like LeetCode or HackerRank. Build small projects — a calculator, a to-do list, or a web scraper using requests and BeautifulSoup. Automate the Boring Stuff with Python is a popular free book for beginners."}
+{"input": "what is the gospel of wealth", "output": "lex: Gospel of Wealth Andrew Carnegie\nlex: Gospel of Wealth philanthropy gilded age\nvec: what is the Gospel of Wealth written by Andrew Carnegie and what does it argue about the duty of the rich?\nvec: how did Andrew Carnegie's Gospel of Wealth influence philanthropy and attitudes toward wealth in America?\nhyde: The Gospel of Wealth is an 1889 essay by Andrew Carnegie arguing that the wealthy have a moral obligation to distribute their surplus wealth for the public good. Carnegie believed that rich individuals were better suited than government to direct resources toward education, libraries, and civic institutions. He practiced this philosophy by funding over 2,500 public libraries."}
+{"input": "how do various religions interpret the concept of god?", "output": "lex: concept of God religions monotheism polytheism\nlex: God Christianity Islam Hinduism Judaism\nlex: religious interpretations divine nature\nvec: how do different world religions like Christianity, Islam, Hinduism, and Buddhism understand the concept of God?\nvec: what are the key differences between monotheistic, polytheistic, and non-theistic religious views of God?\nhyde: Christianity, Islam, and Judaism are monotheistic — they worship one God, though Christianity distinguishes three persons in the Trinity. Hinduism includes both monotheistic and polytheistic traditions: Brahman is the ultimate reality, while deities like Vishnu and Shiva represent aspects of it. Buddhism is non-theistic, focusing on awakening rather than worship of a creator God."}
+{"input": "what is satire", "output": "lex: satire literary device definition\nlex: satire examples humor criticism\nlex: satirical writing Swift Orwell\nvec: what is satire as a literary form and how does it use humor to criticize people, institutions, or society?\nvec: what are famous examples of satire in literature, television, and political commentary?\nhyde: Satire uses irony, exaggeration, and ridicule to expose and criticize foolishness or corruption. Jonathan Swift's A Modest Proposal satirized British policy toward Ireland by suggesting the poor sell their children as food. George Orwell's Animal Farm satirized Soviet totalitarianism. Modern satire appears in shows like The Daily Show and publications like The Onion."}
+{"input": "json serial", "output": "lex: JSON serialization deserialization\nlex: JSON serialize object string\nlex: JSON stringify parse encoding\nvec: how do you serialize objects to JSON and deserialize JSON strings back to objects in programming?\nvec: what functions are used for JSON serialization in Python, JavaScript, and other languages?\nhyde: JSON serialization converts an object into a JSON string for storage or transmission. In JavaScript, JSON.stringify(obj) serializes and JSON.parse(str) deserializes. In Python, json.dumps(obj) converts to a string and json.loads(str) parses back. Custom serialization for dates or complex types requires encoder/decoder overrides."}
+{"input": "how to fix car air conditioning?", "output": "lex: fix car air conditioning AC repair\nlex: car AC not blowing cold recharge\nlex: automotive AC compressor refrigerant\nvec: how do you diagnose and fix a car air conditioning system that is not blowing cold air?\nvec: what are the common causes of car AC failure and how do you recharge the refrigerant?\nhyde: If your car AC blows warm air, check the refrigerant level first — low refrigerant is the most common cause. Use a recharge kit with R-134a (or R-1234yf for newer cars) and a pressure gauge. If the compressor clutch doesn't engage, check the fuse and relay. A leak requires UV dye detection and repair before recharging. Cabin filter clogs can also reduce airflow."}
+{"input": "what is moral absolutism", "output": "lex: moral absolutism ethics definition\nlex: moral absolutism versus relativism\nlex: absolute moral principles deontology\nvec: what is moral absolutism and how does it differ from moral relativism in ethical philosophy?\nvec: what are the arguments for and against the view that some moral rules are universally true?\nhyde: Moral absolutism holds that certain actions are intrinsically right or wrong regardless of context, culture, or consequences. For example, an absolutist would say lying is always wrong, even to protect someone. This view aligns with Kantian deontology and natural law theory. Critics argue it fails to account for moral dilemmas where absolute rules conflict."}

+ 12 - 0
finetune/data/train/dataset_info.json

@@ -0,0 +1,12 @@
+{
+  "dataset_name": "qmd-query-expansion",
+  "train_samples": 5562,
+  "val_samples": 618,
+  "short_query_pct": 26.5,
+  "columns": [
+    "prompt",
+    "completion",
+    "text",
+    "messages"
+  ]
+}

+ 12 - 0
finetune/data/train_v2/dataset_info.json

@@ -0,0 +1,12 @@
+{
+  "dataset_name": "qmd-query-expansion",
+  "train_samples": 1145,
+  "val_samples": 128,
+  "short_query_pct": 29.3,
+  "columns": [
+    "prompt",
+    "completion",
+    "text",
+    "messages"
+  ]
+}

+ 595 - 0
finetune/dataset/clean_data.py

@@ -0,0 +1,595 @@
+#!/usr/bin/env python3
+"""
+Data Quality Reviewer for Query Expansion Training Dataset
+
+This script identifies and flags/fixes semantic errors where technical terms
+are misunderstood. For example:
+- "gem find" expanded as "mineral hunt" instead of "ruby gem search"
+- "yarn spin" expanded as "wool twist" instead of "yarn package manager"
+
+The script uses contextual analysis to detect when technical terms
+are likely being used in a programming context vs. their everyday meaning.
+"""
+
+import json
+import re
+from pathlib import Path
+from dataclasses import dataclass, field
+from typing import Optional
+from collections import defaultdict
+
+
+@dataclass
+class TechnicalTerm:
+    """Definition of a technical term that might be misunderstood."""
+    term: str  # The ambiguous term (e.g., "liquid", "gem", "yarn")
+    context_indicators: list[str]  # Words that suggest tech context
+    wrong_expansions: list[str]  # Patterns that indicate wrong interpretation
+    correct_domain: str  # What domain this belongs to when technical
+    correct_lex: list[str]  # Correct lex expansions
+    correct_vec: list[str]  # Correct vec expansions
+
+
+# Known technical terms that are commonly misunderstood
+KNOWN_TECHNICAL_TERMS = [
+    TechnicalTerm(
+        term="liquid",
+        context_indicators=["shopify", "template", "filter", "tag", "theme", "jekyll"],
+        wrong_expansions=["fluid", "water", "pour", "drink", "beverage", "h2o", "wet"],
+        correct_domain="Shopify/Jekyll templating language",
+        correct_lex=["shopify template syntax", "liquid template filter"],
+        correct_vec=["shopify liquid templating language", "liquid template engine filters"],
+    ),
+    TechnicalTerm(
+        term="gem",
+        context_indicators=["ruby", "bundler", "install", "gemfile", "rails", "require"],
+        wrong_expansions=["mineral", "crystal", "jewel", "stone", "diamond", "jewelry", "precious"],
+        correct_domain="Ruby package manager",
+        correct_lex=["ruby gem package", "gem install command"],
+        correct_vec=["ruby gem package manager", "rubygems library installation"],
+    ),
+    TechnicalTerm(
+        term="yarn",
+        context_indicators=["npm", "package", "install", "node", "javascript", "react", "webpack"],
+        wrong_expansions=["thread", "wool", "knit", "spin", "textile", "fabric", "sew", "twist"],
+        correct_domain="JavaScript package manager",
+        correct_lex=["yarn package manager", "yarn install dependencies"],
+        correct_vec=["yarn javascript package manager", "yarn npm alternative"],
+    ),
+    TechnicalTerm(
+        term="hook",
+        context_indicators=["react", "use", "state", "effect", "component", "callback", "git"],
+        wrong_expansions=["fish", "fishing", "bait", "catch", "hang", "pirate"],
+        correct_domain="React hooks or Git hooks",
+        correct_lex=["react hooks api", "usestate useeffect"],
+        correct_vec=["react hooks state management", "react functional component hooks"],
+    ),
+    TechnicalTerm(
+        term="container",
+        context_indicators=["docker", "kubernetes", "k8s", "image", "orchestration", "pod"],
+        wrong_expansions=["box", "storage", "shipping", "cargo", "tupperware", "jar", "vessel"],
+        correct_domain="Docker/Kubernetes containers",
+        correct_lex=["docker container", "container image"],
+        correct_vec=["docker container virtualization", "container orchestration platform"],
+    ),
+    TechnicalTerm(
+        term="branch",
+        context_indicators=["git", "merge", "checkout", "commit", "main", "master", "repo"],
+        wrong_expansions=["tree", "limb", "wood", "leaf", "twig", "forest"],
+        correct_domain="Git version control",
+        correct_lex=["git branch", "git checkout branch"],
+        correct_vec=["git branch version control", "git branching workflow"],
+    ),
+    TechnicalTerm(
+        term="decorator",
+        context_indicators=["python", "@", "function", "wrapper", "class", "def"],
+        wrong_expansions=["interior", "design", "paint", "furniture", "decor", "ornament"],
+        correct_domain="Python decorators",
+        correct_lex=["python decorator function", "@decorator syntax"],
+        correct_vec=["python function decorators", "python decorator pattern"],
+    ),
+    TechnicalTerm(
+        term="bean",
+        context_indicators=["java", "spring", "injection", "dependency", "servlet", "ejb"],
+        wrong_expansions=["coffee", "food", "vegetable", "legume", "plant", "soy"],
+        correct_domain="Java Beans / Spring Beans",
+        correct_lex=["java bean class", "spring bean injection"],
+        correct_vec=["java enterprise beans", "spring dependency injection beans"],
+    ),
+    TechnicalTerm(
+        term="shell",
+        context_indicators=["bash", "script", "terminal", "command", "linux", "unix", "zsh"],
+        wrong_expansions=["seashell", "ocean", "beach", "clam", "oyster", "egg"],
+        correct_domain="Unix/Linux shell scripting",
+        correct_lex=["bash shell script", "shell command"],
+        correct_vec=["unix shell scripting", "bash command line shell"],
+    ),
+    TechnicalTerm(
+        term="rust",
+        context_indicators=["cargo", "crate", "ownership", "borrow", "lifetime", "unsafe"],
+        wrong_expansions=["oxidation", "metal", "corrosion", "decay", "iron", "orange"],
+        correct_domain="Rust programming language",
+        correct_lex=["rust programming language", "rust cargo package"],
+        correct_vec=["rust systems programming", "rust memory safety"],
+    ),
+    TechnicalTerm(
+        term="go",
+        context_indicators=["golang", "goroutine", "channel", "defer", "gofmt", "module"],
+        wrong_expansions=["travel", "move", "walk", "game", "board game", "leave", "depart"],
+        correct_domain="Go programming language",
+        correct_lex=["golang programming", "go language syntax"],
+        correct_vec=["go programming language", "golang concurrent programming"],
+    ),
+    TechnicalTerm(
+        term="swift",
+        context_indicators=["ios", "xcode", "apple", "uikit", "swiftui", "cocoa"],
+        wrong_expansions=["fast", "quick", "bird", "speed", "rapid", "taylor"],
+        correct_domain="Swift programming language",
+        correct_lex=["swift ios development", "swift programming language"],
+        correct_vec=["swift apple programming language", "swift ios app development"],
+    ),
+    TechnicalTerm(
+        term="pod",
+        context_indicators=["kubernetes", "k8s", "deployment", "service", "cluster", "node"],
+        wrong_expansions=["pea", "seed", "plant", "vegetable", "legume", "whale"],
+        correct_domain="Kubernetes pods",
+        correct_lex=["kubernetes pod", "k8s pod deployment"],
+        correct_vec=["kubernetes pod container group", "k8s pod orchestration"],
+    ),
+    TechnicalTerm(
+        term="redis",
+        context_indicators=["cache", "database", "key-value", "memory", "pub/sub", "queue"],
+        wrong_expansions=[],  # "redis" doesn't have common wrong meanings
+        correct_domain="Redis in-memory database",
+        correct_lex=["redis cache", "redis database"],
+        correct_vec=["redis in-memory data store", "redis caching solution"],
+    ),
+    TechnicalTerm(
+        term="kafka",
+        context_indicators=["message", "stream", "queue", "broker", "topic", "producer", "consumer"],
+        wrong_expansions=["franz", "author", "writer", "novel", "metamorphosis", "literature"],
+        correct_domain="Apache Kafka message queue",
+        correct_lex=["apache kafka", "kafka message broker"],
+        correct_vec=["apache kafka streaming platform", "kafka message queue"],
+    ),
+    TechnicalTerm(
+        term="elastic",
+        context_indicators=["elasticsearch", "search", "index", "kibana", "logstash", "query"],
+        wrong_expansions=["stretch", "rubber", "flexible", "band", "bouncy"],
+        correct_domain="Elasticsearch",
+        correct_lex=["elasticsearch", "elastic search index"],
+        correct_vec=["elasticsearch full-text search", "elastic stack"],
+    ),
+    TechnicalTerm(
+        term="spark",
+        context_indicators=["apache", "hadoop", "data", "rdd", "dataframe", "pyspark"],
+        wrong_expansions=["fire", "ignite", "flame", "plug", "electricity"],
+        correct_domain="Apache Spark",
+        correct_lex=["apache spark", "spark data processing"],
+        correct_vec=["apache spark big data processing", "spark cluster computing"],
+    ),
+    TechnicalTerm(
+        term="flask",
+        context_indicators=["python", "web", "route", "api", "jinja", "werkzeug"],
+        wrong_expansions=["bottle", "container", "lab", "chemistry", "drink", "thermos"],
+        correct_domain="Flask web framework",
+        correct_lex=["flask python web framework", "flask api"],
+        correct_vec=["flask python web development", "flask microframework"],
+    ),
+    TechnicalTerm(
+        term="django",
+        context_indicators=["python", "web", "orm", "model", "view", "template"],
+        wrong_expansions=["jazz", "music", "reinhardt", "guitar", "movie", "western"],
+        correct_domain="Django web framework",
+        correct_lex=["django python framework", "django web development"],
+        correct_vec=["django python web framework", "django orm models"],
+    ),
+    TechnicalTerm(
+        term="rails",
+        context_indicators=["ruby", "gem", "activerecord", "model", "controller", "migration"],
+        wrong_expansions=["train", "track", "railroad", "railway", "metal"],
+        correct_domain="Ruby on Rails",
+        correct_lex=["ruby on rails", "rails web framework"],
+        correct_vec=["ruby on rails framework", "rails mvc architecture"],
+    ),
+    TechnicalTerm(
+        term="node",
+        context_indicators=["javascript", "npm", "express", "async", "require", "module"],
+        wrong_expansions=["lump", "knot", "bump", "growth", "junction"],
+        correct_domain="Node.js",
+        correct_lex=["node.js javascript", "nodejs runtime"],
+        correct_vec=["node.js javascript runtime", "nodejs server-side javascript"],
+    ),
+    TechnicalTerm(
+        term="maven",
+        context_indicators=["java", "pom", "dependency", "build", "artifact", "repository"],
+        wrong_expansions=["expert", "specialist", "connoisseur"],
+        correct_domain="Apache Maven",
+        correct_lex=["apache maven", "maven build tool"],
+        correct_vec=["apache maven java build", "maven dependency management"],
+    ),
+    TechnicalTerm(
+        term="gradle",
+        context_indicators=["java", "kotlin", "android", "build", "groovy", "task"],
+        wrong_expansions=["grade", "slope", "hill", "incline"],
+        correct_domain="Gradle build tool",
+        correct_lex=["gradle build tool", "gradle android"],
+        correct_vec=["gradle java build automation", "gradle kotlin dsl"],
+    ),
+    TechnicalTerm(
+        term="ant",
+        context_indicators=["java", "build", "xml", "target", "task"],
+        wrong_expansions=["insect", "bug", "colony", "hill", "picnic"],
+        correct_domain="Apache Ant build tool",
+        correct_lex=["apache ant", "ant build xml"],
+        correct_vec=["apache ant java build", "ant build automation"],
+    ),
+]
+
+
+@dataclass
+class Issue:
+    """Represents an issue found in a dataset example."""
+    line_number: int
+    input_text: str
+    output_text: str
+    issue_type: str
+    technical_term: str
+    wrong_expansion_found: str
+    suggested_fix: Optional[str] = None
+
+
+@dataclass
+class AnalysisResult:
+    """Results of analyzing the dataset."""
+    total_examples: int = 0
+    issues_found: list[Issue] = field(default_factory=list)
+    examples_with_correct_tech_terms: list[tuple[int, str]] = field(default_factory=list)
+    term_statistics: dict = field(default_factory=lambda: defaultdict(int))
+
+
+def check_for_wrong_expansion(output_text: str, term: TechnicalTerm) -> Optional[str]:
+    """Check if the output contains wrong expansions for a technical term."""
+    output_lower = output_text.lower()
+    for wrong in term.wrong_expansions:
+        if wrong.lower() in output_lower:
+            return wrong
+    return None
+
+
+def has_tech_context(input_text: str, term: TechnicalTerm) -> bool:
+    """Check if the input has indicators of a technical context."""
+    input_lower = input_text.lower()
+    for indicator in term.context_indicators:
+        if indicator.lower() in input_lower:
+            return True
+    return False
+
+
+def is_likely_tech_query(input_text: str) -> bool:
+    """
+    Heuristic to determine if a short query is likely tech-related.
+    Short queries like "gem find" or "yarn spin" are ambiguous.
+    """
+    tech_patterns = [
+        r'\b(install|config|setup|build|run|debug|test|deploy|compile)\b',
+        r'\b(api|cli|sdk|lib|pkg|npm|pip|cargo)\b',
+        r'\b(func|class|method|var|const|let|def)\b',
+        r'\b(http|https|url|port|host|server|client)\b',
+        r'\b(json|xml|yaml|csv|sql|html|css|js)\b',
+    ]
+    input_lower = input_text.lower()
+    for pattern in tech_patterns:
+        if re.search(pattern, input_lower):
+            return True
+    return False
+
+
+def has_non_tech_context(input_text: str, term: TechnicalTerm) -> bool:
+    """
+    Check if the input clearly indicates a non-technical context.
+    This helps avoid false positives for words like "car rust", "yarn spin", etc.
+    """
+    input_lower = input_text.lower()
+    term_lower = term.term.lower()
+
+    # Define non-tech context indicators for each ambiguous term
+    non_tech_contexts = {
+        "rust": ["car", "metal", "iron", "steel", "corrosion", "prevention", "remove", "body"],
+        "gem": ["gemstone", "jewelry", "jewel", "diamond", "precious", "stone", "cut", "shop", "buy", "wear"],
+        "yarn": ["knit", "crochet", "spin", "wool", "thread", "textile", "fabric", "sew", "weave"],
+        "hook": ["fishing", "crochet", "hang", "coat", "wall", "ceiling"],
+        "container": ["storage", "plastic", "food", "shipping", "cargo", "kitchen", "box"],
+        "branch": ["tree", "bank", "library", "store", "office", "organization"],
+        "decorator": ["interior", "home", "room", "house", "design", "party", "cake", "wedding"],
+        "bean": ["coffee", "soy", "kidney", "black", "green", "garden", "cooking", "food", "plant", "grow"],
+        "shell": ["sea", "beach", "egg", "nut", "turtle", "snail", "crab", "clam", "oyster"],
+        "spark": ["plug", "fire", "ignite", "car", "engine", "electric", "romance"],
+        "go": ["travel", "vacation", "trip", "walk", "run", "leave", "visit", "tour"],
+        "swift": ["taylor", "concert", "music", "singer", "speed", "fast", "bird"],
+        "pod": ["pea", "whale", "orca", "dolphin", "vegetable", "seed", "plant"],
+        "ant": ["insect", "colony", "fire", "carpenter", "pest", "bug", "picnic"],
+        "node": ["lymph", "medical", "body", "tree", "network point"],
+        "rails": ["train", "railroad", "railway", "track", "transit", "fence"],
+        "flask": ["lab", "chemistry", "drink", "hip", "thermos", "bottle", "water"],
+        "django": ["jazz", "music", "reinhardt", "guitar", "movie", "western", "unchained"],
+        "maven": ["expert", "connoisseur", "specialist", "guru"],
+        "gradle": ["grade", "school", "slope"],
+        "kafka": ["franz", "author", "novel", "metamorphosis", "literature", "writer", "book"],
+        "elastic": ["band", "rubber", "stretch", "flexible", "waist", "fabric"],
+    }
+
+    if term_lower in non_tech_contexts:
+        for context_word in non_tech_contexts[term_lower]:
+            if context_word.lower() in input_lower:
+                return True
+
+    return False
+
+
+def analyze_example(line_num: int, input_text: str, output_text: str) -> list[Issue]:
+    """Analyze a single example for potential issues."""
+    issues = []
+    input_lower = input_text.lower()
+
+    for term in KNOWN_TECHNICAL_TERMS:
+        term_lower = term.term.lower()
+
+        # Check if the input contains this technical term
+        if term_lower not in input_lower:
+            continue
+
+        # Check if output has wrong expansion
+        wrong_expansion = check_for_wrong_expansion(output_text, term)
+        if wrong_expansion is None:
+            continue
+
+        # Skip if the context clearly indicates non-technical usage
+        if has_non_tech_context(input_text, term):
+            continue
+
+        # Determine if this is likely a technical context
+        is_tech = has_tech_context(input_text, term) or is_likely_tech_query(input_text)
+
+        # For very short inputs that contain ONLY the tech term (like "gem find"),
+        # these are ambiguous and could be tech-related
+        word_count = len(input_text.split())
+        words = [w.lower() for w in input_text.split()]
+
+        # Only flag if it's clearly a tech context OR a very short query
+        # where the term appears prominently (e.g., "gem find", "yarn add")
+        if is_tech:
+            # Create suggested fix for definite tech issues
+            suggested_output = f"lex: {term.correct_lex[0]}\nlex: {term.correct_lex[1] if len(term.correct_lex) > 1 else term.correct_lex[0]}\nvec: {term.correct_vec[0]}\nvec: {term.correct_vec[1] if len(term.correct_vec) > 1 else term.correct_vec[0]}\nhyde: {term.correct_domain} is a concept that provides functionality for software development."
+
+            issue = Issue(
+                line_number=line_num,
+                input_text=input_text,
+                output_text=output_text[:200] + "..." if len(output_text) > 200 else output_text,
+                issue_type="wrong_tech_expansion",
+                technical_term=term.term,
+                wrong_expansion_found=wrong_expansion,
+                suggested_fix=suggested_output
+            )
+            issues.append(issue)
+        elif word_count <= 2 and term_lower in words:
+            # Very short query with the term as a primary word - truly ambiguous
+            issue = Issue(
+                line_number=line_num,
+                input_text=input_text,
+                output_text=output_text[:200] + "..." if len(output_text) > 200 else output_text,
+                issue_type="ambiguous_term",
+                technical_term=term.term,
+                wrong_expansion_found=wrong_expansion,
+                suggested_fix=None
+            )
+            issues.append(issue)
+
+    return issues
+
+
+def analyze_dataset(filepath: Path) -> AnalysisResult:
+    """Analyze the entire dataset for issues."""
+    result = AnalysisResult()
+
+    with open(filepath, 'r', encoding='utf-8') as f:
+        for line_num, line in enumerate(f, 1):
+            line = line.strip()
+            if not line:
+                continue
+
+            try:
+                example = json.loads(line)
+                input_text = example.get('input', '')
+                output_text = example.get('output', '')
+
+                result.total_examples += 1
+
+                # Analyze for issues
+                issues = analyze_example(line_num, input_text, output_text)
+                result.issues_found.extend(issues)
+
+                # Track term statistics
+                for term in KNOWN_TECHNICAL_TERMS:
+                    if term.term.lower() in input_text.lower():
+                        result.term_statistics[term.term] += 1
+
+            except json.JSONDecodeError as e:
+                print(f"Warning: Could not parse line {line_num}: {e}")
+
+    return result
+
+
+def fix_example(example: dict, issues: list[Issue]) -> Optional[dict]:
+    """
+    Attempt to fix an example based on identified issues.
+    Returns None if no fix is needed or possible.
+    """
+    # Only fix examples with definite tech context issues
+    tech_issues = [i for i in issues if i.issue_type == "wrong_tech_expansion" and i.suggested_fix]
+
+    if not tech_issues:
+        return None
+
+    # Use the first tech issue's fix (they should be similar)
+    issue = tech_issues[0]
+
+    fixed = example.copy()
+    fixed['output'] = issue.suggested_fix
+    fixed['_fixed'] = True
+    fixed['_original_output'] = example['output']
+    fixed['_fix_reason'] = f"Technical term '{issue.technical_term}' was incorrectly expanded as '{issue.wrong_expansion_found}'"
+
+    return fixed
+
+
+def generate_report(result: AnalysisResult) -> str:
+    """Generate a human-readable report of the analysis."""
+    lines = []
+    lines.append("=" * 70)
+    lines.append("QUERY EXPANSION DATASET QUALITY REPORT")
+    lines.append("=" * 70)
+    lines.append("")
+    lines.append(f"Total examples analyzed: {result.total_examples}")
+    lines.append(f"Issues found: {len(result.issues_found)}")
+    lines.append("")
+
+    # Group issues by type
+    by_type = defaultdict(list)
+    for issue in result.issues_found:
+        by_type[issue.issue_type].append(issue)
+
+    lines.append("-" * 70)
+    lines.append("ISSUES BY TYPE:")
+    lines.append("-" * 70)
+
+    for issue_type, issues in by_type.items():
+        lines.append(f"\n{issue_type.upper()}: {len(issues)} issues")
+        lines.append("-" * 40)
+
+        # Show up to 10 examples per type
+        for issue in issues[:10]:
+            lines.append(f"\n  Line {issue.line_number}:")
+            lines.append(f"    Input: {issue.input_text}")
+            lines.append(f"    Technical term: '{issue.technical_term}'")
+            lines.append(f"    Wrong expansion found: '{issue.wrong_expansion_found}'")
+            if issue.suggested_fix:
+                lines.append(f"    Suggested fix available: Yes")
+
+        if len(issues) > 10:
+            lines.append(f"\n  ... and {len(issues) - 10} more")
+
+    # Term statistics
+    lines.append("\n" + "-" * 70)
+    lines.append("TECHNICAL TERM OCCURRENCES IN DATASET:")
+    lines.append("-" * 70)
+
+    for term, count in sorted(result.term_statistics.items(), key=lambda x: -x[1]):
+        if count > 0:
+            lines.append(f"  {term}: {count} occurrences")
+
+    lines.append("\n" + "=" * 70)
+
+    return "\n".join(lines)
+
+
+def save_cleaned_dataset(filepath: Path, output_path: Path, result: AnalysisResult):
+    """Save a cleaned version of the dataset."""
+    issues_by_line = defaultdict(list)
+    for issue in result.issues_found:
+        issues_by_line[issue.line_number].append(issue)
+
+    fixed_count = 0
+    flagged_count = 0
+
+    with open(filepath, 'r', encoding='utf-8') as f_in, \
+         open(output_path, 'w', encoding='utf-8') as f_out:
+
+        for line_num, line in enumerate(f_in, 1):
+            line = line.strip()
+            if not line:
+                continue
+
+            try:
+                example = json.loads(line)
+
+                if line_num in issues_by_line:
+                    issues = issues_by_line[line_num]
+                    fixed = fix_example(example, issues)
+
+                    if fixed:
+                        f_out.write(json.dumps(fixed) + '\n')
+                        fixed_count += 1
+                    else:
+                        # Flag but don't fix ambiguous cases
+                        example['_flagged'] = True
+                        example['_flag_reason'] = f"Ambiguous term '{issues[0].technical_term}' may need review"
+                        f_out.write(json.dumps(example) + '\n')
+                        flagged_count += 1
+                else:
+                    f_out.write(json.dumps(example) + '\n')
+
+            except json.JSONDecodeError:
+                # Keep problematic lines as-is
+                f_out.write(line + '\n')
+
+    return fixed_count, flagged_count
+
+
+def main():
+    """Main entry point."""
+    # Paths
+    script_dir = Path(__file__).parent
+    input_path = script_dir / "data" / "qmd_expansion.jsonl"
+    output_path = script_dir / "data" / "qmd_expansion_cleaned.jsonl"
+    report_path = script_dir / "data" / "quality_report.txt"
+
+    print(f"Analyzing dataset: {input_path}")
+    print("-" * 50)
+
+    if not input_path.exists():
+        print(f"Error: Input file not found: {input_path}")
+        return 1
+
+    # Analyze the dataset
+    result = analyze_dataset(input_path)
+
+    # Generate and print report
+    report = generate_report(result)
+    print(report)
+
+    # Save report to file
+    with open(report_path, 'w', encoding='utf-8') as f:
+        f.write(report)
+    print(f"\nReport saved to: {report_path}")
+
+    # Save cleaned dataset
+    fixed_count, flagged_count = save_cleaned_dataset(input_path, output_path, result)
+
+    print(f"\nCleaned dataset saved to: {output_path}")
+    print(f"  - Examples fixed: {fixed_count}")
+    print(f"  - Examples flagged for review: {flagged_count}")
+    print(f"  - Examples unchanged: {result.total_examples - fixed_count - flagged_count}")
+
+    # Summary statistics
+    print("\n" + "=" * 50)
+    print("SUMMARY")
+    print("=" * 50)
+    print(f"Total examples: {result.total_examples}")
+    print(f"Total issues found: {len(result.issues_found)}")
+
+    tech_issues = [i for i in result.issues_found if i.issue_type == "wrong_tech_expansion"]
+    ambig_issues = [i for i in result.issues_found if i.issue_type == "ambiguous_term"]
+
+    print(f"  - Definite tech term errors: {len(tech_issues)}")
+    print(f"  - Ambiguous terms needing review: {len(ambig_issues)}")
+
+    if len(result.issues_found) > 0:
+        error_rate = len(result.issues_found) / result.total_examples * 100
+        print(f"\nError rate: {error_rate:.2f}%")
+
+    return 0
+
+
+if __name__ == "__main__":
+    exit(main())

+ 221 - 0
finetune/dataset/generate_data.py

@@ -0,0 +1,221 @@
+#!/usr/bin/env python3
+"""Generate synthetic training data for QMD query expansion using Claude API."""
+
+import argparse
+import json
+import os
+import random
+from pathlib import Path
+
+try:
+    import anthropic
+except ImportError:
+    print("Install anthropic: pip install anthropic")
+    exit(1)
+
+# Sample query templates for diverse training data
+QUERY_TEMPLATES = [
+    # Technical documentation
+    "how to {action} {technology}",
+    "{technology} {concept} example",
+    "configure {technology} for {use_case}",
+    "{error_type} error in {technology}",
+    "best practices for {concept}",
+
+    # Personal notes / journals
+    "meeting notes {topic}",
+    "ideas for {project}",
+    "{date} journal entry",
+    "thoughts on {topic}",
+
+    # Research / learning
+    "what is {concept}",
+    "difference between {thing1} and {thing2}",
+    "{topic} tutorial",
+    "learn {skill}",
+
+    # Short queries
+    "{keyword}",
+    "{keyword} {modifier}",
+]
+
+ACTIONS = ["install", "configure", "setup", "debug", "deploy", "test", "optimize", "migrate"]
+TECHNOLOGIES = ["python", "typescript", "react", "docker", "kubernetes", "postgres", "redis", "nginx", "git", "linux"]
+CONCEPTS = ["authentication", "caching", "logging", "testing", "deployment", "API", "database", "security"]
+USE_CASES = ["production", "development", "CI/CD", "local", "cloud"]
+ERROR_TYPES = ["connection", "timeout", "permission", "memory", "syntax"]
+TOPICS = ["productivity", "workflow", "architecture", "design", "performance"]
+KEYWORDS = ["auth", "config", "setup", "api", "data", "cache", "log", "test"]
+MODIFIERS = ["best", "fast", "simple", "advanced", "secure"]
+
+SYSTEM_PROMPT = """You are a search query optimization expert for a markdown document search system called QMD.
+
+Your task is to transform user queries into retrieval-optimized outputs with THREE distinct types:
+
+1. **lex** lines: Keyword variations optimized for BM25 full-text search
+   - Short, keyword-focused
+   - Good for exact term matching
+   - 1-3 lines
+
+2. **vec** lines: Semantic reformulations for vector/embedding search
+   - Complete phrases or questions
+   - Capture semantic meaning
+   - 1-3 lines
+
+3. **hyde** line: A hypothetical document passage (HyDE technique)
+   - A realistic passage that would answer the query
+   - Contains domain-specific terminology
+   - Written as if it's FROM a document, not ABOUT the query
+   - MAX 1 line
+
+Output format (STRICT - follow exactly):
+```
+lex: keyword1
+lex: keyword2
+vec: semantic query reformulation
+hyde: A passage that would appear in a document answering this query.
+```
+
+Rules:
+- Each line must start with "lex:", "vec:", or "hyde:"
+- No blank lines
+- No repetition between lines
+- hyde should be a realistic document excerpt, not a question
+- Stay focused on the original query intent"""
+
+USER_PROMPT_TEMPLATE = """Generate query expansion outputs for this search query:
+
+Query: {query}
+
+Respond with ONLY the lex/vec/hyde lines, nothing else."""
+
+
+def generate_random_query() -> str:
+    """Generate a random query from templates."""
+    template = random.choice(QUERY_TEMPLATES)
+
+    replacements = {
+        "{action}": random.choice(ACTIONS),
+        "{technology}": random.choice(TECHNOLOGIES),
+        "{concept}": random.choice(CONCEPTS),
+        "{use_case}": random.choice(USE_CASES),
+        "{error_type}": random.choice(ERROR_TYPES),
+        "{topic}": random.choice(TOPICS),
+        "{project}": random.choice(["website", "app", "CLI tool", "API", "library"]),
+        "{date}": random.choice(["2024-01", "2024-06", "yesterday", "today"]),
+        "{thing1}": random.choice(CONCEPTS[:4]),
+        "{thing2}": random.choice(CONCEPTS[4:]),
+        "{skill}": random.choice(TECHNOLOGIES),
+        "{keyword}": random.choice(KEYWORDS),
+        "{modifier}": random.choice(MODIFIERS),
+    }
+
+    query = template
+    for key, value in replacements.items():
+        query = query.replace(key, value)
+
+    return query
+
+
+def generate_expansion(client: anthropic.Anthropic, query: str) -> str | None:
+    """Generate expansion using Claude API."""
+    try:
+        response = client.messages.create(
+            model="claude-sonnet-4-20250514",
+            max_tokens=300,
+            system=SYSTEM_PROMPT,
+            messages=[
+                {"role": "user", "content": USER_PROMPT_TEMPLATE.format(query=query)}
+            ]
+        )
+        return response.content[0].text.strip()
+    except Exception as e:
+        print(f"Error generating expansion for '{query}': {e}")
+        return None
+
+
+def validate_output(output: str) -> bool:
+    """Validate that output follows the expected format."""
+    lines = output.strip().split("\n")
+    if not lines:
+        return False
+
+    has_lex = False
+    has_vec = False
+
+    for line in lines:
+        line = line.strip()
+        if not line:
+            continue
+        if line.startswith("lex:"):
+            has_lex = True
+        elif line.startswith("vec:"):
+            has_vec = True
+        elif line.startswith("hyde:"):
+            pass
+        else:
+            return False  # Invalid line type
+
+    return has_lex and has_vec
+
+
+def main():
+    parser = argparse.ArgumentParser(description="Generate QMD query expansion training data")
+    parser.add_argument("--count", type=int, default=100, help="Number of examples to generate")
+    parser.add_argument("--output", type=str, default="data/qmd_expansion.jsonl", help="Output file path")
+    parser.add_argument("--queries", type=str, help="Optional file with custom queries (one per line)")
+    args = parser.parse_args()
+
+    api_key = os.environ.get("ANTHROPIC_API_KEY")
+    if not api_key:
+        print("Error: ANTHROPIC_API_KEY environment variable not set")
+        exit(1)
+
+    client = anthropic.Anthropic(api_key=api_key)
+    output_path = Path(args.output)
+    output_path.parent.mkdir(parents=True, exist_ok=True)
+
+    # Load custom queries if provided
+    custom_queries = []
+    if args.queries and Path(args.queries).exists():
+        custom_queries = Path(args.queries).read_text().strip().split("\n")
+        print(f"Loaded {len(custom_queries)} custom queries")
+
+    examples = []
+    seen_queries = set()
+
+    print(f"Generating {args.count} examples...")
+
+    i = 0
+    while len(examples) < args.count:
+        # Use custom query or generate random one
+        if custom_queries and i < len(custom_queries):
+            query = custom_queries[i].strip()
+        else:
+            query = generate_random_query()
+
+        i += 1
+
+        # Skip duplicates
+        if query in seen_queries:
+            continue
+        seen_queries.add(query)
+
+        # Generate expansion
+        output = generate_expansion(client, query)
+        if output and validate_output(output):
+            examples.append({"input": query, "output": output})
+            print(f"[{len(examples)}/{args.count}] {query[:50]}...")
+        else:
+            print(f"  Skipped invalid output for: {query[:50]}...")
+
+    # Write output
+    with open(output_path, "w") as f:
+        for example in examples:
+            f.write(json.dumps(example) + "\n")
+
+    print(f"\nGenerated {len(examples)} examples to {output_path}")
+
+
+if __name__ == "__main__":
+    main()

+ 192 - 0
finetune/dataset/generate_data_offline.py

@@ -0,0 +1,192 @@
+#!/usr/bin/env python3
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "datasets",
+# ]
+# ///
+"""
+Generate QMD training data by transforming s-emanuilov/query-expansion dataset
+and adding synthetic hyde passages. No API calls needed.
+"""
+
+import json
+import random
+from pathlib import Path
+
+# HyDE passage templates for different query types
+HYDE_TEMPLATES = {
+    "how_to": [
+        "To {action}, you need to {steps}. This can be done by {method}.",
+        "The recommended way to {action} is to first {step1}, then {step2}.",
+        "{Topic} can be achieved by {method}. Make sure to {consideration}.",
+    ],
+    "what_is": [
+        "{Topic} is a {category} that {description}. It is commonly used for {use_case}.",
+        "{Topic} refers to {definition}. Key features include {features}.",
+    ],
+    "config": [
+        "To configure {topic}, set the {setting} option to {value}. You can also customize {other}.",
+        "Configuration for {topic} is done in the {file} file. Key settings include {settings}.",
+    ],
+    "error": [
+        "The {error} error occurs when {cause}. To fix this, {solution}.",
+        "If you encounter {error}, check that {check}. Common solutions include {solutions}.",
+    ],
+    "general": [
+        "{Topic} provides {benefit} for {use_case}. It works by {mechanism}.",
+        "When working with {topic}, consider {considerations}. Best practices include {practices}.",
+    ],
+}
+
+def classify_query(query: str) -> str:
+    """Classify query type for hyde template selection."""
+    q = query.lower()
+    if any(w in q for w in ["how to", "how do", "setup", "install", "configure", "create"]):
+        return "how_to"
+    if any(w in q for w in ["what is", "what are", "definition", "meaning"]):
+        return "what_is"
+    if any(w in q for w in ["config", "setting", "option"]):
+        return "config"
+    if any(w in q for w in ["error", "issue", "problem", "fix", "debug"]):
+        return "error"
+    return "general"
+
+
+def extract_topic(query: str) -> str:
+    """Extract main topic from query."""
+    # Remove common prefixes
+    for prefix in ["how to ", "how do i ", "what is ", "what are ", "configure ", "setup "]:
+        if query.lower().startswith(prefix):
+            return query[len(prefix):].strip()
+    return query
+
+
+def generate_hyde(query: str, expansions: list[str]) -> str:
+    """Generate a hypothetical document passage by combining expansions naturally."""
+    topic = extract_topic(query)
+    query_type = classify_query(query)
+
+    # Use the longest, most descriptive expansion as the base
+    sorted_exp = sorted(expansions, key=len, reverse=True)
+    main_exp = sorted_exp[0] if sorted_exp else topic
+
+    # Build a natural passage based on query type
+    if query_type == "how_to":
+        templates = [
+            f"To {topic}, start by reviewing the requirements and dependencies. {main_exp.capitalize()} is the recommended approach. Make sure all prerequisites are met before proceeding.",
+            f"The process of {topic} involves several steps. First, {main_exp}. Follow the official documentation for detailed instructions.",
+            f"When you need to {topic}, the most effective method is to {main_exp}. This ensures compatibility and follows best practices.",
+        ]
+    elif query_type == "what_is":
+        templates = [
+            f"{topic.capitalize()} refers to {main_exp}. It is widely used in various applications and provides significant benefits.",
+            f"The concept of {topic} encompasses {main_exp}. Understanding this is essential for effective implementation.",
+            f"{topic.capitalize()} is defined as {main_exp}. This plays a crucial role in modern development practices.",
+        ]
+    elif query_type == "config":
+        templates = [
+            f"Configuration for {topic} requires setting the appropriate parameters. {main_exp.capitalize()} should be adjusted based on your specific requirements.",
+            f"To configure {topic}, modify the settings in your configuration file. Key options include those related to {main_exp}.",
+            f"The {topic} configuration can be customized by {main_exp}. Default values work for most use cases.",
+        ]
+    elif query_type == "error":
+        templates = [
+            f"The {topic} issue typically occurs when dependencies are misconfigured. To resolve this, {main_exp}. Check your environment settings.",
+            f"If you encounter problems with {topic}, verify that {main_exp}. Common solutions include updating dependencies and checking permissions.",
+            f"Debugging {topic} requires understanding the root cause. Often, {main_exp} resolves the issue. Review logs for details.",
+        ]
+    else:
+        templates = [
+            f"{topic.capitalize()} is an important concept that relates to {main_exp}. It provides functionality for various use cases in software development.",
+            f"Understanding {topic} is essential for modern development. Key aspects include {main_exp}. This knowledge helps in building robust applications.",
+            f"The topic of {topic} covers {main_exp}. Proper implementation follows established patterns and best practices.",
+        ]
+
+    return random.choice(templates)
+
+
+def transform_to_qmd_format(query: str, expansions: list[str]) -> str:
+    """Transform s-emanuilov format to QMD lex/vec/hyde format."""
+    lines = []
+
+    # Generate lex lines (keyword-focused, shorter)
+    lex_candidates = []
+    for exp in expansions:
+        # Shorter versions for lex
+        words = exp.split()
+        if len(words) <= 4:
+            lex_candidates.append(exp)
+        else:
+            # Take key phrases
+            lex_candidates.append(" ".join(words[:3]))
+
+    # Add 1-2 lex lines
+    for lex in lex_candidates[:2]:
+        if lex.lower() != query.lower():
+            lines.append(f"lex: {lex}")
+
+    # Generate vec lines (semantic, complete phrases)
+    vec_candidates = [exp for exp in expansions if len(exp.split()) >= 3]
+    if not vec_candidates:
+        vec_candidates = expansions
+
+    # Add 1-2 vec lines
+    for vec in vec_candidates[:2]:
+        if vec.lower() != query.lower():
+            lines.append(f"vec: {vec}")
+
+    # Generate hyde line
+    hyde = generate_hyde(query, expansions)
+    lines.append(f"hyde: {hyde}")
+
+    return "\n".join(lines)
+
+
+def main():
+    try:
+        from datasets import load_dataset
+    except ImportError:
+        print("Installing datasets...")
+        import subprocess
+        subprocess.run(["uv", "pip", "install", "datasets"], check=True)
+        from datasets import load_dataset
+
+    print("Loading s-emanuilov/query-expansion dataset...")
+    dataset = load_dataset("s-emanuilov/query-expansion", split="train")
+
+    print(f"Loaded {len(dataset)} examples")
+
+    # Transform each example
+    output_path = Path("data/qmd_expansion.jsonl")
+    output_path.parent.mkdir(parents=True, exist_ok=True)
+
+    examples = []
+    for item in dataset:
+        query = item["query"]
+        expansions = item["expansions"]
+
+        output = transform_to_qmd_format(query, expansions)
+        examples.append({"input": query, "output": output})
+
+    # Shuffle
+    random.seed(42)
+    random.shuffle(examples)
+
+    # Write output
+    with open(output_path, "w") as f:
+        for ex in examples:
+            f.write(json.dumps(ex) + "\n")
+
+    print(f"Generated {len(examples)} examples to {output_path}")
+
+    # Show sample
+    print("\nSample output:")
+    print("-" * 50)
+    sample = examples[0]
+    print(f"Input: {sample['input']}")
+    print(f"Output:\n{sample['output']}")
+
+
+if __name__ == "__main__":
+    main()

+ 272 - 0
finetune/dataset/prepare_data.py

@@ -0,0 +1,272 @@
+#!/usr/bin/env python3
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "transformers>=4.45.0",
+#     "jinja2",
+# ]
+# ///
+"""Prepare QMD query expansion data for training.
+
+See PROMPT_FORMAT.md for format specification.
+"""
+
+import argparse
+import json
+import random
+from pathlib import Path
+
+from transformers import AutoTokenizer
+
+_tokenizer = None
+
+def get_tokenizer():
+    global _tokenizer
+    if _tokenizer is None:
+        _tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen3-0.6B")
+    return _tokenizer
+
+# Short single-word queries that need proper expansion examples
+SHORT_QUERIES = [
+    # Technical keywords
+    "auth", "config", "setup", "api", "cache", "log", "test", "debug",
+    "deploy", "build", "lint", "format", "migrate", "backup", "restore",
+    "docker", "git", "npm", "pip", "brew", "curl", "ssh", "ssl", "tls",
+    "cors", "csrf", "jwt", "oauth", "saml", "ldap", "rbac", "acl",
+    "crud", "rest", "graphql", "grpc", "websocket", "sse", "http",
+    "redis", "mongo", "postgres", "mysql", "sqlite", "elastic", "kafka",
+    "nginx", "apache", "caddy", "traefik", "haproxy", "envoy",
+    "react", "vue", "angular", "svelte", "solid", "htmx", "alpine",
+    "node", "deno", "bun", "python", "rust", "golang", "java", "kotlin",
+    "webpack", "vite", "esbuild", "rollup", "parcel", "turbopack",
+    "jest", "vitest", "pytest", "mocha", "cypress", "playwright",
+    # Common short phrases
+    "env vars", "api keys", "error handling", "rate limiting",
+    "file upload", "user auth", "db connection", "query params",
+    "hot reload", "code split", "tree shake", "lazy load",
+]
+
+# Templates for generating short query expansions
+# IMPORTANT: All lex lines MUST include {q} to preserve key terms
+SHORT_TEMPLATES = [
+    {
+        "lex": ["{q} configuration", "{q} settings", "{q} setup"],
+        "vec": ["how to configure {q} in my project", "{q} setup and configuration tutorial"],
+        "hyde": "To set up {q}, first install the required dependencies. Then configure the settings in your project configuration file.",
+    },
+    {
+        "lex": ["{q} tutorial", "{q} guide", "{q} basics"],
+        "vec": ["beginner guide to {q}", "how to get started with {q}"],
+        "hyde": "This guide covers the basics of {q}. Follow the steps below to get started with your first implementation.",
+    },
+    {
+        "lex": ["{q} best practices", "{q} patterns", "{q} tips"],
+        "vec": ["best practices for using {q}", "recommended patterns for {q}"],
+        "hyde": "When working with {q}, follow these best practices: use consistent naming, handle errors properly, and document your code.",
+    },
+    {
+        "lex": ["{q} troubleshooting", "{q} fix", "{q} errors"],
+        "vec": ["how to fix {q} errors", "troubleshooting common {q} problems"],
+        "hyde": "If you encounter {q} issues, check your configuration first. Common problems include missing dependencies and incorrect settings.",
+    },
+    {
+        "lex": ["{q} examples", "{q} code", "{q} usage"],
+        "vec": ["code examples for {q}", "practical {q} implementation examples"],
+        "hyde": "Here are some practical examples of {q} in action. Each example demonstrates a common use case with working code.",
+    },
+]
+
+
+def truncate_hyde(hyde_text: str, max_len: int = 150) -> str:
+    """Truncate hyde to max length, ending at sentence boundary."""
+    if len(hyde_text) <= max_len:
+        return hyde_text
+
+    truncated = hyde_text[:max_len]
+    last_period = truncated.rfind(". ")
+    if last_period > max_len // 2:
+        return truncated[:last_period + 1]
+
+    last_space = truncated.rfind(" ")
+    if last_space > max_len // 2:
+        return truncated[:last_space] + "."
+
+    return truncated[:max_len-1] + "."
+
+
+def clean_output(output: str) -> str:
+    """Clean output: truncate hyde, remove invalid lines."""
+    lines = []
+    for line in output.strip().split("\n"):
+        line = line.strip()
+        if not line:
+            continue
+        if line.startswith("hyde:"):
+            hyde_text = line[5:].strip()
+            hyde_text = truncate_hyde(hyde_text)
+            lines.append(f"hyde: {hyde_text}")
+        elif line.startswith(("lex:", "vec:")):
+            lines.append(line)
+    return "\n".join(lines)
+
+
+def generate_short_example(query: str) -> dict:
+    """Generate a training example for a short query."""
+    template = random.choice(SHORT_TEMPLATES)
+
+    lex_lines = random.sample(template["lex"], 2)
+    vec_lines = random.sample(template["vec"], 2)
+    hyde_line = template["hyde"]
+
+    output_lines = []
+    for lex in lex_lines:
+        output_lines.append(f"lex: {lex.format(q=query)}")
+    for vec in vec_lines:
+        output_lines.append(f"vec: {vec.format(q=query)}")
+    output_lines.append(f"hyde: {hyde_line.format(q=query)}")
+
+    return {"input": query, "output": "\n".join(output_lines)}
+
+
+def format_for_training(input_text: str, output_text: str) -> dict:
+    """Format a single example for SFT training using Qwen chat format."""
+    tokenizer = get_tokenizer()
+
+    # Use /no_think to disable thinking mode - we want direct output
+    messages = [
+        {"role": "user", "content": f"/no_think Expand this search query: {input_text}"},
+        {"role": "assistant", "content": output_text}
+    ]
+
+    # Use tokenizer to generate proper chat format with special tokens
+    text = tokenizer.apply_chat_template(
+        messages,
+        tokenize=False,
+        add_generation_prompt=False,
+    )
+
+    # Strip empty <think> tags - we don't want thinking mode
+    # The template adds "<think>\n\n</think>\n\n" which we remove
+    text = text.replace("<think>\n\n</think>\n\n", "")
+
+    return {
+        "text": text,
+        "messages": messages,
+    }
+
+
+def main():
+    parser = argparse.ArgumentParser(description="Prepare data for training")
+    parser.add_argument("--input", type=str, default="data/qmd_expansion.jsonl", help="Input JSONL file")
+    parser.add_argument("--output", type=str, default="data/train", help="Output directory")
+    parser.add_argument("--split", type=float, default=0.1, help="Validation split ratio")
+    parser.add_argument("--add-short", type=int, default=3, help="Variations per short query to add")
+    args = parser.parse_args()
+
+    input_path = Path(args.input)
+    output_dir = Path(args.output)
+    output_dir.mkdir(parents=True, exist_ok=True)
+
+    if not input_path.exists():
+        print(f"Error: Input file not found: {input_path}")
+        exit(1)
+
+    # Load and clean existing examples
+    examples = []
+    seen_queries = set()
+    long_hyde_count = 0
+
+    with open(input_path) as f:
+        for line in f:
+            if line.strip():
+                ex = json.loads(line)
+
+                # Clean output (truncate hyde, remove invalid lines)
+                original_output = ex["output"]
+                ex["output"] = clean_output(ex["output"])
+
+                # Track hyde truncation
+                if "hyde:" in original_output:
+                    for orig_line in original_output.split("\n"):
+                        if orig_line.strip().startswith("hyde:"):
+                            if len(orig_line) > 160:
+                                long_hyde_count += 1
+
+                # Validate cleaned output
+                has_lex = "lex:" in ex["output"]
+                has_vec = "vec:" in ex["output"]
+
+                if has_lex and has_vec:
+                    examples.append(ex)
+                    seen_queries.add(ex["input"].lower())
+
+    print(f"Loaded and cleaned {len(examples)} examples")
+    print(f"Truncated {long_hyde_count} long hyde sections")
+
+    # Count existing short queries
+    short_existing = sum(1 for ex in examples if len(ex["input"].split()) <= 2)
+    print(f"Existing short queries (1-2 words): {short_existing}")
+
+    # Generate additional short query examples
+    new_short = []
+    for query in SHORT_QUERIES:
+        if query.lower() not in seen_queries:
+            for _ in range(args.add_short):
+                new_short.append(generate_short_example(query))
+            seen_queries.add(query.lower())
+
+    print(f"Generated {len(new_short)} new short query examples")
+
+    # Combine and shuffle
+    all_examples = examples + new_short
+    random.shuffle(all_examples)
+
+    # Format for training
+    formatted = [format_for_training(ex["input"], ex["output"]) for ex in all_examples]
+
+    # Split into train/val
+    split_idx = int(len(formatted) * (1 - args.split))
+    train_data = formatted[:split_idx]
+    val_data = formatted[split_idx:]
+
+    # Write train set
+    train_path = output_dir / "train.jsonl"
+    with open(train_path, "w") as f:
+        for item in train_data:
+            f.write(json.dumps(item) + "\n")
+
+    # Write validation set
+    val_path = output_dir / "val.jsonl"
+    with open(val_path, "w") as f:
+        for item in val_data:
+            f.write(json.dumps(item) + "\n")
+
+    # Write chat format (for TRL)
+    chat_path = output_dir / "train_chat.jsonl"
+    with open(chat_path, "w") as f:
+        for item in train_data:
+            f.write(json.dumps({"messages": item["messages"]}) + "\n")
+
+    # Stats
+    short_final = sum(1 for ex in all_examples if len(ex["input"].split()) <= 2)
+
+    print(f"\n=== Summary ===")
+    print(f"Total examples: {len(all_examples)}")
+    print(f"Short queries: {short_final} ({100*short_final/len(all_examples):.1f}%)")
+    print(f"Train: {len(train_data)}, Val: {len(val_data)}")
+    print(f"Output: {output_dir}")
+
+    # Dataset info
+    dataset_info = {
+        "dataset_name": "qmd-query-expansion",
+        "train_samples": len(train_data),
+        "val_samples": len(val_data),
+        "short_query_pct": round(100*short_final/len(all_examples), 1),
+        "columns": ["prompt", "completion", "text", "messages"],
+    }
+    with open(output_dir / "dataset_info.json", "w") as f:
+        json.dump(dataset_info, f, indent=2)
+
+
+if __name__ == "__main__":
+    main()

+ 272 - 0
finetune/eval.py

@@ -0,0 +1,272 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "transformers>=4.45.0",
+#     "peft>=0.7.0",
+#     "torch",
+#     "huggingface_hub",
+#     "accelerate",
+# ]
+# ///
+"""
+Evaluate a QMD query expansion model.
+
+Generates expansions from a model and scores them using the reward function.
+Replaces the old two-step evals/run.py + evals/score.py workflow.
+
+Usage:
+    uv run eval.py --model tobil/qmd-query-expansion-1.7B-sft
+    uv run eval.py --model tobil/qmd-query-expansion-1.7B-grpo \
+                   --sft-model tobil/qmd-query-expansion-1.7B-sft \
+                   --base-model Qwen/Qwen3-1.7B
+    uv run eval.py --model ./local-checkpoint --verbose
+    uv run eval.py --score-only results.jsonl
+"""
+
+import argparse
+import json
+import os
+import sys
+from collections import Counter
+from datetime import datetime
+from pathlib import Path
+
+# Import reward module
+sys.path.insert(0, os.path.dirname(__file__))
+from reward import score_expansion_detailed
+
+
+def load_queries(path: str) -> list[str]:
+    """Load queries from file, one per line, ignoring comments and blanks."""
+    queries = []
+    with open(path) as f:
+        for line in f:
+            line = line.strip()
+            if line and not line.startswith("#"):
+                queries.append(line)
+    return queries
+
+
+def load_model(model_path: str, base_model: str = None, sft_model: str = None):
+    """Load model with optional SFT stacking for GRPO models."""
+    import torch
+    from peft import PeftModel
+    from transformers import AutoModelForCausalLM, AutoTokenizer
+
+    is_local = Path(model_path).exists()
+
+    # Auto-detect adapter vs full model
+    is_adapter = True
+    if is_local:
+        is_adapter = (Path(model_path) / "adapter_config.json").exists()
+        if is_adapter and not base_model:
+            with open(Path(model_path) / "adapter_config.json") as f:
+                config = json.load(f)
+                base_model = config.get("base_model_name_or_path", "Qwen/Qwen3-1.7B")
+
+    if not base_model:
+        base_model = "Qwen/Qwen3-1.7B"
+
+    print(f"Loading tokenizer from {base_model}...", file=sys.stderr)
+    tokenizer = AutoTokenizer.from_pretrained(base_model)
+    if tokenizer.pad_token is None:
+        tokenizer.pad_token = tokenizer.eos_token
+
+    print(f"Loading base model {base_model}...", file=sys.stderr)
+    model = AutoModelForCausalLM.from_pretrained(
+        base_model, torch_dtype=torch.bfloat16, device_map="auto",
+    )
+
+    if sft_model:
+        print(f"Loading and merging SFT adapter {sft_model}...", file=sys.stderr)
+        model = PeftModel.from_pretrained(model, sft_model)
+        model = model.merge_and_unload()
+
+    if is_adapter:
+        print(f"Loading adapter from {model_path}...", file=sys.stderr)
+        model = PeftModel.from_pretrained(model, model_path)
+
+    model.eval()
+    return model, tokenizer
+
+
+def generate_expansion(model, tokenizer, query: str, max_new_tokens: int = 200) -> str:
+    """Generate a query expansion using Qwen3 chat template with /no_think."""
+    import torch
+
+    messages = [{"role": "user", "content": f"/no_think Expand this search query: {query}"}]
+    prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+    inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+
+    with torch.no_grad():
+        outputs = model.generate(
+            **inputs,
+            max_new_tokens=max_new_tokens,
+            temperature=0.7,
+            do_sample=True,
+            pad_token_id=tokenizer.pad_token_id,
+            eos_token_id=tokenizer.eos_token_id,
+        )
+
+    full_output = tokenizer.decode(outputs[0], skip_special_tokens=True)
+
+    # Extract assistant response
+    if "\nassistant\n" in full_output:
+        expansion = full_output.split("\nassistant\n")[-1].strip()
+    elif "assistant\n" in full_output:
+        expansion = full_output.split("assistant\n")[-1].strip()
+    else:
+        expansion = full_output[len(prompt):].strip()
+
+    # Strip leftover <think> blocks
+    import re
+    if "<think>" in expansion:
+        expansion = re.sub(r'<think>.*?</think>', '', expansion, flags=re.DOTALL).strip()
+
+    return expansion
+
+
+def print_result(query: str, expansion: str, scores: dict, verbose: bool = False):
+    """Print a single scored result."""
+    print(f"\n{'='*60}")
+    print(f"Query: {query}")
+    print(f"{'~'*60}")
+    print(expansion)
+    print(f"{'~'*60}")
+    print(f"Score: {scores['percentage']:.0f}% ({scores['rating']})")
+    print(f"  Format: {scores['format']}/30  Diversity: {scores['diversity']}/30  "
+          f"Hyde: {scores['hyde']}/20  Quality: {scores['quality']}/20  "
+          f"Entity: {scores['entity']}/20  Think: {scores['think_bonus']}/20")
+    if verbose and scores["deductions"]:
+        print(f"  Issues: {', '.join(scores['deductions'][:5])}")
+    if verbose and scores["entities_detected"]:
+        print(f"  Entities: {scores['entities_detected']}")
+
+
+def print_summary(scored_results: list):
+    """Print aggregate summary."""
+    print(f"\n{'='*60}")
+    print("SUMMARY")
+    print(f"{'='*60}")
+
+    avg_score = sum(r["scores"]["percentage"] for r in scored_results) / len(scored_results)
+    ratings = Counter(r["scores"]["rating"] for r in scored_results)
+
+    print(f"  Total queries: {len(scored_results)}")
+    print(f"  Average score: {avg_score:.1f}%")
+    print(f"  Ratings:")
+    for rating in ["Excellent", "Good", "Acceptable", "Poor", "Failed"]:
+        count = ratings.get(rating, 0)
+        print(f"    {rating:10s}: {count:2d} {'#' * count}")
+
+
+def cmd_generate_and_score(args):
+    """Generate expansions from a model and score them."""
+    queries = load_queries(args.queries)
+    print(f"Loaded {len(queries)} queries from {args.queries}", file=sys.stderr)
+
+    model, tokenizer = load_model(args.model, args.base_model, args.sft_model)
+
+    scored_results = []
+    for i, query in enumerate(queries, 1):
+        print(f"[{i}/{len(queries)}] {query}", file=sys.stderr)
+        expansion = generate_expansion(model, tokenizer, query, args.max_tokens)
+        scores = score_expansion_detailed(query, expansion)
+
+        if not args.summary_only:
+            print_result(query, expansion, scores, args.verbose)
+
+        scored_results.append({
+            "query": query,
+            "expansion": expansion,
+            "scores": {k: v for k, v in scores.items() if k not in ("parsed", "deductions", "entities_detected")},
+            "deductions": scores["deductions"],
+            "entities_detected": scores["entities_detected"],
+        })
+
+    print_summary(scored_results)
+
+    if args.output:
+        output_data = {
+            "metadata": {"model": args.model, "timestamp": datetime.now().isoformat()},
+            "summary": {
+                "total": len(scored_results),
+                "average_score": round(sum(r["scores"]["percentage"] for r in scored_results) / len(scored_results), 1),
+            },
+            "results": scored_results,
+        }
+        with open(args.output, "w") as f:
+            json.dump(output_data, f, indent=2)
+        print(f"\nScores saved to: {args.output}")
+
+
+def cmd_score_only(args):
+    """Score an existing JSONL file (from the old run.py format)."""
+    results = []
+    with open(args.input) as f:
+        for line in f:
+            data = json.loads(line)
+            if not data.get("_meta"):
+                results.append(data)
+
+    scored_results = []
+    for result in results:
+        query = result["query"]
+        expansion = result["expansion"]
+        scores = score_expansion_detailed(query, expansion)
+
+        if not args.summary_only:
+            print_result(query, expansion, scores, args.verbose)
+
+        scored_results.append({
+            "query": query,
+            "expansion": expansion,
+            "scores": {k: v for k, v in scores.items() if k not in ("parsed", "deductions", "entities_detected")},
+            "deductions": scores["deductions"],
+            "entities_detected": scores["entities_detected"],
+        })
+
+    print_summary(scored_results)
+
+
+def main():
+    parser = argparse.ArgumentParser(
+        description="Evaluate QMD query expansion models",
+        formatter_class=argparse.RawDescriptionHelpFormatter,
+        epilog="""
+Examples:
+  uv run eval.py --model tobil/qmd-query-expansion-1.7B-sft
+  uv run eval.py --model tobil/qmd-query-expansion-1.7B-grpo \\
+                 --sft-model tobil/qmd-query-expansion-1.7B-sft
+  uv run eval.py --score-only evals/results.jsonl
+        """,
+    )
+
+    # Model evaluation mode
+    parser.add_argument("--model", help="Model path (HF Hub or local)")
+    parser.add_argument("--base-model", default=None, help="Base model for tokenizer (default: Qwen/Qwen3-1.7B)")
+    parser.add_argument("--sft-model", default=None, help="SFT adapter to merge first (for GRPO models)")
+    parser.add_argument("--queries", default="evals/queries.txt", help="Queries file")
+    parser.add_argument("--max-tokens", type=int, default=200, help="Max tokens per generation")
+
+    # Score-only mode
+    parser.add_argument("--score-only", metavar="JSONL", help="Score existing JSONL file instead of generating")
+
+    # Output options
+    parser.add_argument("--output", "-o", help="Save detailed scores to JSON file")
+    parser.add_argument("--verbose", "-v", action="store_true")
+    parser.add_argument("--summary-only", action="store_true")
+
+    args = parser.parse_args()
+
+    if args.score_only:
+        args.input = args.score_only
+        cmd_score_only(args)
+    elif args.model:
+        cmd_generate_and_score(args)
+    else:
+        parser.error("Either --model or --score-only is required")
+
+
+if __name__ == "__main__":
+    main()

+ 3 - 0
finetune/evals/.gitignore

@@ -0,0 +1,3 @@
+# Generated results (re-run evals locally)
+results_*.jsonl
+scores_*.json

+ 48 - 0
finetune/evals/queries.txt

@@ -0,0 +1,48 @@
+# Test queries for QMD query expansion evaluation
+# One query per line, comments start with #
+
+# Technical documentation
+how to configure authentication
+typescript async await
+docker compose networking
+git rebase vs merge
+react useEffect cleanup
+
+# Short/ambiguous queries
+auth
+config
+setup
+api
+
+# Named entities (critical for entity preservation testing)
+who is TDS motorsports
+React hooks tutorial
+Docker container networking
+Kubernetes pod deployment
+AWS Lambda functions
+
+# Personal notes / journals style
+meeting notes project kickoff
+ideas for new feature
+todo list app architecture
+
+# Research / learning
+what is dependency injection
+difference between sql and nosql
+kubernetes vs docker swarm
+
+# Error/debugging
+connection timeout error
+memory leak debugging
+cors error fix
+
+# Temporal / recency queries (should expand with years, "recent", "latest")
+recent news about Shopify
+latest AI developments
+best laptops right now
+what changed in kubernetes latest version
+
+# Complex queries
+how to implement caching with redis in nodejs
+best practices for api rate limiting
+setting up ci cd pipeline with github actions

+ 490 - 0
finetune/jobs/eval.py

@@ -0,0 +1,490 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "transformers>=4.45.0",
+#     "peft>=0.7.0",
+#     "torch",
+#     "huggingface_hub>=0.20.0",
+#     "accelerate",
+# ]
+# ///
+"""
+Evaluate QMD query expansion models on HuggingFace Jobs.
+
+Self-contained script — inlines the reward function and test queries.
+
+    hf jobs uv run --flavor a10g-small --secrets HF_TOKEN --timeout 30m jobs/eval.py
+    hf jobs uv run --flavor a10g-small --secrets HF_TOKEN --timeout 30m jobs/eval.py -- --sft-only
+"""
+
+import argparse
+import csv
+import io
+import json
+import os
+import re
+import sys
+from collections import Counter
+
+import torch
+from huggingface_hub import HfApi, login
+from peft import PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer
+
+# --- Config ---
+BASE_MODEL = "Qwen/Qwen3-1.7B"
+SFT_MODEL = "tobil/qmd-query-expansion-1.7B-sft"
+GRPO_MODEL = "tobil/qmd-query-expansion-1.7B-grpo"
+
+# --- Test queries (inlined from evals/queries.txt) ---
+QUERIES = [
+    # Technical documentation
+    "how to configure authentication",
+    "typescript async await",
+    "docker compose networking",
+    "git rebase vs merge",
+    "react useEffect cleanup",
+    # Short/ambiguous
+    "auth",
+    "config",
+    "setup",
+    "api",
+    # Named entities
+    "who is TDS motorsports",
+    "React hooks tutorial",
+    "Docker container networking",
+    "Kubernetes pod deployment",
+    "AWS Lambda functions",
+    # Personal notes / journals
+    "meeting notes project kickoff",
+    "ideas for new feature",
+    "todo list app architecture",
+    # Research / learning
+    "what is dependency injection",
+    "difference between sql and nosql",
+    "kubernetes vs docker swarm",
+    # Error/debugging
+    "connection timeout error",
+    "memory leak debugging",
+    "cors error fix",
+    # Temporal / recency
+    "recent news about Shopify",
+    "latest AI developments",
+    "best laptops right now",
+    "what changed in kubernetes latest version",
+    # Complex
+    "how to implement caching with redis in nodejs",
+    "best practices for api rate limiting",
+    "setting up ci cd pipeline with github actions",
+]
+
+# =============================================================================
+# Reward function (inlined from reward.py)
+# =============================================================================
+
+STOPWORDS = frozenset({
+    'the', 'a', 'an', 'is', 'are', 'to', 'for', 'of', 'in',
+    'and', 'or', 'it', 'this', 'that', 'be', 'with', 'as', 'on', 'by',
+})
+
+KEY_TERM_STOPWORDS = frozenset({
+    'what', 'is', 'how', 'to', 'the', 'a', 'an', 'in', 'on', 'for', 'of',
+    'and', 'or', 'with', 'my', 'your', 'do', 'does', 'can', 'i', 'me', 'we',
+    'who', 'where', 'when', 'why', 'which', 'find', 'get', 'show', 'tell',
+})
+
+GENERIC_LEX_PHRASES = frozenset({
+    'find information about', 'search for', 'look up', 'get information',
+    'learn about', 'information on', 'details about', 'find out about',
+    'what is', 'how to', 'guide to', 'help with',
+})
+
+CHAT_TEMPLATE_TOKENS = frozenset({
+    '<|im_start|>', '<|im_end|>', '<|endoftext|>',
+    '\nassistant\n', '\nuser\n',
+})
+
+
+def parse_expansion(text):
+    result = {"lex": [], "vec": [], "hyde": [], "invalid": []}
+    for line in text.strip().split("\n"):
+        line = line.strip()
+        if not line:
+            continue
+        if line.startswith("lex:"):
+            result["lex"].append(line[4:].strip())
+        elif line.startswith("vec:"):
+            result["vec"].append(line[4:].strip())
+        elif line.startswith("hyde:"):
+            result["hyde"].append(line[5:].strip())
+        else:
+            result["invalid"].append(line)
+    return result
+
+
+def clean_model_output(text):
+    text = text.replace('<|im_end|>', '').strip()
+    used_thinking = '<think>' in text and '</think>' in text
+    if used_thinking:
+        text = re.sub(r'<think>.*?</think>', '', text, flags=re.DOTALL).strip()
+    return text, used_thinking
+
+
+def extract_named_entities(query):
+    entities = set()
+    words = query.split()
+    prev_was_entity = False
+    for i, word in enumerate(words):
+        clean = word.strip('.,!?:;()[]"\'')
+        if not clean:
+            prev_was_entity = False
+            continue
+        is_entity = False
+        if clean.isupper() and len(clean) >= 2:
+            entities.add(clean.lower()); is_entity = True
+        elif i > 0 and clean[0].isupper() and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower()); is_entity = True
+        elif any(c in clean for c in '.+-#@') and len(clean) >= 2:
+            entities.add(clean.lower()); is_entity = True
+        elif len(clean) > 1 and any(c.isupper() for c in clean[1:]) and clean[0].isupper():
+            entities.add(clean.lower()); is_entity = True
+        elif prev_was_entity and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower()); is_entity = True
+        prev_was_entity = is_entity
+    return entities
+
+
+def get_key_terms(query):
+    return set(query.lower().split()) - KEY_TERM_STOPWORDS
+
+
+def lex_preserves_key_terms(lex_line, query):
+    key_terms = get_key_terms(query)
+    return not key_terms or bool(key_terms & set(lex_line.lower().split()))
+
+
+def lex_preserves_entities(line, entities):
+    if not entities: return True
+    return any(e in line.lower() for e in entities)
+
+
+def lex_is_generic(lex_line):
+    lower = lex_line.lower().strip()
+    for phrase in GENERIC_LEX_PHRASES:
+        if phrase in lower or lower.startswith(phrase.split()[0]):
+            remaining = lower
+            for word in phrase.split():
+                remaining = remaining.replace(word, '', 1).strip()
+            if len(remaining) < 3:
+                return True
+    return False
+
+
+def word_set_distance(a, b):
+    return len(set(a.lower().split()) ^ set(b.lower().split()))
+
+
+def is_diverse(a, b, min_distance=2):
+    a, b = a.lower().strip(), b.lower().strip()
+    if a == b or a in b or b in a: return False
+    return word_set_distance(a, b) >= min_distance
+
+
+def echoes_query(expansion, query):
+    exp, q = expansion.lower().strip(), query.lower().strip()
+    return exp == q or (q in exp and len(exp) < len(q) + 10)
+
+
+def word_repetition_penalty(text):
+    counts = Counter(re.findall(r'\b\w+\b', text.lower()))
+    return sum((c - 2) * 2 for w, c in counts.items()
+               if c >= 3 and w not in STOPWORDS and len(w) > 2)
+
+
+def score_expansion_detailed(query, expansion):
+    text, used_thinking = clean_model_output(expansion.strip())
+    deductions = []
+
+    def _fail(reason):
+        return {
+            "format": 0, "diversity": 0, "hyde": 0, "quality": 0, "entity": 0,
+            "think_bonus": 0, "total": 0, "max_possible": 100,
+            "percentage": 0.0, "rating": "Failed", "deductions": [reason],
+        }
+
+    if any(tok in text for tok in CHAT_TEMPLATE_TOKENS):
+        return _fail("CHAT TEMPLATE LEAKAGE")
+    for line in text.split("\n"):
+        line = line.strip()
+        if line and not line.startswith(("lex:", "vec:", "hyde:")):
+            return _fail(f"INVALID LINE: {line[:50]}")
+
+    parsed = parse_expansion(text)
+
+    format_score = 10
+    if parsed["lex"]: format_score += 10
+    else: deductions.append("missing lex:")
+    if parsed["vec"]: format_score += 10
+    else: deductions.append("missing vec:")
+
+    diversity_score = 0
+    types_present = sum(1 for t in ("lex", "vec") if parsed[t])
+    if types_present >= 2: diversity_score += 10
+    if len(parsed["lex"]) + len(parsed["vec"]) >= 2: diversity_score += 5
+    lex_div = 5
+    for i, a in enumerate(parsed["lex"]):
+        for b in parsed["lex"][i+1:]:
+            if not is_diverse(a, b, 2): lex_div -= 2
+    diversity_score += max(0, lex_div)
+    vec_div = 5
+    for i, a in enumerate(parsed["vec"]):
+        for b in parsed["vec"][i+1:]:
+            if not is_diverse(a, b, 3): vec_div -= 2
+    diversity_score += max(0, vec_div)
+    echo = 5
+    for exp in parsed["lex"] + parsed["vec"]:
+        if echoes_query(exp, query): echo -= 3
+    diversity_score += max(0, echo)
+
+    hyde_score = 0
+    if parsed["hyde"]:
+        hyde_text = parsed["hyde"][0]
+        hyde_score += 5
+        hyde_len = len(hyde_text)
+        if 50 <= hyde_len <= 200: hyde_score += 5
+        elif hyde_len < 50: hyde_score += 2
+        if "\n" not in hyde_text: hyde_score += 5
+        hyde_score += max(0, 5 - word_repetition_penalty(hyde_text))
+
+    quality_score = 5
+    if parsed["lex"] and parsed["vec"]:
+        avg_lex = sum(len(l) for l in parsed["lex"]) / len(parsed["lex"])
+        avg_vec = sum(len(v) for v in parsed["vec"]) / len(parsed["vec"])
+        if avg_lex <= avg_vec: quality_score += 5
+    if parsed["vec"]:
+        natural = sum(1 for v in parsed["vec"] if " " in v and len(v) > 15)
+        quality_score += 5 if natural == len(parsed["vec"]) else 2
+    if parsed["lex"]:
+        with_terms = sum(1 for l in parsed["lex"] if lex_preserves_key_terms(l, query))
+        if with_terms == len(parsed["lex"]): quality_score += 5
+        elif with_terms > 0: quality_score += 2
+
+    entity_score = 0
+    entities = extract_named_entities(query)
+    if entities and parsed["lex"]:
+        with_entities = sum(1 for l in parsed["lex"] if lex_preserves_entities(l, entities))
+        if with_entities == len(parsed["lex"]): entity_score += 15
+        elif with_entities > 0: entity_score += 5
+        else: entity_score -= 30
+        generic_count = sum(1 for l in parsed["lex"] if lex_is_generic(l))
+        if generic_count: entity_score -= generic_count * 15
+        if parsed["vec"]:
+            vec_with = sum(1 for v in parsed["vec"] if lex_preserves_entities(v, entities))
+            if vec_with > 0: entity_score += 5
+    elif not entities:
+        entity_score = 10
+
+    think_bonus = 0 if used_thinking else 20
+    total = format_score + diversity_score + hyde_score + quality_score + entity_score + think_bonus
+    max_possible = 140 if parsed["hyde"] else 120
+    percentage = max(0.0, min(100.0, total / max_possible * 100))
+
+    if percentage >= 80: rating = "Excellent"
+    elif percentage >= 60: rating = "Good"
+    elif percentage >= 40: rating = "Acceptable"
+    elif percentage >= 20: rating = "Poor"
+    else: rating = "Failed"
+
+    return {
+        "format": format_score, "diversity": diversity_score, "hyde": hyde_score,
+        "quality": quality_score, "entity": max(0, entity_score),
+        "think_bonus": think_bonus, "total": max(0, total),
+        "max_possible": max_possible, "percentage": round(percentage, 1),
+        "rating": rating, "deductions": deductions,
+        "entities_detected": list(entities) if entities else [],
+    }
+
+
+# =============================================================================
+# Model loading and generation
+# =============================================================================
+
+def load_model(base, sft=None, grpo=None):
+    print(f"Loading tokenizer from {base}...")
+    tokenizer = AutoTokenizer.from_pretrained(base)
+    if tokenizer.pad_token is None:
+        tokenizer.pad_token = tokenizer.eos_token
+
+    print(f"Loading base model {base}...")
+    model = AutoModelForCausalLM.from_pretrained(
+        base, torch_dtype=torch.bfloat16, device_map="auto",
+    )
+
+    if sft:
+        print(f"Loading and merging SFT adapter {sft}...")
+        model = PeftModel.from_pretrained(model, sft)
+        model = model.merge_and_unload()
+
+    if grpo:
+        print(f"Loading GRPO adapter {grpo}...")
+        model = PeftModel.from_pretrained(model, grpo)
+
+    model.eval()
+    return model, tokenizer
+
+
+def generate_expansion(model, tokenizer, query, max_new_tokens=200):
+    messages = [{"role": "user", "content": f"/no_think Expand this search query: {query}"}]
+    prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+    inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+
+    with torch.no_grad():
+        outputs = model.generate(
+            **inputs, max_new_tokens=max_new_tokens,
+            temperature=0.7, do_sample=True,
+            pad_token_id=tokenizer.pad_token_id,
+            eos_token_id=tokenizer.eos_token_id,
+        )
+
+    full_output = tokenizer.decode(outputs[0], skip_special_tokens=True)
+    if "\nassistant\n" in full_output:
+        expansion = full_output.split("\nassistant\n")[-1].strip()
+    elif "assistant\n" in full_output:
+        expansion = full_output.split("assistant\n")[-1].strip()
+    else:
+        expansion = full_output[len(prompt):].strip()
+
+    if "<think>" in expansion:
+        expansion = re.sub(r'<think>.*?</think>', '', expansion, flags=re.DOTALL).strip()
+    return expansion
+
+
+# =============================================================================
+# Main
+# =============================================================================
+
+def results_to_csv(results, label):
+    """Convert eval results to CSV string."""
+    buf = io.StringIO()
+    writer = csv.writer(buf)
+    writer.writerow([
+        "model", "query", "expansion", "score_pct", "rating",
+        "format", "diversity", "hyde", "quality", "entity", "think_bonus",
+        "total", "max_possible", "deductions",
+    ])
+    for r in results:
+        s = r["scores"]
+        writer.writerow([
+            label, r["query"], r["expansion"], s["percentage"], s["rating"],
+            s["format"], s["diversity"], s["hyde"], s["quality"], s["entity"],
+            s["think_bonus"], s["total"], s["max_possible"],
+            "; ".join(s.get("deductions", [])),
+        ])
+    return buf.getvalue()
+
+
+def upload_csv(results, label, repo_id, api):
+    """Upload eval results CSV to HuggingFace Hub."""
+    csv_data = results_to_csv(results, label)
+    tag = label.split("/")[-1].replace(" ", "_").lower()
+    filename = f"eval_{tag}.csv"
+    print(f"  Uploading {filename} to {repo_id}...")
+    api.upload_file(
+        path_or_fileobj=csv_data.encode("utf-8"),
+        path_in_repo=filename,
+        repo_id=repo_id,
+        repo_type="model",
+    )
+    print(f"  Uploaded: https://huggingface.co/{repo_id}/blob/main/{filename}")
+
+
+def evaluate_model(model, tokenizer, label):
+    print(f"\n{'='*70}")
+    print(f"  EVALUATING: {label}")
+    print(f"{'='*70}")
+
+    results = []
+    for i, query in enumerate(QUERIES, 1):
+        expansion = generate_expansion(model, tokenizer, query)
+        scores = score_expansion_detailed(query, expansion)
+        results.append({"query": query, "expansion": expansion, "scores": scores})
+
+        marker = "+" if scores["percentage"] >= 80 else "-" if scores["percentage"] < 60 else "~"
+        print(f"  [{marker}] {i:2d}/{len(QUERIES)} {scores['percentage']:5.1f}% {scores['rating']:10s}  {query}")
+
+    avg = sum(r["scores"]["percentage"] for r in results) / len(results)
+    ratings = Counter(r["scores"]["rating"] for r in results)
+
+    print(f"\n  {'─'*50}")
+    print(f"  Average score: {avg:.1f}%")
+    print(f"  Ratings:")
+    for rating in ["Excellent", "Good", "Acceptable", "Poor", "Failed"]:
+        count = ratings.get(rating, 0)
+        if count > 0:
+            print(f"    {rating:10s}: {count:2d}  {'█' * count}")
+
+    # Show worst queries
+    worst = sorted(results, key=lambda r: r["scores"]["percentage"])[:5]
+    print(f"\n  Bottom 5:")
+    for r in worst:
+        print(f"    {r['scores']['percentage']:5.1f}%  {r['query']}")
+        if r["scores"]["deductions"]:
+            print(f"           {', '.join(r['scores']['deductions'][:3])}")
+
+    return results, avg
+
+
+def main():
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--sft-only", action="store_true", help="Only evaluate SFT model")
+    parser.add_argument("--upload-repo", default="tobil/qmd-query-expansion-evals",
+                        help="HF repo to upload CSV results")
+    args = parser.parse_args()
+
+    hf_token = os.environ.get("HF_TOKEN")
+    if hf_token:
+        login(token=hf_token)
+
+    api = HfApi()
+    api.create_repo(repo_id=args.upload_repo, repo_type="model", exist_ok=True)
+
+    # Evaluate SFT
+    model, tokenizer = load_model(BASE_MODEL, sft=SFT_MODEL)
+    sft_results, sft_avg = evaluate_model(model, tokenizer, f"SFT: {SFT_MODEL}")
+    upload_csv(sft_results, "sft", args.upload_repo, api)
+
+    if not args.sft_only:
+        # For GRPO: reload base, merge SFT, then load GRPO adapter
+        del model
+        torch.cuda.empty_cache()
+        model, tokenizer = load_model(BASE_MODEL, sft=SFT_MODEL, grpo=GRPO_MODEL)
+        grpo_results, grpo_avg = evaluate_model(model, tokenizer, f"GRPO: {GRPO_MODEL}")
+        upload_csv(grpo_results, "grpo", args.upload_repo, api)
+
+        # Upload combined comparison CSV
+        combined = results_to_csv(sft_results, "sft") + results_to_csv(grpo_results, "grpo").split("\n", 1)[1]
+        api.upload_file(
+            path_or_fileobj=combined.encode("utf-8"),
+            path_in_repo="eval_comparison.csv",
+            repo_id=args.upload_repo,
+            repo_type="model",
+        )
+        print(f"  Uploaded: eval_comparison.csv")
+
+        # Comparison
+        print(f"\n{'='*70}")
+        print(f"  COMPARISON")
+        print(f"{'='*70}")
+        print(f"  SFT  average: {sft_avg:.1f}%")
+        print(f"  GRPO average: {grpo_avg:.1f}%")
+        print(f"  Delta:        {grpo_avg - sft_avg:+.1f}%")
+
+        improved = sum(1 for s, g in zip(sft_results, grpo_results)
+                       if g["scores"]["percentage"] > s["scores"]["percentage"])
+        regressed = sum(1 for s, g in zip(sft_results, grpo_results)
+                        if g["scores"]["percentage"] < s["scores"]["percentage"])
+        print(f"  Improved: {improved}/{len(QUERIES)}, Regressed: {regressed}/{len(QUERIES)}")
+
+
+if __name__ == "__main__":
+    main()

+ 354 - 0
finetune/jobs/eval_common.py

@@ -0,0 +1,354 @@
+"""
+Common evaluation and reward scoring for QMD query expansion models.
+
+Shared by sft.py and grpo.py for post-training evaluation.
+"""
+
+import csv
+import io
+import re
+from collections import Counter
+
+import torch
+from huggingface_hub import HfApi
+
+# =============================================================================
+# Reward function (single source of truth)
+# =============================================================================
+
+STOPWORDS = frozenset({
+    'the', 'a', 'an', 'is', 'are', 'to', 'for', 'of', 'in',
+    'and', 'or', 'it', 'this', 'that', 'be', 'with', 'as', 'on', 'by',
+})
+
+KEY_TERM_STOPWORDS = frozenset({
+    'what', 'is', 'how', 'to', 'the', 'a', 'an', 'in', 'on', 'for', 'of',
+    'and', 'or', 'with', 'my', 'your', 'do', 'does', 'can', 'i', 'me', 'we',
+    'who', 'where', 'when', 'why', 'which', 'find', 'get', 'show', 'tell',
+})
+
+GENERIC_LEX_PHRASES = frozenset({
+    'find information about', 'search for', 'look up', 'get information',
+    'learn about', 'information on', 'details about', 'find out about',
+    'what is', 'how to', 'guide to', 'help with',
+})
+
+CHAT_TEMPLATE_TOKENS = frozenset({
+    '<|im_start|>', '<|im_end|>', '<|endoftext|>',
+    '\nassistant\n', '\nuser\n',
+})
+
+
+def parse_expansion(text):
+    result = {"lex": [], "vec": [], "hyde": [], "invalid": []}
+    for line in text.strip().split("\n"):
+        line = line.strip()
+        if not line:
+            continue
+        if line.startswith("lex:"):
+            result["lex"].append(line[4:].strip())
+        elif line.startswith("vec:"):
+            result["vec"].append(line[4:].strip())
+        elif line.startswith("hyde:"):
+            result["hyde"].append(line[5:].strip())
+        else:
+            result["invalid"].append(line)
+    return result
+
+
+def clean_model_output(text):
+    text = text.replace('<|im_end|>', '').strip()
+    used_thinking = '<think>' in text and '</think>' in text
+    if used_thinking:
+        text = re.sub(r'<think>.*?</think>', '', text, flags=re.DOTALL).strip()
+    return text, used_thinking
+
+
+def extract_named_entities(query):
+    entities = set()
+    words = query.split()
+    prev_was_entity = False
+    for i, word in enumerate(words):
+        clean = word.strip('.,!?:;()[]"\'')
+        if not clean:
+            prev_was_entity = False
+            continue
+        is_entity = False
+        if clean.isupper() and len(clean) >= 2:
+            entities.add(clean.lower()); is_entity = True
+        elif i > 0 and clean[0].isupper() and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower()); is_entity = True
+        elif any(c in clean for c in '.+-#@') and len(clean) >= 2:
+            entities.add(clean.lower()); is_entity = True
+        elif len(clean) > 1 and any(c.isupper() for c in clean[1:]) and clean[0].isupper():
+            entities.add(clean.lower()); is_entity = True
+        elif prev_was_entity and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower()); is_entity = True
+        prev_was_entity = is_entity
+    return entities
+
+
+def get_key_terms(query):
+    return set(query.lower().split()) - KEY_TERM_STOPWORDS
+
+
+def lex_preserves_key_terms(lex_line, query):
+    key_terms = get_key_terms(query)
+    return not key_terms or bool(key_terms & set(lex_line.lower().split()))
+
+
+def lex_preserves_entities(line, entities):
+    if not entities:
+        return True
+    return any(e in line.lower() for e in entities)
+
+
+def lex_is_generic(lex_line):
+    lower = lex_line.lower().strip()
+    for phrase in GENERIC_LEX_PHRASES:
+        if phrase in lower or lower.startswith(phrase.split()[0]):
+            remaining = lower
+            for word in phrase.split():
+                remaining = remaining.replace(word, '', 1).strip()
+            if len(remaining) < 3:
+                return True
+    return False
+
+
+def word_set_distance(a, b):
+    return len(set(a.lower().split()) ^ set(b.lower().split()))
+
+
+def is_diverse(a, b, min_distance=2):
+    a, b = a.lower().strip(), b.lower().strip()
+    if a == b or a in b or b in a:
+        return False
+    return word_set_distance(a, b) >= min_distance
+
+
+def echoes_query(expansion, query):
+    exp, q = expansion.lower().strip(), query.lower().strip()
+    return exp == q or (q in exp and len(exp) < len(q) + 10)
+
+
+def word_repetition_penalty(text):
+    counts = Counter(re.findall(r'\b\w+\b', text.lower()))
+    return sum((c - 2) * 2 for w, c in counts.items()
+               if c >= 3 and w not in STOPWORDS and len(w) > 2)
+
+
+def score_expansion(query, expansion):
+    """Score expansion as float in [0.0, 1.0] for RL reward."""
+    text, used_thinking = clean_model_output(expansion.strip())
+
+    if any(tok in text for tok in CHAT_TEMPLATE_TOKENS):
+        return 0.0
+    for line in text.split("\n"):
+        line = line.strip()
+        if line and not line.startswith(("lex:", "vec:", "hyde:")):
+            return 0.0
+
+    parsed = parse_expansion(text)
+
+    format_score = 10
+    if parsed["lex"]: format_score += 10
+    if parsed["vec"]: format_score += 10
+
+    diversity_score = 0
+    if sum(1 for t in ("lex", "vec") if parsed[t]) >= 2: diversity_score += 10
+    if len(parsed["lex"]) + len(parsed["vec"]) >= 2: diversity_score += 5
+    lex_div = 5
+    for i, a in enumerate(parsed["lex"]):
+        for b in parsed["lex"][i+1:]:
+            if not is_diverse(a, b, 2): lex_div -= 2
+    diversity_score += max(0, lex_div)
+    vec_div = 5
+    for i, a in enumerate(parsed["vec"]):
+        for b in parsed["vec"][i+1:]:
+            if not is_diverse(a, b, 3): vec_div -= 2
+    diversity_score += max(0, vec_div)
+    echo = 5
+    for exp in parsed["lex"] + parsed["vec"]:
+        if echoes_query(exp, query): echo -= 3
+    diversity_score += max(0, echo)
+
+    hyde_score = 0
+    if parsed["hyde"]:
+        hyde_text = parsed["hyde"][0]
+        hyde_score += 5
+        if 50 <= len(hyde_text) <= 200: hyde_score += 5
+        elif len(hyde_text) < 50: hyde_score += 2
+        if "\n" not in hyde_text: hyde_score += 5
+        hyde_score += max(0, 5 - word_repetition_penalty(hyde_text))
+
+    quality_score = 5
+    if parsed["lex"] and parsed["vec"]:
+        avg_lex = sum(len(l) for l in parsed["lex"]) / len(parsed["lex"])
+        avg_vec = sum(len(v) for v in parsed["vec"]) / len(parsed["vec"])
+        if avg_lex <= avg_vec: quality_score += 5
+    if parsed["vec"]:
+        natural = sum(1 for v in parsed["vec"] if " " in v and len(v) > 15)
+        quality_score += 5 if natural == len(parsed["vec"]) else 2
+    if parsed["lex"]:
+        with_terms = sum(1 for l in parsed["lex"] if lex_preserves_key_terms(l, query))
+        if with_terms == len(parsed["lex"]): quality_score += 5
+        elif with_terms > 0: quality_score += 2
+
+    entity_score = 0
+    entities = extract_named_entities(query)
+    if entities and parsed["lex"]:
+        with_entities = sum(1 for l in parsed["lex"] if lex_preserves_entities(l, entities))
+        if with_entities == len(parsed["lex"]): entity_score += 15
+        elif with_entities > 0: entity_score += 5
+        else: entity_score -= 30
+        generic_count = sum(1 for l in parsed["lex"] if lex_is_generic(l))
+        if generic_count: entity_score -= generic_count * 15
+        if parsed["vec"]:
+            vec_with = sum(1 for v in parsed["vec"] if lex_preserves_entities(v, entities))
+            if vec_with > 0: entity_score += 5
+    elif not entities:
+        entity_score = 10
+
+    think_bonus = 0 if used_thinking else 20
+    total = format_score + diversity_score + hyde_score + quality_score + entity_score + think_bonus
+    max_possible = 140 if parsed["hyde"] else 120
+    return max(0.0, min(1.0, total / max_possible))
+
+
+def extract_query_from_prompt(prompt):
+    """Extract the search query from a formatted prompt string."""
+    if "Expand this search query:" in prompt:
+        query = prompt.split("Expand this search query:")[-1].strip()
+        if "<|im_end|>" in query:
+            query = query.split("<|im_end|>")[0].strip()
+        return query
+    return prompt.strip()
+
+
+class QMDRewardFunction:
+    """Reward function wrapper for TRL's GRPOTrainer."""
+    __name__ = "qmd_scoring_reward"
+
+    def __call__(self, completions, prompts=None, **kwargs):
+        rewards = []
+        for i, completion in enumerate(completions):
+            query = ""
+            if prompts and i < len(prompts):
+                query = extract_query_from_prompt(prompts[i])
+            rewards.append(score_expansion(query, completion))
+        return rewards
+
+
+# =============================================================================
+# Evaluation
+# =============================================================================
+
+EVAL_QUERIES = [
+    # Technical documentation
+    "how to configure authentication",
+    "typescript async await",
+    "docker compose networking",
+    "git rebase vs merge",
+    "react useEffect cleanup",
+    # Short/ambiguous
+    "auth", "config", "setup", "api",
+    # Named entities
+    "who is TDS motorsports",
+    "React hooks tutorial",
+    "Docker container networking",
+    "Kubernetes pod deployment",
+    "AWS Lambda functions",
+    # Personal notes / journals
+    "meeting notes project kickoff",
+    "ideas for new feature",
+    "todo list app architecture",
+    # Research / learning
+    "what is dependency injection",
+    "difference between sql and nosql",
+    "kubernetes vs docker swarm",
+    # Error/debugging
+    "connection timeout error",
+    "memory leak debugging",
+    "cors error fix",
+    # Temporal / recency
+    "recent news about Shopify",
+    "latest AI developments",
+    "best laptops right now",
+    "what changed in kubernetes latest version",
+    # Complex
+    "how to implement caching with redis in nodejs",
+    "best practices for api rate limiting",
+    "setting up ci cd pipeline with github actions",
+]
+
+
+def generate_expansion(model, tokenizer, query, max_new_tokens=200):
+    """Generate a query expansion using the model."""
+    messages = [{"role": "user", "content": f"/no_think Expand this search query: {query}"}]
+    prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+    inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+    with torch.no_grad():
+        outputs = model.generate(
+            **inputs, max_new_tokens=max_new_tokens,
+            temperature=0.7, do_sample=True,
+            pad_token_id=tokenizer.pad_token_id,
+            eos_token_id=tokenizer.eos_token_id,
+        )
+    full_output = tokenizer.decode(outputs[0], skip_special_tokens=True)
+    if "\nassistant\n" in full_output:
+        return full_output.split("\nassistant\n")[-1].strip()
+    elif "assistant\n" in full_output:
+        return full_output.split("assistant\n")[-1].strip()
+    return full_output[len(prompt):].strip()
+
+
+def run_eval(model, tokenizer, label, upload_repo="tobil/qmd-query-expansion-evals"):
+    """Evaluate model on EVAL_QUERIES, print results, upload CSV."""
+    api = HfApi()
+    api.create_repo(repo_id=upload_repo, repo_type="model", exist_ok=True)
+
+    print(f"\n{'='*70}")
+    print(f"  EVALUATING: {label}")
+    print(f"{'='*70}")
+
+    results = []
+    for i, query in enumerate(EVAL_QUERIES, 1):
+        expansion = generate_expansion(model, tokenizer, query)
+        score = score_expansion(query, expansion)
+        pct = round(score * 100, 1)
+        rating = ("Excellent" if pct >= 80 else "Good" if pct >= 60
+                  else "Acceptable" if pct >= 40 else "Poor" if pct >= 20 else "Failed")
+        marker = "+" if pct >= 80 else "-" if pct < 60 else "~"
+        print(f"  [{marker}] {i:2d}/{len(EVAL_QUERIES)} {pct:5.1f}% {rating:10s}  {query}")
+        results.append({"query": query, "expansion": expansion, "score": pct, "rating": rating})
+
+    avg = sum(r["score"] for r in results) / len(results)
+    ratings = Counter(r["rating"] for r in results)
+
+    print(f"\n  {'─'*50}")
+    print(f"  Average score: {avg:.1f}%")
+    for r in ["Excellent", "Good", "Acceptable", "Poor", "Failed"]:
+        c = ratings.get(r, 0)
+        if c:
+            print(f"    {r:10s}: {c:2d}  {'█' * c}")
+
+    worst = sorted(results, key=lambda r: r["score"])[:5]
+    print(f"\n  Bottom 5:")
+    for r in worst:
+        print(f"    {r['score']:5.1f}%  {r['query']}")
+
+    buf = io.StringIO()
+    writer = csv.writer(buf)
+    writer.writerow(["model", "query", "expansion", "score_pct", "rating"])
+    for r in results:
+        writer.writerow([label, r["query"], r["expansion"], r["score"], r["rating"]])
+
+    filename = f"eval_{label}.csv"
+    print(f"\n  Uploading {filename} to {upload_repo}...")
+    api.upload_file(
+        path_or_fileobj=buf.getvalue().encode("utf-8"),
+        path_in_repo=filename,
+        repo_id=upload_repo,
+        repo_type="model",
+    )
+    print(f"  Done: https://huggingface.co/{upload_repo}/blob/main/{filename}")

+ 113 - 0
finetune/jobs/eval_verbose.py

@@ -0,0 +1,113 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "transformers>=4.45.0",
+#     "peft>=0.7.0",
+#     "torch",
+#     "huggingface_hub>=0.20.0",
+#     "accelerate",
+# ]
+# ///
+"""
+Verbose eval: prints the actual expansions for every query.
+
+    hf jobs uv run --flavor a10g-small --secrets HF_TOKEN --timeout 30m jobs/eval_verbose.py
+"""
+
+import os
+import re
+import sys
+from collections import Counter
+
+import torch
+from huggingface_hub import login
+from peft import PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer
+
+BASE_MODEL = "Qwen/Qwen3-1.7B"
+SFT_MODEL = "tobil/qmd-query-expansion-1.7B-sft"
+GRPO_MODEL = "tobil/qmd-query-expansion-1.7B-grpo"
+
+QUERIES = [
+    "how to configure authentication",
+    "typescript async await",
+    "docker compose networking",
+    "git rebase vs merge",
+    "react useEffect cleanup",
+    "auth",
+    "config",
+    "setup",
+    "api",
+    "who is TDS motorsports",
+    "React hooks tutorial",
+    "Docker container networking",
+    "Kubernetes pod deployment",
+    "AWS Lambda functions",
+    "meeting notes project kickoff",
+    "ideas for new feature",
+    "todo list app architecture",
+    "what is dependency injection",
+    "difference between sql and nosql",
+    "kubernetes vs docker swarm",
+    "connection timeout error",
+    "memory leak debugging",
+    "cors error fix",
+    "recent news about Shopify",
+    "latest AI developments",
+    "best laptops right now",
+    "what changed in kubernetes latest version",
+    "how to implement caching with redis in nodejs",
+    "best practices for api rate limiting",
+    "setting up ci cd pipeline with github actions",
+]
+
+
+def load_model(base, sft=None, grpo=None):
+    tokenizer = AutoTokenizer.from_pretrained(base)
+    if tokenizer.pad_token is None:
+        tokenizer.pad_token = tokenizer.eos_token
+    model = AutoModelForCausalLM.from_pretrained(base, torch_dtype=torch.bfloat16, device_map="auto")
+    if sft:
+        model = PeftModel.from_pretrained(model, sft)
+        model = model.merge_and_unload()
+    if grpo:
+        model = PeftModel.from_pretrained(model, grpo)
+    model.eval()
+    return model, tokenizer
+
+
+def generate(model, tokenizer, query):
+    messages = [{"role": "user", "content": f"/no_think Expand this search query: {query}"}]
+    prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+    inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+    with torch.no_grad():
+        out = model.generate(**inputs, max_new_tokens=200, temperature=0.7, do_sample=True,
+                             pad_token_id=tokenizer.pad_token_id, eos_token_id=tokenizer.eos_token_id)
+    text = tokenizer.decode(out[0], skip_special_tokens=True)
+    if "\nassistant\n" in text:
+        text = text.split("\nassistant\n")[-1].strip()
+    elif "assistant\n" in text:
+        text = text.split("assistant\n")[-1].strip()
+    if "<think>" in text:
+        text = re.sub(r'<think>.*?</think>', '', text, flags=re.DOTALL).strip()
+    return text
+
+
+def main():
+    hf_token = os.environ.get("HF_TOKEN")
+    if hf_token:
+        login(token=hf_token)
+
+    print("Loading GRPO model...", file=sys.stderr)
+    model, tokenizer = load_model(BASE_MODEL, sft=SFT_MODEL, grpo=GRPO_MODEL)
+
+    for i, query in enumerate(QUERIES, 1):
+        expansion = generate(model, tokenizer, query)
+        print(f"\n{'='*60}")
+        print(f"[{i}/{len(QUERIES)}] {query}")
+        print(f"{'─'*60}")
+        print(expansion)
+
+
+if __name__ == "__main__":
+    main()

+ 130 - 0
finetune/jobs/grpo.py

@@ -0,0 +1,130 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "trl>=0.12.0",
+#     "peft>=0.7.0",
+#     "transformers>=4.45.0",
+#     "accelerate>=0.24.0",
+#     "huggingface_hub>=0.20.0",
+#     "datasets",
+#     "bitsandbytes",
+#     "torch",
+# ]
+# ///
+"""
+GRPO training for QMD query expansion (Qwen3-1.7B).
+
+Runs on top of merged SFT weights. Self-contained for HuggingFace Jobs:
+    hf jobs uv run --flavor a10g-large --secrets HF_TOKEN --timeout 4h jobs/grpo.py
+"""
+
+import os
+import sys
+
+import torch
+from datasets import load_dataset
+from huggingface_hub import login
+from peft import LoraConfig, PeftModel, get_peft_model
+from transformers import AutoModelForCausalLM, AutoTokenizer
+from trl import GRPOTrainer, GRPOConfig
+
+sys.path.insert(0, os.path.dirname(os.path.abspath(__file__)))
+from eval_common import QMDRewardFunction, run_eval
+
+# --- Config (inlined from configs/grpo.yaml) ---
+BASE_MODEL = "Qwen/Qwen3-1.7B"
+SFT_MODEL = "tobil/qmd-query-expansion-1.7B-sft"
+OUTPUT_MODEL = "tobil/qmd-query-expansion-1.7B-grpo"
+DATASET = "tobil/qmd-query-expansion-train-v2"
+
+
+def main():
+    hf_token = os.environ.get("HF_TOKEN")
+    if hf_token:
+        login(token=hf_token)
+
+    print(f"Loading tokenizer from {BASE_MODEL}...")
+    tokenizer = AutoTokenizer.from_pretrained(BASE_MODEL)
+    if tokenizer.pad_token is None:
+        tokenizer.pad_token = tokenizer.eos_token
+
+    # Load and format dataset
+    print(f"Loading dataset: {DATASET}...")
+    dataset = load_dataset(DATASET, split="train")
+
+    def extract_prompt(example):
+        content = example["messages"][0]["content"]
+        messages = [{"role": "user", "content": content}]
+        formatted = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+        return {"prompt": formatted}
+
+    dataset = dataset.map(extract_prompt, remove_columns=dataset.column_names)
+    dataset = dataset.shuffle(seed=42).select(range(min(1000, len(dataset))))
+    print(f"Using {len(dataset)} prompts for GRPO")
+
+    # Load base model, merge SFT adapter
+    print(f"Loading base model {BASE_MODEL}...")
+    base_model = AutoModelForCausalLM.from_pretrained(
+        BASE_MODEL, torch_dtype=torch.bfloat16, device_map="auto",
+    )
+    print(f"Merging SFT adapter {SFT_MODEL}...")
+    model = PeftModel.from_pretrained(base_model, SFT_MODEL)
+    model = model.merge_and_unload()
+    print("SFT adapter merged.")
+
+    # Fresh LoRA for GRPO (small: rank 4, q/v only)
+    grpo_lora = LoraConfig(
+        r=4, lora_alpha=8, lora_dropout=0.05,
+        bias="none", task_type="CAUSAL_LM",
+        target_modules=["q_proj", "v_proj"],
+    )
+    model = get_peft_model(model, grpo_lora)
+    model.print_trainable_parameters()
+
+    config = GRPOConfig(
+        output_dir="qmd-query-expansion-1.7B-grpo",
+        push_to_hub=True,
+        hub_model_id=OUTPUT_MODEL,
+
+        num_generations=4,
+        max_completion_length=200,
+        beta=0.04,  # KL regularization — prevents drift from SFT checkpoint
+
+        num_train_epochs=1,
+        per_device_train_batch_size=2,
+        gradient_accumulation_steps=8,
+        learning_rate=5e-7,
+        max_grad_norm=0.5,
+        max_steps=200,
+
+        logging_steps=10,
+        save_strategy="epoch",
+        bf16=True,
+
+        report_to="none",
+    )
+
+    print("Initializing GRPO trainer...")
+    trainer = GRPOTrainer(
+        model=model,
+        processing_class=tokenizer,
+        args=config,
+        train_dataset=dataset,
+        reward_funcs=[QMDRewardFunction()],
+    )
+
+    print("Starting GRPO training...")
+    trainer.train()
+
+    print("Pushing to Hub...")
+    trainer.push_to_hub()
+    print(f"Done! Model: https://huggingface.co/{OUTPUT_MODEL}")
+
+    # --- Automatic evaluation ---
+    print("\nStarting automatic evaluation...")
+    trainer.model.eval()
+    run_eval(trainer.model, tokenizer, "grpo")
+
+
+if __name__ == "__main__":
+    main()

+ 244 - 0
finetune/jobs/quantize.py

@@ -0,0 +1,244 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "transformers>=4.45.0",
+#     "peft>=0.7.0",
+#     "torch",
+#     "huggingface_hub>=0.20.0",
+#     "accelerate",
+#     "sentencepiece>=0.1.99",
+#     "protobuf>=3.20.0",
+#     "numpy",
+#     "gguf",
+# ]
+# ///
+"""
+Merge SFT + GRPO adapters and convert to GGUF with multiple quantizations.
+
+Uploads each quantization to HuggingFace Hub as it's produced, so partial
+results are available even if the job times out.
+
+    hf jobs uv run --flavor a10g-large --secrets HF_TOKEN --timeout 2h jobs/quantize.py
+    hf jobs uv run --flavor a10g-large --secrets HF_TOKEN --timeout 2h jobs/quantize.py -- --size 4B
+"""
+
+import argparse
+import os
+import subprocess
+import sys
+
+import torch
+from huggingface_hub import HfApi, login
+from peft import PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer
+
+PRESETS = {
+    "1.7B": {
+        "base": "Qwen/Qwen3-1.7B",
+        "sft": "tobil/qmd-query-expansion-1.7B-sft",
+        "grpo": "tobil/qmd-query-expansion-1.7B-grpo",
+        "output": "tobil/qmd-query-expansion-1.7B-gguf",
+    },
+    "4B": {
+        "base": "Qwen/Qwen3-4B",
+        "sft": "tobil/qmd-query-expansion-4B-sft",
+        "grpo": "tobil/qmd-query-expansion-4B-grpo",
+        "output": "tobil/qmd-query-expansion-4B-gguf",
+    },
+}
+
+QUANT_TYPES = [
+    ("Q4_K_M", "4-bit (recommended for most use)"),
+    ("Q5_K_M", "5-bit (balanced quality/size)"),
+    ("Q8_0", "8-bit (highest quality)"),
+]
+
+
+def run_cmd(cmd, description):
+    print(f"  {description}...")
+    try:
+        result = subprocess.run(cmd, check=True, capture_output=True, text=True)
+        return True
+    except subprocess.CalledProcessError as e:
+        print(f"  FAILED: {' '.join(cmd)}")
+        if e.stderr:
+            print(f"  {e.stderr[:500]}")
+        return False
+    except FileNotFoundError:
+        print(f"  Command not found: {cmd[0]}")
+        return False
+
+
+def main():
+    parser = argparse.ArgumentParser(description="Convert QMD model to GGUF")
+    parser.add_argument("--size", default="1.7B", choices=PRESETS.keys(), help="Model size preset")
+    args = parser.parse_args()
+
+    preset = PRESETS[args.size]
+    base_model = preset["base"]
+    sft_model = preset["sft"]
+    grpo_model = preset["grpo"]
+    output_repo = preset["output"]
+    model_name = output_repo.split("/")[-1].replace("-gguf", "")
+
+    print(f"QMD GGUF Conversion: {model_name}")
+    print("=" * 60)
+
+    hf_token = os.environ.get("HF_TOKEN")
+    if hf_token:
+        login(token=hf_token)
+
+    api = HfApi()
+    api.create_repo(repo_id=output_repo, repo_type="model", exist_ok=True)
+
+    # Step 1: Install build tools
+    print("\nStep 1: Installing build dependencies...")
+    subprocess.run(["apt-get", "update", "-qq"], capture_output=True)
+    subprocess.run(["apt-get", "install", "-y", "-qq", "build-essential", "cmake", "git"], capture_output=True)
+
+    # Step 2: Load and merge
+    print(f"\nStep 2: Loading base model {base_model}...")
+    model = AutoModelForCausalLM.from_pretrained(
+        base_model, torch_dtype=torch.bfloat16, device_map="auto", trust_remote_code=True,
+    )
+
+    print(f"Step 3: Merging SFT adapter {sft_model}...")
+    model = PeftModel.from_pretrained(model, sft_model)
+    model = model.merge_and_unload()
+
+    print(f"Step 4: Merging GRPO adapter {grpo_model}...")
+    model = PeftModel.from_pretrained(model, grpo_model)
+    model = model.merge_and_unload()
+
+    tokenizer = AutoTokenizer.from_pretrained(base_model, trust_remote_code=True)
+
+    # Step 3: Save merged model
+    merged_dir = "/tmp/merged_model"
+    print(f"\nStep 5: Saving merged model to {merged_dir}...")
+    model.save_pretrained(merged_dir, safe_serialization=True)
+    tokenizer.save_pretrained(merged_dir)
+    del model
+    torch.cuda.empty_cache()
+
+    # Step 4: Setup llama.cpp
+    print("\nStep 6: Setting up llama.cpp...")
+    if not os.path.exists("/tmp/llama.cpp"):
+        run_cmd(["git", "clone", "--depth", "1", "https://github.com/ggerganov/llama.cpp.git", "/tmp/llama.cpp"],
+                "Cloning llama.cpp")
+    subprocess.run([sys.executable, "-m", "pip", "install", "-q", "-r", "/tmp/llama.cpp/requirements.txt"],
+                   capture_output=True)
+
+    # Step 5: Convert to FP16 GGUF
+    gguf_dir = "/tmp/gguf_output"
+    os.makedirs(gguf_dir, exist_ok=True)
+    fp16_file = f"{gguf_dir}/{model_name}-f16.gguf"
+
+    print(f"\nStep 7: Converting to FP16 GGUF...")
+    if not run_cmd([sys.executable, "/tmp/llama.cpp/convert_hf_to_gguf.py",
+                    merged_dir, "--outfile", fp16_file, "--outtype", "f16"],
+                   "Converting to FP16"):
+        sys.exit(1)
+
+    size_mb = os.path.getsize(fp16_file) / (1024 * 1024)
+    print(f"  FP16: {size_mb:.1f} MB")
+
+    # Upload FP16 immediately
+    print(f"  Uploading FP16 to {output_repo}...")
+    api.upload_file(path_or_fileobj=fp16_file,
+                    path_in_repo=f"{model_name}-f16.gguf", repo_id=output_repo)
+    print(f"  Uploaded: {model_name}-f16.gguf")
+
+    # Step 6: Build quantize tool
+    print("\nStep 8: Building quantize tool...")
+    os.makedirs("/tmp/llama.cpp/build", exist_ok=True)
+    run_cmd(["cmake", "-B", "/tmp/llama.cpp/build", "-S", "/tmp/llama.cpp", "-DGGML_CUDA=OFF"],
+            "CMake configure")
+    run_cmd(["cmake", "--build", "/tmp/llama.cpp/build", "--target", "llama-quantize", "-j", "4"],
+            "Building llama-quantize")
+    quantize_bin = "/tmp/llama.cpp/build/bin/llama-quantize"
+
+    # Step 7: Quantize and upload each one immediately
+    print("\nStep 9: Quantizing and uploading...")
+    for quant_type, desc in QUANT_TYPES:
+        qfile = f"{gguf_dir}/{model_name}-{quant_type.lower()}.gguf"
+        if run_cmd([quantize_bin, fp16_file, qfile, quant_type], f"{quant_type} ({desc})"):
+            qsize = os.path.getsize(qfile) / (1024 * 1024)
+            print(f"  {quant_type}: {qsize:.1f} MB")
+
+            print(f"  Uploading {quant_type} to {output_repo}...")
+            api.upload_file(path_or_fileobj=qfile,
+                            path_in_repo=f"{model_name}-{quant_type.lower()}.gguf", repo_id=output_repo)
+            print(f"  Uploaded: {model_name}-{quant_type.lower()}.gguf")
+
+            # Remove to save disk
+            os.remove(qfile)
+
+    # Step 8: Upload README
+    ollama_name = "qmd-expand" if args.size == "1.7B" else f"qmd-expand-{args.size.lower()}"
+    readme = f"""---
+base_model: {base_model}
+tags: [gguf, llama.cpp, quantized, query-expansion, qmd]
+---
+# {model_name} (GGUF)
+
+GGUF quantizations of the QMD Query Expansion model for use with
+[Ollama](https://ollama.com), [llama.cpp](https://github.com/ggerganov/llama.cpp),
+or [LM Studio](https://lmstudio.ai).
+
+## Available Quantizations
+
+| File | Quant | Description |
+|------|-------|-------------|
+| `{model_name}-q4_k_m.gguf` | Q4_K_M | 4-bit — smallest, recommended for most use |
+| `{model_name}-q5_k_m.gguf` | Q5_K_M | 5-bit — balanced quality/size |
+| `{model_name}-q8_0.gguf` | Q8_0 | 8-bit — highest quality |
+| `{model_name}-f16.gguf` | FP16 | Full precision (large) |
+
+## Details
+
+- **Base:** {base_model}
+- **SFT:** {sft_model}
+- **GRPO:** {grpo_model}
+- **Task:** Query expansion for hybrid search (lex/vec/hyde format)
+- **Eval score:** 90.7% average (29/30 Excellent)
+
+## Quick Start with Ollama
+
+```bash
+huggingface-cli download {output_repo} \\
+    {model_name}-q4_k_m.gguf --local-dir .
+
+echo 'FROM ./{model_name}-q4_k_m.gguf' > Modelfile
+ollama create {ollama_name} -f Modelfile
+ollama run {ollama_name}
+```
+
+## Prompt Format
+
+```
+<|im_start|>user
+/no_think Expand this search query: your query here<|im_end|>
+<|im_start|>assistant
+```
+
+The model produces structured output:
+```
+lex: keyword expansion for BM25 search
+lex: another keyword variant
+vec: natural language expansion for vector search
+vec: another semantic expansion
+hyde: A hypothetical document passage that might match this query.
+```
+"""
+    api.upload_file(path_or_fileobj=readme.encode(),
+                    path_in_repo="README.md", repo_id=output_repo)
+
+    print(f"\nDone! Repository: https://huggingface.co/{output_repo}")
+    print(f"\nTo use with Ollama:")
+    print(f"  huggingface-cli download {output_repo} {model_name}-q4_k_m.gguf --local-dir .")
+    print(f"  echo 'FROM ./{model_name}-q4_k_m.gguf' > Modelfile")
+    print(f"  ollama create {ollama_name} -f Modelfile")
+
+
+if __name__ == "__main__":
+    main()

+ 111 - 0
finetune/jobs/sft.py

@@ -0,0 +1,111 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "trl>=0.12.0",
+#     "peft>=0.7.0",
+#     "transformers>=4.45.0",
+#     "accelerate>=0.24.0",
+#     "huggingface_hub>=0.20.0",
+#     "datasets",
+#     "bitsandbytes",
+#     "torch",
+# ]
+# ///
+"""
+SFT training for QMD query expansion (Qwen3-1.7B).
+
+Self-contained script for HuggingFace Jobs:
+    hf jobs uv run --flavor a10g-large --secrets HF_TOKEN --timeout 2h jobs/sft.py
+"""
+
+import os
+import sys
+from huggingface_hub import login
+
+# --- Config (inlined from configs/sft.yaml) ---
+BASE_MODEL = "Qwen/Qwen3-1.7B"
+OUTPUT_MODEL = "tobil/qmd-query-expansion-1.7B-sft"
+DATASET = "tobil/qmd-query-expansion-train-v2"
+
+hf_token = os.environ.get("HF_TOKEN")
+if hf_token:
+    login(token=hf_token)
+
+from datasets import load_dataset
+from peft import LoraConfig
+from transformers import AutoTokenizer
+from trl import SFTTrainer, SFTConfig
+
+# Load and split dataset
+print(f"Loading dataset: {DATASET}...")
+dataset = load_dataset(DATASET, split="train")
+print(f"Dataset loaded: {len(dataset)} examples")
+
+split = dataset.train_test_split(test_size=0.1, seed=42)
+train_dataset = split["train"]
+eval_dataset = split["test"]
+print(f"  Train: {len(train_dataset)}, Eval: {len(eval_dataset)}")
+
+# SFT config
+config = SFTConfig(
+    output_dir="qmd-query-expansion-1.7B-sft",
+    push_to_hub=True,
+    hub_model_id=OUTPUT_MODEL,
+    hub_strategy="every_save",
+
+    num_train_epochs=5,
+    per_device_train_batch_size=4,
+    gradient_accumulation_steps=4,
+    learning_rate=2e-4,
+    max_length=512,
+
+    logging_steps=10,
+    save_strategy="steps",
+    save_steps=200,
+    save_total_limit=2,
+    eval_strategy="steps",
+    eval_steps=200,
+
+    warmup_ratio=0.03,
+    lr_scheduler_type="cosine",
+    bf16=True,
+
+    report_to="none",
+)
+
+# LoRA: rank 16, all projection layers
+peft_config = LoraConfig(
+    r=16,
+    lora_alpha=32,
+    lora_dropout=0.0,
+    bias="none",
+    task_type="CAUSAL_LM",
+    target_modules=["q_proj", "k_proj", "v_proj", "o_proj", "gate_proj", "up_proj", "down_proj"],
+)
+
+print("Initializing SFT trainer...")
+trainer = SFTTrainer(
+    model=BASE_MODEL,
+    train_dataset=train_dataset,
+    eval_dataset=eval_dataset,
+    args=config,
+    peft_config=peft_config,
+)
+
+print("Starting SFT training...")
+trainer.train()
+
+print("Pushing to Hub...")
+trainer.push_to_hub()
+print(f"Done! Model: https://huggingface.co/{OUTPUT_MODEL}")
+
+# --- Automatic evaluation ---
+sys.path.insert(0, os.path.dirname(os.path.abspath(__file__)))
+from eval_common import run_eval
+
+print("\nStarting automatic evaluation...")
+eval_tokenizer = AutoTokenizer.from_pretrained(BASE_MODEL)
+if eval_tokenizer.pad_token is None:
+    eval_tokenizer.pad_token = eval_tokenizer.eos_token
+trainer.model.eval()
+run_eval(trainer.model, eval_tokenizer, "sft")

+ 428 - 0
finetune/reward.py

@@ -0,0 +1,428 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = []
+# ///
+"""
+QMD Query Expansion Reward Function
+
+Single source of truth for scoring query expansions. Used by:
+- GRPO training (as the RL reward signal)
+- Evaluation scripts (for scoring model outputs)
+
+Scores expansions on five dimensions:
+  Format (30)   - Has lex/vec lines, no invalid lines
+  Diversity (30) - Multiple types, diverse content, no echoes
+  HyDE (20)      - Optional bonus for hypothetical document passage
+  Quality (20)   - Lex shorter than vec, natural language, key terms
+  Entity (20)    - Named entity preservation in lex/vec lines
+
+Returns 0.0-1.0 for RL rewards, or a detailed breakdown dict for evaluation.
+"""
+
+import re
+from collections import Counter
+
+# =============================================================================
+# Constants
+# =============================================================================
+
+STOPWORDS = frozenset({
+    'the', 'a', 'an', 'is', 'are', 'to', 'for', 'of', 'in',
+    'and', 'or', 'it', 'this', 'that', 'be', 'with', 'as', 'on', 'by',
+})
+
+KEY_TERM_STOPWORDS = frozenset({
+    'what', 'is', 'how', 'to', 'the', 'a', 'an', 'in', 'on', 'for', 'of',
+    'and', 'or', 'with', 'my', 'your', 'do', 'does', 'can', 'i', 'me', 'we',
+    'who', 'where', 'when', 'why', 'which', 'find', 'get', 'show', 'tell',
+})
+
+GENERIC_LEX_PHRASES = frozenset({
+    'find information about', 'search for', 'look up', 'get information',
+    'learn about', 'information on', 'details about', 'find out about',
+    'what is', 'how to', 'guide to', 'help with',
+})
+
+# Chat template tokens that indicate a broken output
+CHAT_TEMPLATE_TOKENS = frozenset({
+    '<|im_start|>', '<|im_end|>', '<|endoftext|>',
+    '\nassistant\n', '\nuser\n',
+})
+
+
+# =============================================================================
+# Parsing
+# =============================================================================
+
+def parse_expansion(text: str) -> dict:
+    """Parse a multi-line expansion into {lex, vec, hyde, invalid} lists."""
+    result = {"lex": [], "vec": [], "hyde": [], "invalid": []}
+    for line in text.strip().split("\n"):
+        line = line.strip()
+        if not line:
+            continue
+        if line.startswith("lex:"):
+            result["lex"].append(line[4:].strip())
+        elif line.startswith("vec:"):
+            result["vec"].append(line[4:].strip())
+        elif line.startswith("hyde:"):
+            result["hyde"].append(line[5:].strip())
+        else:
+            result["invalid"].append(line)
+    return result
+
+
+def clean_model_output(text: str) -> tuple[str, bool]:
+    """Strip chat template artifacts from model output.
+
+    Returns (cleaned_text, used_thinking) where used_thinking is True
+    if the model emitted <think>...</think> blocks.
+    """
+    text = text.replace('<|im_end|>', '').strip()
+
+    used_thinking = '<think>' in text and '</think>' in text
+    if used_thinking:
+        text = re.sub(r'<think>.*?</think>', '', text, flags=re.DOTALL).strip()
+
+    return text, used_thinking
+
+
+# =============================================================================
+# Helpers
+# =============================================================================
+
+def extract_named_entities(query: str) -> set:
+    """Extract named entities using heuristics.
+
+    Detects: ALL-CAPS acronyms (TDS, API), capitalized proper nouns (React),
+    technical terms with special chars (node.js, C++), CamelCase (JavaScript),
+    and compound names (TDS motorsports -> both words).
+    """
+    entities = set()
+    words = query.split()
+    prev_was_entity = False
+
+    for i, word in enumerate(words):
+        clean = word.strip('.,!?:;()[]"\'')
+        if not clean:
+            prev_was_entity = False
+            continue
+
+        is_entity = False
+
+        if clean.isupper() and len(clean) >= 2:
+            entities.add(clean.lower())
+            is_entity = True
+        elif i > 0 and clean[0].isupper() and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower())
+            is_entity = True
+        elif any(c in clean for c in '.+-#@') and len(clean) >= 2:
+            entities.add(clean.lower())
+            is_entity = True
+        elif len(clean) > 1 and any(c.isupper() for c in clean[1:]) and clean[0].isupper():
+            entities.add(clean.lower())
+            is_entity = True
+        elif prev_was_entity and clean.lower() not in KEY_TERM_STOPWORDS:
+            entities.add(clean.lower())
+            is_entity = True
+
+        prev_was_entity = is_entity
+
+    return entities
+
+
+def get_key_terms(query: str) -> set:
+    """Get non-stopword terms from a query."""
+    return set(query.lower().split()) - KEY_TERM_STOPWORDS
+
+
+def lex_preserves_key_terms(lex_line: str, query: str) -> bool:
+    """Does the lex line contain at least one key term from the query?"""
+    key_terms = get_key_terms(query)
+    if not key_terms:
+        return True
+    return bool(key_terms & set(lex_line.lower().split()))
+
+
+def lex_preserves_entities(line: str, entities: set) -> bool:
+    """Does the line contain at least one named entity?"""
+    if not entities:
+        return True
+    lower = line.lower()
+    return any(e in lower for e in entities)
+
+
+def lex_is_generic(lex_line: str) -> bool:
+    """Is this lex line a useless generic filler phrase?"""
+    lower = lex_line.lower().strip()
+    for phrase in GENERIC_LEX_PHRASES:
+        if phrase in lower or lower.startswith(phrase.split()[0]):
+            remaining = lower
+            for word in phrase.split():
+                remaining = remaining.replace(word, '', 1).strip()
+            if len(remaining) < 3:
+                return True
+    return False
+
+
+def word_set_distance(a: str, b: str) -> int:
+    """Symmetric difference of word sets (how many words are unique to one)."""
+    return len(set(a.lower().split()) ^ set(b.lower().split()))
+
+
+def is_diverse(a: str, b: str, min_distance: int = 2) -> bool:
+    """Are two strings sufficiently different?"""
+    a, b = a.lower().strip(), b.lower().strip()
+    if a == b or a in b or b in a:
+        return False
+    return word_set_distance(a, b) >= min_distance
+
+
+def echoes_query(expansion: str, query: str) -> bool:
+    """Is this expansion just echoing the original query?"""
+    exp, q = expansion.lower().strip(), query.lower().strip()
+    return exp == q or (q in exp and len(exp) < len(q) + 10)
+
+
+def word_repetition_penalty(text: str) -> int:
+    """Penalty for words repeated 3+ times (excluding stopwords)."""
+    counts = Counter(re.findall(r'\b\w+\b', text.lower()))
+    return sum((c - 2) * 2 for w, c in counts.items()
+               if c >= 3 and w not in STOPWORDS and len(w) > 2)
+
+
+# =============================================================================
+# Scoring
+# =============================================================================
+
+def score_expansion_detailed(query: str, expansion: str) -> dict:
+    """Score an expansion with full breakdown. Returns dict with all dimensions."""
+    text, used_thinking = clean_model_output(expansion.strip())
+    deductions = []
+
+    def _fail(reason):
+        return {
+            "format": 0, "diversity": 0, "hyde": 0, "quality": 0, "entity": 0,
+            "think_bonus": 0, "total": 0, "max_possible": 100,
+            "percentage": 0.0, "rating": "Failed",
+            "deductions": [reason],
+            "parsed": parse_expansion(expansion),
+            "entities_detected": [],
+        }
+
+    # Hard fail: remaining chat template tokens
+    if any(tok in text for tok in CHAT_TEMPLATE_TOKENS):
+        return _fail("CHAT TEMPLATE LEAKAGE")
+
+    # Hard fail: every non-empty line must have a valid prefix
+    for line in text.split("\n"):
+        line = line.strip()
+        if line and not line.startswith(("lex:", "vec:", "hyde:")):
+            return _fail(f"INVALID LINE: {line[:50]}")
+
+    parsed = parse_expansion(text)
+
+    # --- Format (0-30) ---
+    format_score = 10  # no invalid lines (guaranteed by hard fail)
+    if parsed["lex"]:
+        format_score += 10
+    else:
+        deductions.append("missing lex:")
+    if parsed["vec"]:
+        format_score += 10
+    else:
+        deductions.append("missing vec:")
+
+    # --- Diversity (0-30) ---
+    diversity_score = 0
+
+    types_present = sum(1 for t in ("lex", "vec") if parsed[t])
+    if types_present >= 2:
+        diversity_score += 10
+    else:
+        deductions.append("only one type")
+
+    if len(parsed["lex"]) + len(parsed["vec"]) >= 2:
+        diversity_score += 5
+
+    lex_div = 5
+    for i, a in enumerate(parsed["lex"]):
+        for b in parsed["lex"][i+1:]:
+            if not is_diverse(a, b, 2):
+                lex_div -= 2
+                deductions.append(f"lex duplicate: {a[:20]}...")
+    diversity_score += max(0, lex_div)
+
+    vec_div = 5
+    for i, a in enumerate(parsed["vec"]):
+        for b in parsed["vec"][i+1:]:
+            if not is_diverse(a, b, 3):
+                vec_div -= 2
+                deductions.append(f"vec duplicate: {a[:20]}...")
+    diversity_score += max(0, vec_div)
+
+    echo = 5
+    for exp in parsed["lex"] + parsed["vec"]:
+        if echoes_query(exp, query):
+            echo -= 3
+            deductions.append(f"echoes query: {exp[:20]}...")
+    diversity_score += max(0, echo)
+
+    # --- HyDE (0-20, optional bonus) ---
+    hyde_score = 0
+    if parsed["hyde"]:
+        hyde_text = parsed["hyde"][0]
+        hyde_score += 5
+        hyde_len = len(hyde_text)
+        if 50 <= hyde_len <= 200:
+            hyde_score += 5
+        elif hyde_len < 50:
+            hyde_score += 2
+            deductions.append(f"hyde too short ({hyde_len})")
+        else:
+            deductions.append(f"hyde too long ({hyde_len})")
+        if "\n" not in hyde_text:
+            hyde_score += 5
+        hyde_score += max(0, 5 - word_repetition_penalty(hyde_text))
+
+    # --- Quality (0-20) ---
+    quality_score = 5  # base relevance
+    if parsed["lex"] and parsed["vec"]:
+        avg_lex = sum(len(l) for l in parsed["lex"]) / len(parsed["lex"])
+        avg_vec = sum(len(v) for v in parsed["vec"]) / len(parsed["vec"])
+        if avg_lex <= avg_vec:
+            quality_score += 5
+        else:
+            deductions.append("lex longer than vec")
+    if parsed["vec"]:
+        natural = sum(1 for v in parsed["vec"] if " " in v and len(v) > 15)
+        quality_score += 5 if natural == len(parsed["vec"]) else 2
+    if parsed["lex"]:
+        with_terms = sum(1 for l in parsed["lex"] if lex_preserves_key_terms(l, query))
+        if with_terms == len(parsed["lex"]):
+            quality_score += 5
+        elif with_terms > 0:
+            quality_score += 2
+        else:
+            deductions.append("lex missing key terms")
+
+    # --- Entity Preservation (-45 to +20) ---
+    entity_score = 0
+    entities = extract_named_entities(query)
+    if entities and parsed["lex"]:
+        with_entities = sum(1 for l in parsed["lex"] if lex_preserves_entities(l, entities))
+        if with_entities == len(parsed["lex"]):
+            entity_score += 15
+        elif with_entities > 0:
+            entity_score += 5
+        else:
+            entity_score -= 30
+            deductions.append(f"lex missing entities: {entities}")
+
+        generic_count = sum(1 for l in parsed["lex"] if lex_is_generic(l))
+        if generic_count:
+            entity_score -= generic_count * 15
+            deductions.append(f"{generic_count} generic lex phrases")
+
+        if parsed["vec"]:
+            vec_with = sum(1 for v in parsed["vec"] if lex_preserves_entities(v, entities))
+            if vec_with > 0:
+                entity_score += 5
+    elif not entities:
+        entity_score = 10
+
+    # --- Think bonus (0-20): reward NOT using thinking mode ---
+    think_bonus = 0 if used_thinking else 20
+
+    # --- Total ---
+    total = format_score + diversity_score + hyde_score + quality_score + entity_score + think_bonus
+    max_possible = 140 if parsed["hyde"] else 120
+    percentage = max(0.0, min(100.0, total / max_possible * 100))
+
+    if percentage >= 80:
+        rating = "Excellent"
+    elif percentage >= 60:
+        rating = "Good"
+    elif percentage >= 40:
+        rating = "Acceptable"
+    elif percentage >= 20:
+        rating = "Poor"
+    else:
+        rating = "Failed"
+
+    return {
+        "format": format_score,
+        "diversity": diversity_score,
+        "hyde": hyde_score,
+        "quality": quality_score,
+        "entity": max(0, entity_score),
+        "think_bonus": think_bonus,
+        "total": max(0, total),
+        "max_possible": max_possible,
+        "percentage": round(percentage, 1),
+        "rating": rating,
+        "deductions": deductions,
+        "parsed": parsed,
+        "entities_detected": list(entities) if entities else [],
+    }
+
+
+def score_expansion(query: str, expansion: str) -> float:
+    """Score expansion as a float in [0.0, 1.0] for use as RL reward."""
+    result = score_expansion_detailed(query, expansion)
+    return max(0.0, min(1.0, result["total"] / result["max_possible"]))
+
+
+def extract_query_from_prompt(prompt: str) -> str:
+    """Extract the query string from a chat-formatted prompt."""
+    if "Expand this search query:" in prompt:
+        query = prompt.split("Expand this search query:")[-1].strip()
+        if "<|im_end|>" in query:
+            query = query.split("<|im_end|>")[0].strip()
+        return query
+    return prompt.strip()
+
+
+# =============================================================================
+# TRL-compatible reward class
+# =============================================================================
+
+class QMDRewardFunction:
+    """Reward function compatible with TRL's GRPOTrainer."""
+    __name__ = "qmd_scoring_reward"
+
+    def __call__(self, completions: list[str], prompts: list[str] = None, **kwargs) -> list[float]:
+        rewards = []
+        for i, completion in enumerate(completions):
+            query = ""
+            if prompts and i < len(prompts):
+                query = extract_query_from_prompt(prompts[i])
+            rewards.append(score_expansion(query, completion))
+        return rewards
+
+
+# =============================================================================
+# CLI: run standalone to test the reward function
+# =============================================================================
+
+if __name__ == "__main__":
+    print("QMD Reward Function Self-Test")
+    print("=" * 60)
+
+    tests = [
+        ("auth", "lex: auth setup\nlex: authentication config\nvec: how to configure authentication\nhyde: Configure auth by setting AUTH_SECRET."),
+        ("auth", "auth is important for security"),
+        ("who is TDS motorsports", "lex: TDS motorsports history\nlex: TDS motorsports founders\nvec: information about TDS motorsports company"),
+        ("who is TDS motorsports", "lex: find information about\nlex: company details\nvec: who is this company"),
+        ("how to use React hooks", "lex: React hooks tutorial\nlex: useEffect useState\nvec: how to use React hooks in functional components"),
+        ("auth", "<think>Let me think...</think>\nlex: auth"),
+        ("auth", "lex: auth\nThis is some explanation\nvec: more"),
+    ]
+
+    for query, expansion in tests:
+        score = score_expansion(query, expansion)
+        detail = score_expansion_detailed(query, expansion)
+        print(f"\n  Query: '{query}'")
+        print(f"  Score: {score:.2f} ({detail['rating']})")
+        if detail["deductions"]:
+            print(f"  Issues: {', '.join(detail['deductions'][:3])}")

+ 273 - 0
finetune/train.py

@@ -0,0 +1,273 @@
+# /// script
+# requires-python = ">=3.10"
+# dependencies = [
+#     "trl>=0.12.0",
+#     "peft>=0.7.0",
+#     "transformers>=4.45.0",
+#     "accelerate>=0.24.0",
+#     "huggingface_hub>=0.20.0",
+#     "trackio",
+#     "datasets",
+#     "bitsandbytes",
+#     "pyyaml",
+# ]
+# ///
+"""
+Unified training script for QMD query expansion models.
+
+Supports two stages:
+  sft  - Supervised fine-tuning on labeled examples
+  grpo - Group Relative Policy Optimization (RL) on top of merged SFT weights
+
+Usage:
+    uv run train.py sft  --config configs/sft.yaml
+    uv run train.py grpo --config configs/grpo.yaml
+    uv run train.py grpo --config configs/grpo.yaml --dry-run
+"""
+
+import argparse
+import os
+import sys
+
+import yaml
+
+
+def cmd_sft(args):
+    """Run supervised fine-tuning."""
+    import trackio
+    from datasets import load_dataset
+    from peft import LoraConfig
+    from trl import SFTTrainer, SFTConfig
+
+    with open(args.config) as f:
+        cfg = yaml.safe_load(f)
+
+    if args.dry_run:
+        print("SFT Training Configuration:")
+        print(yaml.dump(cfg, default_flow_style=False))
+        return
+
+    print(f"Loading dataset: {cfg['dataset']['name']}...")
+    dataset = load_dataset(cfg["dataset"]["name"], split=cfg["dataset"]["split"])
+    print(f"Dataset loaded: {len(dataset)} examples")
+
+    split = dataset.train_test_split(test_size=cfg["dataset"]["eval_split"], seed=42)
+    train_dataset = split["train"]
+    eval_dataset = split["test"]
+    print(f"  Train: {len(train_dataset)}, Eval: {len(eval_dataset)}")
+
+    config = SFTConfig(
+        output_dir=cfg["model"]["output"].split("/")[-1],
+        push_to_hub=True,
+        hub_model_id=cfg["model"]["output"],
+        hub_strategy="every_save",
+
+        num_train_epochs=cfg["training"]["epochs"],
+        per_device_train_batch_size=cfg["training"]["batch_size"],
+        gradient_accumulation_steps=cfg["training"]["gradient_accumulation_steps"],
+        learning_rate=cfg["training"]["learning_rate"],
+        max_length=cfg["training"]["max_length"],
+
+        logging_steps=10,
+        save_strategy="steps",
+        save_steps=200,
+        save_total_limit=2,
+        eval_strategy="steps",
+        eval_steps=200,
+
+        warmup_ratio=cfg["training"]["warmup_ratio"],
+        lr_scheduler_type=cfg["training"]["lr_scheduler"],
+
+        report_to="trackio",
+        project=cfg["tracking"]["project"],
+        run_name=cfg["tracking"]["run_name"],
+    )
+
+    peft_config = LoraConfig(
+        r=cfg["lora"]["rank"],
+        lora_alpha=cfg["lora"]["alpha"],
+        lora_dropout=cfg["lora"]["dropout"],
+        bias="none",
+        task_type="CAUSAL_LM",
+        target_modules=cfg["lora"]["target_modules"],
+    )
+
+    print("Initializing SFT trainer...")
+    trainer = SFTTrainer(
+        model=cfg["model"]["base"],
+        train_dataset=train_dataset,
+        eval_dataset=eval_dataset,
+        args=config,
+        peft_config=peft_config,
+    )
+
+    print("Starting SFT training...")
+    trainer.train()
+
+    print("Pushing to Hub...")
+    trainer.push_to_hub()
+    trackio.finish()
+    print(f"Done! Model: https://huggingface.co/{cfg['model']['output']}")
+
+
+def cmd_grpo(args):
+    """Run GRPO reinforcement learning on top of merged SFT weights."""
+    import torch
+    import trackio
+    from datasets import load_dataset
+    from huggingface_hub import login
+    from peft import LoraConfig, PeftModel, get_peft_model
+    from transformers import AutoModelForCausalLM, AutoTokenizer
+    from trl import GRPOTrainer, GRPOConfig
+
+    # Import reward from the shared module
+    sys.path.insert(0, os.path.dirname(__file__))
+    from reward import QMDRewardFunction, score_expansion, extract_named_entities
+
+    with open(args.config) as f:
+        cfg = yaml.safe_load(f)
+
+    if args.dry_run:
+        print("GRPO Training Configuration:")
+        print(yaml.dump(cfg, default_flow_style=False))
+        print("\nTesting reward function...")
+
+        tests = [
+            ("auth", "lex: auth setup\nlex: authentication config\nvec: how to configure authentication\nhyde: Configure auth by setting AUTH_SECRET."),
+            ("auth", "auth is important for security"),
+            ("who is TDS motorsports", "lex: TDS motorsports history\nlex: TDS motorsports founders\nvec: information about TDS motorsports company"),
+            ("who is TDS motorsports", "lex: find information about\nlex: company details\nvec: who is this company"),
+        ]
+        for query, expansion in tests:
+            score = score_expansion(query, expansion)
+            print(f"  '{query}' -> {score:.2f}")
+        return
+
+    # Login
+    hf_token = os.environ.get("HF_TOKEN")
+    if hf_token:
+        print("Logging in to HuggingFace Hub...")
+        login(token=hf_token)
+
+    # Load tokenizer
+    base_model_name = cfg["model"]["base"]
+    print(f"Loading tokenizer from {base_model_name}...")
+    tokenizer = AutoTokenizer.from_pretrained(base_model_name)
+    if tokenizer.pad_token is None:
+        tokenizer.pad_token = tokenizer.eos_token
+
+    # Load and format dataset
+    print("Loading dataset...")
+    dataset = load_dataset(cfg["dataset"]["name"], split="train")
+
+    def extract_prompt(example):
+        content = example[cfg["dataset"]["prompt_field"]][0]["content"]
+        messages = [{"role": "user", "content": content}]
+        formatted = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+        return {"prompt": formatted}
+
+    dataset = dataset.map(extract_prompt, remove_columns=dataset.column_names)
+    max_samples = cfg["dataset"].get("max_samples", len(dataset))
+    dataset = dataset.shuffle(seed=42).select(range(min(max_samples, len(dataset))))
+    print(f"Using {len(dataset)} prompts for GRPO")
+
+    # Load base model, merge SFT adapter
+    sft_model_name = cfg["model"]["sft"]
+    print(f"Loading SFT model from {sft_model_name}...")
+    base_model = AutoModelForCausalLM.from_pretrained(
+        base_model_name,
+        torch_dtype=torch.bfloat16,
+        device_map="auto",
+    )
+    model = PeftModel.from_pretrained(base_model, sft_model_name)
+    model = model.merge_and_unload()
+    print("SFT adapter merged.")
+
+    # Add fresh LoRA for GRPO
+    grpo_lora_config = LoraConfig(
+        r=cfg["lora"]["rank"],
+        lora_alpha=cfg["lora"]["alpha"],
+        lora_dropout=cfg["lora"]["dropout"],
+        bias="none",
+        task_type="CAUSAL_LM",
+        target_modules=cfg["lora"]["target_modules"],
+    )
+    model = get_peft_model(model, grpo_lora_config)
+    model.print_trainable_parameters()
+
+    # Build GRPO config, including beta and temperature
+    grpo_cfg = cfg.get("grpo", {})
+    config = GRPOConfig(
+        output_dir=cfg["model"]["output"].split("/")[-1],
+        push_to_hub=True,
+        hub_model_id=cfg["model"]["output"],
+
+        num_generations=grpo_cfg.get("num_generations", 4),
+        max_completion_length=grpo_cfg.get("max_completion_length", 200),
+        beta=grpo_cfg.get("beta", 0.04),
+
+        num_train_epochs=cfg["training"]["epochs"],
+        per_device_train_batch_size=cfg["training"]["batch_size"],
+        gradient_accumulation_steps=cfg["training"]["gradient_accumulation_steps"],
+        learning_rate=cfg["training"]["learning_rate"],
+        max_grad_norm=cfg["training"]["max_grad_norm"],
+        max_steps=cfg["training"].get("max_steps", -1),
+
+        logging_steps=10,
+        save_strategy="epoch",
+
+        report_to="trackio",
+        project=cfg["tracking"]["project"],
+        run_name=cfg["tracking"]["run_name"],
+    )
+
+    # Train
+    print("Initializing GRPO trainer...")
+    trainer = GRPOTrainer(
+        model=model,
+        processing_class=tokenizer,
+        args=config,
+        train_dataset=dataset,
+        reward_funcs=[QMDRewardFunction()],
+    )
+
+    print("Starting GRPO training...")
+    trainer.train()
+
+    print("Pushing to Hub...")
+    trainer.push_to_hub()
+    trackio.finish()
+    print(f"Done! Model: https://huggingface.co/{cfg['model']['output']}")
+
+
+def main():
+    parser = argparse.ArgumentParser(
+        description="QMD Query Expansion Training",
+        formatter_class=argparse.RawDescriptionHelpFormatter,
+        epilog="""
+Examples:
+  uv run train.py sft  --config configs/sft.yaml
+  uv run train.py grpo --config configs/grpo.yaml
+  uv run train.py grpo --config configs/grpo.yaml --dry-run
+        """,
+    )
+    sub = parser.add_subparsers(dest="stage", required=True)
+
+    sft_parser = sub.add_parser("sft", help="Supervised fine-tuning")
+    sft_parser.add_argument("--config", required=True, help="Path to SFT config YAML")
+    sft_parser.add_argument("--dry-run", action="store_true", help="Print config and exit")
+
+    grpo_parser = sub.add_parser("grpo", help="GRPO reinforcement learning")
+    grpo_parser.add_argument("--config", required=True, help="Path to GRPO config YAML")
+    grpo_parser.add_argument("--dry-run", action="store_true", help="Print config, test reward, and exit")
+
+    args = parser.parse_args()
+
+    if args.stage == "sft":
+        cmd_sft(args)
+    elif args.stage == "grpo":
+        cmd_grpo(args)
+
+
+if __name__ == "__main__":
+    main()

+ 1 - 1
src/llm.ts

@@ -150,7 +150,7 @@ export type RerankDocument = {
 const DEFAULT_EMBED_MODEL = "hf:ggml-org/embeddinggemma-300M-GGUF/embeddinggemma-300M-Q8_0.gguf";
 const DEFAULT_RERANK_MODEL = "hf:ggml-org/Qwen3-Reranker-0.6B-Q8_0-GGUF/qwen3-reranker-0.6b-q8_0.gguf";
 // const DEFAULT_GENERATE_MODEL = "hf:ggml-org/Qwen3-0.6B-GGUF/Qwen3-0.6B-Q8_0.gguf";
-const DEFAULT_GENERATE_MODEL = "hf:ggml-org/Qwen3-1.7B-GGUF/Qwen3-1.7B-Q8_0.gguf";
+const DEFAULT_GENERATE_MODEL = "hf:tobil/qmd-query-expansion-1.7B-gguf/qmd-query-expansion-1.7B-q4_k_m.gguf";
 
 // Local model cache directory
 const MODEL_CACHE_DIR = join(homedir(), ".cache", "qmd", "models");