学术研究图谱_academic-research-mapper

张

张建站

2026/6/18 16:50:35

10分钟阅读

以下为本文档的中文说明该技能用于绘制任何技术或学术主题的研究领域图谱。它通过搜索arXiv、Semantic Scholar等学术数据库系统性地收集和分析相关文献识别研究趋势、关键论文、主要研究者和机构合作关系。该技能自动构建主题的知识结构图谱展示研究方向的分支脉络和演进路径。适用于研究生、科研人员和学术新手需要快速了解一个研究领域的全貌。通过自动化文献检索和分析大大缩短了文献调研的时间周期帮助研究人员在论文撰写、课题立项或研究方向选择时获得全面的文献基础支持。该技能提供了详细的操作指南和最佳实践帮助用户快速上手并深入掌握。通过系统的功能模块划分和丰富的应用场景说明用户可以在实际项目中有效运用该技能提升工作效率。该技能注重实用性和可操作性涵盖从基础配置到高级功能的完整知识体系满足不同层次用户的学习需求。持续更新和优化的内容确保用户始终能够接触到最新的技术发展和行业实践。通过此技能的学习和应用用户可以减少摸索时间快速获得可用的解决方案将精力集中在核心业务逻辑和创新工作上从而在技术快速迭代的环境中保持竞争力。该技能的模块化设计使其易于扩展和定制用户可以根据自身需求灵活调整应用方式实现最大化的价值产出。该技能整合了常见的设计模式和最佳实践提供了清晰的学习路径和参考资料帮助用户在短时间内建立起完整的知识框架并有能力在实际项目中灵活运用所学内容解决问题。Research Landscape Mapper — Understand a Field Before You Build or WriteYou have access to the TinyFish CLI (tinyfish), a tool that runs browser automations from the terminal using natural language goals. This skill uses it to search arXiv, Semantic Scholar, and Google Scholar in parallel, then synthesizes results into a structured landscape report with identified gaps.Pre-flight Check (REQUIRED)Before making any TinyFish call, always run BOTH checks:1. CLI installed?bash/zsh:whichtinyfishtinyfish--version||echoTINYFISH_CLI_NOT_INSTALLEDPowerShell:Get-Commandtinyfish;tinyfish--versionIf not installed, stop and tell the user:Install the TinyFish CLI:npm install -g tiny-fish/cli2. Authenticated?tinyfish auth statusIf not authenticated, stop and tell the user:You need a TinyFish API key. Get one at: https://agent.tinyfish.ai/api-keysThen authenticate:Option 1 — CLI login (interactive):tinyfish auth loginOption 2 — bash/zsh (Mac/Linux, current session):exportTINYFISH_API_KEYyour-api-key-hereOption 3 — bash/zsh (persist across sessions, add to ~/.bashrc or ~/.zshrc):echoexport TINYFISH_API_KEYyour-api-key-here~/.zshrcsource~/.zshrcOption 4 — PowerShell (current session only):$env:TINYFISH_API_KEYyour-api-key-hereOption 5 — Claude Code settings:Add to~/.claude/settings.local.json:{env:{TINYFISH_API_KEY:your-api-key-here}}Do NOT proceed until both checks pass.What This Skill DoesGiven a research topic (e.g.“retrieval-augmented generation”or“protein structure prediction”), this skill:SearchesarXivfor preprints sorted by most recent — capturing what is being worked on right nowSearchesSemantic Scholarfor papers ranked by relevance with citation counts — identifying what the field considers importantSearchesGoogle Scholarfor broad coverage including published venues not yet on arXivIt then deduplicates across all three sources by title similarity, clusters papers into subtopics, and synthesizes findings into a structured landscape report: what is well-studied, what is emerging, and where the gaps are.Core Commandtinyfish agent run--urlurlgoalFlagsFlagPurpose--url urlTarget website URL for the agent to navigate--syncWait for the full result before returning (required when you need output before next step)--asyncSubmit and return a run ID immediately — use when firing parallel agents--prettyHuman-readable formatted output for debuggingKeyword StrategyThe quality of results depends entirely on your search terms. Before running anything, derive 2–3 keyword variants from the topic. Each source has different vocabulary norms — academic terms work best on Semantic Scholar, shorter compressed terms work best on arXiv.TopicPrimary keywordsVariant AVariant BRetrieval-augmented generationretrieval augmented generationRAG language modeldense retrieval QAProtein structure predictionprotein structure predictionAlphaFold protein foldingab initio structure biologyNeural architecture searchneural architecture searchNAS automated machine learninghyperparameter optimization deep learningFederated learning privacyfederated learningfederated learning differential privacydistributed training privacyUse the primary keywords for the first parallel pass. If any source returns fewer than 5 results, run a second pass with the variant keywords on that source only.Step-by-Step WorkflowStep 1 — Derive keywords and build URLsBefore running any agents, construct all three search URLs. Do this in your head or in a scratch note — do not make TinyFish calls yet.arXiv URL pattern:https://arxiv.org/search/?querykeywordssearchtypeallorder-announced_date_firstSemantic Scholar URL pattern:https://www.semanticscholar.org/search?qkeywordssortRelevanceGoogle Scholar URL pattern:https://scholar.google.com/scholar?qkeywordsas_sdt0%2C5hlenReplacekeywordswith URL-encoded primary keywords (spaces become).Step 2 — Search all three sources in parallelFire all three agents simultaneously. Do NOT wait for one to finish before starting the next.arXiv — sorted by most recent:tinyfish agent run--sync\\--urlhttps://arxiv.org/search/?queryretrievalaugmentedgenerationsearchtypeallorder-announced_date_first\\Extract the top 15 search results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\abstract_snippet\\: str (first 150 chars of abstract),\\arxiv_id\\: str,\\url\\: str}]. If a result has no year visible, use the submission date year.Semantic Scholar — sorted by relevance with citation counts:tinyfish agent run--sync\\--urlhttps://www.semanticscholar.org/search?qretrievalaugmentedgenerationsortRelevance\\Extract the top 15 search results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\citation_count\\: str,\\venue\\: str,\\abstract_snippet\\: str (first 150 chars),\\url\\: str}]. Scroll down to load more results if fewer than 10 are visible.Google Scholar — broad coverage:tinyfish agent run--sync\\--urlhttps://scholar.google.com/scholar?qretrievalaugmentedgenerationas_sdt0%2C5hlen\\Extract the top 15 search results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\citation_count\\: str,\\venue\\: str,\\snippet\\: str,\\url\\: str}]. Citation count appears after Cited by — extract that number.Parallel ExecutionAll three source searches are fully independent. Always fire them simultaneously.Good — parallel calls (fire and wait):tinyfish agent run--sync\\--urlhttps://arxiv.org/search/?queryretrievalaugmentedgenerationsearchtypeallorder-announced_date_first\\Extract the top 15 results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\abstract_snippet\\: str,\\arxiv_id\\: str,\\url\\: str}]/tmp/arxiv_results.jsontinyfish agent run--sync\\--urlhttps://www.semanticscholar.org/search?qretrievalaugmentedgenerationsortRelevance\\Extract the top 15 results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\citation_count\\: str,\\venue\\: str,\\abstract_snippet\\: str,\\url\\: str}]/tmp/s2_results.jsontinyfish agent run--sync\\--urlhttps://scholar.google.com/scholar?qretrievalaugmentedgenerationas_sdt0%2C5hlen\\Extract the top 15 results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\citation_count\\: str,\\venue\\: str,\\snippet\\: str,\\url\\: str}]/tmp/scholar_results.jsonwaitechoAll three sources complete.Bad — sequential calls:# Do NOT do this — triples the wait time for no benefittinyfish agent run--urlhttps://arxiv.org/...search arxiv, then also search semantic scholar, then also search google scholarEach source is always its own separate call. Never combine them into one goal.Step 3 — Handle sparse results (if needed)After the parallel run completes, check each result set. If any source returned fewer than 5 papers, run a second pass on that source with variant keywords:# Example: arXiv returned only 3 results for primary keywordstinyfish agent run--sync\\--urlhttps://arxiv.org/search/?queryRAGlanguagemodelsearchtypeallorder-announced_date_first\\Extract the top 15 results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\abstract_snippet\\: str,\\arxiv_id\\: str,\\url\\: str}]Do not run second passes if the primary pass was already rich — this wastes steps.Step 4 — Synthesize into a Landscape ReportOnce all three sources have returned results, synthesize findings into this structure. Use only data that TinyFish actually returned — do not hallucinate paper titles, citation counts, or author names.## Research Landscape: topic ### Volume Coverage - arXiv: N papers found, most recent: year - Semantic Scholar: N papers found, highest citations: N (paper title) - Google Scholar: N papers found - Unique papers after deduplication: N ### Key Papers (sorted by citation count) 1. Title — Authors, Year, Venue if known — citation_count citations one-sentence summary from abstract snippet 2. ... (list top 8–10 unique papers) ### Active Subtopics Cluster the papers by what they are actually about. Label each cluster with a short name. - **Subtopic A**: N papers — 1-sentence description of what this cluster covers - **Subtopic B**: N papers — ... - **Subtopic C**: N papers — ... ### Key Authors Groups - Author name — N papers in results, affiliated with institution if visible - ... (list authors appearing 2 times across the results) ### Recency Signal - Papers from last 12 months: N - Papers from last 3 years: N - Oldest paper in results: year - Trend: accelerating / stable / declining (infer from year distribution) ### Gaps Open Directions Based on what the papers cover and what they do not: - **Gap 1**: specific thing that is missing or underexplored - **Gap 2**: ... - **Gap 3**: ... ### Landscape Verdict 2–3 sentences: is this field crowded or open, mature or nascent, dominated by a few groups or distributed, and what is the single most underexplored angle?Deduplication RulesPapers appear across multiple sources. Before synthesizing, deduplicate using these rules in order:Exact title match(case-insensitive) → keep one, prefer the Semantic Scholar entry (has citation count)Title similarity 85%(same words, different punctuation) → treat as the same paperSame arXiv ID→ always the same paper regardless of title variationIf unsure, keep both and note the possible duplicate in the reportSubtopic Clustering GuideGroup papers by reading their abstract snippets, not just their titles. Common cluster patterns:If papers discuss…Cluster labelBenchmarks, evaluation datasets, metrics“Evaluation benchmarks”New model architectures or training methods“Model architecture”Application to a specific domain (medical, legal, code)“Domain adaptation: ”Efficiency, speed, compression, cost“Efficiency scaling”Safety, alignment, robustness, hallucination“Safety reliability”Surveys, meta-analyses, overviews“Surveys overviews”A paper can belong to at most two clusters. Name the clusters based on what you actually see, not these defaults if the topic warrants different ones.Managing Runs# List recent runs (useful if a run takes longer than expected)tinyfish agent run list# Get the full output of a specific run by IDtinyfish agent run getrun_id# Cancel a run that is taking too longtinyfish agent run cancelrun_idOutput FormatThe CLI streamsdata: {...}SSE lines by default. The final usable result is the event wheretype COMPLETEandstatus COMPLETED— the extracted data is in theresultJsonfield. Read the raw output directly; no script-side parsing is required.When saving to files withredirection as shown in the parallel example, the full SSE stream is saved. Extract the JSON by looking for the last line containingCOMPLETEDand parsing theresultJsonvalue from it.Example: Full Run for “Mixture of Experts”# Step 1 — fire all three in paralleltinyfish agent run--sync\\--urlhttps://arxiv.org/search/?querymixtureofexpertstransformersearchtypeallorder-announced_date_first \\Extract top 15 results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\abstract_snippet\\: str,\\arxiv_id\\: str,\\url\\: str}]\\/tmp/moe_arxiv.jsontinyfish agent run--sync\\--urlhttps://www.semanticscholar.org/search?qmixtureofexpertstransformersortRelevance\\Extract top 15 results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\citation_count\\: str,\\venue\\: str,\\abstract_snippet\\: str,\\url\\: str}]\\/tmp/moe_s2.jsontinyfish agent run--sync\\--urlhttps://scholar.google.com/scholar?qmixtureofexpertsLLMas_sdt0%2C5hlen\\Extract top 15 results as JSON: [{\\title\\: str,\\authors\\: [str],\\year\\: str,\\citation_count\\: str,\\venue\\: str,\\snippet\\: str,\\url\\: str}]\\/tmp/moe_scholar.jsonwait# Step 2 — synthesize# Read /tmp/moe_arxiv.json, /tmp/moe_s2.json, /tmp/moe_scholar.json# Deduplicate → cluster → produce landscape report

嵌入式GUI开发中内存设备的原理、应用与性能优化

1. 项目概述：为什么嵌入式GUI需要内存设备？在嵌入式系统里做图形界面开发，最让人头疼的问题之一就是屏幕闪烁。想象一下，你要在屏幕上画一个带背景的按钮，先画一个圆角矩形，再填充颜色，最后在上…...

2026/6/18 16:48:08 阅读更多 →

ATM网络QoS硬件实现：MC92520芯片CLP流量控制配置详解

1. 项目概述：ATM网络流量控制与QoS的硬件实现在构建高可靠、可预测的网络时，流量控制和服务质量（QoS）从来都不是一个纯软件层面的概念。尤其在ATM（异步传输模式）这种面向连接、以固定长度信元（C…...

2026/6/18 16:47:40 阅读更多 →

Qt 中使用 QtConcurrent::run + QFutureWatcher 实现异步处理

背景在 Qt/QML 桌面应用中，C 后端经常需要执行耗时操作——音频处理、文件转换、数据分析等。如果这些操作直接在主线程（UI 线程）同步执行，界面会冻结、无法响应，Windows 甚至弹出"程序未响应"的提示。本文…...

2026/6/18 16:45:48 阅读更多 →

魔兽争霸3性能大改造：告别卡顿，3步实现丝滑对战体验

魔兽争霸3性能大改造：告别卡顿，3步实现丝滑对战体验【免费下载链接】WarcraftHelper Warcraft III Helper , support 1.20e, 1.24e, 1.26a, 1.27a, 1.27b 项目地址: https://gitcode.com/gh_mirrors/wa/WarcraftHelper 你是否还在为魔兽争霸3的卡…...

2026/6/18 7:52:34 阅读更多 →

MC68SZ328 GPIO深度解析：从寄存器配置到中断与低功耗实战

1. 项目概述与GPIO核心价值在嵌入式开发领域，尤其是面对像MC68SZ328这类资源受限但功能丰富的微控制器时，如何高效、精准地管理其通用输入输出（GPIO）端口，往往是项目成败的关键。GPIO不仅仅是简单的“开”和“关”&…...

2026/6/17 21:45:47 阅读更多 →

人生闭环能力的庖丁解牛

它的本质是：**闭环不是“做完”，而是 “有始有终且有回响” (Start-Finish-Echo)。核心矛盾：大多数人只有开环思维 (Open-Loop Thinking)：发起动作 -> 期待结果。但现实世界充满噪声和延迟，如果没有主动的确认 (…...

2026/6/18 12:39:56 阅读更多 →

SketchUp STL插件终极指南：从3D设计到打印的完整转换方案

SketchUp STL插件终极指南：从3D设计到打印的完整转换方案【免费下载链接】sketchup-stl A SketchUp Ruby Extension that adds STL (STereoLithography) file format import and export. 项目地址: https://gitcode.com/gh_mirrors/sk/sketchup-stl 想要将你…...

2026/6/18 12:39:54 阅读更多 →