Subtopic Deep Dive

Expertise Identification from Social Media
Research Guide

What is Expertise Identification from Social Media?

Expertise Identification from Social Media develops computational techniques to detect domain experts using signals from social media platforms such as posts, interactions, and network structures.

Researchers apply topic modeling, credibility scoring, and network analysis to infer expertise from microblogs, blogs, and Q&A sites. Over 20 papers since 2011 explore these methods, with Miles Efron (2011) pioneering microblog retrieval (137 citations) and Wouter Weerkamp and Maarten de Rijke (2012) introducing credibility-inspired ranking (47 citations). Recent work extends to CQA evolution and user comments (Pal et al., 2021; Momeni et al., 2021).

Curated Papers

Key Challenges

Why It Matters

Expertise identification from social media enables real-time expert discovery for hiring, knowledge sharing, and crisis response by leveraging unstructured online data. Jacqueline C. Pike et al. (2013) show social networking sites improve candidate assessment in hiring despite quality tensions. Aditya Pal et al. (2021) analyze expert evolution in CQA communities, aiding platform moderation and recommendation systems. Jin-Woo Jeong et al. (2021) demonstrate crowd-powered search engines using social queries for better information retrieval.

Key Research Challenges

Noisy Social Signals

Social media data contains spam, sarcasm, and off-topic posts that obscure expertise signals. Miles Efron (2011) highlights retrieval challenges in microblogs due to brevity and noise. Wouter Weerkamp and Maarten de Rijke (2012) address credibility assessment amid low-quality content.

Implicit Expertise Inference

Expertise often emerges from interactions rather than explicit claims, requiring advanced modeling. Elaheh Momeni et al. (2021) note varying user expertise levels in comments, complicating annotation quality prediction. Aditya Pal et al. (2021) model temporal dynamics of expert emergence in CQA.

Scalable Network Analysis

Analyzing large-scale social graphs demands efficient algorithms for expert ranking. Pradeep Kumar Roy et al. (2022) review ML/DL approaches for CQA issue analysis, citing scalability limits. Fabio Calefato et al. (2017) study question quality on Stack Overflow, revealing interaction-based challenges.

Essential Papers

Information search and retrieval in microblogs

Miles Efron · 2011 · Journal of the American Society for Information Science and Technology · 137 citations

Modern information retrieval (IR) has come to terms with numerous new media in efforts to help people find information in increasingly diverse settings. Among these new media are so-called microblo...

How to ask for technical help? Evidence-based guidelines for writing questions on Stack Overflow

Fabio Calefato, Filippo Lanubile, Nicole Novielli · 2017 · Information and Software Technology · 126 citations

Evolution of Experts in Question Answering Communities

Aditya Pal, Shuo Chang, Joseph A. Konstan · 2021 · Proceedings of the International AAAI Conference on Web and Social Media · 114 citations

Community Question Answering (CQA) services thrive as a result of a small number of highly active users, typically called experts, who provide a large number of high quality useful answers. Underst...

Analysis of community question‐answering issues via machine learning and deep learning: State‐of‐the‐art review

Pradeep Kumar Roy, Sunil Saumya, Jyoti Prakash Singh et al. · 2022 · CAAI Transactions on Intelligence Technology · 92 citations

Abstract Over the last couple of decades, community question‐answering sites (CQAs) have been a topic of much academic interest. Scholars have often leveraged traditional machine learning (ML) and ...

Early career researchers: Scholarly behaviour and the prospect of change

David Nicholas, Anthony Watkinson, Chérifa Boukacem‐Zeghmouri et al. · 2017 · Learned Publishing · 84 citations

International audience

Knowledge Based Recommender System for Academia Using Machine Learning: A Case Study on Higher Education Landscape of Pakistan

Huma Samin, Tayyaba Azim · 2019 · IEEE Access · 73 citations

Allocation of courses and research students based on faculty's subject specialization and area of interest has always remained a challenging task for university administration due to the presence o...

A Crowd-Powered Socially Embedded Search Engine

Jin-Woo Jeong, Meredith Ringel Morris, Jaime Teevan et al. · 2021 · Proceedings of the International AAAI Conference on Web and Social Media · 54 citations

People have always asked questions of their friends, but now, with social media, they can broadcast their questions to their entire social network. In this paper we study the replies received via T...

Reading Guide

Foundational Papers

Start with Miles Efron (2011) for microblog retrieval basics (137 citations), then Wouter Weerkamp and Maarten de Rijke (2012) for credibility ranking, as they establish core signals for social expertise.

Recent Advances

Study Aditya Pal et al. (2021) on CQA expert evolution and Elaheh Momeni et al. (2021) on comment annotations to grasp dynamic and implicit inference advances.

Core Methods

Topic modeling and IR on microblogs (Efron, 2011); credibility indicators (Weerkamp and de Rijke, 2012); ML/DL for CQA (Roy et al., 2022); temporal network analysis (Pal et al., 2021).

How PapersFlow Helps You Research Expertise Identification from Social Media

Discover & Search

PapersFlow's Research Agent uses searchPapers and exaSearch to find core literature like 'Information search and retrieval in microblogs' by Miles Efron (2011), then citationGraph reveals downstream works on credibility (Weerkamp and de Rijke, 2012) and findSimilarPapers uncovers related CQA expertise papers (Pal et al., 2021).

Analyze & Verify

Analysis Agent employs readPaperContent to extract methods from Efron (2011) microblog retrieval, verifyResponse with CoVe checks claims against abstracts, and runPythonAnalysis runs network metrics on interaction data from Pal et al. (2021) using pandas for expert evolution validation; GRADE scores evidence strength for credibility models.

Synthesize & Write

Synthesis Agent detects gaps in social media expertise inference, such as scalable implicit signal modeling, while Writing Agent uses latexEditText, latexSyncCitations for Efron (2011) and Pal et al. (2021), and latexCompile generates review sections; exportMermaid visualizes expert network flows from Jeong et al. (2021).

Use Cases

"Reproduce expert evolution model from Pal et al. 2021 using Python"

Research Agent → searchPapers('expert evolution CQA') → Analysis Agent → readPaperContent(Pal2021) → runPythonAnalysis(pandas temporal analysis on CQA interactions) → researcher gets matplotlib plots of expertise trajectories.

"Write LaTeX review of credibility ranking in social expert search"

Research Agent → citationGraph(Weerkamp2012) → Synthesis Agent → gap detection → Writing Agent → latexEditText(draft) → latexSyncCitations(Efron2011,Weerkamp2012) → latexCompile → researcher gets compiled PDF with bibliography.

"Find GitHub code for microblog expertise retrieval like Efron 2011"

Research Agent → searchPapers(Efron2011) → Code Discovery → paperExtractUrls → paperFindGithubRepo → githubRepoInspect → researcher gets inspected repos with topic modeling scripts.

Automated Workflows

Deep Research workflow conducts systematic review: searchPapers('expertise social media') → 50+ papers → DeepScan(7-step: readPaperContent, verifyResponse, runPythonAnalysis on networks) → structured report on methods from Efron (2011) to Roy et al. (2022). Theorizer generates hypotheses on implicit expertise from Pal et al. (2021) interactions, using Chain-of-Verification for validation.

Try Doxa for Expertise Identification from Social Media Research

Frequently Asked Questions

What is Expertise Identification from Social Media?

It uses posts, interactions, and networks from platforms like Twitter or Stack Overflow to computationally detect domain experts (Efron, 2011; Weerkamp and de Rijke, 2012).

What methods are used?

Topic modeling for microblogs (Efron, 2011), credibility ranking for blogs (Weerkamp and de Rijke, 2012), and temporal modeling for CQA experts (Pal et al., 2021); ML/DL surveyed in Roy et al. (2022).

What are key papers?

Foundational: Efron (2011, 137 citations), Weerkamp and de Rijke (2012, 47 citations); Recent: Pal et al. (2021, 114 citations), Calefato et al. (2017, 126 citations).

What open problems exist?

Scalable inference of implicit expertise from noisy data (Momeni et al., 2021); handling expert dynamics in real-time social graphs (Pal et al., 2021); cross-domain validation (Roy et al., 2022).

Research Expert finding and Q&A systems with AI

PapersFlow provides specialized AI tools for Computer Science researchers. Here are the most relevant for this topic:

AI Literature Review

Automate paper discovery and synthesis across 474M+ papers

Code & Data Discovery

Find datasets, code repositories, and computational tools

Deep Research Reports

Multi-source evidence synthesis with counter-evidence

AI Academic Writing

Write research papers with AI assistance and LaTeX support

See how researchers in Computer Science & AI use PapersFlow

Field-specific workflows, example queries, and use cases.

Computer Science & AI Guide

Start Researching Expertise Identification from Social Media with AI

Search 474M+ papers, run AI-powered literature reviews, and write with integrated citations — all in one workspace.

Try PapersFlow Free See AI Literature Review

See how PapersFlow works for Computer Science researchers

Part of the Expert finding and Q&A systems Research Guide