Interview Report User Guide

1. Recommended Reading Order

Review the report in this order:

Integrity Score and Potential Violations
Start with the overall integrity score and the number of potential risk events.
Summary
Read the high-level explanation of the candidate’s performance and risk profile.
Overall Risk Analysis
Suspicious Behavior
Review specific behavioral events, timestamps, severity, confidence.
Interview Insights, Skill Reason, and AI Usage
Use these sections for final skill assessment and hiring discussion.

Under Advanced Report Version:

Question & Answer Analysis
Examine each answer’s correctness, depth, clarity, transferability, and supporting evidence.
Detailed Consistency Analysis
Review topic-level consistency, hard-to-fake signals, and integrity risk by topic group.

2. How to Read the Top-Level Metrics

Integrity Score

The Integrity Score is a 0–10 score related to interview integrity risk. A higher score means lower observed risk.

Potential Violations

Potential Violations refers to the number of detected risk events.

This does not mean confirmed cheating. A potential violation may simply indicate behavior such as screen-directed gaze, rapid glancing, possible note-checking, or another signal that should be interpreted in context.

Summary

The Summary provides a high-level interpretation of:

Candidate competence
Answer naturalness
Behavioral risk signals
Likely benign explanations
Overall integrity risk

The Summary should be read as a directional assessment, not as a final verdict.

3. How to Read Overall Risk Analysis

This section is the most important parts of the report. It evaluates whether the candidate’s performance is stable and internally consistent.

Integrity Risk Analysis

This field summarizes the overall risk level, such as Low, Medium, or High.

The risk level should be interpreted as a directional assessment, not as a final conclusion. A higher risk level means the report found stronger or more repeated signals that may require manual review. A lower risk level means the candidate’s behavior and answers were more consistent with independent, spontaneous performance.

Risk Increasers

Risk increasers are signals that may raise concern about possible external assistance, especially when they align with changes in answer quality, timing, or follow-up performance.

Examples include:

Clear visual scanning patterns that align with a sudden jump in answer polish.
Visible use of a secondary device to play pre-captured answers.
Replaying incorrect audio for a technical question.
The candidate asks the questions aloud and immediately reads highly polished answers.
Total lack of spontaneous reasoning, natural pauses, or conversational filler.
Unexplained extreme spikes in capability.
Refusal to engage in live coding or interactive problem solving.
Repeated visual behaviors strongly indicative of off-screen reading during theoretical evaluations.

These signals should still be reviewed in context. A risk increaser is not proof of misconduct by itself, but it may justify closer manual review.

Risk Reducers

Risk reducers are signals that lower the likelihood of external assistance, such as:

Detailed real-world anecdotes
Consistent explanations across multiple questions
Strong follow-up performance
First-principles reasoning
Natural pacing and conversational delivery
Ability to explain trade-offs, edge cases, or implementation details
Stable performance across easy, medium, and hard questions

Risk reducers are especially important when minor behavior events are present. For example, brief screen glances may be less concerning if the candidate provides detailed, consistent, and explainable answers throughout the interview.

Benign Explanations

Benign explanations describe reasonable non-suspicious interpretations of observed behavior.

For example, screen scanning may be consistent with referencing personal notes, reviewing a resume, or recalling structured information.

5. How to Read Suspicious Behavior

The Suspicious Behavior section lists specific behavioral risk events.

Each event may include:

Field	Meaning
Type	The behavior category, such as `screen text reading`
Severity	Low, medium, or high
Timestamp	When the event occurred
Confidence	How confident the system is in the event detection
Topic Group	The interview topic related to the event
Evidence Notes	Short explanation of what was observed

Severity

Severity	Interpretation
Low	Brief, isolated, and plausibly benign
Medium	More noticeable or repeated; review with context
High	Stronger signal of suspicious behavior

6. How to Read Question & Answer Analysis

This section evaluates candidate competence question by question.

Each Q&A typically includes:

Field	Meaning
Question	The interviewer’s question
Answer Summary	Summary of the candidate’s answer
Difficulty	Easy, medium, or hard
Correctness	Whether the answer addressed the question correctly
Depth	How much substance and reasoning the answer contained
Clarity	Whether the answer was structured and easy to follow
Transfer	Whether the candidate could apply the concept to related scenarios
Analysis	Explanation of the score
Behavior Events	Any relevant behavior during that answer
Score	Per-question quality score

When reading this section, do not focus only on the numeric score. Also check whether the candidate:

Directly answered the question
Explained the reasoning
Provided concrete examples
Handled follow-ups
Stayed consistent with earlier answers
Avoided sudden unexplained jumps in quality

7. How to Read Consistency Analysis

Consistency metrics are used as supporting indicators when there are no clear behavioral anomalies, or when behavioral evidence is limited.

They help reviewers evaluate whether the candidate’s answers appear stable, natural, and internally coherent across the interview. These metrics do not prove misconduct. Instead, they provide an additional way to assess integrity risk based on the candidate’s answer patterns, reasoning quality, timing, and ability to handle follow-up questions.

In other words, when the report does not detect obvious suspicious behavior, consistency metrics help answer the question: "Does the candidate’s performance look like a stable and natural demonstration of their own ability?"

Topic Consistency

What it checks:
Whether the candidate gives consistent explanations within the same topic area.

How it supports integrity review:
When no behavioral anomaly is detected, stable topic consistency helps support the conclusion that the candidate understands the subject and is answering independently.

If the candidate changes definitions, contradicts earlier explanations, or shows major inconsistency within the same topic, it may reduce confidence in the answer quality and raise the need for closer review.

Risk reducer:
The candidate explains related concepts consistently across multiple questions.

Risk increaser:
The candidate gives conflicting explanations within the same topic without acknowledging or correcting the inconsistency.

Follow-up Consistency

What it checks:
Whether the candidate can maintain answer quality when the interviewer asks follow-up questions.

How it supports integrity review:
Follow-up questions are harder to prepare in advance. If the candidate can clarify, expand, and defend their answers during follow-ups, it supports genuine understanding.

If the candidate gives a strong initial answer but cannot answer simple follow-ups, the report may lower confidence in the candidate’s demonstrated competence. In the absence of behavioral evidence, this should be treated primarily as a confidence reducer, not as proof of external assistance.

Risk reducer:
The candidate remains clear and accurate under follow-up questioning.

Risk increaser:
The candidate gives polished initial answers but fails to explain, adapt, or reproduce the reasoning when probed.

Reasoning Consistency

What it checks:
Whether the candidate uses a stable reasoning process across answers.

How it supports integrity review:
A natural interview usually shows a consistent reasoning style. For example, the candidate may restate the problem, explain assumptions, walk through steps, and validate the answer.

If the candidate’s reasoning style changes sharply across questions, such as moving from step-by-step reasoning to conclusion-only answers, the report may reduce confidence in the reliability of the performance.

Risk reducer:
The candidate shows a consistent reasoning structure across different questions.

Risk increaser:
The candidate suddenly provides highly polished answers without showing intermediate reasoning.

Difficulty Consistency

What it checks:
Whether the candidate’s performance pattern makes sense across easy, medium, and hard questions.

How it supports integrity review:
In a natural interview, easier questions are usually answered more confidently, while harder questions may require more thought, clarification, or trade-off discussion.

If the candidate struggles with basic questions but performs unusually well on much harder questions without a clear explanation, the report may flag this as an unusual capability pattern.

Risk reducer:
Performance scales naturally with question difficulty.

Risk increaser:
The candidate shows unexplained extreme spikes in capability, especially when harder answers are much stronger than easier ones.

Temporal Consistency

What it checks:
Whether the timing and flow of answers appear natural.

How it supports integrity review:
Temporal consistency looks at the rhythm of the interview: pauses, response timing, answer development, and whether polished answers appear naturally or abruptly.

When no direct behavioral anomaly is detected, timing patterns can still help reviewers understand whether the candidate’s performance feels organic.

Risk reducer:
The candidate answers with natural pacing, reasonable thinking time, and spontaneous explanation.

Risk increaser:
The candidate repeatedly shows long stalls followed by unusually polished answers, without showing visible reasoning or explanation development.

Simple Reviewer Guidance

Consistency metrics are most useful when there are no obvious suspicious behavior events.

Pattern	Interpretation
No behavior anomaly + stable consistency	Strong support for low integrity risk
No behavior anomaly + strong follow-up performance	Strong risk reducer
No behavior anomaly + real examples and reasoning	Supports genuine competence
No behavior anomaly + inconsistent answers	Lowers confidence and may require review
No behavior anomaly + polished answers but weak follow-ups	Competence confidence reducer
No behavior anomaly + unexplained capability spikes	May increase integrity-risk attention

8. How to Read Hard-to-Fake Signals

Hard-to-fake signals are strong indicators of genuine competence.

Examples include:

First-principles explanation
Trade-off analysis
Concrete project examples
Debugging or correction loops
Handling follow-up constraints
Explaining why, not just what
Connecting theory to real implementation

These signals are important because a candidate who can explain concepts under probing, give specific examples, and maintain consistency is less likely to be relying on external assistance.

9. How to Read Interview Insights

The Interview Insights section summarizes strengths by category.

Category	Meaning
Tech	Technical or methodological capability
Domain	Industry-specific knowledge
Functional	Role-specific workflows and execution
Soft Skills	Communication, structure, and clarity

This section is useful for final hiring discussions because it converts individual answers into broader skill signals.

10. How to Read AI Fluency

The AI fluency section indicates whether AI usage was observed and whether it appeared permitted.

Common fields include:

Field	Meaning
Used	Whether AI usage was detected
Consent	Whether AI usage was explicitly allowed
Fluency Score	Candidate’s AI usage skill, if AI use was allowed and observed
Commentary	Explanation of the AI usage judgment

When AI use is not observed or not permitted, the AI fluency part will be not applicable.

11. Suggested Review Workflow

Step 1: Check Overall Risk

Start with Integrity Score, Potential Violations, Summary, and Overall Consistency Analysis.

Step 2: Review Behavioral Events

For each suspicious behavior event, check:

Severity
Confidence
Timestamp
Evidence notes
Whether manual review is required
Whether there is a plausible benign explanation

Step 3: Compare Against Answer Quality

A brief glance or screen-directed gaze should be interpreted differently depending on the candidate’s answer quality.

If the candidate gives detailed, consistent, and explainable answers, the behavior may be low risk.

If the candidate gives sudden perfect answers after pauses, cannot explain follow-ups, or shows repeated external-assistance signals, the risk should be reviewed more carefully.

Step 4: Evaluate Consistency

Give more weight to repeated patterns than isolated events.

Stable topic consistency, follow-up consistency, and reasoning consistency usually reduce integrity risk.

Step 5: Use Skill Summary for Hiring Discussion

Use Interview Insights and Skill Reason to evaluate whether the candidate meets the role requirements.

12. Common Misinterpretations

Misinterpretation 1: Potential Violations means confirmed cheating

Incorrect. Potential violations are risk signals, not confirmed cheating events.

Misinterpretation 2: Integrity Score is a hiring score

Incorrect. Integrity Score measures integrity risk, not job fit or technical ability.

Sherlock FAQs

Sherlock vs WeCP Advanced Proctoring

API Overview

1. Recommended Reading Order

2. How to Read the Top-Level Metrics

Integrity Score

Potential Violations

Summary

3. How to Read Overall Risk Analysis

Integrity Risk Analysis

Risk Increasers

Risk Reducers

Benign Explanations

5. How to Read Suspicious Behavior

Field

Meaning

Severity

Severity

Interpretation

6. How to Read Question & Answer Analysis

Field

Meaning

7. How to Read Consistency Analysis

Topic Consistency

Follow-up Consistency

Reasoning Consistency

Difficulty Consistency

Temporal Consistency

Simple Reviewer Guidance

Pattern

Interpretation

8. How to Read Hard-to-Fake Signals

9. How to Read Interview Insights

Category

Meaning

10. How to Read AI Fluency

Field

Meaning

11. Suggested Review Workflow

Step 1: Check Overall Risk

Step 2: Review Behavioral Events

Step 3: Compare Against Answer Quality

Step 4: Evaluate Consistency

Step 5: Use Skill Summary for Hiring Discussion

12. Common Misinterpretations

Misinterpretation 1: Potential Violations means confirmed cheating

Misinterpretation 2: Integrity Score is a hiring score