Skip to main content

Interview Report User Guide

This report helps interview reviewers evaluate a candidate’s competence, answer consistency, behavioral risk signals, AI usage indicators, and overall interview integrity risk.

Written by Gemma Azur

1. Recommended Reading Order

Review the report in this order:

  1. Integrity Score and Potential Violations
    Start with the overall integrity score and the number of potential risk events.

  2. Summary
    Read the high-level explanation of the candidate’s performance and risk profile.

  3. Overall Risk Analysis

  4. Suspicious Behavior
    Review specific behavioral events, timestamps, severity, confidence.

  5. Interview Insights, Skill Reason, and AI Usage
    Use these sections for final skill assessment and hiring discussion.

Under Advanced Report Version:

  1. Question & Answer Analysis
    Examine each answer’s correctness, depth, clarity, transferability, and supporting evidence.

  2. Detailed Consistency Analysis
    Review topic-level consistency, hard-to-fake signals, and integrity risk by topic group.


2. How to Read the Top-Level Metrics

Integrity Score

The Integrity Score is a 0–10 score related to interview integrity risk. A higher score means lower observed risk.

Potential Violations

Potential Violations refers to the number of detected risk events.

This does not mean confirmed cheating. A potential violation may simply indicate behavior such as screen-directed gaze, rapid glancing, possible note-checking, or another signal that should be interpreted in context.

Summary

The Summary provides a high-level interpretation of:

  • Candidate competence

  • Answer naturalness

  • Behavioral risk signals

  • Likely benign explanations

  • Overall integrity risk

The Summary should be read as a directional assessment, not as a final verdict.


3. How to Read Overall Risk Analysis

This section is the most important parts of the report. It evaluates whether the candidate’s performance is stable and internally consistent.

Integrity Risk Analysis

This field summarizes the overall risk level, such as Low, Medium, or High.

The risk level should be interpreted as a directional assessment, not as a final conclusion. A higher risk level means the report found stronger or more repeated signals that may require manual review. A lower risk level means the candidate’s behavior and answers were more consistent with independent, spontaneous performance.


Risk Increasers

Risk increasers are signals that may raise concern about possible external assistance, especially when they align with changes in answer quality, timing, or follow-up performance.

Examples include:

  • Clear visual scanning patterns that align with a sudden jump in answer polish.

  • Visible use of a secondary device to play pre-captured answers.

  • Replaying incorrect audio for a technical question.

  • The candidate asks the questions aloud and immediately reads highly polished answers.

  • Total lack of spontaneous reasoning, natural pauses, or conversational filler.

  • Unexplained extreme spikes in capability.

  • Refusal to engage in live coding or interactive problem solving.

  • Repeated visual behaviors strongly indicative of off-screen reading during theoretical evaluations.

These signals should still be reviewed in context. A risk increaser is not proof of misconduct by itself, but it may justify closer manual review.


Risk Reducers

Risk reducers are signals that lower the likelihood of external assistance, such as:

  • Detailed real-world anecdotes

  • Consistent explanations across multiple questions

  • Strong follow-up performance

  • First-principles reasoning

  • Natural pacing and conversational delivery

  • Ability to explain trade-offs, edge cases, or implementation details

  • Stable performance across easy, medium, and hard questions

Risk reducers are especially important when minor behavior events are present. For example, brief screen glances may be less concerning if the candidate provides detailed, consistent, and explainable answers throughout the interview.

Benign Explanations

Benign explanations describe reasonable non-suspicious interpretations of observed behavior.

For example, screen scanning may be consistent with referencing personal notes, reviewing a resume, or recalling structured information.


5. How to Read Suspicious Behavior

The Suspicious Behavior section lists specific behavioral risk events.

Each event may include:

Field

Meaning

Type

The behavior category, such as screen text reading

Severity

Low, medium, or high

Timestamp

When the event occurred

Confidence

How confident the system is in the event detection

Topic Group

The interview topic related to the event

Evidence Notes

Short explanation of what was observed

Severity

Severity

Interpretation

Low

Brief, isolated, and plausibly benign

Medium

More noticeable or repeated; review with context

High

Stronger signal of suspicious behavior


6. How to Read Question & Answer Analysis

This section evaluates candidate competence question by question.

Each Q&A typically includes:

Field

Meaning

Question

The interviewer’s question

Answer Summary

Summary of the candidate’s answer

Difficulty

Easy, medium, or hard

Correctness

Whether the answer addressed the question correctly

Depth

How much substance and reasoning the answer contained

Clarity

Whether the answer was structured and easy to follow

Transfer

Whether the candidate could apply the concept to related scenarios

Analysis

Explanation of the score

Behavior Events

Any relevant behavior during that answer

Score

Per-question quality score

When reading this section, do not focus only on the numeric score. Also check whether the candidate:

  • Directly answered the question

  • Explained the reasoning

  • Provided concrete examples

  • Handled follow-ups

  • Stayed consistent with earlier answers

  • Avoided sudden unexplained jumps in quality


7. How to Read Consistency Analysis

Consistency metrics are used as supporting indicators when there are no clear behavioral anomalies, or when behavioral evidence is limited.

They help reviewers evaluate whether the candidate’s answers appear stable, natural, and internally coherent across the interview. These metrics do not prove misconduct. Instead, they provide an additional way to assess integrity risk based on the candidate’s answer patterns, reasoning quality, timing, and ability to handle follow-up questions.

In other words, when the report does not detect obvious suspicious behavior, consistency metrics help answer the question: "Does the candidate’s performance look like a stable and natural demonstration of their own ability?"

Topic Consistency

What it checks:
Whether the candidate gives consistent explanations within the same topic area.

How it supports integrity review:
When no behavioral anomaly is detected, stable topic consistency helps support the conclusion that the candidate understands the subject and is answering independently.

If the candidate changes definitions, contradicts earlier explanations, or shows major inconsistency within the same topic, it may reduce confidence in the answer quality and raise the need for closer review.

Risk reducer:
The candidate explains related concepts consistently across multiple questions.

Risk increaser:
The candidate gives conflicting explanations within the same topic without acknowledging or correcting the inconsistency.


Follow-up Consistency

What it checks:
Whether the candidate can maintain answer quality when the interviewer asks follow-up questions.

How it supports integrity review:
Follow-up questions are harder to prepare in advance. If the candidate can clarify, expand, and defend their answers during follow-ups, it supports genuine understanding.

If the candidate gives a strong initial answer but cannot answer simple follow-ups, the report may lower confidence in the candidate’s demonstrated competence. In the absence of behavioral evidence, this should be treated primarily as a confidence reducer, not as proof of external assistance.

Risk reducer:
The candidate remains clear and accurate under follow-up questioning.

Risk increaser:
The candidate gives polished initial answers but fails to explain, adapt, or reproduce the reasoning when probed.


Reasoning Consistency

What it checks:
Whether the candidate uses a stable reasoning process across answers.

How it supports integrity review:
A natural interview usually shows a consistent reasoning style. For example, the candidate may restate the problem, explain assumptions, walk through steps, and validate the answer.

If the candidate’s reasoning style changes sharply across questions, such as moving from step-by-step reasoning to conclusion-only answers, the report may reduce confidence in the reliability of the performance.

Risk reducer:
The candidate shows a consistent reasoning structure across different questions.

Risk increaser:
The candidate suddenly provides highly polished answers without showing intermediate reasoning.


Difficulty Consistency

What it checks:
Whether the candidate’s performance pattern makes sense across easy, medium, and hard questions.

How it supports integrity review:
In a natural interview, easier questions are usually answered more confidently, while harder questions may require more thought, clarification, or trade-off discussion.

If the candidate struggles with basic questions but performs unusually well on much harder questions without a clear explanation, the report may flag this as an unusual capability pattern.

Risk reducer:
Performance scales naturally with question difficulty.

Risk increaser:
The candidate shows unexplained extreme spikes in capability, especially when harder answers are much stronger than easier ones.


Temporal Consistency

What it checks:
Whether the timing and flow of answers appear natural.

How it supports integrity review:
Temporal consistency looks at the rhythm of the interview: pauses, response timing, answer development, and whether polished answers appear naturally or abruptly.

When no direct behavioral anomaly is detected, timing patterns can still help reviewers understand whether the candidate’s performance feels organic.

Risk reducer:
The candidate answers with natural pacing, reasonable thinking time, and spontaneous explanation.

Risk increaser:
The candidate repeatedly shows long stalls followed by unusually polished answers, without showing visible reasoning or explanation development.


Simple Reviewer Guidance

Consistency metrics are most useful when there are no obvious suspicious behavior events.

Pattern

Interpretation

No behavior anomaly + stable consistency

Strong support for low integrity risk

No behavior anomaly + strong follow-up performance

Strong risk reducer

No behavior anomaly + real examples and reasoning

Supports genuine competence

No behavior anomaly + inconsistent answers

Lowers confidence and may require review

No behavior anomaly + polished answers but weak follow-ups

Competence confidence reducer

No behavior anomaly + unexplained capability spikes

May increase integrity-risk attention

8. How to Read Hard-to-Fake Signals

Hard-to-fake signals are strong indicators of genuine competence.

Examples include:

  • First-principles explanation

  • Trade-off analysis

  • Concrete project examples

  • Debugging or correction loops

  • Handling follow-up constraints

  • Explaining why, not just what

  • Connecting theory to real implementation

These signals are important because a candidate who can explain concepts under probing, give specific examples, and maintain consistency is less likely to be relying on external assistance.


9. How to Read Interview Insights

The Interview Insights section summarizes strengths by category.

Category

Meaning

Tech

Technical or methodological capability

Domain

Industry-specific knowledge

Functional

Role-specific workflows and execution

Soft Skills

Communication, structure, and clarity

This section is useful for final hiring discussions because it converts individual answers into broader skill signals.


10. How to Read AI Fluency

The AI fluency section indicates whether AI usage was observed and whether it appeared permitted.

Common fields include:

Field

Meaning

Used

Whether AI usage was detected

Consent

Whether AI usage was explicitly allowed

Fluency Score

Candidate’s AI usage skill, if AI use was allowed and observed

Commentary

Explanation of the AI usage judgment

When AI use is not observed or not permitted, the AI fluency part will be not applicable.


11. Suggested Review Workflow

Step 1: Check Overall Risk

Start with Integrity Score, Potential Violations, Summary, and Overall Consistency Analysis.

Step 2: Review Behavioral Events

For each suspicious behavior event, check:

  • Severity

  • Confidence

  • Timestamp

  • Evidence notes

  • Whether manual review is required

  • Whether there is a plausible benign explanation

Step 3: Compare Against Answer Quality

A brief glance or screen-directed gaze should be interpreted differently depending on the candidate’s answer quality.

If the candidate gives detailed, consistent, and explainable answers, the behavior may be low risk.

If the candidate gives sudden perfect answers after pauses, cannot explain follow-ups, or shows repeated external-assistance signals, the risk should be reviewed more carefully.

Step 4: Evaluate Consistency

Give more weight to repeated patterns than isolated events.

Stable topic consistency, follow-up consistency, and reasoning consistency usually reduce integrity risk.

Step 5: Use Skill Summary for Hiring Discussion

Use Interview Insights and Skill Reason to evaluate whether the candidate meets the role requirements.


12. Common Misinterpretations

Misinterpretation 1: Potential Violations means confirmed cheating

Incorrect. Potential violations are risk signals, not confirmed cheating events.

Misinterpretation 2: Integrity Score is a hiring score

Incorrect. Integrity Score measures integrity risk, not job fit or technical ability.

Did this answer your question?