Skip to content

Implement Tests for Hallucinations for Virtual Assistant Demo #199

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
riacheruvu opened this issue Feb 4, 2025 · 4 comments
Closed

Implement Tests for Hallucinations for Virtual Assistant Demo #199

riacheruvu opened this issue Feb 4, 2025 · 4 comments
Labels
good first issue Good for newcomers large_difficulty stale This issue or pull request is not active

Comments

@riacheruvu
Copy link

riacheruvu commented Feb 4, 2025

Description

Implement a suite of unit tests to check if the output of the Large Language Model (LLM) used in the Virtual Assistant Demo involves hallucinations. This is a great opportunity to explore techniques on ensuring the reliability and accuracy of the assistant's responses by identifying and mitigating instances where the LLM generates incorrect or misleading information.

This project will involve Python programming; basic experience with AI models from frameworks like PyTorch, TensorFlow, OpenVINO, or ONNX is beneficial. To learn more about the OpenVINO toolkit, visit the documentation here.

Examples of hallucinations:

  • Factual errors
  • Inaccurately summarizing information
  • Creating nonsensical content, such as random sentences (which can also manifest in the LLM trailing off in the middle of a response to a tangential topic)

Steps:

  1. Develop unit tests for Hallucination Detection:

    • Define criteria for what constitutes a hallucination and implement checks based on these criteria.
    • Create unit tests that analyze the output of the LLM used in the Virtual Assistant Demo to detect hallucinations.
  2. Enhance Testing Framework:

    • Integrate the new unit tests into the existing testing framework.
    • Ensure the tests are comprehensive and cover various scenarios where hallucinations might occur.
  3. Documentation:

    • Provide clear and detailed documentation for setting up and running the unit tests.
    • Include comments in the code to explain key sections and logic.

How to Get Started:

  1. Fork the OpenVINO Build Deploy repository.
  2. Follow these installation instructions to setup your environment and install the required dependencies for this project.
  3. Read Demo Contribution Guide.
  4. Build your feature and ensure it meets the requirements specified in the contribution guide.
  5. Submit a pull request.
@jnzw
Copy link

jnzw commented Mar 3, 2025

Team 3 is working on this issue.

Copy link

github-actions bot commented Apr 2, 2025

This issue has been marked because it has been open for 30 days with no activity. It is scheduled to close in 14 days.

@github-actions github-actions bot added the stale This issue or pull request is not active label Apr 2, 2025
@adrianboguszewski adrianboguszewski removed the stale This issue or pull request is not active label Apr 14, 2025
Copy link

This issue has been marked because it has been open for 30 days with no activity. It is scheduled to close in 14 days.

@github-actions github-actions bot added the stale This issue or pull request is not active label May 15, 2025
Copy link

This issue was closed automatically because it has been inactive for 14 days since being marked as stale. Please reopen if needed.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale May 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers large_difficulty stale This issue or pull request is not active
Projects
None yet
Development

No branches or pull requests

3 participants