feat: Traceloop evals #36

adityamehra · 2025-11-06T19:08:27Z

This builds on top of #29

adityamehra · 2025-11-06T19:24:55Z

...enai-traceloop-translator/src/opentelemetry/util/genai/processor/traceloop_span_processor.py

+            return True
+
+        # PRIORITY 2: Check for explicit LLM span kind (even without messages, for compatibility)
+        if span_kind == "llm":


limit to traceloop span kinds - https://www.traceloop.com/docs/openllmetry/contributing/semantic-conventions#llm-frameworks

adityamehra · 2025-11-06T19:28:03Z

...enai-traceloop-translator/src/opentelemetry/util/genai/processor/traceloop_span_processor.py

+
+        # PRIORITY 3: Detect ReAct agent/task spans by kind
+        # These are agent workflows that contain LLM calls
+        if span_kind in ["agent", "task", "workflow"]:


Ideally, evals should be limited to llm spans

adityamehra · 2025-11-06T19:30:01Z

...enai-traceloop-translator/src/opentelemetry/util/genai/processor/traceloop_span_processor.py

+    # ------------------------------------------------------------------
+    # Internal helpers
+    # ------------------------------------------------------------------
+    def _is_llm_span(self, span: ReadableSpan) -> bool:


Ideally, this method should return True for spans which are LLM calls or chat calls with the model

adityamehra · 2025-11-06T19:33:11Z

...enai-traceloop-translator/src/opentelemetry/util/genai/processor/traceloop_span_processor.py

        # Mapping from original span_id to translated INVOCATION (not span) for parent-child relationship preservation
        self._original_to_translated_invocation: Dict[int, Any] = {}
+        # Buffer spans to process them in the correct order (parents before children)
+        self._span_buffer: List[ReadableSpan] = []


Look into the span buffer logic

Remove buffer logic if not required

adityamehra · 2025-11-06T19:34:58Z

...enai-traceloop-translator/src/opentelemetry/util/genai/processor/traceloop_span_processor.py

+            # STEP 2: Check if this is an LLM span that needs evaluation
+            if self._is_llm_span(span):
+                _logger.debug(
+                    "🔍 TRACELOOP PROCESSOR: LLM span '%s' detected! Processing immediately for evaluations",


Remove emojis

adityamehra · 2025-11-06T19:36:36Z

...enai-traceloop-translator/src/opentelemetry/util/genai/processor/traceloop_span_processor.py

+                            "Failed to stop LLM invocation: %s", stop_err
+                        )
+            else:
+                # Non-LLM spans (tasks, workflows, tools) - buffer for optional batch processing


Revisit this logic

verify commit

adityamehra requested review from a team as code owners November 6, 2025 19:08

adityamehra commented Nov 6, 2025

View reviewed changes

91pavan approved these changes Nov 6, 2025

View reviewed changes

adityamehra enabled auto-merge (squash) November 6, 2025 21:02

adityamehra force-pushed the feature/single-tl-processor branch from beb41f3 to 8c60348 Compare November 6, 2025 21:09

adityamehra disabled auto-merge November 6, 2025 21:21

adityamehra force-pushed the feature/single-tl-processor branch from 3ca1998 to fbad29e Compare November 6, 2025 22:15

wrisa approved these changes Nov 6, 2025

View reviewed changes

adityamehra force-pushed the feature/single-tl-processor branch from fbad29e to b192589 Compare November 6, 2025 23:55

add eval support in traceloop

2579cbe

verify commit

adityamehra force-pushed the feature/single-tl-processor branch from b192589 to 2579cbe Compare November 6, 2025 23:58

adityamehra merged commit b9142c7 into main Nov 6, 2025
1 of 14 checks passed

adityamehra deleted the feature/single-tl-processor branch November 6, 2025 23:59

github-actions bot locked and limited conversation to collaborators Nov 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Traceloop evals #36

feat: Traceloop evals #36

Uh oh!

adityamehra commented Nov 6, 2025 •

edited

Loading

Uh oh!

adityamehra Nov 6, 2025

Uh oh!

adityamehra Nov 6, 2025

Uh oh!

adityamehra Nov 6, 2025

Uh oh!

adityamehra Nov 6, 2025

Uh oh!

adityamehra Nov 6, 2025

Uh oh!

adityamehra Nov 6, 2025

Uh oh!

adityamehra Nov 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Traceloop evals #36

feat: Traceloop evals #36

Uh oh!

Conversation

adityamehra commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adityamehra Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

adityamehra Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

adityamehra Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

adityamehra Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

adityamehra Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

adityamehra Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

adityamehra Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

adityamehra commented Nov 6, 2025 •

edited

Loading