Using ILogger instead of EventSource for logging exceptions #3881

timmydo · 2022-11-09T18:56:49Z

Is your feature request related to a problem?

We're running the prometheus exporter middleware in production container environment where we have a setup with ILogger exporting to Geneva. We don't have the means to connect perfview or read files on disk to diagnose why the prometheus exporter is returning 500.

Describe the solution you'd like:

What do you want to happen instead? What is the expected behavior?

https://github.com/open-telemetry/opentelemetry-dotnet/blob/main/src/OpenTelemetry.Exporter.Prometheus.AspNetCore/PrometheusExporterMiddleware.cs#L92

            catch (Exception ex)
            {
                PrometheusExporterEventSource.Log.FailedExport(ex);
                if (!response.HasStarted)
                {
                    response.StatusCode = 500;
                }
            }

something like this

                LogFailedExportException(LogLevel.Error, this.logger);
           

    public partial class PrometheusMiddlewareLogs
    {
        [LoggerMessage(EventId = 1, Message = "ExportFailed exception")]
        static partial void LogFailedExportException(LogLevel logLevel, ILogger logger);
    }

I looked at replacing the middleware class but exporter.CollectionManager is internal so I might need to use some reflection to get around that if we have to hack a fix together.

@cijothomas

The text was updated successfully, but these errors were encountered:

cijothomas · 2022-11-09T18:59:28Z

Linking previous attempts #1867

cijothomas · 2023-02-23T20:54:51Z

Tagging for consideration in 1.6, as that is the milestone when we expect to finalize Logging changes which are currently worked on in main-logs branch.

cijothomas · 2023-02-28T17:51:32Z

Linking to #3447 (comment)

pharring · 2024-02-26T21:45:08Z

If you do this (replace internal EventSources with ILogger), please make sure that ActivityStarted/ActivityStopped events are turned into logger scopes (BeginScope/Dispose).

cijothomas · 2024-04-10T15:12:43Z

If you do this (replace internal EventSources with ILogger), please make sure that ActivityStarted/ActivityStopped events are turned into logger scopes (BeginScope/Dispose).

I am not sure why is that relevant/needed? Could you elaborate...

Code-Grump · 2024-04-10T15:23:57Z

I am somewhat confused - if I'm routing my logging via OTel and I'm having problems with an exporter, writing the problems back into that system feels like it could make the problem worse, or otherwise make it harder to diagnose. Writing to an ILogger is definitely a lot easier to work with than an EventSource, I'd just like to see an example of how this is expected to work.

cijothomas · 2024-04-10T15:37:45Z

I am somewhat confused - if I'm routing my logging via OTel and I'm having problems with an exporter, writing the problems back into that system feels like it could make the problem worse, or otherwise make it harder to diagnose. Writing to an ILogger is definitely a lot easier to work with than an EventSource, I'd just like to see an example of how this is expected to work.

It is a valid point, and yes, solving that (i.e avoiding dead-loop situations) is part of implementation details. See #1867 (comment)

cijothomas · 2024-09-16T20:12:43Z

@Yun-Ting is still working on it targeting 1.10 milestone.

Yun-Ting · 2024-10-08T00:55:44Z

I am somewhat confused - if I'm routing my logging via OTel and I'm having problems with an exporter, writing the problems back into that system feels like it could make the problem worse, or otherwise make it harder to diagnose.

While trying to implement the to ILogger migration for OTel internal diagnostics events, I hit the above mentioning issue.

The crux of the problem we want to solve here is the inability to access the file-on-disk in containerized environments.
I'm thinking we can consider the below alternative solution—write diagnostics logs to the console.

How to enable OTel self diagnostics

By setting environment variables for the at fault process or by adding key-value pair entries to appsettings.

For example,

OTel_selfDiagnostics_target = console; Otel_selfDiagnostcs_level= warning;

{ "OTel_selfDiagnostics_target": "console", "Otel_selfDiagnostcs_level": "warning" }

The assumption here is console logs will be accessible for environments where file-on-disk is not accessible.
(For example: https://learn.microsoft.com/en-us/azure/container-apps/log-streaming?tabs=bash#view-log-streams-via-the-azure-portal)

Implementation

Use the existing EventSource diagnostics infrastructure, and add support for write self diagnostics log to console when user specifies to direct logs there.
Adding support to write to console here: https://github.com/open-telemetry/opentelemetry-dotnet/blob/a55a5ac997204e571a41c495f83de828d3e07685/src/OpenTelemetry/Internal/SelfDiagnosticsConfigRefresher.cs#L37C1-L43C57

cijothomas · 2024-10-08T01:06:40Z

While trying to implement the to ILogger migration for OTel internal diagnostics events, I hit the above mentioning issue.

Isn't that solvable with suppression scope? We prevent SDK's own http spans from being looped back in already. We can use the same mechanism for logs too.

I'm thinking we can consider the below alternative solution—write diagnostics logs to the console.

Console maybe a viable alternative, given stdout is a very common supported solution in many hosting environents, but I think we should still send logs to ILogger, and let user decide if they want ILogger to route it to console or someplace else. Sometimes users want the Ilogger piped via OTel SDK + OTLP itself (eg: Metrics cardinality limit is hit - user can chose to send such logs to their OTLP Logs etc.)

CodeBlanch · 2024-10-08T17:20:24Z

@cijothomas We've been prototyping ILogger. But it seems to introduce more challenges/questions than solutions 🤣 Planning to discuss this on the SIG today if you want to join.

Here's some thoughts in no particular order...

User with manual provider
```
     using var tracerProvider = Sdk.CreateTracerProviderBuilder()
         .AddSource("MyCompany.MyProduct.MyLibrary")
         .AddOtlpExporter()
         .Build();
```
Problem 1, there is no ILoggerFactory in that provider. We could introduce one. Challenge there is a lot of the code is shared with providers created through the host and figuring out if/when to introduce an ILoggerFactory is a challenge. But let's say we solve that. Problem 2, there is no configuration. How would the user toggle logging after the app is deployed? Problem 3, there is no Microsoft.Extensions.Logging.Console available. User won't be able to turn on logging to Console without a code change and a redeploy. Or we have to take a new dependency. Yuck.
User using hosting
```
builder.Services.AddOpenTelemetry()
 .WithTracing(tracing => tracing
     .AddAspNetCoreInstrumentation()
     .AddOtlpExporter())
 .WithMetrics(metrics => metrics
     .AddAspNetCoreInstrumentation()
     .AddOtlpExpoter())
 .WithLogging(logging => logging
     .AddOtlpExporter());
```
There is an ILoggerFactory in that IServiceProvider. Should we use that one? It will send off via OTLP. Does the user want that? Or is the user trying to debug a broken pipeline and actually wants to see stuff on the Console? How do we know? How do we document all the possible things which could happen? If the user does something odd like builder.Logging.ClearProviders() what do we do?
Implementation

EventSource is nice because it is all static. Going to ILogger means everything needs to be able to access the ILoggerFactory instance. That is a code bomb. Also other components say GenevaExporter need to be able to do it too. So there's some public API required.

What I want to do is step back a bit. Instead of jumping into a solution like "use ILogger for internal logging" I think we should focus first on the scenario(s) we want to enable:

I am a user who cannot access the file system where my application is running...
- I want to be able to write internal logging to the Console/stdout.
- I want to be able to turn this on/off in a deployed application using environment variables or IConfiguration (command line, appSettings.json, settings service, whatever).

When @Yun-Ting and I discussed the scenario we felt a much simpler thing to do would be augment the feature we already have to support a console target. It will enable the scenario(s) and avoid all the problems we were running into with ILogger 🤔

Code-Grump · 2024-10-08T19:10:31Z

I feel using ILogger is attractive, but not correct. It promises the ability to write diagnostic information to any compatible sink, using an established configuration system, but it's intended to deliver the application logs via OpenTelemetry. The kind of logging we're interested in is the "telemetry isn't working" kind that we turn on because the systems connected to ILogger aren't working.

Being able to route the existing system to the console, enabled via an environment variable meets that need without a great deal of thinking. I've implemented it myself three times now and would love to not have to make it a fourth.

CodeBlanch · 2024-10-08T19:38:43Z

Obligatory

noahfalk · 2024-10-08T20:52:52Z

+1 on nailing down the scenario goal above all else :)

If its helpful it sounds like the issues you are hitting includes this one: dotnet/runtime#50777

There is lots of discussion on that thread, but the currently suggested workaround is to create your own ILogger and configure it internal to your library in situations where you can't (or don't want to) have the user pass one in through your API surface. Doing that implies taking an assembly dependency on M.E.L and perhaps particular providers like console. Depending on your scenario goal you might also decide that these dependencies aren't justified relative to the functionality you need to meet your goal.

alanwest · 2024-10-11T19:44:41Z

When @Yun-Ting and I discussed the scenario we felt a much simpler thing to do would be augment the feature we already have to support a console target. It will enable the scenario(s) and avoid all the problems we were running into with ILogger

I support this approach. However, I'm not sure there'd be anything inherently wrong with using ILogger either. To me it seems like folks are conflating a number of things together which caused the complications perceived with using ILogger. Namely, OpenTelemetry .NET's support for ILogger is orthogonal to the SDK's self diagnostic logging. In my opinion, it was never a requirement to integrate the self diagnostics into the SDK's logging pipeline. As such, the SDK could create a configure a LoggerFactory expressly for the purpose of self diagnostics that is isolated from the SDK's pipeline.

That said, either ILogger or EventSource is fine. They're both logging frameworks within the .NET ecosystem, so they're both fine options. Every other language SDK has done similar in adopting a logging framework from their ecosystem. For example, OpenTelemetry Java uses java.util.logging. To @noahfalk's point, maybe EventSource is preferable in order to avoid additional dependencies?

Irrespective of logging framework chosen, here are the requirements I'd like us to consider:

Default to writing self diagnostics to stdout. The option to write to a file is nice, but I have found it to be an additional hurdle for supporting end users.
Allow for self diagnostics to be enabled/disabled.
Enable self diagnostics by default with error (or maybe warn) severity being the default.

There was some additional discussion in the SIG meeting about using the OTLP format. I believe this too is orthogonal and should not be a requirement. Later we could entertain a separately configurable export pipeline that could send these logs over OTLP, but I suspect using log forwarders like fluent bit to scrape console logs is the more common pattern for sending these kind of logs to a backend.

Lastly, in the SIG meeting, there was some concern expressed about multi-line log message (e.g., a log message containing a stack trace). I'd be fine with a requirement that self diagnostics be configurable to emit messages on a single line, but I do not think this should be the default. Multiline log messages are very common, and log forwarders like fluent bit can be configured to handle them pretty seamlessly.

CodeBlanch · 2024-10-11T20:29:25Z

EventSource is preferable in order to avoid additional dependencies?

Yes. We have some ILogger dependencies. But we don't have ILogger Console dependencies. If we want console-by-default we would have to take a reference on Microsoft.Extensions.Logging.Console and that feels like a non-starter. Could implement our own ILoggerProvider which writes to Console to avoid that but there is also the size of the effort to switch from EventSource to ILogger. It is big so we better be sure there is a really compelling reason to take it on 😄

Default to writing self diagnostics to stdout. The option to write to a file is nice, but I have found it to be an additional hurdle for supporting end users.

Allow for self diagnostics to be enabled/disabled.

Enable self diagnostics by default with error (or maybe warn) severity being the default.

👍 from me

There was some additional discussion in the SIG meeting about using the OTLP format. I believe this too is orthogonal and should not be a requirement. Later we could entertain a separately configurable export pipeline that could send these logs over OTLP, but I suspect using log forwarders like fluent bit to scrape console logs is the more common pattern for sending these kind of logs to a backend.

That's certainly not what I was suggesting. Maybe someone else did 😄 What I was suggesting was, when it comes to writing to console/stdout, we should do single-line JSON using OTLP JSON schema. Doesn't have to be that, but if we can avoid inventing some new format great!

timmydo added the enhancement New feature or request label Nov 9, 2022

cijothomas mentioned this issue Jan 31, 2023

NotSupportedException is thrown during ConfigureBuilder #3753

Closed

cijothomas added the pkg:OpenTelemetry Issues related to OpenTelemetry NuGet package label Jan 31, 2023

cijothomas mentioned this issue Feb 23, 2023

[FEATURE REQ] Very difficult to know what went wrong when posts to Azure Monitor fail Azure/azure-sdk-for-net#34483

Closed

cijothomas added this to the 1.6.0 milestone Feb 23, 2023

cijothomas mentioned this issue Feb 28, 2023

How to know AddOtlpExporter Endpoint is working or successful? #3447

Closed

utpilla modified the milestones: 1.6.0, 1.7.0 Aug 8, 2023

utpilla modified the milestones: 1.7.0, 1.8.0 Oct 17, 2023

This was referenced Feb 22, 2024

Geneva Exporter silently dropping an exception log record open-telemetry/opentelemetry-dotnet-contrib#1433

Closed

OpenTelemetrySdkEventSource updates for activity tracking #5387

Closed

utpilla modified the milestones: 1.8.0, 1.9.0 Mar 9, 2024

CodeBlanch modified the milestones: 1.9.0, 1.10.0 Mar 13, 2024

reyang mentioned this issue Mar 14, 2024

[Exporter.OpenTelemetryProtocol] fix event source throws System.FormatException #5451

Merged

cijothomas mentioned this issue Apr 10, 2024

Outputting OTEL Self Diagnostics log to Console #2732

Closed

reyang assigned Yun-Ting May 21, 2024

Yun-Ting mentioned this issue Oct 2, 2024

[WIP] Using ILogger instead of EventSource for logging OpenTelemetry Events in OTLP exporter #5879

Closed

4 tasks

CodeBlanch modified the milestones: 1.10.0, Future Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using ILogger instead of EventSource for logging exceptions #3881

Using ILogger instead of EventSource for logging exceptions #3881

timmydo commented Nov 9, 2022

cijothomas commented Nov 9, 2022

cijothomas commented Feb 23, 2023 •

edited

Loading

cijothomas commented Feb 28, 2023

pharring commented Feb 26, 2024 •

edited

Loading

cijothomas commented Apr 10, 2024

Code-Grump commented Apr 10, 2024

cijothomas commented Apr 10, 2024

cijothomas commented Sep 16, 2024

Yun-Ting commented Oct 8, 2024

cijothomas commented Oct 8, 2024 •

edited

Loading

CodeBlanch commented Oct 8, 2024

Code-Grump commented Oct 8, 2024

CodeBlanch commented Oct 8, 2024

noahfalk commented Oct 8, 2024

alanwest commented Oct 11, 2024

CodeBlanch commented Oct 11, 2024

Using ILogger instead of EventSource for logging exceptions #3881

Using ILogger instead of EventSource for logging exceptions #3881

Comments

timmydo commented Nov 9, 2022

cijothomas commented Nov 9, 2022

cijothomas commented Feb 23, 2023 • edited Loading

cijothomas commented Feb 28, 2023

pharring commented Feb 26, 2024 • edited Loading

cijothomas commented Apr 10, 2024

Code-Grump commented Apr 10, 2024

cijothomas commented Apr 10, 2024

cijothomas commented Sep 16, 2024

Yun-Ting commented Oct 8, 2024

How to enable OTel self diagnostics

Implementation

cijothomas commented Oct 8, 2024 • edited Loading

CodeBlanch commented Oct 8, 2024

Code-Grump commented Oct 8, 2024

CodeBlanch commented Oct 8, 2024

noahfalk commented Oct 8, 2024

alanwest commented Oct 11, 2024

CodeBlanch commented Oct 11, 2024

cijothomas commented Feb 23, 2023 •

edited

Loading

pharring commented Feb 26, 2024 •

edited

Loading

cijothomas commented Oct 8, 2024 •

edited

Loading