Sync manually #73

MickL · 2020-01-01T16:35:33Z

Within a serverless environment the batchInterval is a problem: The script has already run and exited, but the logs have not been synced yet. Therefor it would be nice to have a function to sync manually if there are unsynced logs.

leebenson · 2020-01-02T08:24:02Z

Thanks @MickL. The @timberio/node and @timberio/browser packages were designed for traditional server and browser environments, respectively. We definitely need to test and tweak for serverless environments, which typically have their own constraints/quirks/library limitations that might not play well out-the-box with these two packages.

Can you await async functions in your particular serverless environment? If so, it might be worth trying a very low batchInterval and batchSize (see the ITimberOptions docs for setting those) and using await timber.log(...) wherever logs are being sent, to ensure the log has reached Timber before exiting.

Try the above and feel free to re-open if the issue persists, or you spot anything else. Thanks!

MickL · 2020-01-02T10:54:24Z

I would like to see this as a feature request and would like to ask you to reopen this issue.

Wouldnt a sync function be integrated quickly?

Await is no choice since the request has to be handled quickly. Instead at the end I want to send the response and after that keep the worker alive for sending the logs. There for i have a waitUntil function that wants a promise, which could be this new sync function.

leebenson · 2020-01-02T11:08:06Z

I think you can achieve what you're looking for using Promise.all.

How about something like:

// 'holder' for log promises...
const allLogs = [];

// later, wherever you're creating a log
allLogs.push(
  timber.log("your log message here")
)

// finally, when you need to await...
event.waitUntil(Promise.all(allLogs))

This ought to work, since timber.log() itself returns a Promise; storing each log Promise in an array and waiting until they've all completed should have a comparable effect to a general 'sync', and guarantees consistency that all logs have been sent to Timber.

I think this is preferable to having a sync 'built in' to the lib, because then we'll need to internally keep logs around in memory to assess whether all promises have been fulfilled, which is potentially expensive on RAM for applications that are handling masses of logs and may not use this feature.

MickL · 2020-01-02T12:05:30Z

This is not an option. Timber is used in all different kind of services and classes the waitUntil has no access to it.

I mean you batch all logs anyway and send them all together. Why not have the sync function that is triggered by batch manually and reset the timer?

leebenson · 2020-01-02T13:02:43Z

This is not an option. Timber is used in all different kind of services and classes the waitUntil has no access to it.

I'm not familiar with the environment used by Cloudflare Worker, but it sounds like some kind of dependency injection pattern might be useful.

In the short-term while waiting for changes to the lib, this kind of pattern might unblock you:

class Logger {
  constructor(logger) {
    this.logs = [];
    this.logger = logger;
  }

  // keep a copy of each promise
  log = (...params) => this.logs.push(this.logger.log(...params))

  // await this to know when all logs have been sent
  sync = () => Promise.all(this.logs)
}

Then you could create a singleton that is injected into other services/classes, like:

// pass this to whichever services do the logging
const logger = new Logger(new Timber(optionsGoHere))

Then you'd just log as normal, but via logger:

logger.log(logDataGoesHere)

And whenever you want to confirm that all logs have been persisted:

await logger.sync()

I mean you batch all logs anyway and send them all together. Why not have the sync function that is triggered by batch manually and reset the timer?

It's actually slightly more involved than that, due to a combination of batching and throttling.

Logs are batched and sent according to the batchSize, batchInterval, and syncMax properties (you can read about the options here)

There can be multiple requests in-flight at any one time, depending on how these parameters are set.

For example, a batchSize of 10, a batchInterval of 100(ms) and a syncMax of 10 means that 10 logs that are fired in a 50ms interval would result in a request to Timber.io.

Likewise, 10 logs fired over a 150ms interval would actually arrive in 2 batches of 5, since the 100ms upper limit for a 'collection window' of logs would have been exceeded and a request would have been sent.

At the logger level, we're not currently tracking 'pending' logs. Instead, they're passed over to the throttle and batch helpers, which manage their pending workloads and ultimately resolve the log's original promise -- which ultimately bubbles up to the original await timber.log() call.

We could make some changes to the core Base class to track pending logs and provide the wait()-style function you're talking about. This shouldn't require too many changes, but would need to be created and test internally first, so there's likely to be some lead time before this is merged and released.

In the meantime, I'll re-open to track the feature and would propose a pattern such as the one above to keep you moving in the interim 👍

MickL · 2020-01-02T13:07:53Z

Thanks for the fast and detailed answer! This might not be the best solution, but it is a solution I could implement for now.

Instead of the requested "sync manually now" function, a simple function "are there any unsynced logs?" that returns a promise that resolves when everything open is synced would be nice, too.

leebenson · 2020-01-02T13:09:27Z

Instead of the requested "sync manually now" function, a simple function "are there any unsynced logs?" that returns a promise that resolves when everything open is synced would be nice, too.

👍 that's actually exactly what I had mind. Should be relatively simple to add; just wanted to make sure you weren't blocked in the meantime while waiting for it to land.

MickL · 2020-01-02T13:41:56Z

That would be nice. In a serverless environment one probably want to totally disable throttled syncing and sync manually at the end of a request, ideally after the response has been already sent. At least for me it would not make sense to send logs in between the running process, and i cant think of a good batchInterval.

If I set it too high, like 1s, the process has to wait a long time after it is finished. If I set it too small, like 50ms, logs are being sent while the process is still running and wasting performance.

So it would be nice to have both:

Function to check if there are any unsynced logs
Disable automatic syncing + Function to sync manually

leebenson closed this as completed Jan 2, 2020

leebenson mentioned this issue Jan 2, 2020

XMLHttpRequest is not defined - Does not work on Cloudflare Worker #72

Closed

leebenson reopened this Jan 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync manually #73

Sync manually #73

MickL commented Jan 1, 2020

leebenson commented Jan 2, 2020

MickL commented Jan 2, 2020

leebenson commented Jan 2, 2020 •

edited

Loading

MickL commented Jan 2, 2020

leebenson commented Jan 2, 2020

MickL commented Jan 2, 2020

leebenson commented Jan 2, 2020

MickL commented Jan 2, 2020 •

edited

Loading

Sync manually #73

Sync manually #73

Comments

MickL commented Jan 1, 2020

leebenson commented Jan 2, 2020

MickL commented Jan 2, 2020

leebenson commented Jan 2, 2020 • edited Loading

MickL commented Jan 2, 2020

leebenson commented Jan 2, 2020

MickL commented Jan 2, 2020

leebenson commented Jan 2, 2020

MickL commented Jan 2, 2020 • edited Loading

leebenson commented Jan 2, 2020 •

edited

Loading

MickL commented Jan 2, 2020 •

edited

Loading