Near Real-Time Log Collection From Amazon S3 Storage
Sign Up Free Request Demo

Near Real-Time Log Collection From Amazon S3 Storage

We are very excited to announce a new capability for our Amazon S3 sources.

Until recently, the only method Sumo Logic used for discovering new data in an S3 bucket was periodic polling. However, with our new notification-based approach, users can now configure S3 sources such that Sumo Logic is notified immediately (via AWS SNS) whenever a new item is added to an S3 bucket, eliminating the need to wait for new objects to be discovered via periodic polling.

This capability is available today for all S3 sources, either new or pre-existing.

Benefits of Notification-Based Approach

Polling is still an effective approach in many cases. If one’s bucket size is not very large, or the bucket is regularly groomed, polling could still be the right choice. It is easier to set up, as you don’t need to configure SNS notifications. For very large buckets, however, a polling-based approach introduces significant lag. With the new notification-based approach, users get near-real time data and better reliability.

Near Real-time Data

With SNS notifications powering object discovery, Sumo Logic is always notified of new S3 objects within seconds, regardless of how many other objects are present in the bucket. This guarantees near real-time data collection from S3 buckets of any size. So with this approach, data from S3 buckets can be reliably used for real-time alerting.

Best of Both Worlds

Sumo Logic can now offer a unique hybrid approach combining SNS notifications and a poller, thus providing the best of both worlds —  speed and a data reliability guarantee.

This places us ahead of our competitors who are using just one of these methods. Maintaining a polling system as a backup provides two advantages:

  • Capability to discover and collect historical data already present in an S3 bucket.
  • Guarantee of 100 percent data discovery even in the event of a service disruption or missing SNS notifications. The polling system works as a backup in this case.

Architecture

The above diagram explains how the notification based approach works. The workflow can be summarized as: S3 bucket -> SNS topic -> SNS subscription -> Sumo Logic source.

An easy way to create and configure these AWS resources is to download the CloudFormation template provided while creating the source in Sumo Logic. The template can then be uploaded in your AWS console thereby completing the majority of the setup.

Setup

The notification-based discovery approach is easy to set up. The steps can be found in our documentation page.

Although SNS based discovery is an opt-in capability, we recommend using it for all your S3 sources. It is equally easy to set it up for your existing sources. A video showing the configuration steps can be found below:

Results

We used this feature internally to collect logs from our own production S3 buckets. We turned this feature on for the buckets containing ELB logs from various Sumo Logic deployments. After switching to notification-based discovery, we saw a drop in ingest delay from 8 minutes to 2 minutes. This drop in log collection latency has improved our agility when responding to production alerts and incidents. Our SRE teams are now able to identify issues more quickly and respond to them more proactively thanks to the reduction in delay.

We have heard the same story from our early adopting customers. During our customer beta, we saw a similar reduction in S3 ingest lag for those customers. In one particular case where the customer had a very large S3 bucket with hundreds of objects being added every second, we were able to reduce ingest lag from 7 hours down to couple of minutes.

Here is a graph showing the improvement in object discovery time for the above-mentioned customer:

Since making the new notification-based approach generally available for our customers, we’ve been seeing significant adoption. Want to try it out on your S3 sources? Log into the Sumo Logic platform today, and follow these steps.

If you don’t yet have a Sumo Logic account, you can sign up for a free trial today.

Additional Resources

  • Want to know more about scaling your Amazon S3 log management and storage? Check out our Amazon 101 guide to S3 logging!
  • Learn more about using the Sumo Logic app for Amazon S3 by visiting our app page.
  • For more technical articles like this, check out the Sumo Logic blog.

Get Started Today!

Sign up for your FREE Sumo Logic Trial.

Sign Up Free

Request A Free Sumo Logic Demo

Fill out the form below and a Sumo Logic representative will contact you to schedule your free demo.
“Sumo Logic brings everything together into one interface where we can quickly scan across 1,000 servers and gigabytes of logs and quickly identify problems. It’s awesome software and awesome support.”

Jon Dokuli,
VP of Engineering

Thank you for signing up for Sumo Logic.

We are creating your account now.
Please check your email.
Need more help? Contact Us
Sign up for Sumo Logic Free*
Sign up for Sumo Logic Free*
  • No credit card required to sign-up
  • Create your account in minutes
  • No expiration date*
  • *After 30 day trial period, reverts to Sumo Logic Free
    • Please Enter your email address.
    • Please enter a valid email address.
    • This email is already in use for another account.
    • Please use your company email to create an account.
    • Please agree to the Service License.
    • Free trial provisioning is temporarily offline, please call 855-LOG-SUMO to get started.
    View All Pricing Options
    Already have an account? Login