Skip to content

Problem Files


Matching, Unclassifiable and Error files are deemed collectively as Problem Files. They represent files that have classified data (Matching), password protected / KMS encrypted / file size limit exceeded (Unclassifiable) and where the cross account role is broken or the file no longer exists (Error). This page will present the list of problem files found by Classification for Amazon S3 whether from new objects coming in or through the evaluation of your existing objects. The list of problem files is a running tally of all issues you've discovered. You may take actions on the files directly in the bucket and and through the page. Whether the file has been cleaned and moved or deleted, the data will tell you whether the file still exists and needs to be dealt with or if it no longer exists.

The files are broken down by bucket and account so you know where the file entered and into which account.

Problem Files page

Getting the Results Set

You will not see any results when you land on this page. You must first apply filters to get to your working data set. You can then search amongst the results. There are three pieces of data you will provide for the filter: 1 or multiple accounts, the types of problem files and the date range.

Problem Files Filters

Problem Files Filters Accounts
The Accounts (count in past 24h) is a multi-select picker where you can choose one or more accounts to review. If you have many accounts, you can search by typing into the field to limit the account list further. You are presented the account nickname, the account number and the number of problem files found in the last 24 hours.

Problem Files Filters File Types
The Problem File Types is a multi-select picker where you can choose one or more of the problem types to review. By default, all three problem types are chosen, but you can limit this by clearing the undesirable types out. There are three choices of problem file types: Matching, Error and Unclassifiable.

Problem Files Filters Date
The Date Range is a date/time calendar with presets where you can select the specific time range to evaluate. By default, the time selected is the last 24 hours.

With all three fields populated, the Apply Filters button will be enabled. Click the button to retrieve the filtered results.

Be Aware

Results may be truncated as follows:

  • 5000 results, at most, will be returned.
  • 1000 results, at most, will be returned per account

So as not to "spam" you on the Problem Files page or in the scan results notifications, we are throttling the number of entries we write to both for two particular scenarios: 1) the bucket is encrypted (and we do not have permissions to the key), therefore every object would fail to scan and 2) access to a linked account has been broken so we cannot grab the objects and therefore every object would fail to scan.

In both scenarios, you could end up with thousands (or much more) of unclassifiable messages depending on the object counts for the given bucket(s) or account(s). We will write one message per hour for each unclassifiable scenario (per bucket and per account).

Searching the Data

The main data fields are presented within the table: object name, uploaded to bucket, account, result, scan on date, quarantine bucket and file exists check. File Exists indicates whether that file is still available. The Search field above the filtered results is a global search across the entire table. Start by typing in the value for any of the fields and it will start filtering the table down to the matching values. You can also space separate search criteria to search by multiple values. For example, you want to search by a particular originating bucket and the infected status. Your search bar might look like partial-bucket-name infected. That will search by both the bucket name as well as the status of infected. It is very simple to drill down to what you are looking for.

Searching Tips

You may see a short delay after typing in your search value since we won't actually trigger the search until typing has stopped for 1 second.

  • Match words out of order:
    For example if you search for Virus Found it would match a row containing the words Virus and Found, regardless of the order or position that they appear in the table.
  • Partial word matching:
    As filtering provides immediate feedback, parts of words can be matched in the result set. For example Vir will match Virus.
  • Multiple Searches:
    The table provides functionality to enter multiple words separated by a space and the search will return all rows containing at least one of those words.
  • Preserved text:
    This table adds the ability to search for an exact phrase by enclosing the search text in double quotes. For example "Virus Found" will match only text which contains the phrase Virus Found. It will not match Virus is in Found.

Last update: May 20, 2022