Imply Help Center Help Center home page
Submit a request
Sign in
  1. Imply Help Center
  2. Customer Knowledge Base
  3. Data Ingestion

Data Ingestion

  • What happens when changing granularity during data ingestion?
  • Reduced throughput from Kafka/Kinesis indexing service due to WARN "Ingestion was throttled.. because persists were pending"
  • Hadoop Ingestion failed due to error "io.druid.java.util.common.ISE: Job[class io.druid.indexer.DetermineHashedPartitionsJob] failed!"
  • How to estimate cost savings from rollup aggregation
  • Ingestion failed from AWS S3 due to error "java.net.SocketException: Connection reset"
  • Data ingestion failure due to error `Numeric columns do not support multivalue rows`
  • Avro ingestion from Kafka using Confluent Schema Registry
  • Kafka Indexing -> Segment Flow
  • Kerberized Kafka Ingestion
  • Issue: Indexing tasks from Kafka or Kinesis are finishing successfully, but without any data ingested.
  • Unable to append new rows from Kafka indexing when segments already exist
  • Temporary space issue with Dimension Partitioning on EMR instances in Hadoop_index job
  • MapR Hadoop as deep storage for Druid cluster using mapr client
  • Deep storage data/segment deletion using coordinator kill
  • Kafka, SASL / SCRAM credentials in supervisor spec
  • Sample ingestion snippet - transformSpec 'if, then, else' and 'like'
  • Using single-dimension partitioning to improve read performance
  • Starting sequenceNumber [1234] is no longer available for partition [shardId-xxxx] (earliest: [null]) and resetOffsetAutomatically is not enabled
  • How to update heap memory, direct memory for a specific Kafka/kinesis supervisor
  • Assigning ingestion jobs to run in a specific tier of middleManager nodes based on worker category in Cloud
  • How to avoid index_parallel deadlocks in a busy cluster
  • Explanation of Supervisor Exceptions: LOST_CONNECTION_WITH_STREAM, UNABLE_TO_CONNECT_TO_STREAM, UNHEALTHY_TASKS
  • Inconsistent number of rows on the Druid console post re-indexing Job
  • "No such previous checkpoint found" error in streaming Supervisor
  • ERROR :403 , "errorMessage":"Unauthorized" when submitting MSQ job via API
  • MSQ Ingestion failing with java.lang.NoClassDefFoundError: org/apache/hadoop/fs/FileSystem
  • Historical process fails to come up and reports "Packet len is out of range!"
Imply Help Center