Skip to main content

2 posts tagged with "Lightfeed Database"

View All Tags

Introducing Lightfeed API and SDK

· 4 min read
Lightfeed Team

Today we're launching the Lightfeed API along with official TypeScript and Python SDKs (GitHub). These tools provide direct access to structured web data that's been extracted through our platform, making it easy to bring this data into your applications.

Lightfeed API example code

Web Data Made Simple

The Lightfeed API offers key advantages that transform how you work with web data:

  • Instant data retrieval through simple API calls, eliminating the wait times and costs associated with traditional crawl and scrape operations - your data is pre-extracted and ready for search
  • Intelligent extraction, no manual maintenance: our AI-powered extraction can search connected pages, reason and extract data from context. It adapts to website changes automatically, no need to manually maintain scraping scripts.
  • Clean, consistent, and up-to-date data from any list of websites

This approach shifts your focus from data collection infrastructure to actually using the insights from your web data in your applications.

Key Features

Semantic search finds content based on meaning, not just keywords.

tip

This API uses AI language model embeddings to understand the meaning behind your search queries, not just matching keywords. When you search for "innovative AI solutions" it can find records about "generative AI solutions" or "large language models" because it understands these concepts are related. It can return more accurate matches than traditional search engines which primarily rely on exact keyword matching.

import { LightfeedClient } from "lightfeed";

// Initialize client with your API key
const client = new LightfeedClient({
apiKey: "YOUR_API_KEY",
});

const response = await client.searchRecords("your-database-id", {
search: {
text: "innovative AI solutions",
threshold: 0.2,
},
});

console.log(`Found ${response.results.length} matching records`);
console.log(response.results);

Flexible Filtering

Find exactly what you need by applying specific conditions to narrow down results.

import { LightfeedClient } from "lightfeed";

const client = new LightfeedClient({
apiKey: "YOUR_API_KEY",
});

const response = await client.filterRecords("your-database-id", {
filter: {
rules: [
{
column: "industry",
operator: "equals",
value: "Finance",
},
{
column: "employees",
operator: "greater_than",
value: 100,
},
],
},
pagination: {
limit: 100,
},
});

console.log(
`Retrieved ${response.results.length} finance companies with 100+ employees`
);
console.log(response.results);

Time-Range Queries

Access data from specific time periods to track changes over time.

import { LightfeedClient } from "lightfeed";

const client = new LightfeedClient({
apiKey: "your-api-key",
});

const response = await client.getRecords("your-database-id", {
start_time: "2024-12-01T00:00:00Z",
end_time: "2024-12-31T23:59:59Z",
limit: 100,
});

console.log(`Retrieved ${response.results.length} records`);
console.log(response);

Getting Started

Ready to build with web data today? Getting started with Lightfeed API is simple:

  1. Generate your API key in the Lightfeed dashboard
  2. Install our client library:
    npm install lightfeed   # JavaScript/TypeScript
    pip install lightfeed # Python

More Resources

Introducing Lightfeed Extract

· 3 min read
Lightfeed Team

We're thrilled to launch Lightfeed Extract — a powerful, business-grade web data extraction tool that turns any website into clean, structured, and up-to-date data — all from a simple prompt.

Lightfeed Extract

Say goodbye to custom scrapers, brittle workflows, and writing code. Lightfeed handles the heavy lifting, and even better — we keep your data fresh in a continuously maintained, queryable database.

The Web Data Challenge

If you need clean structured data from websites - whether tracking competitors, monitoring pricing trends, extracting business intelligence, training AI models, or powering applications - you're probably familiar with the limitations of existing tools:

Common Extraction Pain Points

  • Manual Scraping and Maintenance: Traditional scrapers require custom code for each website and break when layouts change - forcing teams to constantly rewrite and fix code instead of focusing on business goals.

  • Limited Extraction Depth: Most tools only extract data from specified URLs, missing critical information buried in subpages and linked content.

  • No Integrated Database: Most scrapers don't provide a persistent database — forcing slow, repeated website crawling for each data request instead of fast queries, and making it impossible to track changes, search historic data, or quickly find relevant information.

  • Data Quality Issues: Raw extracted data requires significant post-processing to clean, normalize, and deduplicate - creating additional engineering complexity and introducing potential errors.

  • Anti-Scraping Measures: Modern websites implement various protection mechanisms - including CAPTCHAs, request throttling, and automated bot detection - making reliable data collection increasingly challenging.

The Lightfeed Solution

Lightfeed transforms how organizations extract and maintain clean, structured and up-to-date web data at scale. Our platform leverages Large Language Models (LLMs) and AI agents that can read, understand and interact with website content, making data extraction reliable and fully automated.

Key Benefits

Adaptive AI Extraction

Extract data from any website using simple natural language instructions without writing code. Automatically adapt to website changes.

Deep Content Discovery and Enrichment

Automatically extract data from linked pages and subpages, while enriching information from multiple sources and third-party websites to create comprehensive datasets.

Fast Database Access

Access consistently up-to-date structured data through instant queries instead of slow crawling, with built-in AI search capabilities to track changes and find the most relevant information.

Automated Data Processing

Get clean, normalized data with automatic deduplication and formatting.

Reliable Scraping

Extract data consistently even from the hardest websites—solving CAPTCHAs automatically and using premium proxies to bypass anti-bot measures.

Getting Started with Lightfeed Extract

Ready to transform how you extract structured data from the web?