Skip to content

Latest commit

 

History

History
164 lines (139 loc) · 5.18 KB

File metadata and controls

164 lines (139 loc) · 5.18 KB

ML API's

Pre-trained ML API’s

  • For App Developers

Sight

Vision AI

  • Image Recognition/analysis
  • Label Detection
    • Extracts info in image across categories
  • Text Detection (OCR)
    • Detect and extract text from images
  • Safe Search
    • Recognize explicit content
  • Landmark Detection
  • Logo Detection
  • Image Properties
    • Dominant colors, pixel counts
  • Crop Hints
    • Crop coordinates of dominant object/face
  • Web Detection
    • Find matching web entries
  • Object Localizer
    • Returns labels and bounding boxes for detected objects.
  • Product Search
    • Uses image and specific region(s) or largest object of interest to return matching items from product set.

AutoML Vision

  • Object Detection
    • Bounding box smart multi-object detection, Google Vision API on steroids.
  • Edge
    • The IoT version of Vision detection for Edge Devices.
    • Optimized to achieve high accuracy for low latency use cases on memory-constrained devices.
    • Use Edge Connect to securely deploy the AutoML model to IoT devices (such as Edge TPUs, GPUs, and mobile devices) and run predictions locally on the device.

Video Intelligence API

  • Has pre-trained models that recognize a vast number of objects, places, and actions in stored and streaming video.
  • Labels, shot changes, explicit content, subtitles
  • Use cases:
    • Content moderation
    • Recommended content
    • Media archives
    • Contextual advertisements

AutoML Video Intelligence

  • Video media tagging.
  • Train custom video classification models.
  • Ideal for projects that require custom labels which aren’t covered by the pre-trained Video Intelligence API.
  • Detect shot changes
    • Detect scene changes in a segment or throughout the video.

Language

Natural Language API

  • Syntax analysis
  • Entity analysis
  • Sentiment analysis
  • Content classification
  • Multi-language

AutoML Natural Language

  • Handling things like domain specific sentiment analysis and more.
  • Can classifies text using own custom labels.

Translation API

  • Detect and translate languages
  • Beta:
    • Glossary
    • Batch translations

AutoML Translation

  • Upload translated language pairs -> Train -> Evaluate

Conversation

Cloud Speech-to-Text API

  • Convert audio to text
  • Multi-lingual support
  • Understand sentence structure

Cloud Text-to-Speech API

  • Convert text to audio
  • Multiple languages/voices
  • Natural sounding synthesis

Dialogflow Enterprise Edition

  • Conversational experiences
  • Virtual assistants
  • Sentiment Analysis
    • Model chat-oriented conversations and responses, to assist you as you build interactive chatbots.
  • Text-to-Speech
    • Chatbots trigger synthesized speech for more natural user interaction.

Cloud AutoML

  • Enables developers with limited machine learning expertise to train high-quality models specific to their business needs.
  • Relies on transfer learning and neural architecture search technology.

AutoML Tables

  • Workflow:
    • Table input
    • Define data schema and labels
    • Analyze input features
    • Train (automatic)
      • Feature engineering
        • Normalize and bucketize numeric features
        • Create one-hot encoding and embeddings for categorical features
        • Perform basic processing for text features
        • Extract date- and time-related features from Timestamp columns.
      • Model selection
        • Parallel model testing
          • Linear
          • Feedforward deep neural network
          • Gradient Boosted Decision Tree
          • AdaNet
          • Ensembles of various model architectures
      • Hyperparameter tuning
    • Evaluate model behavior
    • Deploy
  • Structured Data
    • Can use data from BigQuery or GCS (CSV)

AutoML Tables vs BigQuery ML

  • BQ
    • More focused on rapid experimentation or iteration with what data to include in the model and want to use simpler model types for this purpose.
      • Can potentially return model in minutes
  • AutoML
  • Have finalized the data.
  • Optimizing for maximizing model quality without needing to manually do feature engineering, model selection, ensembling, and so on.
  • Willing to wait longer to attain that model quality.
    • Takes at least an hour to train.
  • Have a wide variety of feature inputs (beyond numbers and classes) that would benefit from the additional automated feature engineering that AutoML Tables provides.

Cloud Job Discovery

  • More relevant job searches
  • Power recruitment, job boards

Basic Steps for Most APIs

  • Enable API
  • Create API key
  • Authenticate with API key
  • Encode in base64 (optional)
  • Make an API request
  • Requests and outputs via JSON

Structured Data

  • AutoML Tables
  • Cloud Inference API
    • Quickly run large scale correlations over types time series data.
  • Recommendations AI (Beta)
  • BigQuery ML (beta)

Cost

  • Pay per API request per feature
  • Feature as in Landmark Detection

How to convert images, video, etc for use with API?

  • Can use Cloud Storage URI for GCS stored objects
  • Encode in base64 format

How to combine API’s for scenarios?

  • Search customer service calls and analyze sentiment
    • Speech to Text then Sentiment Analysis with Natural Language