logo

oneDAL documentation

Introduction

  • Installation

Developer Guide

  • Data Analytics Pipeline
  • oneAPI Interfaces
    • Introduction
      • Get Started with oneDAL
      • Build applications with oneDAL
      • Glossary
      • Mathematical Notations
    • Computational Modes
    • Data Management
      • Array
      • Accessors
        • Column accessor
        • Row accessor
      • Data Sources
        • CSV data source
      • Graphs
        • Undirected adjacency vector graph
        • Directed adjacency vector graph
      • Tables
        • Homogeneous table
    • Algorithms
      • Clustering
        • DBSCAN
        • K-Means
        • K-Means initialization
      • Covariance
        • Covariance
      • Decomposition
        • Principal Components Analysis (PCA)
      • Ensembles
        • Decision Forest Classification and Regression (DF)
      • Graph
        • Subgraph Isomorphism
        • Connected Components
      • Kernel Functions
        • Linear kernel
        • Polynomial kernel
        • Radial Basis Function (RBF) kernel
        • Sigmoid kernel
      • Nearest Neighbors (kNN)
        • k-Nearest Neighbors Classification and Search (k-NN)
      • Pairwise Distances
        • Minkowski distance
        • Chebyshev distance
        • Cosine distance
      • Statistics
        • Basic Statistics
      • Support Vector Machines
        • Support Vector Machine Classifier and Regression (SVM)
    • Single Program Multiple Data
    • oneAPI Examples
      • DPC++
        • basic_statistics_dense_batch.cpp
        • column_accessor_homogen.cpp
        • cor_dense_batch.cpp
        • cov_dense_batch.cpp
        • dbscan_brute_force_batch.cpp
        • df_cls_hist_batch.cpp
        • df_cls_traverse_model.cpp
        • df_reg_hist_batch.cpp
        • df_reg_traverse_model.cpp
        • kmeans_init_dense.cpp
        • kmeans_lloyd_dense_batch.cpp
        • knn_cls_brute_force_dense_batch.cpp
        • knn_reg_brute_force_dense_batch.cpp
        • knn_search_brute_force_dense_batch.cpp
        • linear_kernel_dense_batch.cpp
        • linear_regression_dense_batch.cpp
        • pca_cor_dense_batch.cpp
        • rbf_kernel_dense_batch.cpp
        • svm_two_class_thunder_dense_batch.cpp
      • C++
        • basic_statistics_dense_batch.cpp
        • column_accessor_homogen.cpp
        • connected_components_batch.cpp
        • cor_dense_batch.cpp
        • cov_dense_batch.cpp
        • dbscan_brute_force_batch.cpp
        • df_cls_dense_batch.cpp
        • df_reg_dense_batch.cpp
        • directed_graph.cpp
        • graph_service_functions.cpp
        • jaccard_batch.cpp
        • jaccard_batch_app.cpp
        • kmeans_init_dense.cpp
        • kmeans_lloyd_dense_batch.cpp
        • knn_cls_brute_force_dense_batch.cpp
        • knn_cls_kd_tree_dense_batch.cpp
        • knn_search_brute_force_dense_batch.cpp
        • linear_kernel_dense_batch.cpp
        • linear_regression_dense_batch.cpp
        • louvain_batch.cpp
        • pca_dense_batch.cpp
        • polynomial_kernel_dense_batch.cpp
        • rbf_kernel_dense_batch.cpp
        • shortest_paths_batch.cpp
        • sigmoid_kernel_dense_batch.cpp
        • subgraph_isomorphism_batch.cpp
        • svm_multi_class_thunder_dense_batch.cpp
        • svm_nu_cls_thunder_dense_batch.cpp
        • svm_nu_reg_thunder_dense_batch.cpp
        • svm_reg_thunder_dense_batch.cpp
        • svm_two_class_smo_dense_batch.cpp
        • svm_two_class_thunder_dense_batch.cpp
        • triangle_counting_batch.cpp
    • Appendix
      • Decision Tree
      • k-d Tree
  • DAAL Interfaces
    • CPU and GPU Support
    • Library Usage
      • Algorithms
      • Computation Modes
      • Training and Prediction
        • Classification Usage Model
        • Regression Usage Model
        • Recommendation Systems Usage Model
    • Data Management
      • Numeric Tables
        • Generic Interfaces
        • Essential Interfaces for Algorithms
        • Types of Numeric Tables
      • Data Sources
      • Data Dictionaries
      • Data Serialization and Deserialization
      • Data Compression
      • Data Model
    • Analysis
      • K-Means Clustering
        • Batch Processing
        • Distributed Processing
        • Batch Processing
        • Distributed Processing
      • Density-Based Spatial Clustering of Applications with Noise
        • Batch Processing
        • Distributed Processing
      • Correlation and Variance-Covariance Matrices
        • Batch Processing
        • Online Processing
        • Distributed Processing
      • Principal Component Analysis
        • Batch Processing
        • Online Processing
        • Distributed Processing
      • Principal Components Analysis Transform
      • Singular Value Decomposition
        • Batch and Online Processing
        • Distributed Processing
      • Association Rules
      • Kernel Functions
      • Expectation-Maximization
      • Cholesky Decomposition
      • QR Decomposition
        • QR Decomposition without Pivoting
        • Pivoted QR Decomposition
      • Outlier Detection
        • Multivariate Outlier Detection
        • Multivariate BACON Outlier Detection
        • Univariate Outlier Detection
      • Distance Matrix
        • Correlation Distance Matrix
        • Cosine Distance Matrix
      • Distributions
        • Uniform Distribution
        • Normal Distribution
        • Bernoulli Distribution
      • Engines
        • mt19937
        • mcg59
        • mt2203
      • Moments of Low Order
        • Batch Processing
        • Online Processing
        • Distributed Processing
      • Quantile
      • Quality Metrics
        • Working with the Default Metric Set
        • Working with User-defined Quality Metrics
      • Sorting
      • Normalization
        • Z-score
        • Min-max
      • Optimization Solvers
        • Objective Function
        • Iterative Solver
    • Training and Prediction
      • Decision Forest
        • Decision Forest
        • Regression Decision Forest
        • Classification Decision Forest
      • Decision Trees
        • Decision Tree
        • Regression Decision Tree
        • Classification Decision Tree
      • Gradient Boosted Trees
        • Gradient Boosted Trees
        • Regression Gradient Boosted Trees
        • Classification Gradient Boosted Trees
      • Stump
        • Classification Stump
        • Regression Stump
      • Linear and Ridge Regressions
        • Linear Regression
        • Ridge Regression
        • Linear and Ridge Regressions Computation
      • LASSO and Elastic Net Regressions
        • LASSO
        • Elastic Net
        • LASSO and Elastic Net Computation
      • k-Nearest Neighbors (kNN) Classifier
      • Implicit Alternating Least Squares
        • Batch Processing
        • Distributed Processing
        • Batch Processing
        • Distributed Processing: Training
        • Distributed Processing: Prediction of Ratings
      • Logistic Regression
      • Naïve Bayes Classifier
        • Batch Processing
        • Online Processing
        • Distributed Processing
      • Support Vector Machine Classifier
      • Multi-class Classifier
      • Boosting
        • AdaBoost Classifier
        • AdaBoost Multiclass Classifier
        • BrownBoost Classifier
        • LogitBoost Classifier
    • Services
      • Extracting Version Information
      • Handling Errors
      • Managing Memory
      • Managing the Computational Environment
      • Providing a Callback for the Host Application
  • Bibliography

API Reference

  • C++ API
    • Data Management
      • Array
      • Accessors
        • Column accessor
        • Row accessor
      • Data Sources
        • CSV data source
      • Graphs
        • Undirected adjacency vector graph
        • Directed adjacency vector graph
      • Graph Service
        • Undirected adjacency vector graph service
        • Directed adjacency vector graph service
      • Tables
        • Homogeneous table
    • Algorithms
      • Clustering
        • DBSCAN
        • K-Means
        • K-Means initialization
      • Covariance
        • Covariance
      • Decomposition
        • Principal Components Analysis (PCA)
      • Ensembles
        • Decision Forest Classification and Regression (DF)
      • Graph
        • Subgraph Isomorphism
        • Connected Components
      • Kernel Functions
        • Linear kernel
        • Polynomial kernel
        • Radial Basis Function (RBF) kernel
        • Sigmoid kernel
      • Nearest Neighbors (kNN)
        • k-Nearest Neighbors Classification (k-NN)
      • Pairwise Distances
        • Minkowski distance
        • Chebyshev distance
        • Cosine distance
      • Statistics
        • Basic Statistics
      • Support Vector Machines
        • Support Vector Machine Classifier (SVM)
    • Distributed Model: Single Process Multiple Data
      • Distributed SPMD model
      • Communicators

Notes

  • Known Issues
    • oneAPI examples failing with segfault
    • PCA examples failing on GPU devices
    • Static debug configuration not working
    • DPC++ examples not working
    • Incorrect linker behavior
    • Level Zero runtime dependency
    • Potential performance degradations
    • Undeclared identifier CL_DEVICE_IL_VERSION_KHR
    • DLL file not found
  • Notices and Disclaimers

Contributing Guide

  • Coding Guidelines
Theme by the Executable Book Project

Analysis¶

  • K-Means Clustering
  • Density-Based Spatial Clustering of Applications with Noise
  • Correlation and Variance-Covariance Matrices
  • Principal Component Analysis
  • Principal Components Analysis Transform
  • Singular Value Decomposition
  • Association Rules
  • Kernel Functions
  • Expectation-Maximization
  • Cholesky Decomposition
  • QR Decomposition
  • Outlier Detection
  • Distance Matrix
  • Distributions
  • Engines
  • Moments of Low Order
  • Quantile
  • Quality Metrics
  • Sorting
  • Normalization

Optimization Solvers

  • Optimization Solvers
    • Objective Function
      • Computation
      • Sum of Functions
      • Mean Squared Error Algorithm
      • Objective Function with Precomputed Characteristics Algorithm
      • Logistic Loss
      • Cross-entropy Loss
    • Iterative Solver
      • Computation
      • Limited-Memory Broyden-Fletcher-Goldfarb-Shanno Algorithm
      • Stochastic Gradient Descent Algorithm
      • Adaptive Subgradient Method
      • Coordinate Descent Algorithm
      • Stochastic Average Gradient Accelerated Method

previous

Data Model

next

K-Means Clustering

© Copyright 2014 - 2021, Intel Corporation.

Cookies