Flow
sklearn.cluster.hierarchical.FeatureAgglomeration

sklearn.cluster.hierarchical.FeatureAgglomeration

Visibility: public Uploaded 14-08-2021 by Sergey Redyuk sklearn==0.19.1 numpy>=1.8.2 scipy>=0.13.3 0 runs
0 likes downloaded by 0 people 0 issues 0 downvotes , 0 total downloads
  • openml-python python scikit-learn sklearn sklearn_0.19.1
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Agglomerate features. Similar to AgglomerativeClustering, but recursively merges features instead of samples.

Parameters

affinityMetric used to compute the linkage. Can be "euclidean", "l1", "l2", "manhattan", "cosine", or 'precomputed' If linkage is "ward", only "euclidean" is accepteddefault: "euclidean"
compute_full_treeStop early the construction of the tree at n_clusters. This is useful to decrease computation time if the number of clusters is not small compared to the number of features. This option is useful only when specifying a connectivity matrix. Note also that when varying the number of clusters and using caching, it may be advantageous to compute the full tree linkage : {"ward", "complete", "average"}, optional, default "ward" Which linkage criterion to use. The linkage criterion determines which distance to use between sets of features. The algorithm will merge the pairs of cluster that minimize this criterion - ward minimizes the variance of the clusters being merged - average uses the average of the distances of each feature of the two sets - complete or maximum linkage uses the maximum distances between all features of the two setsdefault: "auto"
connectivityConnectivity matrix. Defines for each feature the neighboring features following a given structure of the data This can be a connectivity matrix itself or a callable that transforms the data into a connectivity matrix, such as derived from kneighbors_graph. Default is None, i.e, the hierarchical clustering algorithm is unstructureddefault: null
linkagedefault: "average"
memoryUsed to cache the output of the computation of the tree By default, no caching is done. If a string is given, it is the path to the caching directorydefault: null
n_clustersThe number of clusters to finddefault: 2
pooling_funcThis combines the values of agglomerated features into a single value, and should accept an array of shape [M, N] and the keyword argument `axis=1`, and reduce it to an array of size [M].default: {"oml-python:serialized_object": "function", "value": "numpy.mean"}

0
Runs

List all runs
Parameter:
Rendering chart
Rendering table