Submitting Author: (@joshuabmoore)
All current maintainers: (@joshuabmoore, @benfulcher)
Package Name: pyhctsa
One-Line Description of Package: Python package for highly comparative time-series analysis
Repository Link: https://github.com/DynamicsAndNeuralSystems/pyhctsa
Version submitted: 0.1.2
EiC: @crhea93
Editor: TBD
Reviewer 1: TBD
Reviewer 2: TBD
Archive: TBD
JOSS DOI: TBD
Version accepted: TBD
Date accepted (month/day/year): TBD
Code of Conduct & Commitment to Maintain Package
Description
- Include a brief paragraph describing what your package does:
As time-series data grows increasingly massive and complex, researchers rely on extracting "features", statistical summaries of data patterns, to distill meaningful patterns and insights from these datasets.pyhctsa is a native Python library designed to bridge the gap between advanced time-series analysis and modern open-source workflows. It ports the massive library of the original and highly successful MATLAB-based "Highly Comparative Time-Series Analysis" (HCTSA) framework into Python, enabling users to automatically extract and compare thousands of diverse time-series features from their data.
Scope
Domain Specific
Community Partnerships
If your package is associated with an
existing community please check below:
-
For all submissions, explain how and why the package falls under the categories you indicated above. In your explanation, please address the following points (briefly, 1-2 sentences for each):
- Who is the target audience and what are scientific applications of this package?
The target audience of pyhctsa encompasses a broad spectrum of the global research community, ranging from data scientists and physicists to biologists and engineers. The impact of pyhctsa is expected to be as far-reaching as its predecessor, the original MATLAB HCTSA framework, for which scientific utility has been proven through a myriad of publications across diverse disciplines including neuroscience, medicine, geoscience, engineering, and others. By bringing the framework to the Python open-source ecosystem, the package will facilitate seamless integration with modern machine learning workflows and pipelines.
- Are there other Python packages that accomplish the same thing? If so, how does yours differ?
tsfresh - Limited set of time-series features, most of which comprise simple distributional, spectral and a few hand-selected information-theoretic measures.
tsfel - Limited in scope, with time-series feature set restricted to only a handful of basic distributional, spectral and only a handful of nonlinear analysis-based features.
Kats - Limited scope as far as breadth of time-series features. Package seems to not be maintained in the past 3-4 years. Many open issues which have not been resolved.
Compared to the above packages that rely on a narrow, a priori selection of descriptive statistics, pyhctsa provides a truly comprehensive, interdisciplinary library of over 4000 features, ranging from nonlinear dynamics and scaling laws, to information theory and model-based quantities. By casting a wider net across time-series analysis methods, pyhctsa shifts the focus from simple feature extraction to scientific discovery, enabling the identification of the dynamical "fingerprints" that underlie time-series data across diverse problems. As the Python successor to the MATLAB-based HCTSA, long considered the gold standard in highly comparative time-series analysis, pychtsa intends to uphold the same level of depth and rigour.
- If you made a pre-submission enquiry, please paste the link to the corresponding issue, forum post, or other discussion, or
@tag the editor you contacted:
n/a
Technical checks
For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:
Publication Options
JOSS Checks
Note: JOSS accepts our review as theirs. You will NOT need to go through another full review. JOSS will only review your paper.md file. Be sure to link to this pyOpenSci issue when a JOSS issue is opened for your package. Also be sure to tell the JOSS editor that this is a pyOpenSci reviewed package once you reach this step.
Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?
This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.
Confirm each of the following by checking the box.
Please fill out our survey
P.S. Have feedback/comments about our review process? Leave a comment here
Editor and Review Templates
The editor template can be found here.
The review template can be found here.
Submitting Author: (@joshuabmoore)
All current maintainers: (@joshuabmoore, @benfulcher)
Package Name: pyhctsa
One-Line Description of Package: Python package for highly comparative time-series analysis
Repository Link: https://github.com/DynamicsAndNeuralSystems/pyhctsa
Version submitted: 0.1.2
EiC: @crhea93
Editor: TBD
Reviewer 1: TBD
Reviewer 2: TBD
Archive: TBD
JOSS DOI: TBD
Version accepted: TBD
Date accepted (month/day/year): TBD
Code of Conduct & Commitment to Maintain Package
Description
As time-series data grows increasingly massive and complex, researchers rely on extracting "features", statistical summaries of data patterns, to distill meaningful patterns and insights from these datasets.
pyhctsais a native Python library designed to bridge the gap between advanced time-series analysis and modern open-source workflows. It ports the massive library of the original and highly successful MATLAB-based "Highly Comparative Time-Series Analysis" (HCTSA) framework into Python, enabling users to automatically extract and compare thousands of diverse time-series features from their data.Scope
Please indicate which category or categories.
Check out our package scope page to learn more about our
scope. (If you are unsure of which category you fit, we suggest you make a pre-submission inquiry):
Domain Specific
Community Partnerships
If your package is associated with an
existing community please check below:
For all submissions, explain how and why the package falls under the categories you indicated above. In your explanation, please address the following points (briefly, 1-2 sentences for each):
The target audience of
pyhctsaencompasses a broad spectrum of the global research community, ranging from data scientists and physicists to biologists and engineers. The impact ofpyhctsais expected to be as far-reaching as its predecessor, the original MATLABHCTSAframework, for which scientific utility has been proven through a myriad of publications across diverse disciplines including neuroscience, medicine, geoscience, engineering, and others. By bringing the framework to the Python open-source ecosystem, the package will facilitate seamless integration with modern machine learning workflows and pipelines.tsfresh- Limited set of time-series features, most of which comprise simple distributional, spectral and a few hand-selected information-theoretic measures.tsfel- Limited in scope, with time-series feature set restricted to only a handful of basic distributional, spectral and only a handful of nonlinear analysis-based features.Kats- Limited scope as far as breadth of time-series features. Package seems to not be maintained in the past 3-4 years. Many open issues which have not been resolved.Compared to the above packages that rely on a narrow, a priori selection of descriptive statistics,
pyhctsaprovides a truly comprehensive, interdisciplinary library of over 4000 features, ranging from nonlinear dynamics and scaling laws, to information theory and model-based quantities. By casting a wider net across time-series analysis methods,pyhctsashifts the focus from simple feature extraction to scientific discovery, enabling the identification of the dynamical "fingerprints" that underlie time-series data across diverse problems. As the Python successor to the MATLAB-basedHCTSA, long considered the gold standard in highly comparative time-series analysis,pychtsaintends to uphold the same level of depth and rigour.@tagthe editor you contacted:n/a
Technical checks
For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:
Publication Options
JOSS Checks
paper.mdmatching JOSS's requirements with a high-level description in the package root or ininst/.Note: JOSS accepts our review as theirs. You will NOT need to go through another full review. JOSS will only review your paper.md file. Be sure to link to this pyOpenSci issue when a JOSS issue is opened for your package. Also be sure to tell the JOSS editor that this is a pyOpenSci reviewed package once you reach this step.
Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?
This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.
Confirm each of the following by checking the box.
Please fill out our survey
submission and improve our peer review process. We will also ask our reviewers
and editors to fill this out.
P.S. Have feedback/comments about our review process? Leave a comment here
Editor and Review Templates
The editor template can be found here.
The review template can be found here.
Footnotes
Please fill out a pre-submission inquiry before submitting a data visualization package. ↩