Overview

Time Series Forecasting using Large Language Models

Overview

If Large Language Models can find patterns within textual data and produce the next words in an autoregressive way, can we make Large Language Models to predict the value of a variable in a future timestep given its historical data. We set out to investigate the capability of Large Language Models (LLMs) to perform multivariate time-series forecasting, benchmarking their performance against classical methods (e.g., Prophet) and specialized time-series foundation models. Our study encompasses a range of LLM-based approaches—including single-pass prompts, sliding-window forecasting, multivariate input sequences, diffusion-inspired denoising, and hybrid reprogramming layers

Features

Implementation of time series tokenization techniques for LLMs
Prompt engineering strategies tailored for time series data
Methods for aligning time series embeddings with language space
Evaluation framework for comparing LLM-based approaches with traditional forecasting methods
Adaptations of popular LLM architectures for time series tasks

Key Methods

Prompt Engineering : We frame forecasting as a plain-language instruction: “Given the past LOOKBACK hourly temperatures: […], predict the next HORIZON hourly temperatures. Reply with a comma-separated list of numbers.”
Data Splitting: Different approaches to splitting the data for training the LLM and generating a prediction

a. Batch Processing: In the simplest setup, we feed one contiguous year (8,640 hours) of raw temperature values to the quantized Mistral-7B model and request a 30-day (720-hour) forecast in a single pass
b. Sliding-Window Forecasting: To mitigate long-horizon drift, we implement a sliding-window strategy: partition the historical series into overlapping segments of fixed length (e.g., 168 hours for a 7-day lookback) and iteratively predict the next window (24 hours), appending each prediction to the input for the subsequent step

Post Prediciton Corrections: To correct systematic biases in LLM outputs, we explore two residual‐correction strategies correcting the generated outputs of LLM based on residuals
Diffusion based models: Taking inspiration from predictor-corrector networks in diffusion models, we have applied Score based diffusion models to correct time series outputs

Results

We have seen that LLM+Score based diffusion models had higher accuracy compared to Large Language models and traditional time series models. Further analysis has to be performed on different types of data and huge volumes of data validate the scalability of these models. For more information look into Project Report.pdf - https://github.com/rayapudisaiakhil/TimeSeries-using-LLM-s/blob/main/IE%207374%20Gen%20AI%20-%20Project%20Report.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
prudhvi		prudhvi
.gitignore		.gitignore
IE 7374 Gen AI - Project Report.pdf		IE 7374 Gen AI - Project Report.pdf
LICENSE		LICENSE
README.md		README.md
batch_process.ipynb		batch_process.ipynb
diffusion_model.ipynb		diffusion_model.ipynb
finetuning.ipynb		finetuning.ipynb
residual_finetuning.ipynb		residual_finetuning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Time Series Forecasting using Large Language Models

Overview

Features

Key Methods

Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Time Series Forecasting using Large Language Models

Overview

Features

Key Methods

Results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages