large language model
Created|Updated|aifoundationlarge language model
|Word Count:48|Reading Time:1mins
References
Create Environment
use conda or miniconda1
2
3conda env remove d2l-zh
conda create -n d2l-zh -y python=3.8 pip
conda activate d2l-zh
install dependencies1
pip install jupyter d2l torch torchvision
download d2l-zh.zip1
2
3wget http://zh-v2.d2l.ai/d2l-zh.zip
unzip d2l-zh.zip
jupyter notebook
Author: Chris Wen
Copyright Notice: All articles on this blog are licensed under CC BY-NC-SA 4.0 unless otherwise stated.
Related Articles

2025-07-19
large language model
Large Language ModelDefinitionA large language model is a machine learning model that is trained on a large corpus of text data, such as Wikipedia or the Web. These models can generate high-quality text that is similar to the training data, but can also generate text that is not present in the training data. History < 1990s: IBM’s statistical language model (SLM) 1990s-2000s: Neural language models (NLLMs) 2001s: n-gram model 2010s: GPT-2, GPT-3 Dataset PreprocessingTokenizationSplitting ...

2025-09-14
Machine Learning
Overview of Machine LearningWhat is Machine Learning?Machine Learning (ML) is a subset of artificial intelligence (AI) that enables computers to learn from data and improve their performance over time without being explicitly programmed. ML algorithms build mathematical models based on sample data, known as “training data,” to make predictions or decisions without being explicitly programmed to perform the task. Artificial intelligence is a field of computer science that aims to create machi...

2025-06-19
Classification Algorithms
Dataset IntroductionNSL-KDDThe NSL-KDD dataset is a dataset for intrusion detection, which is a type of supervised learning problem. It consists of a large number of network traffic records that are labeled as either normal or malicious. The dataset contains a total of 41,478 network traffic records, which are categorized into 10 different types of attacks, such as DoS, Probe, U2R, R2L, etc. The dataset is publicly available and can be downloaded from the following link: https://www.unb.ca/ci...

2025-06-21
A Taxonomy and Terminology of Adversarial Machine Learning
BackgroundMachine learning (ML) components are increasingly being deployed in critical applications, form computer vision to cybersecurity. However, the data-driven nature of ML introduces new security challenges compared to traditional knowledge-based AI systems. Adversaries can exploit vulnerabilities in ML models through a variety of adversarial attacks, posing significant risks to the integrity, availability, and confidentiality of these systems. Key Attack TypesThe taxonomy of adversaria...

2025-06-20
Comparative Analysis of Classification Algorithms
Comparative Analysis of Classification AlgorithmsAbstractThis report provides a comprehensive analysis of the performance of three popular classification algorithms:Decision Tree, Logistic Regression, Naive Bayes, Random Forest, Support Vector Machines (SVM) and Multilayer Perceptron.The purpose of the study is to evaluate and compare these algorithms based on key performance metrics such as accuracy, precision, recall, F1-score, and false alarm rate (www.evidentlyai.com, n.d.), using two dif...
Announcement
Maintain the motivation to learn and stay humble when facing every problem.
