S型函数

S型函数（英語：sigmoid function，或稱乙狀函數）是一種函数，因其函數圖像形状像字母S得名。其形狀曲線至少有2個焦點，也叫“二焦點曲線函數”。S型函数是有界、可微的实函数，在实数范围内均有取值，且导数恒为非负^[1]，有且只有一个拐点。S型函数和S型曲线指的是同一事物。

逻辑斯谛函数是一种常见的S型函数，其公式如下：^[1]

S(t)={\frac {1}{1+e^{-t}}}.

其级数展开为：

s:=1/2+{\frac {1}{4}}t-{\frac {1}{48}}t^{3}+{\frac {1}{480}}t^{5}-{\frac {17}{80640}}t^{7}+{\frac {31}{1451520}}t^{9}-{\frac {691}{319334400}}t^{11}+O(t^{12})

其他S型函數案例見下。在一些學科領域，特別是人工神经网络中，S型函數通常特指邏輯斯諦函數。

常見的S型函數

逻辑斯谛函数

f(x)={\frac {1}{1+e^{-x}}}

雙曲正切函數（等價於逻辑斯谛函数的平移與縮放）

f(x)=\tanh x={\frac {e^{x}-e^{-x}}{e^{x}+e^{-x}}}

反正切函數

f(x)=\arctan x

古德曼函數

f(x)=\operatorname {gd} (x)=\int _{0}^{x}{\frac {1}{\cosh t}}\,dt=2\arctan \left(\tanh \left({\frac {x}{2}}\right)\right)

误差函数

f(x)=\operatorname {erf} (x)={\frac {2}{\sqrt {\pi }}}\int _{0}^{x}e^{-t^{2}}\,dt

廣義邏輯斯諦函數（英语：Generalised logistic function）

f(x)=(1+e^{-x})^{-\alpha },\quad \alpha >0

平滑階躍函數（英语：Smoothstep）

f(x)={\begin{cases}\displaystyle {\frac {\int _{0}^{x}{\bigl (}1-u^{2}{\bigr )}^{N}\ du}{\int _{0}^{1}{{\bigl (}1-u^{2}{\bigr )}^{N}\ du}}},&|x|\leq 1\\\operatorname {sgn}(x)&|x|\geq 1\\\end{cases}}\,\quad N\geq 1

一些代數函數, 例如

f(x)={\frac {x}{\sqrt {1+x^{2}}}}

所有連續非負的凸形函數的積分都是S型函數，因此許多常見概率分布的累积分布函数會是S型函數。一個常見的例子是误差函数，它是正态分布的累积分布函数。

参考文献

^ ^1.0 ^1.1 Han, Jun; Morag, Claudio. The influence of the sigmoid function parameters on the speed of backpropagation learning. Mira, José; Sandoval, Francisco (编). From Natural to Artificial Neural Computation. Lecture Notes in Computer Science 930. 1995: 195–201. ISBN 978-3-540-59497-0. doi:10.1007/3-540-59497-3_175.

Mitchell, Tom M. Machine Learning. WCB–McGraw–Hill. 1997. ISBN 0-07-042807-7. . In particular see "Chapter 4: Artificial Neural Networks" (in particular pp. 96–97) where Mitchell uses the word "logistic function" and the "sigmoid function" synonymously – this function he also calls the "squashing function" – and the sigmoid (aka logistic) function is used to compress the outputs of the "neurons" in multi-layer neural nets.
Humphrys, Mark. Continuous output, the sigmoid function. [2015-02-01]. （原始内容存档于2015-02-02）. Properties of the sigmoid, including how it can shift along axes and how its domain may be transformed.

参见

维基共享资源上的相关多媒体资源：S型函数

可微分计算

概论

可微分编程
自動微分
张量微积分（英语：Tensor calculus）
信息几何
统计流形
神经形态工程（英语：Neuromorphic engineering）
模式识别
运算学习理论（英语：Computational learning theory）
归纳偏置

概念

梯度下降
- SGD（英语：Stochastic gradient descent）
聚类
回归
- 过拟合
幻觉
对抗（英语：Adversarial machine learning）
注意力
卷积
損失函數
反向传播
激活函数
- softmax
- sigmoid
- ReLU
正则化
数据集
扩散（英语：Diffusion process）
自回归

应用

硬件

TPU
VPU
IPU（英语：Graphcore）
憶阻器
SpiNNaker（英语：SpiNNaker）

软件库

Theano
TensorFlow
- Keras
PyTorch
JAX
Flux.jl（英语：Flux (machine-learning framework)）

实现

视觉·语音	AlexNet WaveNet 人像合成手寫识别 OCR 语音合成语音识别人脸识别 AlphaFold DALL-E Midjourney Stable Diffusion Sora Whisper（英语：Whisper (speech recognition system)）

自然语言	Word2vec Seq2seq BERT LaMDA Bard NMT 辩手项目（英语：Project Debater）沃森 GPT GPT-1 GPT-2 GPT-3 GPT-4 GPT-J（英语：GPT-J） ChatGPT 文心一言 Chinchilla AI（英语：Chinchilla AI） PaLM（英语：PaLM） BLOOM（英语：BLOOM (language model)） LLaMA TAIDE

决策	AlphaGo Q学习 SARSA OpenAI Five（英语：OpenAI Five）自动驾驶 MuZero 行动选择（英语：Action selection） Auto-GPT 机器人控制（英语：Robot control）