← Python 3 使用 SMTP 发送邮件 Python 3 XML 解析 →

Python 3 多线程

多线程类似于同时执行多个不同程序，多线程运行有如下优点

使用线程可以把占据长时间的程序中的任务放到后台去处理
用户界面可以更加吸引人，这样比如用户点击了一个按钮去触发某些事件的处理，可以弹出一个进度条来显示处理的进度
程序的运行速度可能加快
在一些等待的任务实现上如用户输入、文件读写和网络收发数据等，线程就比较有用了。在这种情况下我们可以释放一些珍贵的资源如内存占用等等

线程在执行过程中与进程还是有区别的

每个独立的线程有一个程序运行的入口、顺序执行序列和程序的出口

但是线程不能够独立执行，必须依存在应用程序中，由应用程序提供多个线程执行控制
每个线程都有它自己的一组 CPU 寄存器，称为线程的上下文，该上下文反映了线程上次运行该线程的 CPU 寄存器的状态

指令指针和堆栈指针寄存器是线程上下文中两个最重要的寄存器，线程总是在进程得到上下文中运行的，这些地址都用于标志拥有线程的进程地址空间中的内存

线程可以被抢占 ( 中断 )
在其它线程正在运行时，线程可以暂时搁置 ( 也称为睡眠 ) -- 这就是线程的退让

Python 多线程编程

Python 中创建一个线程有两种方式：函数或者用类来包装线程对象

函数

调用 thread 模块中的 start_new_thread() 函数来产生新线程

thread.start_new_thread ( function, args[, kwargs] )

参数说明

参数	说明
function	线程函数
args	传递给线程函数的参数,他必须是个 tuple 类型
kwargs	可选参数

导入模块

import thread

范例

        import thread
import time

# 为线程定义一个函数
def print_time( threadName, delay):
   count = 0
   while count < 5:
      time.sleep(delay)
      count += 1
      print ("%s: %s" % ( threadName, time.ctime(time.time()) ))

# 创建两个线程
try:
   thread.start_new_thread( print_time, ("Thread:1:", 2, ) )
   thread.start_new_thread( print_time, ("Thread:2:", 4, ) )
except:
   print ("Error: unable to start thread")

while 1:
   pass

运行以上 Python 范例，输出结果如下

Thread:1:: Wed Jan 17 18:45:36 2018
Thread:2:: Wed Jan 17 18:45:38 2018
Thread:1:: Wed Jan 17 18:45:38 2018
Thread:1:: Wed Jan 17 18:45:40 2018
Thread:2:: Wed Jan 17 18:45:42 2018
Thread:1:: Wed Jan 17 18:45:42 2018
Thread:1:: Wed Jan 17 18:45:44 2018
Thread:2:: Wed Jan 17 18:45:46 2018
Thread:2:: Wed Jan 17 18:45:50 2018
Thread:2:: Wed Jan 17 18:45:54 2018

线程的结束一般依靠线程函数的自然结束

也可以在线程函数中调用 thread.exit() 会抛出 SystemExit exception，以达到退出线程的目的

线程模块 threading

除了使用 thread 模块相关函数创建线程外，Python 还内置了 threading 提供了对线程的支持，threading 提供了低级别的、原始的线程以及一个简单的锁

threading 模块提供了以下函数用来管理线程

方法	说明
threading.currentThread()	返回当前的线程变量
threading.enumerate()	返回一个包含正在运行的线程的 list 正在运行指线程启动后、结束前，不包括启动前和终止后的线程
threading.activeCount()	返回正在运行的线程数量与 len(threading.enumerate()) 有相同的结果

除了上面的函数外，threading 还提供了 Thread 类来处理线程

threading.Thread 类提供了以下方法

方法	说明
run()	用以表示线程活动的方法
start()	启动线程活动
join([time])	等待至线程中止。这阻塞调用线程直至线程的join() 方法被调用中止-正常退出或者抛出未处理的异常-或者是可选的超时发生
isAlive()	返回线程是否活动的
getName()	返回线程名
setName()	设置线程名

范例

我们已经把 threading 介绍的差不多了，现在使用 threading 模块来创建几个线程

使用 Threading 模块创建线程，可以写一个类继承 threading.Thread，然后重写 __init__ 方法和 run()方法

import threading
import time

exitFlag = 0

class myThread (threading.Thread):
    '''继承父类 threading.Thread'''
    def __init__(self, threadID, name, counter):
        threading.Thread.__init__(self)
        self.threadID = threadID
        self.name = name
        self.counter = counter
    def run(self):
        ''' 把要执行的代码写到 run 函数里面 
            线程在创建后会直接运行 run 函数
        ''' 
        print ("Starting " + self.name)
        print_time(self.name, self.counter, 5)
        print ("Exiting " + self.name)

def print_time(threadName, delay, counter):
    while counter:
        if exitFlag:
            threading.Thread.exit()
        time.sleep(delay)
        print ("%s: %s" % (threadName, time.ctime(time.time())))
        counter -= 1

# 创建新线程
thread1 = myThread(1, "Thread-1", 1)
thread2 = myThread(2, "Thread-2", 2)

# 开启线程
thread1.start()
thread2.start()

print ("Exiting Main Thread")

运行以上 Python 代码，输出结果如下

$ python main.py 
Starting Thread-1Starting Thread-2
 Exiting Main Thread

Thread-1: Wed Jan 17 19:06:27 2018
Thread-2: Wed Jan 17 19:06:28 2018
 Thread-1: Wed Jan 17 19:06:28 2018
Thread-1: Wed Jan 17 19:06:29 2018
Thread-2: Wed Jan 17 19:06:30 2018
 Thread-1: Wed Jan 17 19:06:30 2018
Thread-1: Wed Jan 17 19:06:31 2018
Exiting Thread-1
Thread-2: Wed Jan 17 19:06:32 2018
Thread-2: Wed Jan 17 19:06:34 2018
Thread-2: Wed Jan 17 19:06:36 2018
Exiting Thread-2

线程同步

多线程的优势在于可以同时运行多个任务 ( 至少感觉起来是这样)

但是当线程需要共享数据时，可能存在数据不同步的问题

考虑这样一种情况：一个列表里所有元素都是 0，线程 "set" 从后向前把所有元素改成 1，而线程 "print" 负责从前往后读取列表并打印

那么，可能线程 "set" 开始改的时候，线程 "print" 便来打印列表了，输出就成了一半0一半 1，这就是数据的不同步

为了避免这种情况，引入了锁的概念。

锁有两种状态——锁定和未锁定

每当一个线程比如"set"要访问共享数据时，必须先获得锁定
如果已经有别的线程比如 "print" 获得锁定了，那么就让线程 "set" 暂停，也就是同步阻塞
等到线程"print"访问完毕，释放锁以后，再让线程 "set" 继续

经过这样的处理，打印列表时要么全部输出0，要么全部输出1，不会再出现一半0一半1的尴尬场面

为了保证数据的正确性，需要对多个线程进行同步

threading.Thread 对象的 Lock 和 Rlock 可以实现简单的线程同步

这两个对象都有 acquire() 方法和 release() 方法，对于那些需要每次只允许一个线程操作的数据，可以将其操作放到 acquire 和 release 方法之间

下面的范例实现到了对资源的锁和解锁

#!/usr/bin/python

import threading
import time

class myThread (threading.Thread):
    def __init__(self, threadID, name, counter):
        threading.Thread.__init__(self)
        self.threadID = threadID
        self.name = name
        self.counter = counter
    def run(self):
        print ("Starting " + self.name)
       # 获得锁，成功获得锁定后返回True
       # 可选的timeout参数不填时将一直阻塞直到获得锁定
       # 否则超时后将返回False
        threadLock.acquire()
        print_time(self.name, self.counter, 3)
        # 释放锁
        threadLock.release()

def print_time(threadName, delay, counter):
    while counter:
        time.sleep(delay)
        print ("%s: %s" % (threadName, time.ctime(time.time())))
        counter -= 1

threadLock = threading.Lock()
threads = []

# 创建新线程
thread1 = myThread(1, "Thread:1", 1)
thread2 = myThread(2, "Thread:2", 2)

# 开启新线程
thread1.start()
thread2.start()

# 添加线程到线程列表
threads.append(thread1)
threads.append(thread2)

# 等待所有线程完成
for t in threads:
    t.join()
print ("Exiting Main Thread")

运行上面的 Python 代码，输出结果如下

$ python main.py 
Starting Thread:1
Starting Thread:2
Thread:1: Wed Jan 17 19:13:58 2018
Thread:1: Wed Jan 17 19:13:59 2018
Thread:1: Wed Jan 17 19:14:00 2018
Thread:2: Wed Jan 17 19:14:02 2018
Thread:2: Wed Jan 17 19:14:04 2018
Thread:2: Wed Jan 17 19:14:06 2018
Exiting Main Thread

线程优先级队列 ( Queue )

Python queue 模块中提供了同步的、线程安全的队列类，包括 FIFO (先入先出 ) 队列 Queue，LIFO ( 后入先出 ) 队列 LifoQueue，和优先级队列 PriorityQueue

这些队列都实现了锁原语，能够在多线程中直接使用

可以使用队列来实现线程间的同步

导入模块

import queue

queue.Queue 类常用方法

下表列出了 queue.Queue 类中常用的方法

方法	说明
Queue.qsize()	返回队列的大小
Queue.empty()	如果队列为空，返回 True,反之 False
Queue.full()	如果队列满了，返回 True,反之 False
Queue.full	与 maxsize 大小对应
Queue.get([block[, timeout]])	获取队列，timeout等待时间
Queue.get_nowait()	相当Queue.get(False)
Queue.put(item)	写入队列，timeout 等待时间
Queue.put_nowait(item)	相当 Queue.put(item, False)
Queue.task_done()	完成一项工作后，该函数向任务已经完成的队列发送一个信号
Queue.join()	实际上意味着等到队列为空，再执行别的操作

范例

下面的代码使用 queue.Queue 类来完成线程间的同步

#!/usr/bin/python

import Queue
import threading
import time

exitFlag = 0

class myThread (threading.Thread):
    def __init__(self, threadID, name, q):
        threading.Thread.__init__(self)
        self.threadID = threadID
        self.name = name
        self.q = q
    def run(self):
        print ("Starting " + self.name)
        process_data(self.name, self.q)
        print ("Exiting " + self.name)

def process_data(threadName, q):
    while not exitFlag:
        queueLock.acquire()
        if not workQueue.empty():
            data = q.get()
            queueLock.release()
            print ("%s processing %s" % (threadName, data))
        else:
            queueLock.release()
        time.sleep(1)

threadList = ["Thread-1", "Thread-2", "Thread-3"]
nameList = ["One", "Two", "Three", "Four", "Five"]
queueLock = threading.Lock()
workQueue = queue.Queue(10)
threads = []
threadID = 1

# 创建新线程
for tName in threadList:
    thread = myThread(threadID, tName, workQueue)
    thread.start()
    threads.append(thread)
    threadID += 1

# 填充队列
queueLock.acquire()
for word in nameList:
    workQueue.put(word)
queueLock.release()

# 等待队列清空
while not workQueue.empty():
    pass

# 通知线程是时候退出
exitFlag = 1

# 等待所有线程完成
for t in threads:
    t.join()
print ("Exiting Main Thread")

运行以上 Python 代码，输出结果如下

Starting Thread-1
Starting Thread-2
Starting Thread-3
Thread-1 processing One
Thread-2 processing Two
Thread-3 processing Three
Thread-1 processing Four
Thread-2 processing Five
Exiting Thread-3
Exiting Thread-1
Exiting Thread-2
Exiting Main Thread

← Python 3 使用 SMTP 发送邮件 Python 3 XML 解析 →

简单教程

Python 3 多线程

Python 多线程编程

参数说明

导入模块

范例

范例

线程同步

线程优先级队列 ( Queue )

导入模块

queue.Queue 类常用方法

范例

Python3 基础教程