Pathema logo
Search for

Protein aggregates referred to as progenitor toxins or toxin complexes secrete botulinum neurotoxins. These aggregates comprise neurotoxin plus a set of seven toxin associated proteins (TAPs). TAPs include non-toxic non-hemagglutinin proteins (NTNH) and hemagglutinins (HA). As seen in Table 1 from Hines et al. 2005, PMID: 16085839, a number of proteins known to form protein aggregates have been identified for the seven botulinum serotypes.

Neurotoxins, designated BoNT/A to BoNT/G, are synthesized as single polypeptides. Proteolytic activation yields N-terminal light chain (Lc) and C-terminal heavy chain (Hc) polypeptides connected by disulfide bonds. Genes encoding neurotoxin proteins reside on the Clostridium botulinum chromosome (types A, B, F), bacteriophages (types C, D, E) and plasmid (type G).

NCBI Queries for BoNT Gene Sequences

Lists of gene-specific and gene-related nucleotide and protein sequences can be retrieved from NCBI's Nucleotide database by text searches based on the protein or gene name for the Clostridium botulinum genes.

Click on the provided links, below, to view the results for each gene. Each query is presented with the associated Gene Symbol, Putative Identification, NCBI Search String, NCBI Protein Link, NCBI Nucleotide Link and Pathema Annotation Link.

Gene Symbol
Putative Identification
NCBI Search String
NCBI Protein Link
NCBI Nucleotide Link
Pathema Annotation Link
ADP-ribosyltransferase C3 ADP-ribosyltransferase C3 Clostridium botulinum[organism] AND ADP-ribosyltransferase C3 Protein Nucleotide
BoNT neurotoxin Clostridium botulinum[organism] AND BoNT Protein Nucleotide
botA botulinum neurotoxin type A botA Clostridium botulinum[organism] AND botA Protein Nucleotide Pathema Annotation
botB botulinum neurotoxin type B botB Clostridium botulinum[organism] AND botB Protein Nucleotide
botE botulinum neurotoxin type E botE Clostridium botulinum[organism] AND botE Protein Nucleotide
botR transcriptional regulator botR Clostridium botulinum[organism] AND botR Protein Nucleotide Pathema Annotation
C2 toxin C2 toxin (component I or II) Clostridium botulinum[organism] AND C2 toxin Protein Nucleotide
FlaB flagellin Clostridium botulinum[organism] AND flagellin Protein Nucleotide Pathema Annotation
HA1 HA1 Clostridium botulinum[organism] AND HA1 Protein Nucleotide
HA2 HA2 Clostridium botulinum[organism] AND HA2 Protein Nucleotide
HA3 HA3 Clostridium botulinum[organism] AND HA3 Protein Nucleotide
HA-17 haemagglutinin HA-17 Clostridium botulinum[organism] AND HA-17 Protein Nucleotide Pathema Annotation
HA-19 HA-19 Clostridium botulinum[organism] AND HA-19 Protein Nucleotide
HA-33 HA-33 Clostridium botulinum[organism] AND HA-33 Protein Nucleotide
HA-34 haemagglutinin component HA-34 Clostridium botulinum[organism] AND HA-34 Protein Nucleotide Pathema Annotation
HA-70 haemagglutinin HA-70 Clostridium botulinum[organism] AND HA-70 Protein Nucleotide Pathema Annotation
HA-II HA-II Clostridium botulinum[organism] AND HA-II Protein Nucleotide
ORF-1 ORF-1 Clostridium botulinum[organism] AND ORF-1 Protein Nucleotide
ORF-22 ORF-22 Clostridium botulinum[organism] AND ORF-22 Protein Nucleotide
ORF-X1 ORF-X1 Clostridium botulinum[organism] AND ORF-X1 Protein Nucleotide
ORF-X2 ORF-X2 Clostridium botulinum[organism] AND ORF-X2 Protein Nucleotide
P-21 P-21 Clostridium botulinum[organism] AND P-21 Protein Nucleotide
P-47 P-47 Clostridium botulinum[organism] AND P-47 Protein Nucleotide
NTNH nontoxic-nonhemagglutinin Clostridium botulinum[organism] AND NTNH Protein Nucleotide Pathema Annotation

Back to top

Clostridium botulinum Genes

Clostridium botulinum genes including BoNT genes involved in cellular processes such as pathogenesis, germination, toxin production and resistance are presented here by Serotype (A-G), Gene Symbol, PubMed ID, Protein Accession, Nucleotide Accession and Source Strain. Clostridium botulinum strains are linked to the corresponding Strain entry on the BoNT_strains page.

Serotype
Gene Symbol
PubMed ID
Protein Accession
Nucleotide Accession
Source Strain
type A BoNT PMID: 8863443 CAA63551 X92973 62A
BoNT PMID: 14557061 AAQ06331 AF488749 Allergan-Hall A
BoNT/A PMID: 2160960 AAA23262 M30196 62A
BoNT/A PMID: 8310180 CAA51824 X73423 Kyoto-F
BoNT/A PMID: 8863443 CAA61234 X87974 Kyoto-F
BoNT/A PMID: 16842364 ABC26002 DQ310546 Mascarpone
BoNT/A ABA29018 DQ185901 657Ba
BoNT/A ABD65472 DQ409059 Hall A
BoNT/A ABA29017 DQ185900 Loch Maree
BoNT/A PMID: 12732962 AAM75961 AF461540 Hall A-hyper
BoNT/A2 PMID: 15956371 AAX53156 AY953275 FRI-H1A2
botA PMID: 2185020 CAA36289 X52066 NCTC 2916
BotR PMID: 12732962 AAM75952 AF461538 62A
BotR PMID: 12732962 AAM75959 AF461540 Hall A-hyper
BotR PMID: 12732962 AAR89495 AY497357 NCTC 2916
BotR/A PMID: 16842364 ABC25999 DQ310546 Mascarpone
botR/OrfX PMID: 14557061 AAQ06332 AF488750 Allergan-Hall A
HA-17 PMID: 12732962 AAM75950 AF461538 62A
HA-17 PMID: 14557061 AAQ06327 AF488745 Allergan-Hall A
HA-17 ABD65468 DQ409059 Hall A
HA-17 PMID: 12732962 AAM75957 AF461540 Hall A-hyper
HA-17 PMID: 8764477 AAB42188 L42537 NCTC 2916
HA-19/20 PMID: 16085839
HA-33 PMID: 8631890 CAA61129 X87850 667Ab
HA-33 PMID: 9504990 CAA74632 Y14239 NCTC 2916
HA-33 PMID: 12732962 AAM75951 AF461538 62A
HA-33 PMID: 12732962 AAM75958 AF461540 Hall A-hyper
HA-33 CAA55718 X79104 NCTC 7272
HA-33 Chain A PMID: 15701519 1YBIA Hall A
HA-33 Chain B PMID: 15701519 1YBIB Hall A
HA-34 PMID: 14557061 AAQ06328 AF488746 Allergan-Hall A
HA-34 PMID: 14557061
HA-34 PMID: 8764477 AAB42189 L42537 NCTC 2916
HA-70 PMID: 12732962 AAM75949 AF461538 62A
HA-70 PMID: 14557061 AAQ06329 AF488747 Allergan-Hall A
HA-70 ABD65467 DQ409059 Hall A
HA-70 PMID: 8764477 AAB42187 L42537 NCTC 2916
HA-70 PMID: 12732962 AAM75956 AF461540 Hall A-hyper
HA-II PMID: 8631890 CAA61130 X87850 667Ab
HA-II CAA55719 X79104 NCTC 7272
NTNH PMID: 8631890 CAA61127 X87850 667Ab
NTNH PMID: 8631890 CAA61123 X87848 667Ab
NTNH PMID: 8631890 CAA61125 X87849 667Ab
NTNH PMID: 8713133 BAA12299 D84289 7103-H
NTNH PMID: 8863443 CAA63550 X92973 62A
NTNH PMID: 8863443 CAA61233 X87974 Kyoto-F
NTNH PMID: 8863443 CAA65348 X96493 Kyoto-F
NTNH PMID: 9504990 CAA74630 Y14238 NCTC 2916
NTNH PMID: 9504990 CAA74634 Y14239 NCTC 2916
NTNH PMID: 12732962 AAM75953 AF461538 62A
NTNH PMID: 12732962 AAM75960 AF461540 Hall A-hyper
NTNH PMID: 14557061 AAQ06330 AF488748 Allergan-Hall A
NTNH PMID: 16842364 ABC26001 DQ310546 Mascarpone
NTNH ABD65471 DQ409059 Hall A
NTNH CAA55716 X79104 NCTC 7272
ORF-1 PMID: 12732962 AF461539 62A
ORF22-a PMID: 8521962 BAA11049 D67030 NIH
ORFX1 PMID: 16842364 ABC25998 DQ310546 Mascarpone
ORF-X1 PMID: 9465394 BAA24887 AB004778 Kyoto-F
ORF-X1 PMID: 15158256 AAR89496 AY497357 NCTC 2916
ORFX2 PMID: 16842364 ABC25997 DQ310546 Mascarpone
ORF-X2 PMID: 9465394 BAA24886 AB004778 Kyoto-F
P-21 PMID:8863443 CAA65345 X96491 Chiba-H
P-21 PMID: 8863443 CAA65347 X96493 Kyoto-F
P-21 PMID: 8863443 CAA65346 X96492 NCTC 9837
P-21 PMID: 9465394 BAA24888 AB004778 Kyoto-F
P-21 PMID: 9504990 CAA74633 Y14239 NCTC 2916
P-21 CAA55717 X79104 NCTC 7272
P-47 PMID: 8863443 CAA65349 X96493 Kyoto-F
P-47 PMID: 9504990 CAA74629 Y14238 NCTC 2916
P-47 PMID: 15158256 AAR89499 AY497357 NCTC 2916
P-47 PMID: 16842364 ABC26000 DQ310546 Mascarpone
putative flagellin PMID: 12732962 AAM75948 AF461538 62A
putative flagellin PMID: 12732962 AAM75955 AF461540 Hall A-hyper
putative flagellin PMID: 12732962 AAM75964 AF461542 NCTC 2916
type B BoNT PMID: 9767710 CAA73968 Y13630 CDC 3281 (ATCC 43757)
BoNT PMID: 9767710 CAA73972 Y13631 CDC 3281 (ATCC 43757)
BoNT/B PMID: 7764370 CAA50482 X71343 Eklund 17B (ATCC 25765)
BoNT/B PMID: 7764370 CAA50482 X71343 Eklund 17B (ATCC 25765)
BoNT/B PMID: 8408542 CAA50148 X70817 NCTC 7273
BoNT/B PMID: 15240298 AAS59787 AY555068 CDC 4848
BoNT/B PMID: 16272395 BAE48264 AB232927 Okra
botB AAA23211 M81186
btcB PMID: 11097932 AAF87749 AF278540 213B
gerAA CAC17460 AJ295694 NCTC 7273
gerAB CAC17461 AJ295694 NCTC 7273
gerAC CAD28560 AJ295694 NCTC 7273
HA1 PMID: 16272395 BAE48261 AB232927 Okra
HA-17 PMID: 9290060 CAA70496 Y09312 Eklund 17B
HA-17 PMID: 9767710 CAA73964 Y13630 CDC 3281 (ATCC 43757)
HA-17 PMID: 8310180
HA2 PMID: 16272395 BAE48260 AB232927 Okra
HA-33 PMID: 9767710 CAA73965 Y13630 CDC 3281 (ATCC 43757)
HA-33 CAA55714 X79103 17B
HA-33 CAA55710 X79102 NCTC 7273
HA-70 PMID: 9290060 CAA70495 Y09312 Eklund 17B
HA-70 PMID: 9767710 CAA73963 Y13630 CDC 3281 (ATCC 43757)
HA-II CAA55715 X79103 17B
HA-II CAA55711 X79102 NCTC 7273
hem33/B AAB64348 U63808 Lamanna
NTNH PMID: 8863443 CAA55074 X78230 NCTC 7273
NTNH PMID: 9290060 CAA55073 X78229 Eklund 17B (ATCC 25765)
NTNH PMID: 9767710 CAA73967 Y13630 CDC 3281 (ATCC 43757)
NTNH PMID: 16272395 BAE48263 AB232927 Okra
NTNH CAA55712 X79103 17B
NTNH CAA55708 X79102 NCTC 7273
NTNH AAB64350 U63808 Lamanna
NTNH PMID: 9767710 CAA73971 Y13631 CDC 3281 (ATCC 43757)
ORF-22 AAB64349 U63808 Lamanna
P-21 PMID: 9767710 CAA73969 Y13631 CDC 3281 (ATCC 43757)
P-21 PMID: 9767710 CAA73966 Y13631 CDC 3281 (ATCC 43757)
P-21 PMID: 16272395 BAE48262 AB232927 Okra
P-21 CAA55713 X79103 17B
P-21 CAA55709 X79102 NCTC 7273
P-47 PMID: 9767710 CAA73970 Y13631 CDC 3281 (ATCC 43757)
type C ANTP-17 PMID: 8028579 CAA51310 X72793 468
ANTP-33 PMID: 8028579 CAA51311 X72793 468
ANTP-70 PMID: 8028579 CAA51309 X72793 468
BoNT PMID: 8593068 BAA08418 D49440 6813
BoNT PMID: 11676492 BAB71749 AB061780 C-Yoichi
BoNT/C1 PMID: 8028579 CAA51313 X72793 468
C2 toxin PMID: 8645309 2210236A
C2 toxin (component I) PMID: 11741886 CAA11969 AJ224480 92-13
C2 toxin (component-I) PMID: 8645309 BAA09942 D63903 (C)-203U28
C2 toxin (component-I) PMID: 8645309 BAA32536 D88982 (C)-203U28
C2 toxin (component-II) PMID: 8645309 BAA32537 D88982 (C)-203U28
HA-17 PMID: 2222445 CAA44260 X62389 C-Stockholm (C-St)
HA-17 PMID: 11676492 BAB71746 AB061780 C-Yoichi
HA-17 BAA89710 AB037166 C-6814
HA-17 (cha 17) PMID: 7802661 AAB32848 S74768
HA-33 PMID: 2205574 CAA37210 X53041 C-Stockholm (C-St)
HA-33 PMID: 2222445 CAA4426 X62389 C-Stockholm (C-St)
HA-33 PMID: 11676492 BAB71747 AB061780 C-Yoichi
HA-33 BAA89711 AB037166 C-6814
HA-33 (cha 33) PMID: 7802661 AAB32847 S74768
HA-70 PMID: 2222445 BAA07575 D38562 C-Stockholm (C-St)
HA-70 PMID: 11676492 BAB71745 AB061780 C-Yoichi
HA-70 BAA89709 AB037166 C-6814
HA-70 (cha 70) PMID: 7802661 AAB32849 S74768
NTNHA PMID: 11676492 BAB71748 AB061780 C-Yoichi
NTNHA BAA89712 AB037166 C-6814
ORF-22 PMID: 11676492 BAB71744 AB061780 C-Yoichi
ORF-22 BAA89708 AB037166 C-6814
probable BoNT regulator protein PMID: 16287978 BAE47779 AP008983 C-Stockholm (C-St)
type D HA-17 PMID: 11713244 BAA90658 AB037920 D-4947
HA-70 PMID: 11713244 BAA90657 AB037920 D-4947
ADP-ribosyltransferase C3 PMID: 8225604 BAA04492 D17555 South African
BoNT/D PMID: 2216736 CAA38175 X54254 BVD/-3
HA1 PMID: 9802560 BAA75082 AB012112 1873
HA2 PMID: 9802560 BAA75081 AB012112 1873
HA3 PMID: 9802560 BAA75080 AB012112 1873
HA-33 PMID: 11713244 BAA90659 AB037920 D-4947
NTNH PMID: 9802560 BAA75083 AB012112 1873
NTNHA PMID: 11713244 BAA90660 AB037920 D-4947
NTX PMID: 9802560 BAA75084 AB012112 1873
ORF-22 PMID: 9802560 BAA75079 AB012112 1873
ORF-22 PMID: 11713244 BAA90656 AB037920 D-4947
type E BoNT/E PMID: 1543481 CAA43999 X62089 Beluga
BoNT/E PMID: 1543481 CAA43998 X62088 ATCC 43755 (Clostridium butyricum)
BoNT/E PMID: 1543481 CAA43998 X62088 ATCC 43181(Clostridium butyricum)
BoNT/E PMID: 11133447 BAB07885 AB040123 Iwanai
botE PMID: 1541280 CAA44558 X62683 NCTC 11219
FlaB ABG49497 DQ658239 Bennett
neurotoxin binding protein PMID: 16177380 AAD09563 U70780 Alaska
P-47 PMID: 9465394 BAA24881 D88418 Iwanai
type F BoNT/F PMID: 8486245 CAA48329 X68262 ATCC 43756 (Clostridium baratii)
BoNT/F PMID: 9732534 CAA67512 X99064 Langeland
BoNT/F PMID: 15240298 AAS59788 AY555069 CDC BL2821
BoNT/F CAA57358 X81714 NCTC 10281
BoNT/F AAA23263 M92906 202F
NTNH PMID: 1398040 CAA50404 X71086 202F
NTNH PMID: 7764998 AAC60474 S73676
NTNH PMID: 8863443 CAA65352 X96494 Langeland NCTC 10281
NTNH PMID: 9732534 CAA71744 Y10770 202F
NTNH PMID: 9732534 CAA67511 X99064 Langeland
ORF-X1 PMID: 9465394 BAA24890 AB004779 Langeland
ORF-X2 PMID: 9465394 BAA24889 AB004779 Langeland
P-21 PMID: 8863443 CAA65350 X96494 Langeland NCTC 10281
P-21 PMID: 9465394 BAA24891 AB004779 Langeland
P-47 PMID: 8863443 CAA65351 X96494 Langeland NCTC 10281
P-47 PMID: 9732534 CAA71743 Y10770 202F
type G BoNT/G PMID: 8268233 CAA52275 X74162 113/30, NCFB 3012
BoNT PMID: 9290060 CAA61229 X87972 ATCC 27322
HA-70 PMID: 9290060 CAA61225 X87972 ATCC 27322
NTNH PMID: 9290060 CAA61228 X87972 ATCC 27322
HA-II PMID: 9290060 CAA61226 X87972 ATCC 27322
P-21 PMID: 9290060 CAA61227 X87972 ATCC 27322

Back to top

Pathema's Annotated BoNT Genes

Detailed results of Pathema's annotation of Clostridium botulinum can be found at the Clostridium botulinum Hall strain A Genome Page. You can also navigate directly to subsets of Pathema's annotated genes by the links provided below.

Genes involved in specific Role Categories can be viewed at Pathema's Role Categories Search page including Amino acid biosynthesis, Cellular processing, Energy metabolism, Protein synthesis and Transcription searches. Here, we provide links to a subset of toxin-related Role Category links.

Back to top

Genome Annotation Data for Clostridium botulinum Genomes

Genome annotation data for Clostridium botulinum genome was obtained from the Entrez Genome database at NCBI. Genome Name, the associated RefSeq Accession linked to a graphical display of the genome annotation, and the Number of Annotated Genes linked to NCBI's Entrez Gene database are presented. Additional genome-related information on the corresponding Pathema Genome Pages can be viewed by clicking on the Genome Name link.

Genome Name
RefSeq Accession
Number of Annotated Genes
Clostridium botulinum Hall strain A
Clostridium botulinum phage C-St, complete genome NC_007581 198

Back to top

Synthetic Genes

The difficulty and high costs involved in culturing large quantities of toxin-producing Clostridium botulinum and the danger involved in working with highly toxic components such as formalin have led to the use of synthetic genes in new vaccine development. Due to rare codons and AT rich base compositions in naturally occurring clostridial DNA, synthetic genes are constructed for optimal expression in heterologous expression systems such as E. coli and yeast (Byrne MP and LA Smith 2000, PMID: 11086225).

Synthetically constructed genes are presented with their Synthetic Gene Name, Gene Source (N-terminal catalytic endopeptidase domain (light chain, Lc), translocation domain (HN or H1 of heavy chain, Hc) and the C-terminal binding domain (HC or H2 of heavy chain, Hc)), PubMed ID, Nucleotide Accession and Expression System.


Synthetic Gene Name
Gene Source
PubMed ID
Nucleotide Accession
Expression System
bntAC-1(botA) BoNT/A Hc PMID: 7790092 U22962 Escherichia coli
rBoNT(H(C)) BoNT Hc PMID: 11086225 Pichia pastoris
rBoNT/A LC BoNT/A Lc PMID: 11195972 Escherichia coli
BoNT/E Hc BoNT/E Hc DQ497397

Back to top

Pathema's Genome Tools and Resources

Navigate to Pathema's Clostridium botulinum Resource Page to view Genome Annotation-related information including links to pre-computed Searches, Pathema's Genome Tools, Comparative Analysis resources and gene-specific information.

Back to top