; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0141 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0141
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionThioredoxin
Genome locationMC11:1011330..1015221
RNA-Seq ExpressionMC11g0141
SyntenyMC11g0141
Gene Ontology termsGO:0006662 - glycerol ether metabolic process (biological process)
GO:0015035 - protein disulfide oxidoreductase activity (molecular function)
InterPro domainsIPR013766 - Thioredoxin domain
IPR017937 - Thioredoxin, conserved site
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037881.1 Thioredoxin H1 [Cucurbita argyrosperma subsp. argyrosperma]2.96e-6585.71Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M  K EVIAC TVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAP+FAELA KMS+VIFLKVDVDEL  VAAEWGVSALPCF+FLKNG +VDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D LQKI+LQ+A
Subjt:  RDVLQKIILQHA

XP_022158840.1 thioredoxin H1-like [Momordica charantia]3.73e-77100Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        RDVLQKIILQHA
Subjt:  RDVLQKIILQHA

XP_022982430.1 thioredoxin H1-like [Cucurbita maxima]6.21e-6786.61Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M  K EVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAP+FAELA K+S+VIFLKVDVDEL  VAAEWGVSALPCF+FLKNG +VDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D LQKI+LQHA
Subjt:  RDVLQKIILQHA

XP_023525273.1 thioredoxin H1-like [Cucurbita pepo subsp. pepo]3.60e-6685.71Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M  K EVIACHTVGSWKQQ+LKGKQSNKLIVVDFTAAWCGPCRAMAP+FAELA KMS+VIFLKVDVDEL  VAAEWGVSALPCF+FLKNG +VDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D LQKI+LQ+A
Subjt:  RDVLQKIILQHA

XP_038898480.1 thioredoxin H1-like [Benincasa hispida]9.10e-6683.04Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M+ K EVI CHTVGSWKQQLLKGKQS+KLIVVDFTAAWCGPCRA+AP+FAELA KMS+VIFLKVDVD+LTTVAAEWGVSALPCF+FLKNG +V+RFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D L+K++LQHA
Subjt:  RDVLQKIILQHA

TrEMBL top hitse value%identityAlignment
A0A1S3CSR4 Thioredoxin8.30e-6581.25Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M +K +VI CHTV SWKQQLLKGKQS+KLIVVDFTAAWCGPCRA+AP+F ELA KMS+VIFLKVDVD+LTTVAAEWGVSALPCF+FLKNGN+VDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D L+K++L HA
Subjt:  RDVLQKIILQHA

A0A5A7V9W3 Thioredoxin8.30e-6581.25Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M +K +VI CHTV SWKQQLLKGKQS+KLIVVDFTAAWCGPCRA+AP+F ELA KMS+VIFLKVDVD+LTTVAAEWGVSALPCF+FLKNGN+VDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D L+K++L HA
Subjt:  RDVLQKIILQHA

A0A6J1DWZ0 thioredoxin H1-like1.81e-77100Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        RDVLQKIILQHA
Subjt:  RDVLQKIILQHA

A0A6J1FL98 thioredoxin H1-like7.32e-6584.82Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M  K EVIAC TVGSWKQQLLKGKQSNKLIVVDFTA WCGPCRAMAP+FAELA KMS+VIFLKVDVDEL  VAAEWGVSALPCF+FLKNG +VDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D LQKI+LQ+A
Subjt:  RDVLQKIILQHA

A0A6J1IZB4 thioredoxin H1-like3.01e-6786.61Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M  K EVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAP+FAELA K+S+VIFLKVDVDEL  VAAEWGVSALPCF+FLKNG +VDRFVGAR
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQHA
        +D LQKI+LQHA
Subjt:  RDVLQKIILQHA

SwissProt top hitse value%identityAlignment
P29448 Thioredoxin H14.4e-3455.05Show/hide
Query:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD
        ++ +VIACHTV +W +QL K  +S  L+VVDFTA+WCGPCR +AP FA+LA K+ +V+FLKVD DEL +VA++W + A+P F+FLK G ++D+ VGA++D
Subjt:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD

Query:  VLQKIILQH
         LQ  I +H
Subjt:  VLQKIILQH

P29449 Thioredoxin H-type 19.9e-3456.36Show/hide
Query:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD
        ++ +V  CH V  W +   KG ++ KL+VVDFTA+WCGPCR +AP+ A++A KM  VIFLKVDVDEL TV+AEW V A+P FVF+K+G  VDR VGA+++
Subjt:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD

Query:  VLQKIILQHA
         LQ+ I++HA
Subjt:  VLQKIILQHA

Q07090 Thioredoxin H-type 21.1e-3255.86Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M ++ +VI  HTV +W + L KG    KLIVVDFTA+WCGPC+ +A  +AELA KM +V FLKVDVDEL +VA +W V A+P F+FLK G +VD+ VGA+
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQH
        +D LQ+ I +H
Subjt:  RDVLQKIILQH

Q39362 Thioredoxin H-type 21.6e-3158.18Show/hide
Query:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANK-MSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARR
        ++ +VI CH +  W  QL   KQSNKLIV+DFTA+WC PCR +AP+FA+LA K MSS IF KVDVDEL  VA E+GV A+P FV +K+GN+VD+ VGAR+
Subjt:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANK-MSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARR

Query:  DVLQKIILQH
        + L   I +H
Subjt:  DVLQKIILQH

Q43636 Thioredoxin H-type7.6e-3457.8Show/hide
Query:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD
        ++ +VI CHTV +W +QL KG  +  LIVVDFTA+WCGPCR +AP  AELA K+ +V FLKVDVDEL TVA EW V ++P F+FLK G ++D+ VGA++D
Subjt:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD

Query:  VLQKIILQH
         LQ+ I +H
Subjt:  VLQKIILQH

Arabidopsis top hitse value%identityAlignment
AT1G19730.1 Thioredoxin superfamily protein2.5e-3258.18Show/hide
Query:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANK-MSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARR
        ++ +VI CHT   W  QL K K+SNKLIV+DFTA+WC PCR +AP+F +LA K MSS IF KVDVDEL +VA E+GV A+P FVF+K G +VD+ VGA +
Subjt:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANK-MSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARR

Query:  DVLQKIILQH
        + LQ  I++H
Subjt:  DVLQKIILQH

AT1G45145.1 thioredoxin H-type 51.5e-3251.35Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M  + EVIACHT+  W +++    +S KLIV+DFTA+WC PCR +AP+FAE+A K ++V+F K+DVDEL  VA E+ V A+P FVF+K GN++DR VGA 
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQH
        +D + + +++H
Subjt:  RDVLQKIILQH

AT3G17880.1 tetraticopeptide domain-containing thioredoxin6.6e-2542.99Show/hide
Query:  EVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRDVLQ
        EVI+ H+    + +    K++++L+++ FTA WCGPCR M+PL++ LA + S V+FLKVD+D+   VAA W +S++P F F+++G  VD+ VGA +  L+
Subjt:  EVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRDVLQ

Query:  KIILQHA
        + I QH+
Subjt:  KIILQHA

AT3G51030.1 thioredoxin H-type 13.2e-3555.05Show/hide
Query:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD
        ++ +VIACHTV +W +QL K  +S  L+VVDFTA+WCGPCR +AP FA+LA K+ +V+FLKVD DEL +VA++W + A+P F+FLK G ++D+ VGA++D
Subjt:  DKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRD

Query:  VLQKIILQH
         LQ  I +H
Subjt:  VLQKIILQH

AT5G42980.1 thioredoxin 35.8e-2950.45Show/hide
Query:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR
        M  + EVIACHTV  W ++L    +S KLIV+DFTA WC PCR +AP+FA+LA K   V+F KVDVDEL TVA E+ V A+P F+F+K G + +  VGA 
Subjt:  MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGAR

Query:  RDVLQKIILQH
        ++ +   + +H
Subjt:  RDVLQKIILQH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGACAAGAGGGAAGTGATTGCGTGCCATACCGTTGGATCGTGGAAGCAGCAGCTCCTCAAGGGAAAACAATCCAATAAACTGATTGTTGTGGACTTCACTGCCGC
GTGGTGCGGTCCATGCCGTGCCATGGCTCCACTGTTTGCAGAGTTGGCCAATAAGATGAGTAGTGTCATATTCTTGAAGGTCGATGTTGATGAATTGACTACTGTTGCTG
CGGAGTGGGGAGTGAGTGCACTTCCTTGCTTCGTATTCTTGAAGAATGGAAATTTGGTGGACAGGTTTGTAGGTGCAAGAAGAGATGTGTTGCAGAAGATTATATTACAG
CATGCCTAA
mRNA sequenceShow/hide mRNA sequence
AGATACACCCAATCATAACCAAAAATGAGAAAAACATACTACAGATTAGAAAACAGATCGTTGTCTCGTCCCAAACAAGCGGGATAATGATACAGTGTGACCTTTGGTCC
ACTGATCAAAGATCCTCAGCATCCATTCAACCGCCAAACCAAACAGAACGTCCATCTCCACGACCTCCATAAATAATAATCTAATTCAAGCAAATGTTAAATACAAGTTG
TGTGTGAAGTTTCTTGAAAACTTATGGTGGTCAGTGCAAAGCAAGTAATGGTGGTCAGTGCAAAGCAAGTAATGGTGGTAGCTTAGAAGAGAGGTGTGGGCTGAGTACTA
AAAGGATCAAAAACTAGTTGTGTTGGCTAAGTTGCTATGAAAATGAACTAACAGAACAATATTCATCATCACAAACCAAACCATACAAGAAATTTCACCATCCAACAGTC
CAACATAATGAACTATTTCACTCAACCTTAACATTGCCCAACAATAGGAAGCTAATTTAGCTTTCACAAGTACAATAGTCGTCAAAGAAAGACTAACATCTGATGCTCAT
CTTAAATGTGAAGATCTTTAGATGTTCAGCTTTGAGAGCCATAGACCTCTCAATAAATATCCAAGTGAAATCGAAGTCTGGCTTCAGAAGAAGGAAGGCCAAGAGCTCAA
ATTATTCACTTAACTCTTCAGATTATTTGCAATTTCTATATCAAGAGTGTAAGCAATGGTACACCAGATGAAGCGAATTCAATTATATATCAAAATTTACAGCTTTAACA
ACATCCGTCCTCTGTATAAAATACGTGTAAAAGCAAGCAAAATTAAGCATTAGTACCTCCAATTTGCTACCTTATGCTTCAATTATCTAGCTTAATCTCTTTCCACGGAA
ACTTCTTGGACTTCCCTTTCAACTGGTGGACGGGAACACATCTTTAACAGCGGAAGCGGCGGTCAAGTAATTAAGAGTGTTTCTCAATGTTTCCTTCTCCCTTTTCAACC
TCGCCGCCGTATCCTCCAATTCAAGCAACGCCTGCTGCTCTCTCGGCGCACCTTCAAAAGTGCTTCCGACGAAGAACGAGAACGGAGTTGGGAATAAGTTCCTTCTCAAA
TCTTGAACCTCTTTCTCCGGCTTCCCACTGAGTCGGTTCGACAACCGAATCACATCCTTCATATACGATTCAACCTCGTTGGCCAATGCATCTAAATCCTCTTCTCCACT
GCCGGTCGGCCGGTCTTCAAGCCAGGTCACTTCGGCGACGAGGTACGGTTTCGTCCGAACAAGATTGGTGACACGGAACCGCTCTTGACCCTTACAGATGAGGAAGAATC
GGTCGTCAACGAGGCGCTCGTGTTTGACGACCTCACCGACGCATCCAACATCCGTCGTACCCGAAACCGCATCCGTATAAATAACTCCGAAGCGGAGATCGGTCTGGAGC
AGAGTGTGCATCATCATCCGGTAGCGGAACTCGAAGATCTGCAGAGGAAGAATAGCGCCGGGGAAGAGAACGAGAGGAAGAGGAAACAGCGGTAGCTCAACGACGTCATC
GGATTTCGGAGAACCGGTGTGGTGCTTTTCCGGGAAAGACGAAGCAGAACACTTAAGCGGATTCGGCTTGTAGCGGCGGTGGCGAAGGGCGGCCGGAGAGAAGAATCTAG
GGCTAGGGTTTAAGGAAGATGAAATGGTGTTGGGATTTAGAATGGGTTTCTGGGATAGAGACGAAGAGGAAGAGGAGGGAATTAGCTGAGGGAGAGCCATTGTAGTGCTG
CCAGTGTTCCAAGTGTTCTAGATAGAGAACCAAATCGCTCGTTTGCTTGTAGAGAGACTGTAATTCTACTCTCTGTTCTCTGCCTCACAAAAATCAAATTTCCCCACAAA
TTAAAATTATTTCACTCCATTTCCATCCCTCGCTTTCTCTCTCCAATCTGAAAAATCTGGTGGTTTTAAGCAGAGGCCGAGTCGAATTGAGATTGTGGATAATATAGAGA
GAGGAAATACTCGAGAATGTAGGATGTGGACTGAGATTTTCCAACGGGGGACTAGAACTTGAAGCAGAGCCGAAAACCTGCCGATACTACGACTCTGATGATACGAATTT
TCATTTTCATTTGGGTTTATTTGAGAAACTCGGCCCATTATTGGTGAGGAAGCCCATTTCGAGCCCACATATTTCCTCCAGAAAGGCCCAGATATTACATATTGAGGCCC
AGCAACTTTGGGCTCGTGTGAATGGCTAGAGATTGAACGATGCTTATATAGTTGCGTACCGCGTAGCAATAACTGCAAAATCAGATGCTCTTCTCTTTTAGCTGTATTTG
GTCTGGCCTAACGCACTTCCATCGACGGGCGTTTGGAGGAAATTGTTCAGATTCTTGAAGTTTTCATCGTTGGACGACGATCTTACAGAGAGAATCACTTACAACTTACA
AGTTACAGCTCTCTGTGCTTTTGTTTTCGAGGTCGAAAATGGTGGACAAGAGGGAAGTGATTGCGTGCCATACCGTTGGATCGTGGAAGCAGCAGCTCCTCAAGGGAAAA
CAATCCAATAAACTGATTGTTGTGGACTTCACTGCCGCGTGGTGCGGTCCATGCCGTGCCATGGCTCCACTGTTTGCAGAGTTGGCCAATAAGATGAGTAGTGTCATATT
CTTGAAGGTCGATGTTGATGAATTGACTACTGTTGCTGCGGAGTGGGGAGTGAGTGCACTTCCTTGCTTCGTATTCTTGAAGAATGGAAATTTGGTGGACAGGTTTGTAG
GTGCAAGAAGAGATGTGTTGCAGAAGATTATATTACAGCATGCCTAATGGAAGTCTATGCAACTTTTGTGAGTTGCTATAAAGTGATTCTTTTAGTTAGAACAATTTGAA
TTTTTCTTGTGTTGGATTGTTTTTAATCAGTTTACTATTATGAGGCTACCGGACTCGTTATGTAATAAGTTAACATCATTATTTTCTTGAATAAAAATGAGCATCAATTT
ATAGTTGCTTTAAACATTCTCAGTGGTTTCTCCAACTGCTTAAACTATCATACTTGGACATTGAAAAATGTAACTTGCCAAGAGGCTAATGAAAATACTGATCCAAATTA
AACAGAATGTTAAAATGAGGAGAGATTCTGAAAAGGGGATTCATCAAAAGAGAAAATTACTCTAATGACAATAAATACATAATTTTATTATAACTTAAACTCATCTCATC
TGATTAATACAACGCTTAGGAACAGGGACTTTGGCACAAATGGCTACCATGATTTTTCAGGAAAAACAATGTAATGACCTTCAATCTTCCATTCCACTAATGATAAATTA
CTAATATTTATCATATATAATATGCAATGCTCTACATATGATCAATGAAATTTACCTTCAGCGTGTAAGTGACCCCCTTGGCTTATGGAGCAAGAATACAGGCAGTCAAA
AAAATTCTTATGAACCACGCCCATCGATATCTGCTACCGCACAATCGACCTCAACTAATAATTGTTTCAATTCACTTGGACATGGGTCTTCTCCAAAGTTACCTCTGGTT
GTTGACACCAAGCAACTCTCCTGTCCAAGGCATTT
Protein sequenceShow/hide protein sequence
MVDKREVIACHTVGSWKQQLLKGKQSNKLIVVDFTAAWCGPCRAMAPLFAELANKMSSVIFLKVDVDELTTVAAEWGVSALPCFVFLKNGNLVDRFVGARRDVLQKIILQ
HA