; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1855 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1855
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDSBA domain-containing protein
Genome locationMC08:26823155..26826527
RNA-Seq ExpressionMC08g1855
SyntenyMC08g1855
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR001853 - DSBA-like thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048695.1 hypothetical protein E6C27_scaffold4358G00140 [Cucumis melo var. makuwa]3.85e-16083.77Show/hide
Query:  MEKLQIWPCPGFNCHCGSFLPTDSKIF-----------QGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH
        ME L+I P   FNCHCGSFLP DS+              GE K+ RIM ESVGSRNM+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQYDFEL WH
Subjt:  MEKLQIWPCPGFNCHCGSFLPTDSKIF-----------QGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH

Query:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA
        PFQLNPSAPKEGVVK E+YR+KFGIQSEQME+RMAEVFRGLGLDYD SGLTGNTLDSH+LIYLAGQQGL KQHDLVEELC+GYFTQGKYIGDR+FLLECA
Subjt:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA

Query:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        RKAGVEGAAEFL + DNGV +VKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
Subjt:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

TYK14915.1 hypothetical protein E5676_scaffold1623G00040 [Cucumis melo var. makuwa]4.69e-16184.15Show/hide
Query:  MEKLQIWPCPGFNCHCGSFLPTDSKIFQ-----------GEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH
        ME L+I P   FNCHCGSFLP DS+              GEYK+ RIM ESVGSRNM+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQYDFEL WH
Subjt:  MEKLQIWPCPGFNCHCGSFLPTDSKIFQ-----------GEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH

Query:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA
        PFQLNPSAPKEGVVK E+YR+KFGIQSEQME+RMAEVFRGLGLDYD SGLTGNTLDSH+LIYLAGQQGL KQHDLVEELC+GYFTQGKYIGDR+FLLECA
Subjt:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA

Query:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        RKAGVEGAAEFL + DNGV +VKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
Subjt:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

XP_022155021.1 uncharacterized protein LOC111022167 [Momordica charantia]2.96e-168100Show/hide
Query:  SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM
        SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM
Subjt:  SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM

Query:  AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS
        AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS
Subjt:  AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS

Query:  GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
Subjt:  GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

XP_022980705.1 uncharacterized protein LOC111479994 [Cucurbita maxima]2.76e-15289.41Show/hide
Query:  LPTDS-KIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQ
        LP +S + FQG+ KY R MAESVGSR+M+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQY FELKWHPFQLNPSAPKEGVVK+EFYR+KFGIQSEQ
Subjt:  LPTDS-KIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQ

Query:  MESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKY
        ME+RM EVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELC+GYFTQGKYIGDR+FLLECARKA VEGAAEFL SDD GV +VKEELEKY
Subjt:  MESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKY

Query:  SGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        SGKISGVPFYVINGKHKL+GAQPPEVFLRAFQVAGK
Subjt:  SGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

XP_038900410.1 uncharacterized protein YwbO [Benincasa hispida]3.82e-15490.48Show/hide
Query:  SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM
        S+ FQGE KY RIMAESVGSRNM+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQY+FELKWHPFQLNPSAPKEG+VKRE+YR+KFGIQS+QME+RM
Subjt:  SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM

Query:  AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS
        AEVFRGLGLDYDMSGLTGNTLDSH+LIYLAGQQGL KQHDLVEELC+GYFTQGKYIGDR FLLECARKAGVEGAAEFL S DNGV ++KEEL+KYSGKIS
Subjt:  AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS

Query:  GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
Subjt:  GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

TrEMBL top hitse value%identityAlignment
A0A5A7U4Z3 DSBA domain-containing protein1.86e-16083.77Show/hide
Query:  MEKLQIWPCPGFNCHCGSFLPTDSKIF-----------QGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH
        ME L+I P   FNCHCGSFLP DS+              GE K+ RIM ESVGSRNM+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQYDFEL WH
Subjt:  MEKLQIWPCPGFNCHCGSFLPTDSKIF-----------QGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH

Query:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA
        PFQLNPSAPKEGVVK E+YR+KFGIQSEQME+RMAEVFRGLGLDYD SGLTGNTLDSH+LIYLAGQQGL KQHDLVEELC+GYFTQGKYIGDR+FLLECA
Subjt:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA

Query:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        RKAGVEGAAEFL + DNGV +VKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
Subjt:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

A0A5D3CTQ6 DSBA domain-containing protein2.27e-16184.15Show/hide
Query:  MEKLQIWPCPGFNCHCGSFLPTDSKIFQ-----------GEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH
        ME L+I P   FNCHCGSFLP DS+              GEYK+ RIM ESVGSRNM+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQYDFEL WH
Subjt:  MEKLQIWPCPGFNCHCGSFLPTDSKIFQ-----------GEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWH

Query:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA
        PFQLNPSAPKEGVVK E+YR+KFGIQSEQME+RMAEVFRGLGLDYD SGLTGNTLDSH+LIYLAGQQGL KQHDLVEELC+GYFTQGKYIGDR+FLLECA
Subjt:  PFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECA

Query:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        RKAGVEGAAEFL + DNGV +VKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
Subjt:  RKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

A0A6J1DQG1 uncharacterized protein LOC1110221671.43e-168100Show/hide
Query:  SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM
        SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM
Subjt:  SKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRM

Query:  AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS
        AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS
Subjt:  AEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKIS

Query:  GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
Subjt:  GVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

A0A6J1GTB1 uncharacterized protein LOC1114574031.56e-15188.98Show/hide
Query:  LPTDS-KIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQ
        LP +S + FQG+ KY R MAESVGSR+M+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQY FELKWHPFQLNPSAPKEGVVK+EFYR+KFGIQSEQ
Subjt:  LPTDS-KIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQ

Query:  MESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKY
        ME+RM EVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELC+GYFTQGKYIGDR+FLLECA KA VEGAAEFL SDD GV +VKEELEKY
Subjt:  MESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKY

Query:  SGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        SGKISGVPFYVINGKHKL+GAQPPEVFLRAFQVAGK
Subjt:  SGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

A0A6J1IXA2 uncharacterized protein LOC1114799941.34e-15289.41Show/hide
Query:  LPTDS-KIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQ
        LP +S + FQG+ KY R MAESVGSR+M+KKLIQIDISSDTVCPWCFVGKKNLDKAI+ASQDQY FELKWHPFQLNPSAPKEGVVK+EFYR+KFGIQSEQ
Subjt:  LPTDS-KIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQ

Query:  MESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKY
        ME+RM EVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELC+GYFTQGKYIGDR+FLLECARKA VEGAAEFL SDD GV +VKEELEKY
Subjt:  MESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKY

Query:  SGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK
        SGKISGVPFYVINGKHKL+GAQPPEVFLRAFQVAGK
Subjt:  SGKISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK

SwissProt top hitse value%identityAlignment
P39598 Uncharacterized protein YwbO1.6e-1024.27Show/hide
Query:  IQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSA-----PKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLD
        + I + SD VCP+CFVGK   ++AI       D E++W PF+L PS      P     K+  ++T     +E+           LG++ +   ++ +   
Subjt:  IQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSA-----PKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLD

Query:  SHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYS---GKISGVPFYVINGKHKLSGAQPP
                  +  +K H+    +   +F + + IGD + L + A + G++GA+     +      V+ +  K++     I+ VP ++I G   + GA   
Subjt:  SHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYS---GKISGVPFYVINGKHKLSGAQPP

Query:  EVFLRA
        +VF +A
Subjt:  EVFLRA

Arabidopsis top hitse value%identityAlignment
AT5G38900.1 Thioredoxin superfamily protein3.2e-8367.59Show/hide
Query:  MAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDM
        MAES  S    KKLIQID+SSD+VCPWCFVGKKNLDKAI AS+DQY+FE++W PF L+PSAPKEGV K+EFY  K+G + + M +RM+EVF+GLGL++D 
Subjt:  MAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRTKFGIQSEQMESRMAEVFRGLGLDYDM

Query:  SGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKL
        +GLTGN+LDSHRLI+  G+Q  +KQH LVEEL +GYFTQGK+IGDREFL+E A K G+EGA EFL   +NGV +VKEEL KYS  I+GVP Y INGK KL
Subjt:  SGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSGKISGVPFYVINGKHKL

Query:  SGAQPPEVFLRAFQVA
        SGAQPPE F  AF+ A
Subjt:  SGAQPPEVFLRAFQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGCTTCAAATATGGCCATGCCCCGGATTCAACTGCCACTGCGGTTCGTTTCTCCCAACAGATTCCAAAATTTTCCAGGGGGAGTACAAATATTTCAGGATCAT
GGCTGAGTCGGTTGGAAGTAGGAATATGGAGAAAAAGCTTATACAAATTGATATAAGCTCAGATACGGTTTGCCCGTGGTGCTTTGTAGGCAAAAAGAATCTTGACAAAG
CTATAGCGGCTTCTCAGGATCAATATGATTTTGAGTTAAAGTGGCATCCATTCCAGCTCAATCCTTCTGCTCCCAAGGAGGGCGTTGTTAAGAGGGAATTTTACAGGACG
AAGTTCGGAATTCAATCTGAACAGATGGAATCTCGGATGGCAGAGGTGTTTAGGGGTCTTGGATTGGATTATGACATGTCTGGACTGACGGGAAATACTCTAGACAGCCA
CAGACTTATATATTTGGCAGGTCAACAGGGTCTAGACAAGCAACATGACCTTGTGGAGGAGTTGTGCGTTGGATACTTCACTCAAGGAAAATACATTGGTGACCGGGAAT
TTCTTTTGGAATGTGCTAGAAAGGCTGGGGTGGAAGGAGCAGCTGAGTTTCTCGGGTCAGATGATAACGGAGTGAACCAGGTTAAGGAAGAGCTTGAGAAGTACTCGGGA
AAAATTTCAGGAGTCCCCTTCTACGTTATCAATGGGAAGCATAAACTGAGTGGTGCTCAACCCCCTGAGGTTTTCCTAAGAGCTTTTCAAGTGGCTGGAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATTTTAGAACAACATATACGTGTAGCCACGAAAACGGTATTTTATTTTTTAACATTGAAAAAAACACCGCAACTATATTTTAAATTTTATTTTTCACTCCCTTAGGCTTA
TTTATTATTTATTATTTCCCGGTGTGAAAGTCGTAGGTGGTAGGTAAACGTTGAAAATGGAGAAGCTTCAAATATGGCCATGCCCCGGATTCAACTGCCACTGCGGTTCG
TTTCTCCCAACAGATTCCAAAATTTTCCAGGGGGAGTACAAATATTTCAGGATCATGGCTGAGTCGGTTGGAAGTAGGAATATGGAGAAAAAGCTTATACAAATTGATAT
AAGCTCAGATACGGTTTGCCCGTGGTGCTTTGTAGGCAAAAAGAATCTTGACAAAGCTATAGCGGCTTCTCAGGATCAATATGATTTTGAGTTAAAGTGGCATCCATTCC
AGCTCAATCCTTCTGCTCCCAAGGAGGGCGTTGTTAAGAGGGAATTTTACAGGACGAAGTTCGGAATTCAATCTGAACAGATGGAATCTCGGATGGCAGAGGTGTTTAGG
GGTCTTGGATTGGATTATGACATGTCTGGACTGACGGGAAATACTCTAGACAGCCACAGACTTATATATTTGGCAGGTCAACAGGGTCTAGACAAGCAACATGACCTTGT
GGAGGAGTTGTGCGTTGGATACTTCACTCAAGGAAAATACATTGGTGACCGGGAATTTCTTTTGGAATGTGCTAGAAAGGCTGGGGTGGAAGGAGCAGCTGAGTTTCTCG
GGTCAGATGATAACGGAGTGAACCAGGTTAAGGAAGAGCTTGAGAAGTACTCGGGAAAAATTTCAGGAGTCCCCTTCTACGTTATCAATGGGAAGCATAAACTGAGTGGT
GCTCAACCCCCTGAGGTTTTCCTAAGAGCTTTTCAAGTGGCTGGAAAATAAAAGCTTTTGGATTTAACACCATCAATAAGAATGTAACTTGTTTGCTGAAACACCAGAAT
ACATACTTGTTGATCTTGTATTTATTACCATATTGAAAATGCATGTCAGATCAGACTACTATTTACTGTTTTCCCAGCAATAAAATACTAGGTTGTACTCTGTTAGCTTA
GCCCATGTTGACATTTTAATAGCTTTGTTTGGACAGAAATATAGTATATAATGAAATGGAGTTATTCAAATGATCTGGACC
Protein sequenceShow/hide protein sequence
MEKLQIWPCPGFNCHCGSFLPTDSKIFQGEYKYFRIMAESVGSRNMEKKLIQIDISSDTVCPWCFVGKKNLDKAIAASQDQYDFELKWHPFQLNPSAPKEGVVKREFYRT
KFGIQSEQMESRMAEVFRGLGLDYDMSGLTGNTLDSHRLIYLAGQQGLDKQHDLVEELCVGYFTQGKYIGDREFLLECARKAGVEGAAEFLGSDDNGVNQVKEELEKYSG
KISGVPFYVINGKHKLSGAQPPEVFLRAFQVAGK