; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0016 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0016
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRubredoxin-like domain-containing protein
Genome locationMC06:133933..140359
RNA-Seq ExpressionMC06g0016
SyntenyMC06g0016
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
InterPro domainsIPR024934 - Rubredoxin-like domain
IPR024935 - Rubredoxin domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138023.1 uncharacterized protein LOC101207574 isoform X1 [Cucumis sativus]2.30e-7574.44Show/hide
Query:  MASVSATSLRSFQLP---HSK---EDGGADRHSN-----SNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYN
        MAS+SA+SL SF LP   H K   EDGG DR+S      SNRL LKSSF SPLR     KIP L  Q S   AA+PKFS  MRVASKQAYICRDCGYIYN
Subjt:  MASVSATSLRSFQLP---HSK---EDGGADRHSN-----SNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYN

Query:  DRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        DRTPFDKLPDKYFCPVCGAPKRRFRPYEQ+V+KN NE DVRKARKAQIQ+DEA+G VLPIAAA+GIVALVGLYLYLNS +
Subjt:  DRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

XP_022134987.1 uncharacterized protein LOC111007102 [Momordica charantia]6.35e-146100Show/hide
Query:  MSLVFWPKPQSLSPLPSLLFSSLIDSLLHFLFCLSSTLHPPMASVSATSLRSFQLPHSKEDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSG
        MSLVFWPKPQSLSPLPSLLFSSLIDSLLHFLFCLSSTLHPPMASVSATSLRSFQLPHSKEDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSG
Subjt:  MSLVFWPKPQSLSPLPSLLFSSLIDSLLHFLFCLSSTLHPPMASVSATSLRSFQLPHSKEDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSG

Query:  KIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALV
        KIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALV
Subjt:  KIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALV

Query:  GLYLYLNSVY
        GLYLYLNSVY
Subjt:  GLYLYLNSVY

XP_022927654.1 uncharacterized protein LOC111434474 [Cucurbita moschata]2.27e-7878.29Show/hide
Query:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF
        MASVSATSLR FQLP   HSK   EDGGADRHS+   L LKSSFFSPLR      IP L  Q S  +AAAPK  +SMRVASKQAYICRDCGYIYNDRTPF
Subjt:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF

Query:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        +KLPDKYFCPVCGAPKRRFRPYEQSVTKN NE D RKARKAQIQ+DEA+G VLPIAAAVGIVALVGLYLYLN+ +
Subjt:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

XP_023531329.1 uncharacterized protein LOC111793604 [Cucurbita pepo subsp. pepo]5.59e-7978.86Show/hide
Query:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF
        MASVSATSLR FQLP   HSK   EDGGADRHS+   L LKSSFFSPLR      IP L  Q S  +AAAPK  +SMRVASKQAYICRDCGYIYNDRTPF
Subjt:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF

Query:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        DKLPDKYFCPVCGAPKRRFRPYEQSVTKN NE D RKARKAQIQ+DEA+G VLPIAAAVGIVALVGLYLYLN+ +
Subjt:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

XP_038880712.1 uncharacterized protein LOC120072320 isoform X1 [Benincasa hispida]4.21e-7978.53Show/hide
Query:  MASVSATSLRSFQLPH---SKEDGGADRHS-----NSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRT
        MASVSA+SL SF LP    SKEDGGADR+S      SNRL LKSSF SPLR     KIP   +Q S   AAAPKFS  MRVASKQAYICRDCGYIYNDRT
Subjt:  MASVSATSLRSFQLPH---SKEDGGADRHS-----NSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRT

Query:  PFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        PFDKLPDKYFCPVCGAPKRRFRPYEQSVTKN NE DVRKARKA+IQ+DEA+G VLPIAAAVGIVALVGLYLYLNSV+
Subjt:  PFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

TrEMBL top hitse value%identityAlignment
A0A5A7UPF0 Rubredoxin family protein1.11e-7575Show/hide
Query:  MASVSATSLRSFQLP---HSK---EDGGADRHSN-----SNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYN
        MASVSA+SL SF LP   H K   EDGG DR+S      SNRL LKSSF SPLR     KIP L  Q S   AA+PKFS  MRVASKQAYICRDCGYIYN
Subjt:  MASVSATSLRSFQLP---HSK---EDGGADRHSN-----SNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYN

Query:  DRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        DRTPFDKLPDKYFCPVCGAPKRRFRPYEQ+V KN NE D+RKARKAQIQ+DEA+G VLPIAAA+GIVALVGLYLYLNSV+
Subjt:  DRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

A0A6J1C1D8 uncharacterized protein LOC1110071023.07e-146100Show/hide
Query:  MSLVFWPKPQSLSPLPSLLFSSLIDSLLHFLFCLSSTLHPPMASVSATSLRSFQLPHSKEDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSG
        MSLVFWPKPQSLSPLPSLLFSSLIDSLLHFLFCLSSTLHPPMASVSATSLRSFQLPHSKEDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSG
Subjt:  MSLVFWPKPQSLSPLPSLLFSSLIDSLLHFLFCLSSTLHPPMASVSATSLRSFQLPHSKEDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSG

Query:  KIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALV
        KIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALV
Subjt:  KIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALV

Query:  GLYLYLNSVY
        GLYLYLNSVY
Subjt:  GLYLYLNSVY

A0A6J1EIL7 uncharacterized protein LOC1114344741.10e-7878.29Show/hide
Query:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF
        MASVSATSLR FQLP   HSK   EDGGADRHS+   L LKSSFFSPLR      IP L  Q S  +AAAPK  +SMRVASKQAYICRDCGYIYNDRTPF
Subjt:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF

Query:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        +KLPDKYFCPVCGAPKRRFRPYEQSVTKN NE D RKARKAQIQ+DEA+G VLPIAAAVGIVALVGLYLYLN+ +
Subjt:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

A0A6J1IG97 uncharacterized protein LOC1114731101.10e-7878.29Show/hide
Query:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF
        MASVSATSLR FQLP   HSK   EDGGADRHS+   L LKSSFFSPLR      IP L  Q S  +AAAPK  +SMRVASKQAYICRDCGYIYNDRTPF
Subjt:  MASVSATSLRSFQLP---HSK---EDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPF

Query:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        +KLPDKYFCPVCGAPKRRFRPYEQSVTKN NE D RKARKAQIQ+DEA+G VLPIAAAVGIVALVGLYLYLN+ +
Subjt:  DKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

E5GBM0 Electron transporter1.11e-7575Show/hide
Query:  MASVSATSLRSFQLP---HSK---EDGGADRHSN-----SNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYN
        MASVSA+SL SF LP   H K   EDGG DR+S      SNRL LKSSF SPLR     KIP L  Q S   AA+PKFS  MRVASKQAYICRDCGYIYN
Subjt:  MASVSATSLRSFQLP---HSK---EDGGADRHSN-----SNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYN

Query:  DRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY
        DRTPFDKLPDKYFCPVCGAPKRRFRPYEQ+V KN NE D+RKARKAQIQ+DEA+G VLPIAAA+GIVALVGLYLYLNSV+
Subjt:  DRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY

SwissProt top hitse value%identityAlignment
O26258 Probable rubredoxin1.3e-0537.7Show/hide
Query:  SISMRVASKQAYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPYE
        +I   V++ + Y CR CGYIY+             TPF+ LP+ + CP CGA K+ F+P +
Subjt:  SISMRVASKQAYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPYE

P04170 Rubredoxin-13.3e-0651.16Show/hide
Query:  QAYICRDCGYIY----NDRTPFDKLPDKYFCPVCGAPKRRFRP
        Q Y+C  CGY Y    +D  PFD+LPD + CPVCG  K +F P
Subjt:  QAYICRDCGYIY----NDRTPFDKLPDKYFCPVCGAPKRRFRP

P24297 Rubredoxin6.2e-0540Show/hide
Query:  YICRDCGYIYND-----------RTPFDKLPDKYFCPVCGAPKRRFRPYE
        ++C+ CGYIY++            T F++LPD + CP+CGAPK  F   E
Subjt:  YICRDCGYIYND-----------RTPFDKLPDKYFCPVCGAPKRRFRPYE

P58992 Rubredoxin-11.4e-0440Show/hide
Query:  AYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPY
        +++C +CGYIY+              PFDKLPD + CPVC  PK +F  +
Subjt:  AYICRDCGYIYNDR-----------TPFDKLPDKYFCPVCGAPKRRFRPY

Q9AL94 Rubredoxin6.9e-0439.22Show/hide
Query:  YICRDCGYIY-----------NDRTPFDKLPDKYFCPVCGAPKRRFRPYEQ
        Y+C  CGYIY           N  T F+ +PD + CP+CG  K +F P E+
Subjt:  YICRDCGYIY-----------NDRTPFDKLPDKYFCPVCGAPKRRFRPYEQ

Arabidopsis top hitse value%identityAlignment
AT5G17170.1 rubredoxin family protein9.5e-0932.71Show/hide
Query:  QKSGKIAAAPKFSISMRVASK--QAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAV
        +K  K  A P+F   +    K    +IC DCG+IY     FD+ PD Y CP C APK+RF  Y+ +  K +                   G + PI   V
Subjt:  QKSGKIAAAPKFSISMRVASK--QAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAV

Query:  GIVALVG
        G++A +G
Subjt:  GIVALVG

AT5G51010.1 Rubredoxin-like superfamily protein7.7e-4372.17Show/hide
Query:  LGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAA
        LG++KS   ++AP+F  SMRV+SKQAYICRDCGYIYNDRTPFDKLPD YFCPVC APKRRFR Y   V+KNVN+ DVRKARKA++QRDEAVG  LPI  A
Subjt:  LGSQKSGKIAAAPKFSISMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAA

Query:  VGIVALVGLYLYLNS
        VG++AL  LY Y+NS
Subjt:  VGIVALVGLYLYLNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTCGTTTTCTGGCCGAAGCCCCAATCTCTATCTCCCCTTCCTTCTCTTCTCTTCTCTTCTCTGATTGATTCTCTTCTTCACTTTCTTTTCTGTCTTAGCTCCAC
CCTCCACCCACCAATGGCTTCTGTTTCAGCTACTTCTCTCAGGAGCTTCCAGCTGCCACATTCTAAGGAAGACGGCGGCGCTGATCGGCACTCCAACTCCAACCGCTTGT
ATCTGAAATCGTCCTTCTTCTCTCCTTTACGCACTCCCTCCAACTCCAAGATTCCTCCTTTAGGGAGCCAAAAATCTGGTAAAATTGCAGCTGCACCCAAGTTCTCCATC
TCCATGCGCGTCGCCTCCAAGCAAGCCTATATCTGTCGTGATTGCGGGTACATTTACAATGATCGAACTCCTTTTGACAAATTGCCTGATAAGTATTTCTGTCCTGTCTG
TGGTGCTCCTAAGCGACGATTTAGACCTTATGAGCAATCCGTGACAAAAAATGTTAACGAACTCGATGTGAGGAAGGCGAGGAAGGCGCAGATTCAGAGAGATGAAGCTG
TTGGGAATGTGCTGCCTATTGCTGCTGCAGTTGGAATCGTGGCACTTGTAGGTTTATACTTGTACCTGAATAGCGTGTATTAG
mRNA sequenceShow/hide mRNA sequence
AAAAGGGTGAATGAGGAGTGAGTATTTTGGTTCGAGGGTGTTGGGTGTGTAGATAGGATCCGATAAAATTGATTGATTGTCCGCCACTCGTAGTCTGTATTCACTATTCG
CCCATGTCTCTCGTTTTCTGGCCGAAGCCCCAATCTCTATCTCCCCTTCCTTCTCTTCTCTTCTCTTCTCTGATTGATTCTCTTCTTCACTTTCTTTTCTGTCTTAGCTC
CACCCTCCACCCACCAATGGCTTCTGTTTCAGCTACTTCTCTCAGGAGCTTCCAGCTGCCACATTCTAAGGAAGACGGCGGCGCTGATCGGCACTCCAACTCCAACCGCT
TGTATCTGAAATCGTCCTTCTTCTCTCCTTTACGCACTCCCTCCAACTCCAAGATTCCTCCTTTAGGGAGCCAAAAATCTGGTAAAATTGCAGCTGCACCCAAGTTCTCC
ATCTCCATGCGCGTCGCCTCCAAGCAAGCCTATATCTGTCGTGATTGCGGGTACATTTACAATGATCGAACTCCTTTTGACAAATTGCCTGATAAGTATTTCTGTCCTGT
CTGTGGTGCTCCTAAGCGACGATTTAGACCTTATGAGCAATCCGTGACAAAAAATGTTAACGAACTCGATGTGAGGAAGGCGAGGAAGGCGCAGATTCAGAGAGATGAAG
CTGTTGGGAATGTGCTGCCTATTGCTGCTGCAGTTGGAATCGTGGCACTTGTAGGTTTATACTTGTACCTGAATAGCGTGTATTAGATCCAGTGGCCAGTGGGTGGCTCA
GTGCTTCTCACTATTATGTTATATTGATTTACATTCTATATGTATTTTTTCCACCCCCCAATTCAAAAACCTGGCATATTGCTATAACGTTGAGGCATGCAAGATTATTT
TTATATTAAATTGTTTCTTGAGACTTGAACATGTTAATTTGTGTTGAATGGGGTTTGATTCGGTAGTTAATTTGATGAATGAGGTGCACCACCAGGAAATGTGCTTCTAA
GGATGGTTAGTGAAGTGAGATTAGGATCTTGTGTGATTTGGAACTGAAAATTGGTGGTTGTTTTTTATTGTGATTTGTACAATAAATCTGAAACATGGATCAGGATCACG
TAACTCGAGGTGGAGGCGTGTTATCCAGAAATGACTTGAACAGTATAAGTGACCTTTGGGGTTGAGAGAATGGAGCTTCGTGAGAGGCACCTCGAATGGTGGCAAATGAA
AGGATATTACCATTACCATAAACTTGGCTCCAACCTCCAACCTGAGAAGTGAGACACAGGCCCAGGGGGAAATGCTTAATTCACAATCTCCAACCAAATCAGGACTCGGG
ACTCGGGACAAATACGGCTTACCTGCTTTCCATGGAACCAAGCTCCGTATGGAACACTCGTGTTCAGACCCAGCTCTGTTGCCAACCTGTGAACCAGCCGCCGACTTCCC
ATCAATGGAATTACCGAATCTTGATCTCCACTGAAACCAAAGAATTAGGTCCCTTTAATTTTAATGTATATGTACGGTTTGGTAACGCTCGTCAAATTGTAACCCTAACC
CACCTGTAAATCAAGACTCGTATGCCGGTTCTGACAAGTGACCCAACAATGGAGATGGTTGGTATTTCCAGGTTGAGTAGCTCATAGTCCAGAGCTCTGGTGTAATGTCA
TTGACAAGCAGTAATAGTCAATAGTCAATAGTCAGACTCAGAATTCAGAAATGAAAATCAAAGCAGAGTGGAATTACTCGCTGCAAACAGCCCACTTGTGGACTCCGACG
AGACGGGCATGAAGGGCCTTTTTCACATCCTCTCGGTTCAGATATTTAACAGTTTCATCTTCGATGCAAACATCTATCCTTTCTCGGACATGCTGCGGCTGTGGGCTTAA
GTACTTGGACTGTGAGAGCACTGAAGGAATGCAGACATCGAGAGTAACGTCATATTTGTCCACGAACTTACTGGTTTCTGTGTTGACTTGGGTCATCACCCGCAGACAAA
CCCCTGAGATTGAGTCTCTGTAGTACTCGCTAACATACCGGGAATAGTTACAAGCACATGTGAACAATCTGTAAGTGGAGTCTGATATCAGACCATGCGACCAAAAGAAC
TCCGCCCTTGAATTCAAGTCTGTCGCATATTCCATAACCGGATTCCCTAGCTGCACATCAAACACTTTCATTTTTCCCTTTCTTTTCTCTTCCACGCCACATAAAAACAA
AACTTACAGCAATTCCCTGCAGATTGAATAACTTTTCCTTCCTGTTCAACTCGGTCATAAGCCTCGCCAGCTGAGGAATGTAATGGCCTGGTATAATAAGGAAAAAGATT
TGGTTTTTCTCTTTTCTTTGGGGGGTTCCCATTGCAAGAGAGAGAGGGAGAAGGAGGAGAGAGAGAGAAAAAAGGGAGAAATGTCTTTTATCTGGGTACCTGCGTAACTC
TCTCCTGTTAGAAATAGATCTCTATGTTTGTAATGAGGGAATTTACTGAACCACCTTTGCAAGAATATGAGATTGTCTCTTGCTGAAGATAGCAACATGAAAAAACGTAG
GTTTGAAGCATGAATAAAAGACACAAGCATACAGCATGATTTGTGTTCTTTTTTAGCATTGACAAAAATACTGACTAAAACATAAGTAAATTTCTCAGTAATTTCTTTGC
CAAATCAATAATCATCTTCTGAAGAGAGAGAGAGAGAAGGGGTTGCGCTACCCTTGAACAGAAGGCAGAAGGGATGCAAAAATACCTGTGGCCTCGTCATCCATTGTCGC
ATGAGAGGCGCTATTATCAGCATAAGAGAACCCAACTCCTGCGGGTGTCTCTAAGTACAACATGTTTGCTTCTGAAATGATCAAATTTGAACTTGCTTTTACTGATCAGC
AGCTATATAATATCCAGCAAATATGGAGAAACTGGTGTAGTTGATATAGAAGTAGATACCTCGGTTCCAACTGTATTCATTTTTCACCAAAACCTCCCCGTTCGGCCTAA
AAGGTCCATTTTCTGAGAACGCCCCAACTCCAAGGGAAGAACAACCAGGGCCTGCAAGAAATCAGAAAAATGTCCTCCCTAACCAGTCAAGCAAGTTTGAAGGACTTGGA
ATTAAAGAAGGTAAGAAGGTAAGGAGTCAACACACCTCCATTAAGCCAGAGAACCAAAGGCTTGGAATCAGGGTCAGTTTCTGCTTCAACCAAGTAGTAAAACAGTGCTC
TCTGCTTTTGGTCATCTACATGAATGTAACCTGAAAACTGGTGAAACCCCACACGCGGCTGTCCAGGAAGGCTAGTGATCTTGTCAGAATGTGCAAAAGAGGAGCCAACC
TCCTTGCAAATGCAAAGATGAAGAATAAGAGCAGCCATTGTCGCGGCCTTCCATGTTGAAGAAAACATGGTTTAGACGGTGAGAGACAACTAAGACAAAAGGACCTAGAG
AAAGTGCTGTTTAAAATGCTGCTCGGTGGATGGGGGGTCGGCGTGGGGTACTAAATAAGCAGGCGATCTAATATTTAAACCGCCATCATACTTGAGCCTTATTACTTGGC
AGGCCTGATACAGAATAAAACTAATCAATTTTGCGATTTCTGTTCATATCCAATAGCATTATTTGGTAGGACACATCCAAAACATAAAAGATTGGTTTTTCGAAAATGAA
GAAACAAATGGAATATAAGGCTGCTTTATCAGAATTCAGAGCCCTGAAAGTGCTGTTATGATGTTTCCCTATCTTCTGTTCTCTAACAACAACAAAAAAGTGTCTCTCGT
GAGGCGTACATTGTAAAGTGGCTTTACACAAACTACTTTCCACTCTCAGAAATCAGTTGTACAGTTATTTGTAACACAAATTTCTATTAGATTTCAAGTATGGTATGATT
ATCTGACAGACTCTTGGGGGATTTTGAAGAATCTTGACAGGATTTTCAGGGGTTCTTTTTAGTTTTTCGAGGAGACAATTGGGTTTTCTTGTTCGACTTGTACTTCGCCT
ATAACTGCACTGCTCTACAGCCAAATAATAGGTTGTATCCCGACTTTCAAATGCTCCTTCAGGTTCAAATAAGATAATCTGGAAGGGAAGTGAGGGTGCGGGAAAGAATG
CCTTAATCTAATTATTGGTTATCATCAAGATCCACATGCTTGAACCAACCCCCCAATTTATGATAATGGTAATGGGGCCTCTCCCTCACTTCATATTTCATAATTCATAT
CATGCCTACTGGTAGCTTTAGTGTAGTAGGTGAGCTACCAAGAAGCAGCGGATCCAAACGAAACCATTTTCTCATTTGTTGTATAGCTATAGATGGAGACTTTACAGTGT
GTAGTGTAGTGTAGTGTAGGCCTTTGGACCACTCTGGAAAAGGTGCCAGTGGGGTTCCTCTGATTTGCTTAAATTGTGGCGCTCTACAAATAACACCAACCACCAAAATA
CAAAGCGACACACAGGTACCAATATCATTTTCCATACCATGCGCTTAACCCCCGGCCCCAACACACTCAATAATGCATTTTTTTATCATTTGAGTTTTAGAAATTGGATT
TTCATATCTACAACTACAATCACTTTGTTTCCGCCTTGCCTTCTCTCTTTCCGAGGCAAAAAGGTAGAGAGCTTTATCTTAAAAGAGACTCTATGCTTTGAAAAGCTTTG
GTTAGTAGGGCAGTAGCGTCAACTCCGGTCTCCACCACACCCCCACAATCCATGAAATTGACTGGTCGAGCCGCCCCTCGTCTCGAGAGGCACAAAATTAAACAATACAA
CACGACAAACTTCACATTGAGTTCTCTTCAAGTTCTAACAAAGCTGAAATGAATTACTTGCATCTCTTAGAACAGGTCCAATTTGGTTTTTGACTTCCTCAAGTAGAATG
TTGGTTTGGTTGAAAAAATCGTTTTTTGAGGAAACTACTCAACCGTGGGGTTAAGATTGGGGTTAACAACGAGGTCCTCGATCAAATCTGCAGCTTCTTGAACGGTGGTT
ATGTTCTGAGCGTTATCCTCCTCGATATTGATGTCAAATTCTTCCTCTAAGGCCATAATTATCTCCACCTGTCAGTTGATTGGTGTGAGCATAGAAAAGGGTAAGGCCGT
GGCCGTAAAGGTGGAGATGGATTACATACAAGGTCGAGGGAGTCGGCACCCAGAGCTAAGAACTTGGATTCAGGGGTGAGCTCTGATTCGGCAGGCAAGGCCAATTGTTT
CCTCACAACTGAACACACTTTATCAACTGTCTCTGGTTTTGCCTGCATTAATTCCACTTTTTAATACTTGCTTTGCCAATGGACACCACAATGTCAAACTAATTATTAAA
TAATTACCGCACAAGAGATACGGAGCTGGGATGTTCTCAGAACATGCAAACCATTCTTTCTCCATCCAAATTTCACACTTGAAATTCTTCTCACAACCTGCCACCATATA
CAAATGCATTCCACAACATTTTAACTTTGGCAAAAATCATATTTTAATTATAGGCTGGTTCAGAGAATACCTGGTTGATGTTGATCTTGGTGGGCGGGATTAGAGGTAGC
TGGAATCTCAGGCAAGAAGCTGAAAAGGAAGCCATTGTAATTATAAGAAGAAAGAGAGTTTAGTTATATGGAAAAGGCGAAACTGTATAAGGTGAAGGAGTTGGAAGGAG
AGGGAATGGATTGGATATGTAGCTGAGGTGAGTGAGAACTGAGGGATGAACGAGTGGTGCCCATTGCCTCTCCTTCATCAATTTCATCGATTCCTATCTACTTCTCACTT
CTCACTGGCCGAATTTTGACACACCCATCTCGTGGGCCCATTTTTTAAATATCGCTCTTCTTTTTTCTTTTAACAAAGTTCAACTAAAAATAGAATGCAA
Protein sequenceShow/hide protein sequence
MSLVFWPKPQSLSPLPSLLFSSLIDSLLHFLFCLSSTLHPPMASVSATSLRSFQLPHSKEDGGADRHSNSNRLYLKSSFFSPLRTPSNSKIPPLGSQKSGKIAAAPKFSI
SMRVASKQAYICRDCGYIYNDRTPFDKLPDKYFCPVCGAPKRRFRPYEQSVTKNVNELDVRKARKAQIQRDEAVGNVLPIAAAVGIVALVGLYLYLNSVY