; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g09130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g09130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:7542496..7545524
RNA-Seq ExpressionMoc09g09130
SyntenyMoc09g09130
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.2e-4545.37Show/hide
Query:  IFEYSFRLPIHPLAQEFLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL---------------------
        +FEY  RLP+HP  QEFL RTGLAPAQVA NGWGV+F+LA+LFWLR R+ EE +LL VDQLLACFE KRI++K GR+Y+                     
Subjt:  IFEYSFRLPIHPLAQEFLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL---------------------

Query:  -------------------------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQ
                                 F NLV+I+P+P+L+ ++F+ LK+YK++F  GRK+   +T++LL  SGLL+YN  + P+E  RPN  LAMVC F+ 
Subjt:  -------------------------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQ

Query:  GVKRDRPSQG---VASASKRASTPVVV
        GVKR    +     A+ S +  TP VV
Subjt:  GVKRDRPSQG---VASASKRASTPVVV

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]1.2e-4741.43Show/hide
Query:  DTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDFVDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRR
        D     EV DVSP+++V+R++   KSK NKRK  SS+++V E     RV       +DP AR+G T DI +RFKI+PSSA +KE+  + S  CFDR  ++
Subjt:  DTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDFVDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRR

Query:  ASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASS-----AASLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELL
        ASKFV  P S I++++DY+ + HA  C AAI++K++LD R+L  V E EA S     A +LE ELKEAR E    K   E   A+ KS + E     EL 
Subjt:  ASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASS-----AASLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELL

Query:  RGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVLLEEAFPP----------------------------EVDL
        +  + + K LE EKF L+++ND   R         K   S+V++LK E+EL ++KLSNGVLLEEAF                              ++DL
Subjt:  RGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVLLEEAFPP----------------------------EVDL

Query:  GPIKLQYTKKWVLGPNETPGP
         P+K  YTKKW  GP +T GP
Subjt:  GPIKLQYTKKWVLGPNETPGP

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.2e-4540.24Show/hide
Query:  IGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-----SLE
        +GGT D+  RF+++PSS+ VK++   +S +C DRC +RASKFVS PGS +QR +D +AE   A   +AIMVKAELDGR     KE E SSAA     +L+
Subjt:  IGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-----SLE

Query:  GELKEARAEAHSWKFTSEVD-KAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVL
        GEL +A+ E    +  +EVD KAEL   K E  +H   LR AHA+ K LEKEKF LLK+ D       D  + L+ +D+ + +L AE++  + +L+NG L
Subjt:  GELKEARAEAHSWKFTSEVD-KAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVL

Query:  LEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEVDGASLPT
        LEE+F                                ++DL  +K +Y++KW  GPN TPGPQ +V +Y+++LDS+ +++EE +  S    E+      T
Subjt:  LEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEVDGASLPT

Query:  TGAISSQELGLGDSQELDILTLQGELGSHLRSN
           + SQ+ G   SQE+++L  +GEL SHL S+
Subjt:  TGAISSQELGLGDSQELDILTLQGELGSHLRSN

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-7548.01Show/hide
Query:  EFANRLYSELEEEIDNFRFLDEDGDDSDTSTLDQGLEFPSQMPESYLGSLHKRYSISDDIILRLPKEGEQADNPPEGCVTLYLKIFEYSFRLPIHPLAQE
        + A RL S+L EEI+N R + +DG+DSD ST  QGLE+PS++PE YLGSL + ++I ++I+LRLP+EGE+ADNPPEG VTLY K+FEY  RLP+HP  QE
Subjt:  EFANRLYSELEEEIDNFRFLDEDGDDSDTSTLDQGLEFPSQMPESYLGSLHKRYSISDDIILRLPKEGEQADNPPEGCVTLYLKIFEYSFRLPIHPLAQE

Query:  FLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL-------------------------------------
        FL RTGLAPAQVA NGWGV+F+LA+LFWLR R+ EE +L  VDQLLACFE KRI++K GR+Y+                                     
Subjt:  FLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL-------------------------------------

Query:  ---------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQG---VAS
                 F NLV+I+P+P+L+ ++F+ LK+YK++F  GRK+   +T++LL  SGLL+YN  + P+E+ RPN ELAMVCGF+ GVKR    +     A+
Subjt:  ---------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQG---VAS

Query:  ASKRASTPVVVDLPVEVEVVEVHQDAS
         S + +TP VV    E   + +  ++S
Subjt:  ASKRASTPVVVDLPVEVEVVEVHQDAS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.9e-6938.68Show/hide
Query:  KSGRYYL-----FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQGVAS
        +SGR +      F NLV+IK IP+L+ +TF+ LK YKD F   RK+   +T+KLL  SGLL+YN L+  +EA RPN ELAMVCGF+  VKR    +  A 
Subjt:  KSGRYYL-----FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQGVAS

Query:  ASKRASTPVVVDLP-------------VEVEVVEVHQDASTLKGVDTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDF
         +   + PV   +P             V   V+E+           + ++ E LDVSP+ +VR   SP + ++ K+K  SS    E G      +S  D 
Subjt:  ASKRASTPVVVDLP-------------VEVEVVEVHQDASTLKGVDTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDF

Query:  VDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-
        VDDP AR+ GTS++ +RF ++PSS+ VK++   +S +C DR  RRASKFVS PGS +QR +D  AE   A    A+MVKAELDGR     KE E S AA 
Subjt:  VDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-

Query:  ----SLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSK
            +L+GEL +A+ E    +  +EVD A++   K E  +H   LR AHA+ K LEKEKF LLK+ D       D  + L+ +D+ + +L  E++  + +
Subjt:  ----SLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSK

Query:  LSNGVLLEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEV
        L+NG LLEE+F                                ++DL  +K +Y++KW  GPN TP PQ +VD+Y+++LDS+ +++EE +  S    EV
Subjt:  LSNGVLLEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEV

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138263.5e-4545.37Show/hide
Query:  IFEYSFRLPIHPLAQEFLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL---------------------
        +FEY  RLP+HP  QEFL RTGLAPAQVA NGWGV+F+LA+LFWLR R+ EE +LL VDQLLACFE KRI++K GR+Y+                     
Subjt:  IFEYSFRLPIHPLAQEFLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL---------------------

Query:  -------------------------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQ
                                 F NLV+I+P+P+L+ ++F+ LK+YK++F  GRK+   +T++LL  SGLL+YN  + P+E  RPN  LAMVC F+ 
Subjt:  -------------------------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQ

Query:  GVKRDRPSQG---VASASKRASTPVVV
        GVKR    +     A+ S +  TP VV
Subjt:  GVKRDRPSQG---VASASKRASTPVVV

A0A6J1DBX9 uncharacterized protein LOC1110189135.7e-4841.43Show/hide
Query:  DTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDFVDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRR
        D     EV DVSP+++V+R++   KSK NKRK  SS+++V E     RV       +DP AR+G T DI +RFKI+PSSA +KE+  + S  CFDR  ++
Subjt:  DTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDFVDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRR

Query:  ASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASS-----AASLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELL
        ASKFV  P S I++++DY+ + HA  C AAI++K++LD R+L  V E EA S     A +LE ELKEAR E    K   E   A+ KS + E     EL 
Subjt:  ASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASS-----AASLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELL

Query:  RGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVLLEEAFPP----------------------------EVDL
        +  + + K LE EKF L+++ND   R         K   S+V++LK E+EL ++KLSNGVLLEEAF                              ++DL
Subjt:  RGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVLLEEAFPP----------------------------EVDL

Query:  GPIKLQYTKKWVLGPNETPGP
         P+K  YTKKW  GP +T GP
Subjt:  GPIKLQYTKKWVLGPNETPGP

A0A6J1DF31 uncharacterized protein LOC1110199091.6e-4540.24Show/hide
Query:  IGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-----SLE
        +GGT D+  RF+++PSS+ VK++   +S +C DRC +RASKFVS PGS +QR +D +AE   A   +AIMVKAELDGR     KE E SSAA     +L+
Subjt:  IGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-----SLE

Query:  GELKEARAEAHSWKFTSEVD-KAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVL
        GEL +A+ E    +  +EVD KAEL   K E  +H   LR AHA+ K LEKEKF LLK+ D       D  + L+ +D+ + +L AE++  + +L+NG L
Subjt:  GELKEARAEAHSWKFTSEVD-KAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVL

Query:  LEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEVDGASLPT
        LEE+F                                ++DL  +K +Y++KW  GPN TPGPQ +V +Y+++LDS+ +++EE +  S    E+      T
Subjt:  LEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEVDGASLPT

Query:  TGAISSQELGLGDSQELDILTLQGELGSHLRSN
           + SQ+ G   SQE+++L  +GEL SHL S+
Subjt:  TGAISSQELGLGDSQELDILTLQGELGSHLRSN

A0A6J1DXS5 uncharacterized protein LOC1110255028.5e-7648.01Show/hide
Query:  EFANRLYSELEEEIDNFRFLDEDGDDSDTSTLDQGLEFPSQMPESYLGSLHKRYSISDDIILRLPKEGEQADNPPEGCVTLYLKIFEYSFRLPIHPLAQE
        + A RL S+L EEI+N R + +DG+DSD ST  QGLE+PS++PE YLGSL + ++I ++I+LRLP+EGE+ADNPPEG VTLY K+FEY  RLP+HP  QE
Subjt:  EFANRLYSELEEEIDNFRFLDEDGDDSDTSTLDQGLEFPSQMPESYLGSLHKRYSISDDIILRLPKEGEQADNPPEGCVTLYLKIFEYSFRLPIHPLAQE

Query:  FLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL-------------------------------------
        FL RTGLAPAQVA NGWGV+F+LA+LFWLR R+ EE +L  VDQLLACFE KRI++K GR+Y+                                     
Subjt:  FLVRTGLAPAQVASNGWGVVFSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYL-------------------------------------

Query:  ---------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQG---VAS
                 F NLV+I+P+P+L+ ++F+ LK+YK++F  GRK+   +T++LL  SGLL+YN  + P+E+ RPN ELAMVCGF+ GVKR    +     A+
Subjt:  ---------FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQG---VAS

Query:  ASKRASTPVVVDLPVEVEVVEVHQDAS
         S + +TP VV    E   + +  ++S
Subjt:  ASKRASTPVVVDLPVEVEVVEVHQDAS

A0A6J1DZB3 uncharacterized protein LOC1110256659.1e-7038.68Show/hide
Query:  KSGRYYL-----FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQGVAS
        +SGR +      F NLV+IK IP+L+ +TF+ LK YKD F   RK+   +T+KLL  SGLL+YN L+  +EA RPN ELAMVCGF+  VKR    +  A 
Subjt:  KSGRYYL-----FRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLELAMVCGFSQGVKRDRPSQGVAS

Query:  ASKRASTPVVVDLP-------------VEVEVVEVHQDASTLKGVDTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDF
         +   + PV   +P             V   V+E+           + ++ E LDVSP+ +VR   SP + ++ K+K  SS    E G      +S  D 
Subjt:  ASKRASTPVVVDLP-------------VEVEVVEVHQDASTLKGVDTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDF

Query:  VDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-
        VDDP AR+ GTS++ +RF ++PSS+ VK++   +S +C DR  RRASKFVS PGS +QR +D  AE   A    A+MVKAELDGR     KE E S AA 
Subjt:  VDDPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAA-

Query:  ----SLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSK
            +L+GEL +A+ E    +  +EVD A++   K E  +H   LR AHA+ K LEKEKF LLK+ D       D  + L+ +D+ + +L  E++  + +
Subjt:  ----SLEGELKEARAEAHSWKFTSEVDKAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSK

Query:  LSNGVLLEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEV
        L+NG LLEE+F                                ++DL  +K +Y++KW  GPN TP PQ +VD+Y+++LDS+ +++EE +  S    EV
Subjt:  LSNGVLLEEAFPP------------------------------EVDLGPIKLQYTKKWVLGPNETPGPQDVVDQYLKDLDSE-AELEEGEGASFSSQEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGGTTCGAGCATATCGAACTCGGACATTCTTCCCCAAAATGAAGGCTTAATTCCTGAGAAAATGGTTGTCTTCCTCCTTGGAGAGTTTTCTCCCCCAAACATTGA
CCCCCTCTCTGCTAAGTTTGATCTCGATCTGGTAGAGAAGTTATTCCCCTTGGGGAAACCAATCCACGTGGTGACTTCTGATGCCGATACAATGAATATTACCGTTGCAA
AGTCTCGAAAACGAAACCATCATTCGGCGCGCACTTGGATACCTCAGATTCCTTCCCCTGGAAAGGCCGACTCTGTCGAGGAATTTGCCAATAGGTTATATTCGGAGTTA
GAAGAAGAGATAGATAACTTTAGGTTCCTTGATGAGGATGGGGATGATAGTGACACCTCCACCTTGGACCAGGGTTTGGAGTTTCCCTCTCAAATGCCGGAGAGCTATCT
TGGTTCTCTTCATAAGAGGTATAGCATTTCGGACGACATCATTCTTAGGCTCCCTAAAGAAGGGGAGCAGGCAGATAATCCTCCAGAAGGATGTGTAACCTTGTACTTGA
AGATATTCGAGTACAGCTTCCGCCTACCCATTCATCCTTTGGCGCAAGAGTTCCTGGTTCGAACTGGGCTAGCTCCTGCTCAAGTGGCCTCCAATGGATGGGGTGTGGTC
TTTAGTTTAGCCGTACTATTCTGGCTTAGGTGTCGAGAGGTAGAAGAATTAGATCTCCTTAGGGTCGACCAGCTTTTAGCATGTTTCGAGCTTAAGCGAATTTCTAGGAA
GTCGGGTCGATACTATTTGTTTAGGAACTTAGTTGCTATCAAACCCATTCCCCAACTCTCCTCATCCACTTTCAACATCTTAAAATTTTACAAGGACAAGTTCAAGAGTG
GTAGGAAACTGAACAACTTCCTGACCAACAAGCTTCTCGCAACTTCAGGTCTGCTCAACTATAACTCGTTGCTTGTTCCTCTTGAGGCTCACAGACCCAACTTGGAGCTT
GCAATGGTTTGCGGATTCTCTCAAGGCGTGAAACGGGACCGCCCAAGCCAAGGGGTCGCTTCTGCTTCAAAGAGGGCATCCACCCCCGTTGTAGTCGACCTTCCTGTCGA
GGTCGAGGTGGTGGAGGTTCACCAAGATGCCTCCACCCTTAAGGGAGTCGACACTGCTCAGGACAAGGAGGTCTTGGACGTTTCCCCCATCAGGGATGTTCGGAGACGGG
CCTCCCCTAAGAAGTCGAAGAAGAACAAGCGCAAAGCCCATTCTTCGGAGAATATGGTGGAGGAGGGCCGGGCTGCACCGAGGGTTAGCTCCTTCGGGGATTTTGTTGAT
GATCCGGCAGCAAGAATTGGAGGCACCTCGGACATCGAGATAAGGTTCAAGATCAAGCCTTCCAGCGCTCAAGTAAAGGAGAGAGCCATAGAGATGTCGGGCTCCTGTTT
CGACCGCTGCTGGAGGAGGGCTTCCAAGTTCGTGAGCGCTCCGGGGTCAGCCATCCAGCGAATGCTGGACTACTCTGCTGAGACCCACGCCGCCATTTGCCAAGCGGCCA
TTATGGTGAAGGCCGAACTGGACGGGCGCAACCTTTTCACCGTGAAGGAAATAGAGGCTTCTTCAGCCGCTTCCTTAGAAGGGGAGCTCAAAGAGGCCCGAGCTGAGGCC
CATTCGTGGAAGTTTACTTCTGAGGTCGACAAGGCTGAACTTAAAAGTGCCAAGGCGGAGGCTGCTCGCCACATGGAGCTTCTAAGGGGTGCGCACGCTGTGGCCAAAGT
CCTGGAGAAAGAGAAGTTCGTGTTGCTGAAGAAGAACGACGAATTCGAGCGTTGCTATGCAGACTTTGAAGAGAAACTGAAAGCCCGAGACTCCAAGGTGGAGAAGCTGA
AAGCCGAAATCGAGCTTCAGAGGTCCAAACTCAGCAATGGGGTGTTGTTGGAGGAGGCTTTTCCTCCTGAGGTCGACCTTGGGCCCATCAAGCTTCAATACACCAAGAAG
TGGGTCTTAGGTCCCAACGAGACTCCTGGCCCCCAAGACGTGGTGGACCAGTACTTGAAGGACCTCGACTCCGAGGCCGAGCTTGAGGAGGGTGAAGGCGCCAGCTTCTC
TTCCCAAGAGGTTGATGGGGCGAGCCTTCCTACCACTGGAGCGATTTCCTCCCAAGAGCTTGGGCTGGGGGACTCCCAGGAGCTCGACATCCTGACTTTGCAGGGTGAAC
TCGGGTCTCACCTCAGAAGCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGGTTCGAGCATATCGAACTCGGACATTCTTCCCCAAAATGAAGGCTTAATTCCTGAGAAAATGGTTGTCTTCCTCCTTGGAGAGTTTTCTCCCCCAAACATTGA
CCCCCTCTCTGCTAAGTTTGATCTCGATCTGGTAGAGAAGTTATTCCCCTTGGGGAAACCAATCCACGTGGTGACTTCTGATGCCGATACAATGAATATTACCGTTGCAA
AGTCTCGAAAACGAAACCATCATTCGGCGCGCACTTGGATACCTCAGATTCCTTCCCCTGGAAAGGCCGACTCTGTCGAGGAATTTGCCAATAGGTTATATTCGGAGTTA
GAAGAAGAGATAGATAACTTTAGGTTCCTTGATGAGGATGGGGATGATAGTGACACCTCCACCTTGGACCAGGGTTTGGAGTTTCCCTCTCAAATGCCGGAGAGCTATCT
TGGTTCTCTTCATAAGAGGTATAGCATTTCGGACGACATCATTCTTAGGCTCCCTAAAGAAGGGGAGCAGGCAGATAATCCTCCAGAAGGATGTGTAACCTTGTACTTGA
AGATATTCGAGTACAGCTTCCGCCTACCCATTCATCCTTTGGCGCAAGAGTTCCTGGTTCGAACTGGGCTAGCTCCTGCTCAAGTGGCCTCCAATGGATGGGGTGTGGTC
TTTAGTTTAGCCGTACTATTCTGGCTTAGGTGTCGAGAGGTAGAAGAATTAGATCTCCTTAGGGTCGACCAGCTTTTAGCATGTTTCGAGCTTAAGCGAATTTCTAGGAA
GTCGGGTCGATACTATTTGTTTAGGAACTTAGTTGCTATCAAACCCATTCCCCAACTCTCCTCATCCACTTTCAACATCTTAAAATTTTACAAGGACAAGTTCAAGAGTG
GTAGGAAACTGAACAACTTCCTGACCAACAAGCTTCTCGCAACTTCAGGTCTGCTCAACTATAACTCGTTGCTTGTTCCTCTTGAGGCTCACAGACCCAACTTGGAGCTT
GCAATGGTTTGCGGATTCTCTCAAGGCGTGAAACGGGACCGCCCAAGCCAAGGGGTCGCTTCTGCTTCAAAGAGGGCATCCACCCCCGTTGTAGTCGACCTTCCTGTCGA
GGTCGAGGTGGTGGAGGTTCACCAAGATGCCTCCACCCTTAAGGGAGTCGACACTGCTCAGGACAAGGAGGTCTTGGACGTTTCCCCCATCAGGGATGTTCGGAGACGGG
CCTCCCCTAAGAAGTCGAAGAAGAACAAGCGCAAAGCCCATTCTTCGGAGAATATGGTGGAGGAGGGCCGGGCTGCACCGAGGGTTAGCTCCTTCGGGGATTTTGTTGAT
GATCCGGCAGCAAGAATTGGAGGCACCTCGGACATCGAGATAAGGTTCAAGATCAAGCCTTCCAGCGCTCAAGTAAAGGAGAGAGCCATAGAGATGTCGGGCTCCTGTTT
CGACCGCTGCTGGAGGAGGGCTTCCAAGTTCGTGAGCGCTCCGGGGTCAGCCATCCAGCGAATGCTGGACTACTCTGCTGAGACCCACGCCGCCATTTGCCAAGCGGCCA
TTATGGTGAAGGCCGAACTGGACGGGCGCAACCTTTTCACCGTGAAGGAAATAGAGGCTTCTTCAGCCGCTTCCTTAGAAGGGGAGCTCAAAGAGGCCCGAGCTGAGGCC
CATTCGTGGAAGTTTACTTCTGAGGTCGACAAGGCTGAACTTAAAAGTGCCAAGGCGGAGGCTGCTCGCCACATGGAGCTTCTAAGGGGTGCGCACGCTGTGGCCAAAGT
CCTGGAGAAAGAGAAGTTCGTGTTGCTGAAGAAGAACGACGAATTCGAGCGTTGCTATGCAGACTTTGAAGAGAAACTGAAAGCCCGAGACTCCAAGGTGGAGAAGCTGA
AAGCCGAAATCGAGCTTCAGAGGTCCAAACTCAGCAATGGGGTGTTGTTGGAGGAGGCTTTTCCTCCTGAGGTCGACCTTGGGCCCATCAAGCTTCAATACACCAAGAAG
TGGGTCTTAGGTCCCAACGAGACTCCTGGCCCCCAAGACGTGGTGGACCAGTACTTGAAGGACCTCGACTCCGAGGCCGAGCTTGAGGAGGGTGAAGGCGCCAGCTTCTC
TTCCCAAGAGGTTGATGGGGCGAGCCTTCCTACCACTGGAGCGATTTCCTCCCAAGAGCTTGGGCTGGGGGACTCCCAGGAGCTCGACATCCTGACTTTGCAGGGTGAAC
TCGGGTCTCACCTCAGAAGCAACTGA
Protein sequenceShow/hide protein sequence
MSGSSISNSDILPQNEGLIPEKMVVFLLGEFSPPNIDPLSAKFDLDLVEKLFPLGKPIHVVTSDADTMNITVAKSRKRNHHSARTWIPQIPSPGKADSVEEFANRLYSEL
EEEIDNFRFLDEDGDDSDTSTLDQGLEFPSQMPESYLGSLHKRYSISDDIILRLPKEGEQADNPPEGCVTLYLKIFEYSFRLPIHPLAQEFLVRTGLAPAQVASNGWGVV
FSLAVLFWLRCREVEELDLLRVDQLLACFELKRISRKSGRYYLFRNLVAIKPIPQLSSSTFNILKFYKDKFKSGRKLNNFLTNKLLATSGLLNYNSLLVPLEAHRPNLEL
AMVCGFSQGVKRDRPSQGVASASKRASTPVVVDLPVEVEVVEVHQDASTLKGVDTAQDKEVLDVSPIRDVRRRASPKKSKKNKRKAHSSENMVEEGRAAPRVSSFGDFVD
DPAARIGGTSDIEIRFKIKPSSAQVKERAIEMSGSCFDRCWRRASKFVSAPGSAIQRMLDYSAETHAAICQAAIMVKAELDGRNLFTVKEIEASSAASLEGELKEARAEA
HSWKFTSEVDKAELKSAKAEAARHMELLRGAHAVAKVLEKEKFVLLKKNDEFERCYADFEEKLKARDSKVEKLKAEIELQRSKLSNGVLLEEAFPPEVDLGPIKLQYTKK
WVLGPNETPGPQDVVDQYLKDLDSEAELEEGEGASFSSQEVDGASLPTTGAISSQELGLGDSQELDILTLQGELGSHLRSN