; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:13425341..13435211
RNA-Seq ExpressionMoc08g17710
SyntenyMoc08g17710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.5e-4146.96Show/hide
Query:  MRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARA
        MRF++E S++GVK++  ++S +C DRC RRAS+FVS  GS +QR +D   EA  A+  + +M+KA+LDGR+ LT  E E   TTLE AT+LKGEL     
Subjt:  MRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARA

Query:  EARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAF
        +A+ E  + ++  +A K +L   K +  +H   LR AH + K LEKE F LLK+ D+       L + L+ KD+ + +L  EL+  K +L++   LEE+F
Subjt:  EARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAF

Query:  RQHPDFDGFAKDFSDAGFKFLMKGLKEIAP
        RQHP+FDGFAKDFSDAGFKFLMKG+    P
Subjt:  RQHPDFDGFAKDFSDAGFKFLMKGLKEIAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]8.2e-4344.35Show/hide
Query:  GELVDDLTARMGTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWT
        G  ++ +T  +G   I  + +IEPS++GV+++  ++S +  DRC RRASKFVS  GS +QR +DY  EA  A+ Q+ + +KA+LDGR++L   E E    
Subjt:  GELVDDLTARMGTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWT

Query:  TLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAEL
         LETA+S    +K+   +A +E +  K+  E+   + +  K +  R    LR AH + + LE+E F LLK+ D+       + + L+AKD E+E   AEL
Subjt:  TLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAEL

Query:  ELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE
        E  K +LSN V LEEAFRQHPDFDGFAKDFSDAGFKFLMKG+    P+
Subjt:  ELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]2.2e-4846.37Show/hide
Query:  QRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGELVDDLTARMG-TSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLD
        QR+  +  S++   +T SS+D+V E      V     L +D  AR+G T DI MRFKIEPS+AG+KE+  K S  CFDR  ++ASKFV V  S I++++D
Subjt:  QRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGELVDDLTARMG-TSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLD

Query:  YITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKE
        Y  + H  +C   I+MK+KLD RDL+ +NE EA    LE AT+L+ ELK    EAR E +V KS  EA   + KS + +     E  +  +V+ K LE E
Subjt:  YITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKE

Query:  NFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPEDLDSE
         F L+++ND L R       + K   SEV++LK E+EL K+KLSN V LEEAF+ H DFD F  DFSD  FKFLMKG+ E+A  DLD E
Subjt:  NFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPEDLDSE

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.9e-4540.72Show/hide
Query:  GTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGE
        GT D+  RF++EPS++GVK++  ++S +C DRC +RASKFVS  GS +QR +D   EA  A+  + IM+KA+LDGR+ L   E E S   LE AT+LKGE
Subjt:  GTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGE

Query:  LKEARAEA---RAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLS
        L +A+ E    RAE           K EL   K +  +H   LR AH + K LEKE F LLK+ D+       L + L+ KD+ + +L AEL+  K +L+
Subjt:  LKEARAEA---RAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLS

Query:  NRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE---DLDS------------------------------EAELEDGEGASFSSQEVDGASLP
        N   LEE+FRQH DFDGFAKDFSDAGFKFLMKG+    P    DL +                              +++  D E     SQE +     
Subjt:  NRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE---DLDS------------------------------EAELEDGEGASFSSQEVDGASLP

Query:  ATGVISSQEVGLQDSQELDIMASQGELGPHLGSN
           V S Q+     SQE++++ S+GEL  HLGS+
Subjt:  ATGVISSQEVGLQDSQELDIMASQGELGPHLGSN

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.8e-5940.33Show/hide
Query:  AVRPVPQLSQSTFDILKFYKDKFKSGRKISNFLIDKHLTASGLLEYNPLLVPPKAHRPNSEL-------GCEAKASQPRG-CFCFKKGAQPCRGR-PSHR
        +++ +P+L+Q+TFD LK YKD F   RKI   + DK L  SGLL+YNPL+   +A RPNSEL       G   + S+ R        G +P     P   
Subjt:  AVRPVPQLSQSTFDILKFYKDKFKSGRKISNFLIDKHLTASGLLEYNPLLVPPKAHRPNSEL-------GCEAKASQPRG-CFCFKKGAQPCRGR-PSHR

Query:  GQGGGSPQRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGE----------------------------------LVDDLTARM-GTSDIEMRFKIE
         QG   P   +        L    S E   +E   A  VSP  E                                  LVDD  ARM GTS++ MRF +E
Subjt:  GQGGGSPQRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGE----------------------------------LVDDLTARM-GTSDIEMRFKIE

Query:  PSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEA
        PS++GVK++  ++S +C DR  RRASKFVS  GS +QR +D + EA  A+    +M+KA+LDGR+ L   E E S+  LE AT+LKGEL     +A+ E 
Subjt:  PSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEA

Query:  QVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDF
         + ++  +A KV+L   K +  +H   LR AH + K LEKE F LLK+ D+       L + L+ KD+ + +L  EL+  K +L+N   LEE+FRQHPDF
Subjt:  QVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGLKEIAP
        DGFAKDFSDAGFKFLMKG+    P
Subjt:  DGFAKDFSDAGFKFLMKGLKEIAP

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161937.5e-4246.96Show/hide
Query:  MRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARA
        MRF++E S++GVK++  ++S +C DRC RRAS+FVS  GS +QR +D   EA  A+  + +M+KA+LDGR+ LT  E E   TTLE AT+LKGEL     
Subjt:  MRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARA

Query:  EARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAF
        +A+ E  + ++  +A K +L   K +  +H   LR AH + K LEKE F LLK+ D+       L + L+ KD+ + +L  EL+  K +L++   LEE+F
Subjt:  EARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAF

Query:  RQHPDFDGFAKDFSDAGFKFLMKGLKEIAP
        RQHP+FDGFAKDFSDAGFKFLMKG+    P
Subjt:  RQHPDFDGFAKDFSDAGFKFLMKGLKEIAP

A0A6J1D971 uncharacterized protein LOC1110185384.0e-4344.35Show/hide
Query:  GELVDDLTARMGTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWT
        G  ++ +T  +G   I  + +IEPS++GV+++  ++S +  DRC RRASKFVS  GS +QR +DY  EA  A+ Q+ + +KA+LDGR++L   E E    
Subjt:  GELVDDLTARMGTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWT

Query:  TLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAEL
         LETA+S    +K+   +A +E +  K+  E+   + +  K +  R    LR AH + + LE+E F LLK+ D+       + + L+AKD E+E   AEL
Subjt:  TLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAEL

Query:  ELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE
        E  K +LSN V LEEAFRQHPDFDGFAKDFSDAGFKFLMKG+    P+
Subjt:  ELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE

A0A6J1DBX9 uncharacterized protein LOC1110189131.1e-4846.37Show/hide
Query:  QRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGELVDDLTARMG-TSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLD
        QR+  +  S++   +T SS+D+V E      V     L +D  AR+G T DI MRFKIEPS+AG+KE+  K S  CFDR  ++ASKFV V  S I++++D
Subjt:  QRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGELVDDLTARMG-TSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLD

Query:  YITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKE
        Y  + H  +C   I+MK+KLD RDL+ +NE EA    LE AT+L+ ELK    EAR E +V KS  EA   + KS + +     E  +  +V+ K LE E
Subjt:  YITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKE

Query:  NFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPEDLDSE
         F L+++ND L R       + K   SEV++LK E+EL K+KLSN V LEEAF+ H DFD F  DFSD  FKFLMKG+ E+A  DLD E
Subjt:  NFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPEDLDSE

A0A6J1DF31 uncharacterized protein LOC1110199091.9e-4540.72Show/hide
Query:  GTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGE
        GT D+  RF++EPS++GVK++  ++S +C DRC +RASKFVS  GS +QR +D   EA  A+  + IM+KA+LDGR+ L   E E S   LE AT+LKGE
Subjt:  GTSDIEMRFKIEPSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGE

Query:  LKEARAEA---RAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLS
        L +A+ E    RAE           K EL   K +  +H   LR AH + K LEKE F LLK+ D+       L + L+ KD+ + +L AEL+  K +L+
Subjt:  LKEARAEA---RAEAQVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLS

Query:  NRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE---DLDS------------------------------EAELEDGEGASFSSQEVDGASLP
        N   LEE+FRQH DFDGFAKDFSDAGFKFLMKG+    P    DL +                              +++  D E     SQE +     
Subjt:  NRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEIAPE---DLDS------------------------------EAELEDGEGASFSSQEVDGASLP

Query:  ATGVISSQEVGLQDSQELDIMASQGELGPHLGSN
           V S Q+     SQE++++ S+GEL  HLGS+
Subjt:  ATGVISSQEVGLQDSQELDIMASQGELGPHLGSN

A0A6J1DZB3 uncharacterized protein LOC1110256651.4e-5940.33Show/hide
Query:  AVRPVPQLSQSTFDILKFYKDKFKSGRKISNFLIDKHLTASGLLEYNPLLVPPKAHRPNSEL-------GCEAKASQPRG-CFCFKKGAQPCRGR-PSHR
        +++ +P+L+Q+TFD LK YKD F   RKI   + DK L  SGLL+YNPL+   +A RPNSEL       G   + S+ R        G +P     P   
Subjt:  AVRPVPQLSQSTFDILKFYKDKFKSGRKISNFLIDKHLTASGLLEYNPLLVPPKAHRPNSEL-------GCEAKASQPRG-CFCFKKGAQPCRGR-PSHR

Query:  GQGGGSPQRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGE----------------------------------LVDDLTARM-GTSDIEMRFKIE
         QG   P   +        L    S E   +E   A  VSP  E                                  LVDD  ARM GTS++ MRF +E
Subjt:  GQGGGSPQRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGE----------------------------------LVDDLTARM-GTSDIEMRFKIE

Query:  PSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEA
        PS++GVK++  ++S +C DR  RRASKFVS  GS +QR +D + EA  A+    +M+KA+LDGR+ L   E E S+  LE AT+LKGEL     +A+ E 
Subjt:  PSNAGVKERAIKMSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEA

Query:  QVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDF
         + ++  +A KV+L   K +  +H   LR AH + K LEKE F LLK+ D+       L + L+ KD+ + +L  EL+  K +L+N   LEE+FRQHPDF
Subjt:  QVWKSTSEADKVELKSAKAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDF

Query:  DGFAKDFSDAGFKFLMKGLKEIAP
        DGFAKDFSDAGFKFLMKG+    P
Subjt:  DGFAKDFSDAGFKFLMKGLKEIAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAAGTTTCTTATCCATTTCTACCAACTTTTCTTCCATGGTGGATAGAGCTTCCTTGGATGCCATTACTTTGTTTGTGAAGTTCTTCAACTTTGCAGCTTGT
GAAATGCTTCGCTTGTTGAAGTTCTTCGACTTTGCGGCTTGTGAAGTTCTTCGACTTTGCAGCTTTCTCGTACTCTTCAAGTCTCCAATTCTTCAAGTCTTGTTG
GCTTTGGATGCTTCGCTTGACTCTGAACGATTGTCTGATCGTAGCAGAAAACAGGCAAGATTATCATGGCGAAAGCGGCATCTCTCCTTTCTGCTTCTATTATTC
TTCTTGCTCTCAGAAGGAGATTTGTTGGCGGTGAATGGAGAGGTATTCTTCTTGCTCTCAGAAGGAGATTTGTTGGCGGTGAATGGAGAGGTTAGGATAGATCTG
ACTGTCGGGTTATTGGAAATATGCGAATTCGGCTCTTCAACTTCGTCAAGTTCCTCTAGTTCAAGTTCTTCTGATAGGGTAGATCGGAGTCGGAGCCCTTCGCCT
GAAAAGGCTGACTCTGTCGAGGAATTCGCCAGTAGATATAGTATTCCGGACAATATTGTCCTCAAGCTCCCTAAAGAAGAGAAGCGAGCAGACAATTCTCCTGAA
GGGTGCGTGATCTTATACTTGAAGATGTTCGAGTACGGTTTCCGCCTACCCATTCATCCTCTGGCGCAAAGGTGTCAAGAGGTAGAAGAGTTAGAACTCCTCATG
GTCGACCAGCTTCTAGCATTTGCTGTCAGACCTGTACCTCAGCTTTCCCAGTCCACTTTTGACATCTTGAAATTCTACAAGGACAAGTTCAAGAGTGGCAGGAAG
ATCAGCAACTTTTTGATAGACAAACACCTTACAGCTTCAGGTCTGCTTGAGTACAATCCGCTGCTCGTTCCTCCCAAGGCTCACAGACCCAACTCGGAGCTCGGG
TGTGAAGCGAAAGCGTCTCAACCAAGGGGTTGCTTCTGCTTCAAAAAAGGCGCCCAGCCTTGTCGTGGTCGACCTTCCCACCGAGGTCAAGGTGGTGGAAGTCCA
CAAAGACGCTTACATGCCAAAGGGAGTCGGCACTGTTTAGGACGGACCCACTCTTCCGAGGACATGGTGAAGGAAGGTTGGGCTGCTCCTGGGGTTAGCCCCTTT
GGGGAGCTAGTTGACGACCTGACGGCAAGGATGGGCACATCGGACATTGAGATGAGGTTCAAGATCGAACCTTCCAATGCCGGGGTGAAGGAGAGAGCCATAAAG
ATGTCTGGCTCGTGCTTTGACCGTTGTTGGAGGAGGGCTTCCAAGTTTGTGAGTGTTCTAGGGTCGGCCATCCAACGACTGTTGGACTATATTACCGAGGCTCAT
ACTGCCGCCTGCCAAACGACCATTATGATGAAGGCCAAGCTCGACGGGCGCGATCTTCTCACCATAAATGAGCATGAGGCTTCCTGGACTACTTTGGAGACGGCT
ACTTCCTTAAAAGGGGAGCTTAAAGAGGCCCGAGCTGAGGCCCGAGCTGAGGCCCAGGTGTGGAAATCCACCTCCGAGGCCGACAAGGTAGAACTTAAAAGCGCC
AAGGCGAAAACCGCTCGCCACATGGAGGCTCTAAGGGGTGCCCATGTTGTGGCCAAGAGCCTAGAGAAGGAGAACTTCGCCCTTTTGAAGCAAAATGACGAACTT
GAACGTCGCGGGGTGGCCCTCGAGGAGGAGCTGAAGGCCAAAGACTCCGAGGTGGAGAAGCTCAAGGCCGAGCTGGAGCTTGAGAAGTCAAAACTCAGCAACAGG
GTGCGTTTGGAGGAAGCCTTTCGCCAACATCCTGATTTTGATGGGTTTGCCAAAGACTTCAGTGATGCGGGCTTCAAATTCCTGATGAAAGGACTTAAGGAAATA
GCTCCCGAGGACCTCGACTCTGAGGCCGAGCTTGAGGATGGCGAAGGCGCCAGCTTCTCTTCCCAGGAGGTTGACGGGGCCAGCCTTCCTGCCACTGGAGTGATT
TCCTCCCAAGAGGTTGGGCTGCAGGACTCCCAGGAGCTTGATATTATGGCTTCACAGGGTGAACTTGGGCCGCACCTCGGAAGCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAAGTTTCTTATCCATTTCTACCAACTTTTCTTCCATGGTGGATAGAGCTTCCTTGGATGCCATTACTTTGTTTGTGAAGTTCTTCAACTTTGCAGCTTGT
GAAATGCTTCGCTTGTTGAAGTTCTTCGACTTTGCGGCTTGTGAAGTTCTTCGACTTTGCAGCTTTCTCGTACTCTTCAAGTCTCCAATTCTTCAAGTCTTGTTG
GCTTTGGATGCTTCGCTTGACTCTGAACGATTGTCTGATCGTAGCAGAAAACAGGCAAGATTATCATGGCGAAAGCGGCATCTCTCCTTTCTGCTTCTATTATTC
TTCTTGCTCTCAGAAGGAGATTTGTTGGCGGTGAATGGAGAGGTATTCTTCTTGCTCTCAGAAGGAGATTTGTTGGCGGTGAATGGAGAGGTTAGGATAGATCTG
ACTGTCGGGTTATTGGAAATATGCGAATTCGGCTCTTCAACTTCGTCAAGTTCCTCTAGTTCAAGTTCTTCTGATAGGGTAGATCGGAGTCGGAGCCCTTCGCCT
GAAAAGGCTGACTCTGTCGAGGAATTCGCCAGTAGATATAGTATTCCGGACAATATTGTCCTCAAGCTCCCTAAAGAAGAGAAGCGAGCAGACAATTCTCCTGAA
GGGTGCGTGATCTTATACTTGAAGATGTTCGAGTACGGTTTCCGCCTACCCATTCATCCTCTGGCGCAAAGGTGTCAAGAGGTAGAAGAGTTAGAACTCCTCATG
GTCGACCAGCTTCTAGCATTTGCTGTCAGACCTGTACCTCAGCTTTCCCAGTCCACTTTTGACATCTTGAAATTCTACAAGGACAAGTTCAAGAGTGGCAGGAAG
ATCAGCAACTTTTTGATAGACAAACACCTTACAGCTTCAGGTCTGCTTGAGTACAATCCGCTGCTCGTTCCTCCCAAGGCTCACAGACCCAACTCGGAGCTCGGG
TGTGAAGCGAAAGCGTCTCAACCAAGGGGTTGCTTCTGCTTCAAAAAAGGCGCCCAGCCTTGTCGTGGTCGACCTTCCCACCGAGGTCAAGGTGGTGGAAGTCCA
CAAAGACGCTTACATGCCAAAGGGAGTCGGCACTGTTTAGGACGGACCCACTCTTCCGAGGACATGGTGAAGGAAGGTTGGGCTGCTCCTGGGGTTAGCCCCTTT
GGGGAGCTAGTTGACGACCTGACGGCAAGGATGGGCACATCGGACATTGAGATGAGGTTCAAGATCGAACCTTCCAATGCCGGGGTGAAGGAGAGAGCCATAAAG
ATGTCTGGCTCGTGCTTTGACCGTTGTTGGAGGAGGGCTTCCAAGTTTGTGAGTGTTCTAGGGTCGGCCATCCAACGACTGTTGGACTATATTACCGAGGCTCAT
ACTGCCGCCTGCCAAACGACCATTATGATGAAGGCCAAGCTCGACGGGCGCGATCTTCTCACCATAAATGAGCATGAGGCTTCCTGGACTACTTTGGAGACGGCT
ACTTCCTTAAAAGGGGAGCTTAAAGAGGCCCGAGCTGAGGCCCGAGCTGAGGCCCAGGTGTGGAAATCCACCTCCGAGGCCGACAAGGTAGAACTTAAAAGCGCC
AAGGCGAAAACCGCTCGCCACATGGAGGCTCTAAGGGGTGCCCATGTTGTGGCCAAGAGCCTAGAGAAGGAGAACTTCGCCCTTTTGAAGCAAAATGACGAACTT
GAACGTCGCGGGGTGGCCCTCGAGGAGGAGCTGAAGGCCAAAGACTCCGAGGTGGAGAAGCTCAAGGCCGAGCTGGAGCTTGAGAAGTCAAAACTCAGCAACAGG
GTGCGTTTGGAGGAAGCCTTTCGCCAACATCCTGATTTTGATGGGTTTGCCAAAGACTTCAGTGATGCGGGCTTCAAATTCCTGATGAAAGGACTTAAGGAAATA
GCTCCCGAGGACCTCGACTCTGAGGCCGAGCTTGAGGATGGCGAAGGCGCCAGCTTCTCTTCCCAGGAGGTTGACGGGGCCAGCCTTCCTGCCACTGGAGTGATT
TCCTCCCAAGAGGTTGGGCTGCAGGACTCCCAGGAGCTTGATATTATGGCTTCACAGGGTGAACTTGGGCCGCACCTCGGAAGCAACTGA
Protein sequenceShow/hide protein sequence
MLSFLSISTNFSSMVDRASLDAITLFVKFFNFAACEMLRLLKFFDFAACEVLRLCSFLVLFKSPILQVLLALDASLDSERLSDRSRKQARLSWRKRHLSFLLLLF
FLLSEGDLLAVNGEVFFLLSEGDLLAVNGEVRIDLTVGLLEICEFGSSTSSSSSSSSSSDRVDRSRSPSPEKADSVEEFASRYSIPDNIVLKLPKEEKRADNSPE
GCVILYLKMFEYGFRLPIHPLAQRCQEVEELELLMVDQLLAFAVRPVPQLSQSTFDILKFYKDKFKSGRKISNFLIDKHLTASGLLEYNPLLVPPKAHRPNSELG
CEAKASQPRGCFCFKKGAQPCRGRPSHRGQGGGSPQRRLHAKGSRHCLGRTHSSEDMVKEGWAAPGVSPFGELVDDLTARMGTSDIEMRFKIEPSNAGVKERAIK
MSGSCFDRCWRRASKFVSVLGSAIQRLLDYITEAHTAACQTTIMMKAKLDGRDLLTINEHEASWTTLETATSLKGELKEARAEARAEAQVWKSTSEADKVELKSA
KAKTARHMEALRGAHVVAKSLEKENFALLKQNDELERRGVALEEELKAKDSEVEKLKAELELEKSKLSNRVRLEEAFRQHPDFDGFAKDFSDAGFKFLMKGLKEI
APEDLDSEAELEDGEGASFSSQEVDGASLPATGVISSQEVGLQDSQELDIMASQGELGPHLGSN