; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:10481611..10486399
RNA-Seq ExpressionMoc03g15700
SyntenyMoc03g15700
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]2.6e-3939.43Show/hide
Query:  IRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRS
        +RF +E S + ++++V +I +    RC+R A++FV  PGS +QR ID  AE            K+ELDG E L+ K++E+  T L++AT ++GEL + + 
Subjt:  IRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRS

Query:  KLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFR
        ++   + +   K   ++ E EK+  +L+A +AI K LE EKF+L                                 + A LEE+FR+HP+FDGFA+DF 
Subjt:  KLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFR

Query:  DTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR
        D GFKFLMKG+A   P+L  DL  +KKRY++ WAS PN TPGPQSLV+KY++EL+S+Y D  ++ED  + E   VG ++
Subjt:  DTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]1.3e-3838.28Show/hide
Query:  SVEALKVFLLREVHGGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAP
        S+E   V  L+EV   S   K+K NK K  SS+  V EV+      L +D +A++GAT D+ +RF IEPS A ++E+V +  S  + R ++ A+KFV  P
Subjt:  SVEALKVFLLREVHGGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAP

Query:  GSDMQRLIDITAEQHA-TCPRD---KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLE
         S ++++ID T + HA +C      KS+LD  + +   ++E+   AL+ AT +E EL+E R + +  + K   K    E EVE   +  K+ Y I+K LE
Subjt:  GSDMQRLIDITAEQHA-TCPRD---KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLE

Query:  VEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNLDLEPIKKRYAKRWASDPNNT
         EKFKL                                 N   LEE F+ H DFD F  DF D  FKFLMKG+ EVA +LDLEP+K+ Y K+WAS P  T
Subjt:  VEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNLDLEPIKKRYAKRWASDPNNT

Query:  PGP
         GP
Subjt:  PGP

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.9e-4239.72Show/hide
Query:  MGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVE
        MG T DV  RF +EPS + ++++V +I +    RC++ A+KFV  PGS +QR ID  AE            K+ELDG E L+ K++E+   AL++AT ++
Subjt:  MGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVE

Query:  GELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDF
        GEL + + ++   + +   K   ++ E EK+  +L+A +AI K LE EKF+L                                 N + LEE+FR+H DF
Subjt:  GELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDF

Query:  DGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR
        DGFA+DF D GFKFLMKG+A   P+L  DL  +KK+Y+++WAS PN TPGPQSLV KY++EL+S+Y D  ++ED  + E  ++G ++
Subjt:  DGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.3e-4237.1Show/hide
Query:  RAMGFPSKMPENYLRPLRKRYQIPYNISLLVPRAGEKADDPSEGCITFYLNMFEYGFRLPVDPFIQE------------------------------TRE
        + + +PS++PE+YL  LR+ + IP NI L +P  GE+AD+P EG +T Y  MFEYG RLP+ PF+QE                               R+
Subjt:  RAMGFPSKMPENYLRPLRKRYQIPYNISLLVPRAGEKADDPSEGCITFYLNMFEYGFRLPVDPFIQE------------------------------TRE

Query:  VEGFELLVAKQLLAYFEVKRISNKPCRYYLCARKGTGGLSR-------------------VVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLN
         E  EL    QLLA FE KRI+ KP R+Y+CARKG GG+ +                   +       SF    + + F   ++IRP+PELTQ +FD L 
Subjt:  VEGFELLVAKQLLAYFEVKRISNKPCRYYLCARKGTGGLSR-------------------VVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLN

Query:  YYKDKFKGRGKFGTLITDKLLLAFGLLDFNPLLVPVEATRPNLELG----------DKAKGGSRQPKVGSSSYPKAAEIASDA------MVEVERLQGPP
        YYK++F    K GTL+TD+LLL  GLLD+NP + P+E++RPN EL            K+KG +   +   SS P    +   A      ++E+E   G P
Subjt:  YYKDKFKGRGKFGTLITDKLLLAFGLLDFNPLLVPVEATRPNLELG----------DKAKGGSRQPKVGSSSYPKAAEIASDA------MVEVERLQGPP

Query:  PGEKRPRSHS
          EKRPR  +
Subjt:  PGEKRPRSHS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.6e-6836.81Show/hide
Query:  LCARKGTGGL-----------------SRVVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLNYYKDKFKGRGKFGTLITDKLLLAFGLLDFNP
        +CARKGTGG+                 S   L           + + F   ++I+ +PEL Q TFD L +YKD F    K  TL+TDKLLL  GLLD+NP
Subjt:  LCARKGTGGL-----------------SRVVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLNYYKDKFKGRGKFGTLITDKLLLAFGLLDFNP

Query:  LLVPVEATRPNLELG----------DKAKGGSRQ--------------PKVGSSSYPKAAEIASDAMVEVERLQGPPPGEKRPRSHSVEALKVFLLREVH
        L+  +EA+RPN EL            K+KG +                P+  +      +      ++E++ L G   GEKR R  S EAL V  L EV 
Subjt:  LLVPVEATRPNLELG----------DKAKGGSRQ--------------PKVGSSSYPKAAEIASDAMVEVERLQGPPPGEKRPRSHSVEALKVFLLREVH

Query:  GGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQ
        G S  ++ +K K+  +SSE        +S  DL+DD EA+M  T +V +RF +EPS + ++++V +I +    R +R A+KFV  PGS +QR ID  AE 
Subjt:  GGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQ

Query:  HAT----CPRDKSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL--------
                   K+ELDG E L+ K++E+   AL++AT ++GEL + + ++   + +   K+  ++ E EK+  +L+A +AI K LE EKF+L        
Subjt:  HAT----CPRDKSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL--------

Query:  -------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLK
                                 N   LEE+FR+HPDFDGFA+DF D GFKFLMKG+A   P+L  DL  +KK+Y+++WAS PN TP PQSLV+KY++
Subjt:  -------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLK

Query:  ELNSEYQDNGKDEDDLTHEGEDVGASR
        EL+S+Y D  ++ED  + E  +VG ++
Subjt:  ELNSEYQDNGKDEDDLTHEGEDVGASR

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161931.3e-3939.43Show/hide
Query:  IRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRS
        +RF +E S + ++++V +I +    RC+R A++FV  PGS +QR ID  AE            K+ELDG E L+ K++E+  T L++AT ++GEL + + 
Subjt:  IRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRS

Query:  KLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFR
        ++   + +   K   ++ E EK+  +L+A +AI K LE EKF+L                                 + A LEE+FR+HP+FDGFA+DF 
Subjt:  KLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFR

Query:  DTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR
        D GFKFLMKG+A   P+L  DL  +KKRY++ WAS PN TPGPQSLV+KY++EL+S+Y D  ++ED  + E   VG ++
Subjt:  DTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR

A0A6J1DBX9 uncharacterized protein LOC1110189136.2e-3938.28Show/hide
Query:  SVEALKVFLLREVHGGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAP
        S+E   V  L+EV   S   K+K NK K  SS+  V EV+      L +D +A++GAT D+ +RF IEPS A ++E+V +  S  + R ++ A+KFV  P
Subjt:  SVEALKVFLLREVHGGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAP

Query:  GSDMQRLIDITAEQHA-TCPRD---KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLE
         S ++++ID T + HA +C      KS+LD  + +   ++E+   AL+ AT +E EL+E R + +  + K   K    E EVE   +  K+ Y I+K LE
Subjt:  GSDMQRLIDITAEQHA-TCPRD---KSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLE

Query:  VEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNLDLEPIKKRYAKRWASDPNNT
         EKFKL                                 N   LEE F+ H DFD F  DF D  FKFLMKG+ EVA +LDLEP+K+ Y K+WAS P  T
Subjt:  VEKFKL---------------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNLDLEPIKKRYAKRWASDPNNT

Query:  PGP
         GP
Subjt:  PGP

A0A6J1DF31 uncharacterized protein LOC1110199099.3e-4339.72Show/hide
Query:  MGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVE
        MG T DV  RF +EPS + ++++V +I +    RC++ A+KFV  PGS +QR ID  AE            K+ELDG E L+ K++E+   AL++AT ++
Subjt:  MGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRD----KSELDGHEKLSEKDKESILTALDSATAVE

Query:  GELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDF
        GEL + + ++   + +   K   ++ E EK+  +L+A +AI K LE EKF+L                                 N + LEE+FR+H DF
Subjt:  GELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL---------------------------------NSAFLEETFRKHPDF

Query:  DGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR
        DGFA+DF D GFKFLMKG+A   P+L  DL  +KK+Y+++WAS PN TPGPQSLV KY++EL+S+Y D  ++ED  + E  ++G ++
Subjt:  DGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDVGASR

A0A6J1DXS5 uncharacterized protein LOC1110255022.1e-4237.1Show/hide
Query:  RAMGFPSKMPENYLRPLRKRYQIPYNISLLVPRAGEKADDPSEGCITFYLNMFEYGFRLPVDPFIQE------------------------------TRE
        + + +PS++PE+YL  LR+ + IP NI L +P  GE+AD+P EG +T Y  MFEYG RLP+ PF+QE                               R+
Subjt:  RAMGFPSKMPENYLRPLRKRYQIPYNISLLVPRAGEKADDPSEGCITFYLNMFEYGFRLPVDPFIQE------------------------------TRE

Query:  VEGFELLVAKQLLAYFEVKRISNKPCRYYLCARKGTGGLSR-------------------VVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLN
         E  EL    QLLA FE KRI+ KP R+Y+CARKG GG+ +                   +       SF    + + F   ++IRP+PELTQ +FD L 
Subjt:  VEGFELLVAKQLLAYFEVKRISNKPCRYYLCARKGTGGLSR-------------------VVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLN

Query:  YYKDKFKGRGKFGTLITDKLLLAFGLLDFNPLLVPVEATRPNLELG----------DKAKGGSRQPKVGSSSYPKAAEIASDA------MVEVERLQGPP
        YYK++F    K GTL+TD+LLL  GLLD+NP + P+E++RPN EL            K+KG +   +   SS P    +   A      ++E+E   G P
Subjt:  YYKDKFKGRGKFGTLITDKLLLAFGLLDFNPLLVPVEATRPNLELG----------DKAKGGSRQPKVGSSSYPKAAEIASDA------MVEVERLQGPP

Query:  PGEKRPRSHS
          EKRPR  +
Subjt:  PGEKRPRSHS

A0A6J1DZB3 uncharacterized protein LOC1110256657.6e-6936.81Show/hide
Query:  LCARKGTGGL-----------------SRVVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLNYYKDKFKGRGKFGTLITDKLLLAFGLLDFNP
        +CARKGTGG+                 S   L           + + F   ++I+ +PEL Q TFD L +YKD F    K  TL+TDKLLL  GLLD+NP
Subjt:  LCARKGTGGL-----------------SRVVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLNYYKDKFKGRGKFGTLITDKLLLAFGLLDFNP

Query:  LLVPVEATRPNLELG----------DKAKGGSRQ--------------PKVGSSSYPKAAEIASDAMVEVERLQGPPPGEKRPRSHSVEALKVFLLREVH
        L+  +EA+RPN EL            K+KG +                P+  +      +      ++E++ L G   GEKR R  S EAL V  L EV 
Subjt:  LLVPVEATRPNLELG----------DKAKGGSRQ--------------PKVGSSSYPKAAEIASDAMVEVERLQGPPPGEKRPRSHSVEALKVFLLREVH

Query:  GGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQ
        G S  ++ +K K+  +SSE        +S  DL+DD EA+M  T +V +RF +EPS + ++++V +I +    R +R A+KFV  PGS +QR ID  AE 
Subjt:  GGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRERVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQ

Query:  HAT----CPRDKSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL--------
                   K+ELDG E L+ K++E+   AL++AT ++GEL + + ++   + +   K+  ++ E EK+  +L+A +AI K LE EKF+L        
Subjt:  HAT----CPRDKSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKF--KLAKVEVEVEKNLKNLKAIYAILKDLEVEKFKL--------

Query:  -------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLK
                                 N   LEE+FR+HPDFDGFA+DF D GFKFLMKG+A   P+L  DL  +KK+Y+++WAS PN TP PQSLV+KY++
Subjt:  -------------------------NSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNL--DLEPIKKRYAKRWASDPNNTPGPQSLVEKYLK

Query:  ELNSEYQDNGKDEDDLTHEGEDVGASR
        EL+S+Y D  ++ED  + E  +VG ++
Subjt:  ELNSEYQDNGKDEDDLTHEGEDVGASR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGTGAGGACAGCGATGCCTCGACCTCGGGCCATGGGGTTTCCTTCAAAGATGCCCGAAAACTACCTTAGACCGCTACGTAAGCGGTACCAAATCCCTTATAACAT
AAGTTTACTCGTACCTCGGGCTGGAGAAAAAGCTGACGATCCTTCGGAGGGATGCATCACTTTTTATCTAAACATGTTCGAATACGGGTTTCGACTGCCTGTCGACCCCT
TCATCCAGGAGACTCGTGAAGTGGAAGGTTTTGAGCTACTTGTGGCCAAGCAGCTCCTCGCCTACTTCGAGGTGAAACGAATTTCCAATAAGCCCTGCCGCTATTATTTG
TGTGCAAGGAAGGGCACAGGGGGATTATCAAGGGTCGTACTACCCTCTTGTTCTTTGTCGTTTTTGATTACTTTGCTAACTTCTCTGTTTATTCTTGCAATTGCCATCAG
ACCGATGCCCGAGCTAACGCAACCGACTTTTGACGTGTTAAATTACTACAAGGACAAATTCAAAGGTAGAGGAAAGTTCGGGACTTTGATAACCGACAAGCTACTGCTCG
CCTTCGGTTTGCTTGATTTTAACCCTCTCCTCGTTCCTGTTGAGGCCACAAGACCGAACTTGGAGTTGGGTGACAAGGCTAAGGGTGGAAGTCGCCAACCCAAGGTTGGG
TCGAGCTCATACCCAAAAGCTGCGGAGATAGCGAGTGACGCCATGGTCGAGGTGGAGCGTCTTCAAGGTCCTCCCCCTGGCGAAAAAAGACCTCGCAGTCATTCTGTTGA
GGCACTGAAGGTTTTCCTTTTGAGGGAGGTGCATGGGGGCTCTTCTCCAAAGAAGGCCAAAAAGAATAAAGAAAAAAAGACTTCTTCAGAGGGTGCTGTCTCCGAGGTTC
AGAGGAGTAGCTTCTTGGACTTGATTGACGATTCAGAGGCTAAGATGGGAGCTACTATCGATGTGGAGATTAGGTTCCATATCGAACCCTCTAGAGCCAAGATGAGGGAG
AGAGTTGAACAGATCGGAAGCGCCAACTACTTTCGATGCATGAGGTGGGCCACCAAGTTTGTTTGTGCCCCTGGGTCCGACATGCAACGCCTTATAGACATTACTGCCGA
GCAGCATGCCACGTGCCCTCGTGATAAAAGCGAGCTCGACGGGCACGAGAAGCTATCTGAGAAAGATAAAGAATCTATCTTGACTGCCTTGGACTCTGCTACCGCTGTCG
AGGGTGAGCTTCAGGAAGTAAGGTCCAAACTGAAGGCCACCCAGGAAAAGTTCAAACTTGCCAAGGTGGAGGTGGAGGTGGAGAAGAACTTAAAGAACCTCAAGGCAATT
TACGCCATTCTGAAGGATTTAGAGGTTGAGAAGTTCAAACTCAACAGTGCCTTCCTTGAGGAGACCTTCCGGAAGCACCCCGATTTTGACGGCTTCGCCAGGGACTTCAG
GGACACGGGGTTCAAGTTCCTAATGAAGGGCGTTGCTGAAGTGGCTCCCAACCTCGACCTCGAACCCATCAAGAAGCGATACGCAAAGCGATGGGCCTCTGATCCGAACA
ACACCCCTGGTCCCCAAAGCTTGGTGGAAAAATATTTGAAGGAGTTGAACTCTGAGTATCAGGACAATGGTAAGGATGAGGATGACCTAACCCATGAGGGTGAAGATGTT
GGCGCTTCCCGAGCCAAAGATGAAAGACTGCTCCTGTTACACCTGTTGTCAATTACTCTCAGGAGGCTGGACCCGAGGCATATATGCCTTAGTCCTGTCTTACAGCTTTT
CTTATTGAACAGGTCGGCTCTTGGGTATCAACATGCCTCTGAACCTTCCGTAGGGCCAAATGTCCGACCTGAAAAAGTCAAGGTCGAACTCGAGCGTTGTAATGTCTGGC
CATCGTCGGCATTCCGATCTCGACAGGAACTACGGCCTCGATGCGAAAAGCCAATAAGAAAGGGGTCTCTCCAATTGACTTACGGGGAGTCATTTTATAGTACCAGAGGA
CTTCAAATGGGCAGGGGACAAACTGAGGTGGCTGATGCTGAGCTTTGCGTAGAATTCTTTAAACTTCCTGTTATCAAATATATTGCCATTGTCTATCACAATGGCATTAG
GAATGCCGAAGCGACACATAATGTTGGTGTAGATGAAGGCCGTGATTTTGGTTTCAGATCCGATCACTTCTACTCCTCATTGAGCAAAGGGCCAAGGGGTAGTGATAGGG
GTGAGCAGCTTGGGTGGCTAGTGAATGATGTTTGCAAAGCGTTGGCAGTTGTCACACCTCTTCACAAATTCTTGAGCATCTCGATCTATGATGGGCTAGTAGTAACCTTT
CAGACTACCTTGGACGATAATGACCTGGCCCCTGAGTGGTTGCCACAAACTCCTTCGTGGATTTCCCTGAGAAAATCTGTCGAGACATTGCCCGTCAAGTATCCTCTGAT
TATATCCATCCATAACGGAGGTTGGGAATCGACTTCCATCACGCCTAGTTCCAAAATTGACGGAGTTTCAAATATCTCAACGGAGATCGATCTAACGAGATCGGTCTTTT
TTGAACTAGGCGAGGTGCGCTCTGACTTTGCTCAGCTTGCACCAAGCCCTTTAGCCACGCGCAGACCGGGCAACAGCACTTCATACTCGGCCTTGTTGTTTGAGGTACGA
AAATTGAAGTGGAGGGTATACTCGAAATGCGTGCCATCTGGTGCGAGTAGAAGCACTCCAGTGCCACATCCTTTGTCATTCGATGGCCCATCAACAAACATAGTCCAAGA
GAGGTTAGGCTTCAAGGTCGAGGCTGAGCTTCATCCGGTAGTTGCGAAGTACCCCGAAGGTTTCAGTCAAGTTGGCAAGGTGCGACCTGGACTGCTTGCTTTTAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGTGAGGACAGCGATGCCTCGACCTCGGGCCATGGGGTTTCCTTCAAAGATGCCCGAAAACTACCTTAGACCGCTACGTAAGCGGTACCAAATCCCTTATAACAT
AAGTTTACTCGTACCTCGGGCTGGAGAAAAAGCTGACGATCCTTCGGAGGGATGCATCACTTTTTATCTAAACATGTTCGAATACGGGTTTCGACTGCCTGTCGACCCCT
TCATCCAGGAGACTCGTGAAGTGGAAGGTTTTGAGCTACTTGTGGCCAAGCAGCTCCTCGCCTACTTCGAGGTGAAACGAATTTCCAATAAGCCCTGCCGCTATTATTTG
TGTGCAAGGAAGGGCACAGGGGGATTATCAAGGGTCGTACTACCCTCTTGTTCTTTGTCGTTTTTGATTACTTTGCTAACTTCTCTGTTTATTCTTGCAATTGCCATCAG
ACCGATGCCCGAGCTAACGCAACCGACTTTTGACGTGTTAAATTACTACAAGGACAAATTCAAAGGTAGAGGAAAGTTCGGGACTTTGATAACCGACAAGCTACTGCTCG
CCTTCGGTTTGCTTGATTTTAACCCTCTCCTCGTTCCTGTTGAGGCCACAAGACCGAACTTGGAGTTGGGTGACAAGGCTAAGGGTGGAAGTCGCCAACCCAAGGTTGGG
TCGAGCTCATACCCAAAAGCTGCGGAGATAGCGAGTGACGCCATGGTCGAGGTGGAGCGTCTTCAAGGTCCTCCCCCTGGCGAAAAAAGACCTCGCAGTCATTCTGTTGA
GGCACTGAAGGTTTTCCTTTTGAGGGAGGTGCATGGGGGCTCTTCTCCAAAGAAGGCCAAAAAGAATAAAGAAAAAAAGACTTCTTCAGAGGGTGCTGTCTCCGAGGTTC
AGAGGAGTAGCTTCTTGGACTTGATTGACGATTCAGAGGCTAAGATGGGAGCTACTATCGATGTGGAGATTAGGTTCCATATCGAACCCTCTAGAGCCAAGATGAGGGAG
AGAGTTGAACAGATCGGAAGCGCCAACTACTTTCGATGCATGAGGTGGGCCACCAAGTTTGTTTGTGCCCCTGGGTCCGACATGCAACGCCTTATAGACATTACTGCCGA
GCAGCATGCCACGTGCCCTCGTGATAAAAGCGAGCTCGACGGGCACGAGAAGCTATCTGAGAAAGATAAAGAATCTATCTTGACTGCCTTGGACTCTGCTACCGCTGTCG
AGGGTGAGCTTCAGGAAGTAAGGTCCAAACTGAAGGCCACCCAGGAAAAGTTCAAACTTGCCAAGGTGGAGGTGGAGGTGGAGAAGAACTTAAAGAACCTCAAGGCAATT
TACGCCATTCTGAAGGATTTAGAGGTTGAGAAGTTCAAACTCAACAGTGCCTTCCTTGAGGAGACCTTCCGGAAGCACCCCGATTTTGACGGCTTCGCCAGGGACTTCAG
GGACACGGGGTTCAAGTTCCTAATGAAGGGCGTTGCTGAAGTGGCTCCCAACCTCGACCTCGAACCCATCAAGAAGCGATACGCAAAGCGATGGGCCTCTGATCCGAACA
ACACCCCTGGTCCCCAAAGCTTGGTGGAAAAATATTTGAAGGAGTTGAACTCTGAGTATCAGGACAATGGTAAGGATGAGGATGACCTAACCCATGAGGGTGAAGATGTT
GGCGCTTCCCGAGCCAAAGATGAAAGACTGCTCCTGTTACACCTGTTGTCAATTACTCTCAGGAGGCTGGACCCGAGGCATATATGCCTTAGTCCTGTCTTACAGCTTTT
CTTATTGAACAGGTCGGCTCTTGGGTATCAACATGCCTCTGAACCTTCCGTAGGGCCAAATGTCCGACCTGAAAAAGTCAAGGTCGAACTCGAGCGTTGTAATGTCTGGC
CATCGTCGGCATTCCGATCTCGACAGGAACTACGGCCTCGATGCGAAAAGCCAATAAGAAAGGGGTCTCTCCAATTGACTTACGGGGAGTCATTTTATAGTACCAGAGGA
CTTCAAATGGGCAGGGGACAAACTGAGGTGGCTGATGCTGAGCTTTGCGTAGAATTCTTTAAACTTCCTGTTATCAAATATATTGCCATTGTCTATCACAATGGCATTAG
GAATGCCGAAGCGACACATAATGTTGGTGTAGATGAAGGCCGTGATTTTGGTTTCAGATCCGATCACTTCTACTCCTCATTGAGCAAAGGGCCAAGGGGTAGTGATAGGG
GTGAGCAGCTTGGGTGGCTAGTGAATGATGTTTGCAAAGCGTTGGCAGTTGTCACACCTCTTCACAAATTCTTGAGCATCTCGATCTATGATGGGCTAGTAGTAACCTTT
CAGACTACCTTGGACGATAATGACCTGGCCCCTGAGTGGTTGCCACAAACTCCTTCGTGGATTTCCCTGAGAAAATCTGTCGAGACATTGCCCGTCAAGTATCCTCTGAT
TATATCCATCCATAACGGAGGTTGGGAATCGACTTCCATCACGCCTAGTTCCAAAATTGACGGAGTTTCAAATATCTCAACGGAGATCGATCTAACGAGATCGGTCTTTT
TTGAACTAGGCGAGGTGCGCTCTGACTTTGCTCAGCTTGCACCAAGCCCTTTAGCCACGCGCAGACCGGGCAACAGCACTTCATACTCGGCCTTGTTGTTTGAGGTACGA
AAATTGAAGTGGAGGGTATACTCGAAATGCGTGCCATCTGGTGCGAGTAGAAGCACTCCAGTGCCACATCCTTTGTCATTCGATGGCCCATCAACAAACATAGTCCAAGA
GAGGTTAGGCTTCAAGGTCGAGGCTGAGCTTCATCCGGTAGTTGCGAAGTACCCCGAAGGTTTCAGTCAAGTTGGCAAGGTGCGACCTGGACTGCTTGCTTTTAACTAG
Protein sequenceShow/hide protein sequence
MMVRTAMPRPRAMGFPSKMPENYLRPLRKRYQIPYNISLLVPRAGEKADDPSEGCITFYLNMFEYGFRLPVDPFIQETREVEGFELLVAKQLLAYFEVKRISNKPCRYYL
CARKGTGGLSRVVLPSCSLSFLITLLTSLFILAIAIRPMPELTQPTFDVLNYYKDKFKGRGKFGTLITDKLLLAFGLLDFNPLLVPVEATRPNLELGDKAKGGSRQPKVG
SSSYPKAAEIASDAMVEVERLQGPPPGEKRPRSHSVEALKVFLLREVHGGSSPKKAKKNKEKKTSSEGAVSEVQRSSFLDLIDDSEAKMGATIDVEIRFHIEPSRAKMRE
RVEQIGSANYFRCMRWATKFVCAPGSDMQRLIDITAEQHATCPRDKSELDGHEKLSEKDKESILTALDSATAVEGELQEVRSKLKATQEKFKLAKVEVEVEKNLKNLKAI
YAILKDLEVEKFKLNSAFLEETFRKHPDFDGFARDFRDTGFKFLMKGVAEVAPNLDLEPIKKRYAKRWASDPNNTPGPQSLVEKYLKELNSEYQDNGKDEDDLTHEGEDV
GASRAKDERLLLLHLLSITLRRLDPRHICLSPVLQLFLLNRSALGYQHASEPSVGPNVRPEKVKVELERCNVWPSSAFRSRQELRPRCEKPIRKGSLQLTYGESFYSTRG
LQMGRGQTEVADAELCVEFFKLPVIKYIAIVYHNGIRNAEATHNVGVDEGRDFGFRSDHFYSSLSKGPRGSDRGEQLGWLVNDVCKALAVVTPLHKFLSISIYDGLVVTF
QTTLDDNDLAPEWLPQTPSWISLRKSVETLPVKYPLIISIHNGGWESTSITPSSKIDGVSNISTEIDLTRSVFFELGEVRSDFAQLAPSPLATRRPGNSTSYSALLFEVR
KLKWRVYSKCVPSGASRSTPVPHPLSFDGPSTNIVQERLGFKVEAELHPVVAKYPEGFSQVGKVRPGLLAFN