; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr9:12236027..12242440
RNA-Seq ExpressionMoc09g14340
SyntenyMoc09g14340
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]1.6e-8752.01Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS
        MFYNGLNGQTRTI+DAA+GGTL+SKT E A  LLE+MA+N++QWP+ER+ AK+VAG++E++ +++L AQV  L++ IS L+      S E VA+      
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS

Query:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQP--PPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE
          E + EQ QYVNN N+ Y+GN     +P +YHPGLRNHEN SY N +NVLQP  PPGF SQP+EKK SLED + +F+ E+ +R  + +++++ +E    
Subjt:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQP--PPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE

Query:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKE--ATPTLQ--DDKPTSSIASSL
             +KN+EV IGQ+A+T+N  Q+G FPS+TEVNP+E CKA+TLRSGKE++    K+ +         ++K++V +E     TL+  D  PT S   + 
Subjt:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKE--ATPTLQ--DDKPTSSIASSL

Query:  P---NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY
        P     LPYPQRFQK+K+D QF+KFL+IFKK+HINIPFADALEQMPNY
Subjt:  P---NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]1.6e-8752.01Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS
        MFYNGLNGQTRTI+DAA+GGTL+SKT E A  LLE+MA+N++QWP+ER+ AK+VAG++E++ +++L AQV  L++ IS L+      S E VA+      
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS

Query:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQP--PPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE
          E + EQ QYVNN N+ Y+GN     +P +YHPGLRNHEN SY N +NVLQP  PPGF SQP+EKK SLED + +F+ E+ +R  + +++++ +E    
Subjt:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQP--PPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE

Query:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKE--ATPTLQ--DDKPTSSIASSL
             +KN+EV IGQ+A+T+N  Q+G FPS+TEVNP+E CKA+TLRSGKE++    K+ +         ++K++V +E     TL+  D  PT S   + 
Subjt:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKE--ATPTLQ--DDKPTSSIASSL

Query:  P---NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY
        P     LPYPQRFQK+K+D QF+KFL+IFKK+HINIPFADALEQMPNY
Subjt:  P---NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]2.7e-8752.59Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS
        MFYNGLNGQTRTI+DAA+GGTL+SKT E A  LLE+MA+N++QWP+ER+ AK+VAG+++++ +++L AQV  L++ IS L+      S E +A+      
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS

Query:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQP--PPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE
          E + EQ QYVNN N+ Y+GN     +P +YHPGLRNHEN SY N +NVLQP  PPGF SQP+E+K SLED + +F+ E+ +R  + +++++ +E    
Subjt:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQP--PPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE

Query:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQ-EPEKKKMEEPVITN-EEWENKEEVVKEATPTLQ--DDKPTSSIASSL
            AIKN+EV IGQ+A+T+N  Q+G FPS+TEVNP+E CKA+TLRSGKE++  P K+    P   N  + +NK E  +    TL+  D  PT S   + 
Subjt:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQ-EPEKKKMEEPVITN-EEWENKEEVVKEATPTLQ--DDKPTSSIASSL

Query:  P---NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY
        P     LPYPQRFQK+K+D QF+KFL+IFKK+HINIPFADALEQMPNY
Subjt:  P---NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]2.5e-12557.29Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAADTYSY
        MFYNGLNGQTRTILDAAAGGTLLS+T ENAYILL+DMA NSFQWPSERS AK+VAGMYEIDE+SSLKAQVQALTNA+SKLSGPGT HS ELVAA DTYSY
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAADTYSY

Query:  YEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNT
        YEPTIEQAQ                                              FTS P EKKSSLEDLLGAFINE RSRASRIENQVEGMEVKLEGNT
Subjt:  YEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNT

Query:  TAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNSLPYP
        T+IKNMEV IGQ+A TLNTMQKGKFPSD EV PREHCKAVTLRSGKELQEPEKKKMEEPVIT EE ENKEEVVKEATP LQ DKPTSSI SS PNSLPYP
Subjt:  TAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNSLPYP

Query:  Q----------RFQ------KKKIDA-------------------------------------QFAKFL----------------EIFKKLH--------
        Q          RF       K+K++A                                      F K L                ++  K+         
Subjt:  Q----------RFQ------KKKIDA-------------------------------------QFAKFL----------------EIFKKLH--------

Query:  --------------------------INIPFADALEQMPNYTVVFDISRAMKYPKEVSTCHRIDVADATVAEIRYFVSFDDALEKCMFGYDIEEIGELDQ
                                  I++       ++    VVFDIS AMKY +EVSTCHRIDV DATVAEIR FVSF DA EKCM G  IE IGELDQ
Subjt:  --------------------------INIPFADALEQMPNYTVVFDISRAMKYPKEVSTCHRIDVADATVAEIRYFVSFDDALEKCMFGYDIEEIGELDQ

Query:  E
        E
Subjt:  E

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]9.9e-9052.89Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS
        MFYNGLNGQTRTI+DAA+GGTL+SKT E A  LLE+MA+N++QWP+ER+ AK+VAG++E++  ++L AQV +L++ +S L+        E VAA+  T  
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS

Query:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGN
          E + EQ QY+NN N+ Y+GN     +P +YHPGLRNHENFSY N +NVLQPPPGF SQP+EKK SLED + +F+ E+++   + ++Q++ +E      
Subjt:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGN

Query:  TTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQ-EPEKKKMEEPVITNE-EWENK--EEVVKEATPTLQDDKPTSSIASSLP-
           +KN+EV IGQ+A+T+N  Q+G FPS+TEVNP+E CKA+TLRSG+E++  P K+    P   N  + +NK  EE + E T    D  P+ S   + P 
Subjt:  TTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQ-EPEKKKMEEPVITNE-EWENK--EEVVKEATPTLQDDKPTSSIASSLP-

Query:  --NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY
            LPYPQRFQK+K+D QF+KFL+IFKK+HINIPFADALEQMPNY
Subjt:  --NSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQMPNY

TrEMBL top hitse value%identityAlignment
A0A2I4FP56 uncharacterized protein LOC1090008371.8e-6037.59Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS
        MFY   NGQT+T +DA +GG L+SKT E    LLE+MA++++QWP++R+ AK+VAG++E++ ++++ AQV  L++ IS L       S E VAA   T  
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS

Query:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVL--QPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE
          E + EQ QY+NN N+ Y+GN     +P H+HPGLRNHEN SY N +NVL  QPP GF SQ ++KK SLE+ + +F+ E+ ++    ++Q++ +E    
Subjt:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVL--QPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLE

Query:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNS-
            AIKN+EV IGQ+A+T+N  Q+  FPS+TEVNP E  KA+TLRSG+E+++P  K+           ++K +V +E     + DKP   +A S P++ 
Subjt:  GNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNS-

Query:  ------LPYPQ----------RFQK----KKIDAQFAKFLEIFKKLHIN----------IPF------------ADALEQMPNYTVVFDISRAMKYPKEV
              LPYPQ          +F K    KK   +  + +++F++ +I           +PF             +   ++    V+F+I +  + P+E 
Subjt:  ------LPYPQ----------RFQK----KKIDAQFAKFLEIFKKLHIN----------IPF------------ADALEQMPNYTVVFDISRAMKYPKEV

Query:  STCHRIDVADATVAE
        STC R+DV    V E
Subjt:  STCHRIDVADATVAE

A0A2P5AMA4 Uncharacterized protein1.0e-6346.86Show/hide
Query:  MAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYSYYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGL
        MA+N++QWP+E S   RVA + E+D +++L  QV ALT+ IS L+      S+E +  ++ ++      +EQ  Y+NN NF Y+GN   ++LPTHYHP L
Subjt:  MAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYSYYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGL

Query:  RNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPRE
        RNHENFSYANNRNVLQPP GF     EKK SL+D+L  FI E++ R ++ E +++ +E         +K++EV I Q+A+++      KFPSDTE NP++
Subjt:  RNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPRE

Query:  HCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQM
        HCK +TLRS KE++ P++K         +  E   EV+ +       D P       +   L YPQRFQKKK+D+QFAKF+EIFKKLHINIPFADAL+QM
Subjt:  HCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFADALEQM

Query:  PNY
         NY
Subjt:  PNY

A0A6J1DU19 uncharacterized protein LOC1110243611.2e-12557.29Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAADTYSY
        MFYNGLNGQTRTILDAAAGGTLLS+T ENAYILL+DMA NSFQWPSERS AK+VAGMYEIDE+SSLKAQVQALTNA+SKLSGPGT HS ELVAA DTYSY
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAADTYSY

Query:  YEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNT
        YEPTIEQAQ                                              FTS P EKKSSLEDLLGAFINE RSRASRIENQVEGMEVKLEGNT
Subjt:  YEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNT

Query:  TAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNSLPYP
        T+IKNMEV IGQ+A TLNTMQKGKFPSD EV PREHCKAVTLRSGKELQEPEKKKMEEPVIT EE ENKEEVVKEATP LQ DKPTSSI SS PNSLPYP
Subjt:  TAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNSLPYP

Query:  Q----------RFQ------KKKIDA-------------------------------------QFAKFL----------------EIFKKLH--------
        Q          RF       K+K++A                                      F K L                ++  K+         
Subjt:  Q----------RFQ------KKKIDA-------------------------------------QFAKFL----------------EIFKKLH--------

Query:  --------------------------INIPFADALEQMPNYTVVFDISRAMKYPKEVSTCHRIDVADATVAEIRYFVSFDDALEKCMFGYDIEEIGELDQ
                                  I++       ++    VVFDIS AMKY +EVSTCHRIDV DATVAEIR FVSF DA EKCM G  IE IGELDQ
Subjt:  --------------------------INIPFADALEQMPNYTVVFDISRAMKYPKEVSTCHRIDVADATVAEIRYFVSFDDALEKCMFGYDIEEIGELDQ

Query:  E
        E
Subjt:  E

A0A6J1E1F6 uncharacterized protein LOC1110255512.4e-6557.69Show/hide
Query:  MEVKLEGNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIAS
        MEVKLEGNTTAIKNMEV IGQMASTLN MQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVIT EE ENKEEVVKEATPTLQ DKPTSSIAS
Subjt:  MEVKLEGNTTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIAS

Query:  SLPNSLPYPQ----------RFQ------KKKIDA-------------------------------------QFAKFL----------------------
        SLPNSLPYPQ          RF       K+K++A                                      F K L                      
Subjt:  SLPNSLPYPQ----------RFQ------KKKIDA-------------------------------------QFAKFL----------------------

Query:  ----EIFKKLHINIPFADALEQMPNYTVVFDISRAMKYPKEVSTCHRIDVADATVAEIRYFVSFDDALEKCMFGYDIEEIGELDQE
             +  +  I++       ++    VVFDISRAMKYPKEVSTC  IDV DATVAEIR FV F DALEKCM G DIEEIGELDQE
Subjt:  ----EIFKKLHINIPFADALEQMPNYTVVFDISRAMKYPKEVSTCHRIDVADATVAEIRYFVSFDDALEKCMFGYDIEEIGELDQE

A0A6P9DWY0 uncharacterized protein LOC1183440261.8e-6551.72Show/hide
Query:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS
        MFYNGLNGQT+TI+DAA+GGTL+SKT E A  LLE+MA+N++QWP ER+  K+VAG++E++ +++L AQV +L++ IS L+      S E VAA   T  
Subjt:  MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAAD-TYS

Query:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGN
          E + EQ QY+NN N+ Y+GN     +P +YH GLRNHEN SY N +NVLQP PGF SQP+EKK SLED + +F+ E+ +R  + +++++ +E      
Subjt:  YYEPTIEQAQYVNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGN

Query:  TTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPV
          AIKN+EV IGQ+A+T+N  Q+G FPS+TEVNPRE CKA+TLRSG+EL      +   P+
Subjt:  TTAIKNMEVHIGQMASTLNTMQKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTACAATGGACTGAATGGACAAACAAGGACTATACTGGATGCTGCAGCAGGAGGCACTTTATTATCCAAAACACTTGAGAATGCTTACATATTACTGGAGGACAT
GGCAGCCAATAGTTTCCAATGGCCTAGTGAGAGATCGTATGCCAAAAGAGTTGCTGGAATGTATGAAATCGATGAGGTAAGTTCCCTAAAAGCTCAAGTTCAAGCTTTGA
CTAATGCTATCTCTAAACTTTCAGGACCAGGAACTTTTCATTCAAAAGAGTTGGTGGCAGCAGCAGATACATATTCTTACTATGAGCCAACCATCGAGCAAGCTCAGTAT
GTCAATAATATAAATTTTTGCTACAAGGGAAATCAGCAACAAAGCTCGTTGCCAACACACTATCATCCAGGGTTAAGGAATCATGAAAATTTTTCTTATGCTAACAACAG
GAATGTTTTGCAACCTCCACCAGGTTTTACATCTCAGCCAACTGAAAAGAAATCCTCCCTTGAGGATCTACTTGGGGCTTTCATCAATGAGTCTAGAAGTCGAGCTAGTC
GGATTGAAAATCAGGTAGAAGGGATGGAAGTTAAATTGGAAGGAAACACAACTGCCATCAAGAACATGGAGGTGCATATAGGGCAAATGGCATCCACATTGAACACTATG
CAGAAAGGGAAGTTTCCAAGTGACACTGAAGTTAACCCACGAGAACATTGCAAAGCCGTCACTTTGAGAAGCGGAAAGGAGCTCCAGGAGCCTGAAAAGAAAAAAATGGA
AGAACCAGTCATCACAAATGAGGAATGGGAAAATAAGGAGGAAGTTGTAAAGGAGGCCACTCCTACTCTACAGGATGACAAGCCTACTAGTTCTATTGCTTCTAGTCTTC
CTAACTCTTTACCTTATCCTCAGCGTTTCCAAAAGAAAAAGATTGATGCTCAATTTGCTAAATTTTTAGAAATTTTTAAGAAACTTCACATTAATATTCCTTTTGCAGAT
GCACTGGAACAAATGCCAAATTATACCGTTGTGTTCGACATATCTCGTGCCATGAAATATCCCAAGGAGGTAAGTACATGCCATAGGATAGATGTTGCTGATGCCACTGT
GGCTGAAATAAGATATTTTGTTTCATTTGATGATGCCCTTGAAAAATGCATGTTTGGTTATGATATTGAGGAAATAGGGGAATTAGATCAGGAGCGCCATGGCGGATTCA
AGGGCGCTACAGCGCTCTGCAGCGCTTGGCCCAACAGAATTTCCCGAGAGCAGAGCGCCATGGTGCTCACTTATAGCGTCATGGCACTGTCGCGATATCTGGGATTGCGT
GTTCTTTTAGAGCGATTTTGGGAGTTCAAGTTGTGGGTTTTTGTCAAATCCACACGTTCATCCCAACCAACGTTTTTGTCGTATTGTTGTGTTTGTACTGTTATGGCTCC
AAATCGAGCACGTGGTGTCTCCTCTTCTTGTTCATATGATAGAAAAAAATTTATTAATGAGGAGGCATCCACTTGGTTCTCTACTGTAGAGGCCCATAAAAGTCTTATTC
AGGAACGGGGCCTATTGCTTAATGAGGTCCACCAACAGAGCATGTATGAGAATATTGTTTCTAGAAGGTGGGTCAATTTTTTTCAGCAGCCCAATGTCGCCGTGGTCCCC
CTTGTTCGTGAATTTTATGCTAATATCCAGGAGGGGTCCAATATTTCTTTTGTCCGGGGAAGAAGCGTGGCTTTCGACAGTATCTCTATCAATGAATTTTTTGAGCTTCC
CAATTTTGATCGGGACGACTATAATCGGTATGCTAGTGAGGAGCTTGACATGGATCAGATCCTTAGTAGTTTATGCCGCCCGAATGCTGAATGGAAAATGCGGCACAGTG
AAGCCATTACCTTCAAATCTGCAGATCTGAGTGTTCACAATAAGAGCATTAAGCATGCAAGACGCGGTACCACAACCGGTGGCCTTCCCCATCCCATGTTGATAACCGAT
CTCTGTAAGAAGGTTGGTGTTGTTTGCGACCCCACGGAGACATTCCTAGGTCCCAAAAGTGTGATGGACAAGAATTACATTTTGTCTATCCGTGGGTGGAAGCCCCTTAT
GGAGATTGACGGCCCTACGGAAGAACAACCGGAAGTCCATGCCGACCCACCTCCACAGCAGCGCGCTCCCCGACCACTTGAGGAGGAAGTTCGCCATCTTTCACGCCAGT
TCCATCGTTTTCAGGTGAATCGGAGGCTGCAATTTGACTATCTTGTTCAATGCTTTCAGGCTCAACAATCATCGCAACCTCTTCCTCCACTTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTTACAATGGACTGAATGGACAAACAAGGACTATACTGGATGCTGCAGCAGGAGGCACTTTATTATCCAAAACACTTGAGAATGCTTACATATTACTGGAGGACAT
GGCAGCCAATAGTTTCCAATGGCCTAGTGAGAGATCGTATGCCAAAAGAGTTGCTGGAATGTATGAAATCGATGAGGTAAGTTCCCTAAAAGCTCAAGTTCAAGCTTTGA
CTAATGCTATCTCTAAACTTTCAGGACCAGGAACTTTTCATTCAAAAGAGTTGGTGGCAGCAGCAGATACATATTCTTACTATGAGCCAACCATCGAGCAAGCTCAGTAT
GTCAATAATATAAATTTTTGCTACAAGGGAAATCAGCAACAAAGCTCGTTGCCAACACACTATCATCCAGGGTTAAGGAATCATGAAAATTTTTCTTATGCTAACAACAG
GAATGTTTTGCAACCTCCACCAGGTTTTACATCTCAGCCAACTGAAAAGAAATCCTCCCTTGAGGATCTACTTGGGGCTTTCATCAATGAGTCTAGAAGTCGAGCTAGTC
GGATTGAAAATCAGGTAGAAGGGATGGAAGTTAAATTGGAAGGAAACACAACTGCCATCAAGAACATGGAGGTGCATATAGGGCAAATGGCATCCACATTGAACACTATG
CAGAAAGGGAAGTTTCCAAGTGACACTGAAGTTAACCCACGAGAACATTGCAAAGCCGTCACTTTGAGAAGCGGAAAGGAGCTCCAGGAGCCTGAAAAGAAAAAAATGGA
AGAACCAGTCATCACAAATGAGGAATGGGAAAATAAGGAGGAAGTTGTAAAGGAGGCCACTCCTACTCTACAGGATGACAAGCCTACTAGTTCTATTGCTTCTAGTCTTC
CTAACTCTTTACCTTATCCTCAGCGTTTCCAAAAGAAAAAGATTGATGCTCAATTTGCTAAATTTTTAGAAATTTTTAAGAAACTTCACATTAATATTCCTTTTGCAGAT
GCACTGGAACAAATGCCAAATTATACCGTTGTGTTCGACATATCTCGTGCCATGAAATATCCCAAGGAGGTAAGTACATGCCATAGGATAGATGTTGCTGATGCCACTGT
GGCTGAAATAAGATATTTTGTTTCATTTGATGATGCCCTTGAAAAATGCATGTTTGGTTATGATATTGAGGAAATAGGGGAATTAGATCAGGAGCGCCATGGCGGATTCA
AGGGCGCTACAGCGCTCTGCAGCGCTTGGCCCAACAGAATTTCCCGAGAGCAGAGCGCCATGGTGCTCACTTATAGCGTCATGGCACTGTCGCGATATCTGGGATTGCGT
GTTCTTTTAGAGCGATTTTGGGAGTTCAAGTTGTGGGTTTTTGTCAAATCCACACGTTCATCCCAACCAACGTTTTTGTCGTATTGTTGTGTTTGTACTGTTATGGCTCC
AAATCGAGCACGTGGTGTCTCCTCTTCTTGTTCATATGATAGAAAAAAATTTATTAATGAGGAGGCATCCACTTGGTTCTCTACTGTAGAGGCCCATAAAAGTCTTATTC
AGGAACGGGGCCTATTGCTTAATGAGGTCCACCAACAGAGCATGTATGAGAATATTGTTTCTAGAAGGTGGGTCAATTTTTTTCAGCAGCCCAATGTCGCCGTGGTCCCC
CTTGTTCGTGAATTTTATGCTAATATCCAGGAGGGGTCCAATATTTCTTTTGTCCGGGGAAGAAGCGTGGCTTTCGACAGTATCTCTATCAATGAATTTTTTGAGCTTCC
CAATTTTGATCGGGACGACTATAATCGGTATGCTAGTGAGGAGCTTGACATGGATCAGATCCTTAGTAGTTTATGCCGCCCGAATGCTGAATGGAAAATGCGGCACAGTG
AAGCCATTACCTTCAAATCTGCAGATCTGAGTGTTCACAATAAGAGCATTAAGCATGCAAGACGCGGTACCACAACCGGTGGCCTTCCCCATCCCATGTTGATAACCGAT
CTCTGTAAGAAGGTTGGTGTTGTTTGCGACCCCACGGAGACATTCCTAGGTCCCAAAAGTGTGATGGACAAGAATTACATTTTGTCTATCCGTGGGTGGAAGCCCCTTAT
GGAGATTGACGGCCCTACGGAAGAACAACCGGAAGTCCATGCCGACCCACCTCCACAGCAGCGCGCTCCCCGACCACTTGAGGAGGAAGTTCGCCATCTTTCACGCCAGT
TCCATCGTTTTCAGGTGAATCGGAGGCTGCAATTTGACTATCTTGTTCAATGCTTTCAGGCTCAACAATCATCGCAACCTCTTCCTCCACTTCCATAA
Protein sequenceShow/hide protein sequence
MFYNGLNGQTRTILDAAAGGTLLSKTLENAYILLEDMAANSFQWPSERSYAKRVAGMYEIDEVSSLKAQVQALTNAISKLSGPGTFHSKELVAAADTYSYYEPTIEQAQY
VNNINFCYKGNQQQSSLPTHYHPGLRNHENFSYANNRNVLQPPPGFTSQPTEKKSSLEDLLGAFINESRSRASRIENQVEGMEVKLEGNTTAIKNMEVHIGQMASTLNTM
QKGKFPSDTEVNPREHCKAVTLRSGKELQEPEKKKMEEPVITNEEWENKEEVVKEATPTLQDDKPTSSIASSLPNSLPYPQRFQKKKIDAQFAKFLEIFKKLHINIPFAD
ALEQMPNYTVVFDISRAMKYPKEVSTCHRIDVADATVAEIRYFVSFDDALEKCMFGYDIEEIGELDQERHGGFKGATALCSAWPNRISREQSAMVLTYSVMALSRYLGLR
VLLERFWEFKLWVFVKSTRSSQPTFLSYCCVCTVMAPNRARGVSSSCSYDRKKFINEEASTWFSTVEAHKSLIQERGLLLNEVHQQSMYENIVSRRWVNFFQQPNVAVVP
LVREFYANIQEGSNISFVRGRSVAFDSISINEFFELPNFDRDDYNRYASEELDMDQILSSLCRPNAEWKMRHSEAITFKSADLSVHNKSIKHARRGTTTGGLPHPMLITD
LCKKVGVVCDPTETFLGPKSVMDKNYILSIRGWKPLMEIDGPTEEQPEVHADPPPQQRAPRPLEEEVRHLSRQFHRFQVNRRLQFDYLVQCFQAQQSSQPLPPLP