; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012837 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012837
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationscaffold63:3865090..3868325
RNA-Seq ExpressionMS012837
SyntenyMS012837
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649209.1 hypothetical protein Csa_014401 [Cucumis sativus]9.4e-8887.75Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV  +APAQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG

Query:  RGRF
        +  F
Subjt:  RGRF

XP_004148996.1 uncharacterized protein LOC101210049 [Cucumis sativus]9.4e-8887.75Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV  +APAQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG

Query:  RGRF
        +  F
Subjt:  RGRF

XP_022136543.1 uncharacterized protein LOC111008218 [Momordica charantia]1.4e-10499.51Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAA AQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFG
        GRFG
Subjt:  GRFG

XP_022941743.1 uncharacterized protein LOC111447019 isoform X1 [Cucurbita moschata]4.3e-8586.76Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+R+SLRQGALAK+RSNFQGNQF LA EVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPR FNRR PNW KTR +APPVQRKPF NGTFIPK  I AP Q QTN  PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRN G RQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFG
        GR G
Subjt:  GRFG

XP_022982836.1 uncharacterized protein LOC111481570 isoform X1 [Cucurbita maxima]4.8e-8485.78Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MA KPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+R+SLRQGA AK+RSNFQGNQF LA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPR FNR  PNW KTR +APPVQRKPF NGTFIPK  IAAP Q QTN  PRQ+PQTLDSLFANMKEQRLRVLSQRQNG GA QRN G RQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFG
        GR G
Subjt:  GRFG

TrEMBL top hitse value%identityAlignment
A0A0A0KUX2 Uncharacterized protein4.5e-8887.75Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV  +APAQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG

Query:  RGRF
        +  F
Subjt:  RGRF

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 27.5e-8384.47Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQF LA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQN--GNGAPQRNGGGRQQRPP
        APIRPR F RRAPNW+KTR DA PPV +K FTNG F+PKV  +APAQ QTN  PRQRPQTLDSLFANMKEQRLRVLSQRQN  G GA Q+  GGRQQRPP
Subjt:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQN--GNGAPQRNGGGRQQRPP

Query:  WGRGRF
        WG+  F
Subjt:  WGRGRF

A0A6J1C7V3 uncharacterized protein LOC1110082187.0e-10599.51Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAA AQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFG
        GRFG
Subjt:  GRFG

A0A6J1FPB6 uncharacterized protein LOC111447019 isoform X12.1e-8586.76Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+R+SLRQGALAK+RSNFQGNQF LA EVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPR FNRR PNW KTR +APPVQRKPF NGTFIPK  I AP Q QTN  PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRN G RQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFG
        GR G
Subjt:  GRFG

A0A6J1J5N5 uncharacterized protein LOC111481570 isoform X12.3e-8485.78Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MA KPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+R+SLRQGA AK+RSNFQGNQF LA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPR FNR  PNW KTR +APPVQRKPF NGTFIPK  IAAP Q QTN  PRQ+PQTLDSLFANMKEQRLRVLSQRQNG GA QRN G RQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFG
        GR G
Subjt:  GRFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10970.1 unknown protein3.2e-3348.56Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.2 unknown protein3.2e-3348.56Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.3 unknown protein3.2e-3348.56Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.4 unknown protein3.2e-3348.56Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.5 unknown protein7.0e-2544.44Show/hide
Query:  MDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFN-RRAPN-
        MDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A  R R +N  R  N 
Subjt:  MDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFN-RRAPN-

Query:  ------------WSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR-
                    W   RF APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+ 
Subjt:  ------------WSKTRFDAPPVQRKPFTNGTFIPKVAIAAP---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR-

Query:  --PPWGR
           PW R
Subjt:  --PPWGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTAAGCCACTTACTACTGAGGCAATTGCCATAACCGAGAAGAAGATGGACATGGCTTTAGACGACATTATCAAAATGTCCAAAAATACTGGAAACAAAACTAG
AAAGCAAAGAAGACTTCCGAACAAAACGCAGAAATTTCCAAATAATGCTACTCAGGATAGACCTAGGAAGTTGCAGCGATTTATGGACTCAAGATCTTCTCTAAGACAGG
GGGCTTTGGCCAAAAAAAGGTCAAATTTTCAAGGGAATCAGTTTCCTTTGGCAGCAGAGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGCCCTAGAGGTTTTAATCGG
AGGGCACCAAATTGGAGTAAGACAAGGTTTGATGCTCCACCGGTACAGAGGAAGCCTTTTACTAATGGAACCTTTATTCCCAAGGTAGCAATAGCAGCACCGGCGCAGCC
CCAGACAAATACAATGCCAAGACAGAGGCCACAAACACTTGACTCGTTGTTTGCCAACATGAAGGAGCAGAGGTTGAGGGTGTTGTCGCAGCGACAAAATGGAAATGGTG
CTCCACAACGCAATGGTGGCGGTCGCCAGCAAAGACCTCCATGGGGAAGAGGCCGCTTCGGC
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTAAGCCACTTACTACTGAGGCAATTGCCATAACCGAGAAGAAGATGGACATGGCTTTAGACGACATTATCAAAATGTCCAAAAATACTGGAAACAAAACTAG
AAAGCAAAGAAGACTTCCGAACAAAACGCAGAAATTTCCAAATAATGCTACTCAGGATAGACCTAGGAAGTTGCAGCGATTTATGGACTCAAGATCTTCTCTAAGACAGG
GGGCTTTGGCCAAAAAAAGGTCAAATTTTCAAGGGAATCAGTTTCCTTTGGCAGCAGAGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGCCCTAGAGGTTTTAATCGG
AGGGCACCAAATTGGAGTAAGACAAGGTTTGATGCTCCACCGGTACAGAGGAAGCCTTTTACTAATGGAACCTTTATTCCCAAGGTAGCAATAGCAGCACCGGCGCAGCC
CCAGACAAATACAATGCCAAGACAGAGGCCACAAACACTTGACTCGTTGTTTGCCAACATGAAGGAGCAGAGGTTGAGGGTGTTGTCGCAGCGACAAAATGGAAATGGTG
CTCCACAACGCAATGGTGGCGGTCGCCAGCAAAGACCTCCATGGGGAAGAGGCCGCTTCGGC
Protein sequenceShow/hide protein sequence
MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFNR
RAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAPAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGRGRFG