; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0754 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0754
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationMC10:6307479..6311990
RNA-Seq ExpressionMC10g0754
SyntenyMC10g0754
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649209.1 hypothetical protein Csa_014401 [Cucumis sativus]5.72e-11188.06Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV+  A AQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG

Query:  R
        +
Subjt:  R

KAG6600514.1 Nucleoside diphosphate kinase IV, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]3.80e-11179.83Show/hide
Query:  LFRSGSSEDFCPNFLRDLSPIVFSSDELSSRQEMAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDS
        LF  G+SE   P+    +S IVFSSDELS  QEMAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+
Subjt:  LFRSGSSEDFCPNFLRDLSPIVFSSDELSSRQEMAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDS

Query:  RSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANM
        R+SLRQGALAK+RSNFQGNQF LA EVAR AAVAPIRPR FNRR PNW KTR +APPVQRKPF NGTFIPK+    + QP  N  PRQRPQTLDSLFANM
Subjt:  RSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANM

Query:  KEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGRGRFGN
        KEQRLRVLSQRQNG  A QRNG  RQQRPPWGRGR  N
Subjt:  KEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGRGRFGN

XP_004148996.1 uncharacterized protein LOC101210049 [Cucumis sativus]3.51e-11188.06Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV+  A AQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG

Query:  R
        +
Subjt:  R

XP_022136543.1 uncharacterized protein LOC111008218 [Momordica charantia]1.15e-138100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFGNYA
        GRFGNYA
Subjt:  GRFGNYA

XP_022941743.1 uncharacterized protein LOC111447019 isoform X1 [Cucurbita moschata]6.24e-10986.34Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+R+SLRQGALAK+RSNFQGNQF LA EVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPR FNRR PNW KTR +APPVQRKPF NGTFIPK  I A  Q QTN  PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNG  RQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFGN
        GR GN
Subjt:  GRFGN

TrEMBL top hitse value%identityAlignment
A0A0A0KUX2 Uncharacterized protein1.70e-11188.06Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQFPLA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG
        APIRPR F RRAPNW+KTR +A PPV RKPFTNG F+PKV+  A AQPQTNT PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNGG +QQRPPWG
Subjt:  APIRPRGFNRRAPNWSKTRFDA-PPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWG

Query:  R
        +
Subjt:  R

A0A5D3CYD8 Uncharacterized protein1.26e-10484.73Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRRLPNK QKFPNNATQDRPRKLQRFMDSRSSLRQGALA +RSNFQGNQF LA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPP-VQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNG--APQRNGGGRQQRPP
        APIRPR F RRAPNW+KTR DAPP V +K FTNG F+PKV+  A AQ QTN  PRQRPQTLDSLFANMKEQRLRVLSQRQNG G  A Q+  GGRQQRPP
Subjt:  APIRPRGFNRRAPNWSKTRFDAPP-VQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNG--APQRNGGGRQQRPP

Query:  WGR
        WG+
Subjt:  WGR

A0A6J1C7V3 uncharacterized protein LOC1110082185.59e-139100Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFGNYA
        GRFGNYA
Subjt:  GRFGNYA

A0A6J1FPB6 uncharacterized protein LOC111447019 isoform X13.02e-10986.34Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+R+SLRQGALAK+RSNFQGNQF LA EVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPR FNRR PNW KTR +APPVQRKPF NGTFIPK  I A  Q QTN  PRQRPQTLDSLFANMKEQRLRVLSQRQNG GA QRNG  RQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFGN
        GR GN
Subjt:  GRFGN

A0A6J1J5N5 uncharacterized protein LOC111481570 isoform X17.06e-10885.37Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV
        MA KPLTTEAIAITEKKMDMALDDIIKMSKNTGNK RKQRR PNK QKFPNNATQDRPRKLQRFMD+R+SLRQGA AK+RSNFQGNQF LA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAV

Query:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR
        APIRPR FNR  PNW KTR +APPVQRKPF NGTFIPK  IAA  Q QTN  PRQ+PQTLDSLFANMKEQRLRVLSQRQNG GA QRNG  RQQRPPWGR
Subjt:  APIRPRGFNRRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGR

Query:  GRFGN
        GR GN
Subjt:  GRFGN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10970.1 unknown protein4.2e-3449.04Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K     R    Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.2 unknown protein4.2e-3449.04Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K     R    Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.3 unknown protein4.2e-3449.04Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K     R    Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.4 unknown protein4.2e-3449.04Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAP

Query:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR
         R R +N  R  N +++RF APP Q +    G F+ K     R    Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+
Subjt:  IRPRGFN-RRAPNWSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR

Query:  ---PPWGR
            PW R
Subjt:  ---PPWGR

AT4G10970.5 unknown protein1.2e-2544.93Show/hide
Query:  MDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFN-RRAPN-
        MDM+LD+IIKM K NT     K++R+ NK +KF + A ++   K QR+MDSRS +RQGA AKKRSNFQGNQFP+   VARKAA A  R R +N  R  N 
Subjt:  MDMALDDIIKMSK-NTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFN-RRAPN-

Query:  ------------WSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR-
                    W   RF APP Q +    G F+ K     R    Q Q N      RQ PQTLDS FANMKE+R+R+     N +       G  QQ+ 
Subjt:  ------------WSKTRFDAPPVQRKPFTNGTFIPKVAIAAR---AQPQTN---TMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQR-

Query:  --PPWGR
           PW R
Subjt:  --PPWGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGACTCCAAATCCTAATCAAACCTCCTTAAATATTTTTCTACGTGAAGTATCTTCGCAACGACGTCGTTCCTTCCACCCTATATATTACCTTGCTTTGCCCTATTT
TTTTCACCCCGCGTCTCCTTTTCCTTCGGCGGTGAGGAATCAACTCTCCCGCGAACCTCTGGAATTGTTCCGTTCAGGATCTTCGGAAGATTTCTGCCCTAATTTTCTTC
GTGATCTTTCTCCGATCGTTTTCTCGAGCGACGAATTATCTTCTCGTCAAGAGATGGCGGCTAAGCCACTTACTACTGAGGCAATTGCCATAACCGAGAAGAAGATGGAC
ATGGCTTTAGACGACATTATCAAAATGTCCAAAAATACTGGAAACAAAACTAGAAAGCAAAGACGGCTTCCGAACAAAACGCAGAAATTTCCTAATAATGCTACTCAGGA
TAGACCTAGGAAGTTGCAGCGATTTATGGACTCAAGATCTTCTCTAAGACAGGGGGCTTTGGCCAAAAAAAGGTCAAATTTTCAAGGGAATCAGTTTCCTTTGGCAGCAG
AGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGCCCTAGAGGTTTTAATCGGAGGGCACCAAATTGGAGTAAGACAAGGTTTGATGCTCCACCGGTACAGAGGAAGCCT
TTTACTAATGGAACCTTTATTCCCAAGGTAGCAATAGCAGCACGGGCGCAGCCCCAGACAAATACAATGCCAAGACAGAGGCCACAAACACTTGACTCGTTGTTTGCCAA
CATGAAGGAGCAGAGGTTGAGGGTGTTGTCGCAGCGACAAAATGGAAATGGTGCTCCACAACGCAATGGTGGCGGTCGCCAGCAAAGACCTCCATGGGGAAGAGGCCGCT
TCGGCAACTACGCATGA
mRNA sequenceShow/hide mRNA sequence
CTTCATAGGGGAAAATTGGTCGCTCCACACAGGTGAATTTGCCAAGGCTAAGGCTAACCAAGATCCGATGACTTGTCACAATTTGAGTGGTTGCTTTCAAAATGCTGAGC
TGTCATCTACCCCGTACGTAAACTCCGGGCCGCTCACAGGGACAAGCAATATCTTCGGTTCCTCCAGCTAATGCGGACTCCAAATCCTAATCAAACCTCCTTAAATATTT
TTCTACGTGAAGTATCTTCGCAACGACGTCGTTCCTTCCACCCTATATATTACCTTGCTTTGCCCTATTTTTTTCACCCCGCGTCTCCTTTTCCTTCGGCGGTGAGGAAT
CAACTCTCCCGCGAACCTCTGGAATTGTTCCGTTCAGGATCTTCGGAAGATTTCTGCCCTAATTTTCTTCGTGATCTTTCTCCGATCGTTTTCTCGAGCGACGAATTATC
TTCTCGTCAAGAGATGGCGGCTAAGCCACTTACTACTGAGGCAATTGCCATAACCGAGAAGAAGATGGACATGGCTTTAGACGACATTATCAAAATGTCCAAAAATACTG
GAAACAAAACTAGAAAGCAAAGACGGCTTCCGAACAAAACGCAGAAATTTCCTAATAATGCTACTCAGGATAGACCTAGGAAGTTGCAGCGATTTATGGACTCAAGATCT
TCTCTAAGACAGGGGGCTTTGGCCAAAAAAAGGTCAAATTTTCAAGGGAATCAGTTTCCTTTGGCAGCAGAGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGCCCTAG
AGGTTTTAATCGGAGGGCACCAAATTGGAGTAAGACAAGGTTTGATGCTCCACCGGTACAGAGGAAGCCTTTTACTAATGGAACCTTTATTCCCAAGGTAGCAATAGCAG
CACGGGCGCAGCCCCAGACAAATACAATGCCAAGACAGAGGCCACAAACACTTGACTCGTTGTTTGCCAACATGAAGGAGCAGAGGTTGAGGGTGTTGTCGCAGCGACAA
AATGGAAATGGTGCTCCACAACGCAATGGTGGCGGTCGCCAGCAAAGACCTCCATGGGGAAGAGGCCGCTTCGGCAACTACGCATGAAGAAACTTCATGCAATGGATGGA
GAAATTTGTGTGGGAATTGTAGATGTTGCGCCCAATAGTAGTAGCCAGTTGATAGGGGACAAAAAGTAATGTAGTTGTTACCTGTCATTTTTTCTTTAACCAAACCCTTT
ACATTTTAGCATTTTCTAGCTTGATGATGATTATGGACCAAGTACTATAGTTTTGTATTTTTCAGTTTGTGTAAAGTTTCTTCTTAGTCTGTTT
Protein sequenceShow/hide protein sequence
MRTPNPNQTSLNIFLREVSSQRRRSFHPIYYLALPYFFHPASPFPSAVRNQLSREPLELFRSGSSEDFCPNFLRDLSPIVFSSDELSSRQEMAAKPLTTEAIAITEKKMD
MALDDIIKMSKNTGNKTRKQRRLPNKTQKFPNNATQDRPRKLQRFMDSRSSLRQGALAKKRSNFQGNQFPLAAEVARKAAVAPIRPRGFNRRAPNWSKTRFDAPPVQRKP
FTNGTFIPKVAIAARAQPQTNTMPRQRPQTLDSLFANMKEQRLRVLSQRQNGNGAPQRNGGGRQQRPPWGRGRFGNYA