; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002925 (gene) of Snake gourd v1 genome

Gene IDTan0002925
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein isoform 2
Genome locationLG01:10579449..10583715
RNA-Seq ExpressionTan0002925
SyntenyTan0002925
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600514.1 Nucleoside diphosphate kinase IV, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]4.1e-8886.57Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRRFPNK+QKFPN+A QDRPRKLQRF D+R+SLRQGALAKRRSNFQGNQFALATEVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG
        APIRPR FNRR PNW KTRVEAPPVQRKPF NG FIPK+ AP Q Q NA PRQ+PQTLDSLFANMKEQRLR LSQRQNG AQQRNG RQQRPPWGR R  
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG

Query:  N
        N
Subjt:  N

KAG7031152.1 hypothetical protein SDJN02_05192, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-8886.57Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRRFPNK+QKFPN+A QDRPRKLQRF D+R+SLRQGALAKRRSNFQGNQFALATEVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG
        APIRPR FNRR PNW KTRVEAPPVQRKPF NG FIPK+ AP Q Q NA PRQ+PQTLDSLFANMKEQRLR LSQRQNG AQQRNG RQQRPPWGR R  
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG

Query:  N
        N
Subjt:  N

XP_022941743.1 uncharacterized protein LOC111447019 isoform X1 [Cucurbita moschata]1.7e-8987.56Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRRFPNK+QKFPN+A QDRPRKLQRF D+R+SLRQGALAKRRSNFQGNQFALATEVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG
        APIRPR FNRR PNW KTRVEAPPVQRKPF NG FIPK+ AP Q QTNA PRQ+PQTLDSLFANMKEQRLR LSQRQNG AQQRNG RQQRPPWGR R G
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG

Query:  N
        N
Subjt:  N

XP_022982836.1 uncharacterized protein LOC111481570 isoform X1 [Cucurbita maxima]3.7e-8987.56Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MA KPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRRFPNK+QKFPN+A QDRPRKLQRF D+R+SLRQGA AKRRSNFQGNQFALATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG
        APIRPR FNR  PNW KTRVEAPPVQRKPF NG FIPK+AAP Q QTNA PRQKPQTLDSLFANMKEQRLR LSQRQNG AQQRNG RQQRPPWGR R G
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG

Query:  N
        N
Subjt:  N

XP_023528268.1 uncharacterized protein LOC111791234 isoform X1 [Cucurbita pepo subsp. pepo]5.4e-8886.57Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+G+K RKQRRFPNK+QKFPN+A QDRPRKLQRF D+R+SLRQGALAKRRSNFQGNQFALATEVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG
        APIRPR FNRR PNW KTRVEAPPVQRKP  NG FIPK+ AP Q QTNA PRQ+PQTLDSLFANMKEQRLR LSQRQNG AQQRNG RQQRPPWGR R G
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG

Query:  N
        N
Subjt:  N

TrEMBL top hitse value%identityAlignment
A0A0A0KUX2 Uncharacterized protein5.5e-8687.13Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRR PNK+QKFPN+A QDRPRKLQRF DSRSSLRQGALA RRSNFQGNQF LATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEA-PPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQN-GAAQQRNGGR-QQRPPWGRA
        APIRPR F RRAPNWNKTRVEA PPV RKPFTNGNF+PKV+AP+QPQTN  PRQ+PQTLDSLFANMKEQRLR LSQRQN G AQQRNGGR QQRPPWG+ 
Subjt:  APIRPRVFNRRAPNWNKTRVEA-PPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQN-GAAQQRNGGR-QQRPPWGRA

Query:  RF
         F
Subjt:  RF

A0A5A7TPS5 Pentatricopeptide repeat (PPR) superfamily protein isoform 22.1e-8585.29Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRR PNK+QKFPN+A QDRPRKLQRF DSRSSLRQGALA RRSNFQGNQFALATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEA-PPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQN----GAAQQRNGGRQQRPPWG
        APIRPR F RRAPNWNKTRV+A PPV +K FTNGNF+PKV+AP+Q QTNA PRQ+PQTLDSLFANMKEQRLR LSQRQN    GA QQRNGGRQQRPPWG
Subjt:  APIRPRVFNRRAPNWNKTRVEA-PPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQN----GAAQQRNGGRQQRPPWG

Query:  RARF
        +  F
Subjt:  RARF

A0A6J1C7V3 uncharacterized protein LOC1110082182.4e-8685.85Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRR PNK QKFPN+A QDRPRKLQRF DSRSSLRQGALAK+RSNFQGNQF LA EVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKV--AAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQ--NGAAQQRNGGRQQRPPWGR
        APIRPR FNRRAPNW+KTR +APPVQRKPFTNG FIPKV  AA +QPQTN MPRQ+PQTLDSLFANMKEQRLR LSQRQ  NGA Q+  GGRQQRPPWGR
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKV--AAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQ--NGAAQQRNGGRQQRPPWGR

Query:  ARFGN
         RFGN
Subjt:  ARFGN

A0A6J1FPB6 uncharacterized protein LOC111447019 isoform X18.1e-9087.56Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MAAKPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRRFPNK+QKFPN+A QDRPRKLQRF D+R+SLRQGALAKRRSNFQGNQFALATEVAR AAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG
        APIRPR FNRR PNW KTRVEAPPVQRKPF NG FIPK+ AP Q QTNA PRQ+PQTLDSLFANMKEQRLR LSQRQNG AQQRNG RQQRPPWGR R G
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG

Query:  N
        N
Subjt:  N

A0A6J1J5N5 uncharacterized protein LOC111481570 isoform X11.8e-8987.56Show/hide
Query:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV
        MA KPLTTEAIAITEKKMDMALDDIIKMSKN+GNK RKQRRFPNK+QKFPN+A QDRPRKLQRF D+R+SLRQGA AKRRSNFQGNQFALATEVARKAAV
Subjt:  MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAV

Query:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG
        APIRPR FNR  PNW KTRVEAPPVQRKPF NG FIPK+AAP Q QTNA PRQKPQTLDSLFANMKEQRLR LSQRQNG AQQRNG RQQRPPWGR R G
Subjt:  APIRPRVFNRRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFG

Query:  N
        N
Subjt:  N

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10970.1 unknown protein2.9e-3148.37Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K++ N  + K++R  NK +KF + AA++   K QR+ DSRS +RQGA AK+RSNFQGNQF + T VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP

Query:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ
         R R +N  R  N N++R  APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R      N +    NG    +QQ
Subjt:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ

Query:  RP--PWGR--ARFGN
        R   PW R   RF N
Subjt:  RP--PWGR--ARFGN

AT4G10970.2 unknown protein2.9e-3148.37Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K++ N  + K++R  NK +KF + AA++   K QR+ DSRS +RQGA AK+RSNFQGNQF + T VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP

Query:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ
         R R +N  R  N N++R  APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R      N +    NG    +QQ
Subjt:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ

Query:  RP--PWGR--ARFGN
        R   PW R   RF N
Subjt:  RP--PWGR--ARFGN

AT4G10970.3 unknown protein2.9e-3148.37Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K++ N  + K++R  NK +KF + AA++   K QR+ DSRS +RQGA AK+RSNFQGNQF + T VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP

Query:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ
         R R +N  R  N N++R  APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R      N +    NG    +QQ
Subjt:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ

Query:  RP--PWGR--ARFGN
        R   PW R   RF N
Subjt:  RP--PWGR--ARFGN

AT4G10970.4 unknown protein2.9e-3148.37Show/hide
Query:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP
        KP+TTE +A+TEKKMDM+LD+IIKM K++ N  + K++R  NK +KF + AA++   K QR+ DSRS +RQGA AK+RSNFQGNQF + T VARKAA A 
Subjt:  KPLTTEAIAITEKKMDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAP

Query:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ
         R R +N  R  N N++R  APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R      N +    NG    +QQ
Subjt:  IRPRVFN-RRAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQ

Query:  RP--PWGR--ARFGN
        R   PW R   RF N
Subjt:  RP--PWGR--ARFGN

AT4G10970.5 unknown protein4.2e-2243.93Show/hide
Query:  MDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAPIRPRVFN-RRAPN-
        MDM+LD+IIKM K++ N  + K++R  NK +KF + AA++   K QR+ DSRS +RQGA AK+RSNFQGNQF + T VARKAA A  R R +N  R  N 
Subjt:  MDMALDDIIKMSKNSGNKAR-KQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAPIRPRVFN-RRAPN-

Query:  ------------WNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQR
                    W   R  APP Q +    G F+ K          Q Q N      RQ PQTLDS FANMKE+R+R      N +    NG    +QQR
Subjt:  ------------WNKTRVEAPPVQRKPFTNGNFIPKVAAPS-----QPQTN---AMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGG---RQQR

Query:  P--PWGR--ARFGN
           PW R   RF N
Subjt:  P--PWGR--ARFGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTAAACCACTTACTACTGAGGCAATTGCCATAACTGAGAAGAAGATGGACATGGCCTTAGATGACATTATCAAAATGTCCAAAAATTCTGGGAATAAAGCCAG
GAAGCAAAGAAGGTTTCCGAACAAAGTGCAGAAATTCCCAAATCATGCTGCTCAAGATAGACCTAGGAAGTTGCAGCGTTTCACGGACTCAAGATCTTCCCTAAGACAGG
GGGCTTTGGCTAAAAGAAGGTCAAATTTTCAAGGGAATCAGTTTGCTTTGGCAACTGAGGTTGCAAGAAAGGCTGCAGTTGCTCCAATTCGGCCTAGAGTTTTTAACCGT
AGGGCACCCAATTGGAATAAGACAAGGGTTGAAGCTCCACCCGTTCAGAGGAAGCCATTTACTAATGGAAACTTCATTCCCAAGGTAGCTGCACCATCCCAACCACAAAC
AAATGCTATGCCGAGACAGAAGCCACAGACGCTCGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTGAGGGCTTTGTCGCAGCGACAAAATGGTGCTGCACAGCAAC
GGAATGGTGGTCGCCAGCAAAGACCTCCCTGGGGAAGAGCCCGTTTTGGTAACTGA
mRNA sequenceShow/hide mRNA sequence
AATGAAACCTCCTTAATATTACGCGAACTCTCCACTGAAACGACGCCGTTGCTTCCCCTATATATACCCCCCTCCTTTGCCCTATTTTTCTACCCGAGTCTCATCTTCCT
TCGGCGGTAAGCAATTGACTCTCCCGCGAAGCTCAGGAATTCTTCCTCTCTTTCGTTTAGGATCTTCGGAAGTTCTCTGCCCTAATCTTTCACTGGTGTTTCTCCGATCG
TCTTATAGACCGAAGAATTTTCTTCTTGTCAAGTGATGGCGGCTAAACCACTTACTACTGAGGCAATTGCCATAACTGAGAAGAAGATGGACATGGCCTTAGATGACATT
ATCAAAATGTCCAAAAATTCTGGGAATAAAGCCAGGAAGCAAAGAAGGTTTCCGAACAAAGTGCAGAAATTCCCAAATCATGCTGCTCAAGATAGACCTAGGAAGTTGCA
GCGTTTCACGGACTCAAGATCTTCCCTAAGACAGGGGGCTTTGGCTAAAAGAAGGTCAAATTTTCAAGGGAATCAGTTTGCTTTGGCAACTGAGGTTGCAAGAAAGGCTG
CAGTTGCTCCAATTCGGCCTAGAGTTTTTAACCGTAGGGCACCCAATTGGAATAAGACAAGGGTTGAAGCTCCACCCGTTCAGAGGAAGCCATTTACTAATGGAAACTTC
ATTCCCAAGGTAGCTGCACCATCCCAACCACAAACAAATGCTATGCCGAGACAGAAGCCACAGACGCTCGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTGAGGGC
TTTGTCGCAGCGACAAAATGGTGCTGCACAGCAACGGAATGGTGGTCGCCAGCAAAGACCTCCCTGGGGAAGAGCCCGTTTTGGTAACTGAAGGATACACGCAAAGAAAC
TTCGTGCGATGGATGAGTATTAGTGTGGGAAATGTAGATGTTGCCTGCTTGTCCACGGTGCCCGATAGCTGATAGGGGATTAAAAGGTTTCTTTTTGTCATTTTTTTTAA
CCCAAACCTTTGGCTTGTTGTTTTAGCATTTTTCAAGCTTCATTAAGGACCGAAAACTATAGATTTGTATTTTGTACCCGTTCATGTAAAGGTTCTTACTTTTTTCTCCT
TTTTAGTTCACTTTCTTCCCTCGTGACCTAATTTCTCAACTTGTTATGTTAAATCCCATTTACACTGCCGGCCCCTCGGCAGCTCTGCTTGTTCTTTGACTTGTACATAT
TGTAGCGAATAAATACCAGTTGATGTGCCGTTTGATTGCTTACAACATATAATTTATATTTCA
Protein sequenceShow/hide protein sequence
MAAKPLTTEAIAITEKKMDMALDDIIKMSKNSGNKARKQRRFPNKVQKFPNHAAQDRPRKLQRFTDSRSSLRQGALAKRRSNFQGNQFALATEVARKAAVAPIRPRVFNR
RAPNWNKTRVEAPPVQRKPFTNGNFIPKVAAPSQPQTNAMPRQKPQTLDSLFANMKEQRLRALSQRQNGAAQQRNGGRQQRPPWGRARFGN