; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005284 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005284
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationscaffold11:34852313..34861152
RNA-Seq ExpressionSpg005284
SyntenySpg005284
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]4.6e-6462.39Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GLEGET + ESK DRSSQRS+LYRLEIMRIELFSVYGVQSE+SLQDFQDAEPDIL+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        E AAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_022137915.1 uncharacterized protein At3g49140 isoform X1 [Momordica charantia]1.1e-6563.68Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GL+GE LSFESKSD+SSQRS+LYRLEIMRIELFSVYGVQ+EISLQDFQ+AEPDILVHSTAEIVE FSEKGIRCNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        EVAAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_031740660.1 uncharacterized protein At3g49140 isoform X1 [Cucumis sativus]4.6e-6462.39Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GLEGET + ESK DRSSQRS+LYRLEIMRIELFSVYGVQSE+SLQDFQDAEPDIL+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        E AAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]8.1e-6964.73Show/hide
Query:  VISSLTPGLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNN
        VISSLT GLEGETLS ESK DRSSQRS+LYRLEIMRIELFSVYGVQSE+SLQDFQ AEPDIL+HSTAEI+ERFSEKGIRCNIALKALCKKRGLHVE    
Subjt:  VISSLTPGLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNN

Query:  WYQSQWSTTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFP
                                                                                DAILIGVDSLGMDVRVCFGTEV+TFRFP
Subjt:  WYQSQWSTTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFP

Query:  FKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        FKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  FKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]4.9e-6664.1Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GLEGETLS ESK DRSSQRS+LYRLEIMRIELFSVYGVQSE+SLQDFQ AEPDIL+HSTAEI+ERFSEKGIRCNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLGMDVRVCFGTEV+TFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein2.2e-6462.39Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GLEGET + ESK DRSSQRS+LYRLEIMRIELFSVYGVQSE+SLQDFQDAEPDIL+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        E AAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X12.1e-6260.68Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GLEGET + E K DRSSQRS+LYRLEI+RIELFSVYGVQSE+SLQDFQDAEPDIL+HST +I+ERF+EKGI+CNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        EVAAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A5A7TTC0 Pentatricopeptide repeat (PPR) superfamily protein isoform 22.1e-6260.68Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GLEGET + E K DRSSQRS+LYRLEI+RIELFSVYGVQSE+SLQDFQDAEPDIL+HST +I+ERF+EKGI+CNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        EVAAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A5D3CYH3 Pentatricopeptide repeat (PPR) superfamily protein isoform 22.1e-6260.68Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GLEGET + E K DRSSQRS+LYRLEI+RIELFSVYGVQSE+SLQDFQDAEPDIL+HST +I+ERF+EKGI+CNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        EVAAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A6J1C800 uncharacterized protein At3g49140 isoform X15.3e-6663.68Show/hide
Query:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS
        GL+GE LSFESKSD+SSQRS+LYRLEIMRIELFSVYGVQ+EISLQDFQ+AEPDILVHSTAEIVE FSEKGIRCNIALKALCKKRGLHVE           
Subjt:  GLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWS

Query:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
                                                                         DAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS
Subjt:  TTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATS

Query:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        EVAAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  EVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491403.1e-1527.78Show/hide
Query:  GETLSFESKSDRSSQR-SSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTT
        G+    +S  D  ++   + Y+LE++RI+L +  G Q+E+ ++D + A+PD + H++AEI+ R  E G +   ALK+LC +   H  I            
Subjt:  GETLSFESKSDRSSQR-SSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTT

Query:  PERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEV
                    Q  EV                                                 LIG+DSLG D+R+C G ++ + RF F  RATSE 
Subjt:  PERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEV

Query:  AAEKQIQQLLFPRSRR
         AE QI++LLFP++ +
Subjt:  AAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-1627.78Show/hide
Query:  GETLSFESKSDRSSQR-SSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTT
        G+    +S  D  ++   + Y+LE++RI+L +  G Q+E+ ++D + A+PD + H++AEI+ R  E G +   ALK+LC +   H  I            
Subjt:  GETLSFESKSDRSSQR-SSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTT

Query:  PERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEV
                    Q  EV                                                 LIG+DSLG D+R+C G ++ + RF F  RATSE 
Subjt:  PERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEV

Query:  AAEKQIQQLLFPRSRR
         AE QI++LLFP++ +
Subjt:  AAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein2.9e-4044.55Show/hide
Query:  SKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTTPERSGGKG
        S+ D +   SSLYRLEI+ IEL S+YG +S ISLQDFQDAEPDILVHST+ I+ERF+ +GI  +IALKALCKK+GLH E                     
Subjt:  SKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTTPERSGGKG

Query:  ADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQ
                                                               +A LI VDSLGMDVRV  G +V+T RFPFK RAT+E+AAEK+I Q
Subjt:  ADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQ

Query:  LLFPRSRRKKLRSHGDGLRD
        LLFPRSRR+KL+ H + L+D
Subjt:  LLFPRSRRKKLRSHGDGLRD

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-1326.39Show/hide
Query:  GETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTTP
        GE  S     + S      Y+LEI+RI+L +  G Q+E+ ++D + A+PD++  ++  I+ R  E G +   AL++LC +        NN  Q++     
Subjt:  GETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTTP

Query:  ERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVA
                                                                      +  LIG+DSLG D+R+C G ++ T RF F IRATSE  
Subjt:  ERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVA

Query:  AEKQIQQLLFPRSRRK
        AE Q+++LLF  +  K
Subjt:  AEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein3.0e-1326.39Show/hide
Query:  GETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTTP
        GE  S     + S      Y+LEI+RI+L +  G Q+E+ ++D + A+PD++  ++  I+ R  E G +   AL++LC +        NN  Q++     
Subjt:  GETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEIPNNWYQSQWSTTP

Query:  ERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVA
                                                                      +  LIG+DSLG D+R+C G ++ T RF F IRATSE  
Subjt:  ERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVA

Query:  AEKQIQQLLFPRSRRK
        AE Q+++LLF  +  K
Subjt:  AEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACGAGGCGATGGCGGCAGCGACTTAAACCAGACGGCCCAGCTCCAACTTCGACGGTGGTGAAAACCCAGAGACGTGTGTATCGGTTTGACTTCGGCGAAAACCC
ACGAGAGGACGAGCGCGATGGCGGCGGCGACTTAAACCAGACGTTCGACTTCGACGGCGGCGTTCGACTTCGACGGCGGCGTTCGACTTCGACGGCAGCGTTCGACTTCG
ACTTCGACAGCGGCGTTCGACTTCGACTTCGACGGCGGCGTTCGACTTCGGTGAAAACCCACGAGAGGAGAGGCGTTTTTGTAGTCAGAGCCAGCCAGTTTTTCATTAAA
CAGGCATGCGAGTCTTCAACTGCAGAAAAGTCGCTTTCTGCAATTGCTGCCGAAATGGGGCAGCTAAATAATGAAATTCAAGAGCACCGTAGGGTGCTAAACTATCTTTT
TAGGAGTGTGAGAACAATCGATCCCGCTCGGAAAGAAGCGCGCATTCGCGCTTTCAGACAACGGGTCGAGGATATGGAGGGGAGGCAGCAAGCACTCATAGTGCAGGCTA
TTGAATTCTCCTTTACCAATCTATCTAAGCTAAAGGTGTCTCTTGACCTTTCAAAAGAAAAGAGTTCCTCCATCATTGATCATTCAAAGTCTAATTTAGATTCTGTAGAG
GAATCTCGTGTTCGTCTTTTGACTCAATCCACAAATCAATTGGAGTCTCCCACGAAAGCTTCTAAGCACTATAGGCTTTTAGTATTGTTGGAGTTCTTCCCAGTCATCTC
GTCTTTAACCCCAGGTTTAGAAGGTGAAACCTTGAGCTTTGAGTCCAAAAGTGATAGAAGCAGCCAAAGATCCTCTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCT
TCTCTGTGTATGGAGTTCAGTCTGAAATTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATCCTTGTGCACTCTACTGCGGAAATTGTAGAGCGTTTCAGTGAGAAG
GGTATTAGGTGCAATATTGCCCTTAAAGCTCTTTGCAAAAAGAGGGGTCTTCATGTTGAGATACCTAACAATTGGTATCAGAGCCAGTGGTCGACGACTCCAGAGAGGAG
TGGCGGCAAAGGGGCTGATTGGAACCAGCTGAGTGAAGTGTCGCCGTGCACAGCCACCGAGACGTCAGCTTTCCGAGGAGGGGCAATTGTTAGGGACCAAGGCAGTATGA
GATGCCTTTGTCCCACATTGGAAAGATATGGGCTCCTAATGGGCTCCTCACCCCTCTGCCCTCAGGCTGATGCCATTTTGATCGGAGTCGATAGTCTTGGCATGGATGTG
AGGGTATGTTTTGGAACAGAAGTACGGACGTTCCGATTTCCCTTTAAAATCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGATC
TCGTCGTAAAAAATTACGAAGTCATGGGGATGGATTGAGAGATACTGTGAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGACGAGGCGATGGCGGCAGCGACTTAAACCAGACGGCCCAGCTCCAACTTCGACGGTGGTGAAAACCCAGAGACGTGTGTATCGGTTTGACTTCGGCGAAAACCC
ACGAGAGGACGAGCGCGATGGCGGCGGCGACTTAAACCAGACGTTCGACTTCGACGGCGGCGTTCGACTTCGACGGCGGCGTTCGACTTCGACGGCAGCGTTCGACTTCG
ACTTCGACAGCGGCGTTCGACTTCGACTTCGACGGCGGCGTTCGACTTCGGTGAAAACCCACGAGAGGAGAGGCGTTTTTGTAGTCAGAGCCAGCCAGTTTTTCATTAAA
CAGGCATGCGAGTCTTCAACTGCAGAAAAGTCGCTTTCTGCAATTGCTGCCGAAATGGGGCAGCTAAATAATGAAATTCAAGAGCACCGTAGGGTGCTAAACTATCTTTT
TAGGAGTGTGAGAACAATCGATCCCGCTCGGAAAGAAGCGCGCATTCGCGCTTTCAGACAACGGGTCGAGGATATGGAGGGGAGGCAGCAAGCACTCATAGTGCAGGCTA
TTGAATTCTCCTTTACCAATCTATCTAAGCTAAAGGTGTCTCTTGACCTTTCAAAAGAAAAGAGTTCCTCCATCATTGATCATTCAAAGTCTAATTTAGATTCTGTAGAG
GAATCTCGTGTTCGTCTTTTGACTCAATCCACAAATCAATTGGAGTCTCCCACGAAAGCTTCTAAGCACTATAGGCTTTTAGTATTGTTGGAGTTCTTCCCAGTCATCTC
GTCTTTAACCCCAGGTTTAGAAGGTGAAACCTTGAGCTTTGAGTCCAAAAGTGATAGAAGCAGCCAAAGATCCTCTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCT
TCTCTGTGTATGGAGTTCAGTCTGAAATTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATCCTTGTGCACTCTACTGCGGAAATTGTAGAGCGTTTCAGTGAGAAG
GGTATTAGGTGCAATATTGCCCTTAAAGCTCTTTGCAAAAAGAGGGGTCTTCATGTTGAGATACCTAACAATTGGTATCAGAGCCAGTGGTCGACGACTCCAGAGAGGAG
TGGCGGCAAAGGGGCTGATTGGAACCAGCTGAGTGAAGTGTCGCCGTGCACAGCCACCGAGACGTCAGCTTTCCGAGGAGGGGCAATTGTTAGGGACCAAGGCAGTATGA
GATGCCTTTGTCCCACATTGGAAAGATATGGGCTCCTAATGGGCTCCTCACCCCTCTGCCCTCAGGCTGATGCCATTTTGATCGGAGTCGATAGTCTTGGCATGGATGTG
AGGGTATGTTTTGGAACAGAAGTACGGACGTTCCGATTTCCCTTTAAAATCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGATC
TCGTCGTAAAAAATTACGAAGTCATGGGGATGGATTGAGAGATACTGTGAGTTTTTAG
Protein sequenceShow/hide protein sequence
MKTRRWRQRLKPDGPAPTSTVVKTQRRVYRFDFGENPREDERDGGGDLNQTFDFDGGVRLRRRRSTSTAAFDFDFDSGVRLRLRRRRSTSVKTHERRGVFVVRASQFFIK
QACESSTAEKSLSAIAAEMGQLNNEIQEHRRVLNYLFRSVRTIDPARKEARIRAFRQRVEDMEGRQQALIVQAIEFSFTNLSKLKVSLDLSKEKSSSIIDHSKSNLDSVE
ESRVRLLTQSTNQLESPTKASKHYRLLVLLEFFPVISSLTPGLEGETLSFESKSDRSSQRSSLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEK
GIRCNIALKALCKKRGLHVEIPNNWYQSQWSTTPERSGGKGADWNQLSEVSPCTATETSAFRGGAIVRDQGSMRCLCPTLERYGLLMGSSPLCPQADAILIGVDSLGMDV
RVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF