; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015802 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015802
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptiontRNA_edit domain-containing protein
Genome locationscaffold6:44459065..44467201
RNA-Seq ExpressionSpg015802
SyntenySpg015802
Gene Ontology termsGO:0106074 - aminoacyl-tRNA metabolism involved in translational fidelity (biological process)
GO:0002161 - aminoacyl-tRNA editing activity (molecular function)
GO:0004812 - aminoacyl-tRNA ligase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR036754 - YbaK/aminoacyl-tRNA synthetase-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041095.1 YbaK/aminoacyl-tRNA synthetase-associated domain isoform 2 [Cucumis melo var. makuwa]7.5e-9183.96Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MG+E+EAMAE+NSMEALAKLEHRQTLIL+RIS LEL YLPNLDS PS VP +GAD DTVARLSAILRTNAVNDFSFKRVPSDYYDW LEARRDVL+AAS+
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQAPS+VVDCSDRNNSKYY+VVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIPGRGAIS
        MKTDIP   ++S
Subjt:  MKTDIPGRGAIS

TYK12014.1 YbaK/aminoacyl-tRNA synthetase-associated domain isoform 2 [Cucumis melo var. makuwa]1.3e-9085.92Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MG+E+EAMAE+NSMEALAKLEHRQTLIL+RIS LEL YLPNLDS PS VP +GAD DTVARLSAILRTNAVNDFSFKRVPSDYYDW LEARRDVL+AAS+
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQAPS+VVDCSDRNNSKYY+VVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        MKTDIP
Subjt:  MKTDIP

XP_008448863.1 PREDICTED: uncharacterized protein LOC103490895 [Cucumis melo]3.7e-9085.44Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MG+E+EAMAE+NSMEALAKLEHRQTLIL+RIS LEL YLPNLDS PS VP +GAD DTVARLSAILRTNAVNDFSFKRVPSDYYDW LEARRDVL+AAS+
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQAPS+VVDCSDRNNSKYY+VVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETS KLTGYEHNGVTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        MKTDIP
Subjt:  MKTDIP

XP_008448863.1 PREDICTED: uncharacterized protein LOC103490895 [Cucumis melo]7.2e-1784.31Show/hide
Query:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS
        ++ ++ VILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEF+NFVKPFI+KCS
Subjt:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS

XP_008448863.1 PREDICTED: uncharacterized protein LOC103490895 [Cucumis melo]1.1e-8985.44Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MGTE+EAMAE+N MEALAKLEHRQTLIL+RISKLELAYLPNLDSA S VP +G D DTVARLSAILRTN+VNDFSFKRVPSDYYDW LEARRDVL+AASV
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQAPS+VVDCSDRNNSKYY+VVVQYTAKFNAE VKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        M+TDIP
Subjt:  MKTDIP

XP_022951583.1 uncharacterized protein LOC111454353 [Cucurbita moschata]1.0e-1578.43Show/hide
Query:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS
        ++ ++ VILDEAI KL+PDYFWLGGGE+DLKLGIRTSEF+NFVKPFI+KCS
Subjt:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS

XP_038903466.1 uncharacterized protein LOC120090048 [Benincasa hispida]3.2e-1786.27Show/hide
Query:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS
        +R ++ VILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEF+NFVKPFI+KCS
Subjt:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS

XP_038903466.1 uncharacterized protein LOC120090048 [Benincasa hispida]7.3e-8682.52Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MGTE  AMAERNSMEALAKLE+RQTLI+NRISKLELAYLPNLDS PSP+   GAD DTVARLS ILRTNAVNDFSFKRVPSDYYDW+LEARRDVLNAASV
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQA S+VVDCS+RNNSKYYVVVVQY+AKFNAE V+SFLYSLNDGKI KKKFNLRLAPEE SA+LTGY HN VTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        MKTDIP
Subjt:  MKTDIP

TrEMBL top hitse value%identityAlignment
A0A1S3BLB4 uncharacterized protein LOC1034908951.8e-9085.44Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MG+E+EAMAE+NSMEALAKLEHRQTLIL+RIS LEL YLPNLDS PS VP +GAD DTVARLSAILRTNAVNDFSFKRVPSDYYDW LEARRDVL+AAS+
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQAPS+VVDCSDRNNSKYY+VVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETS KLTGYEHNGVTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        MKTDIP
Subjt:  MKTDIP

A0A1S3BLB4 uncharacterized protein LOC1034908953.5e-1784.31Show/hide
Query:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS
        ++ ++ VILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEF+NFVKPFI+KCS
Subjt:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS

A0A1S3BLB4 uncharacterized protein LOC1034908953.5e-8682.52Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MGTE  AMAERNSMEALAKLE+RQTLI+NRISKLELAYLPNLDS PSP+   GAD DTVARLS ILRTNAVNDFSFKRVPSDYYDW+LEARRDVLNAASV
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQA S+VVDCS+RNNSKYYVVVVQY+AKFNAE V+SFLYSLNDGKI KKKFNLRLAPEE SA+LTGY HN VTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        MKTDIP
Subjt:  MKTDIP

A0A5A7TI57 YbaK/aminoacyl-tRNA synthetase-associated domain isoform 23.6e-9183.96Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MG+E+EAMAE+NSMEALAKLEHRQTLIL+RIS LEL YLPNLDS PS VP +GAD DTVARLSAILRTNAVNDFSFKRVPSDYYDW LEARRDVL+AAS+
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQAPS+VVDCSDRNNSKYY+VVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIPGRGAIS
        MKTDIP   ++S
Subjt:  MKTDIPGRGAIS

A0A5D3CLG6 YbaK/aminoacyl-tRNA synthetase-associated domain isoform 26.2e-9185.92Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MG+E+EAMAE+NSMEALAKLEHRQTLIL+RIS LEL YLPNLDS PS VP +GAD DTVARLSAILRTNAVNDFSFKRVPSDYYDW LEARRDVL+AAS+
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQAPS+VVDCSDRNNSKYY+VVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        MKTDIP
Subjt:  MKTDIP

A0A6J1GI39 uncharacterized protein LOC1114543535.0e-1678.43Show/hide
Query:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS
        ++ ++ VILDEAI KL+PDYFWLGGGE+DLKLGIRTSEF+NFVKPFI+KCS
Subjt:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS

A0A6J1GI39 uncharacterized protein LOC1114543535.6e-8480.58Show/hide
Query:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV
        MGTE  A+AERNSMEALAKLE+RQTLI+NRISKLELAYLPNLDS PSP+  + AD DTV+RLS ILRTNAVNDFSF RVPSDYYDW+LEARRDVLNAASV
Subjt:  MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASV

Query:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG
        DHLCKSIVL             VNTQA S+VVDCS+RNNSKYYVVVVQY+AKFNAE V+SFLYSLNDGKI KKKFNLRLAPEE SA+LTGY HN VTCIG
Subjt:  DHLCKSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIG

Query:  MKTDIP
        MKTDIP
Subjt:  MKTDIP

A0A6J1KXM4 uncharacterized protein LOC1114984835.0e-1678.43Show/hide
Query:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS
        ++ ++ VILDEAI KL+PDYFWLGGGE+DLKLGIRTSEF+NFVKPFI+KCS
Subjt:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45063.1 copper ion binding;electron carriers4.2e-0731.37Show/hide
Query:  KFKYWTLS-PSGCRLCIKDGEDIDHIFLHCEFARKAWAFSARLLGITFCMPRRVED---WLPDGLNAWNMKKKAKVIVGCAFRATMWLLWKERNARTFED
        +F  W +  PS C LC    E   H+F  C F+ + W+F      +T   PR  +D   WL          KK   I+  A++A+++ +W+ERN R   +
Subjt:  KFKYWTLS-PSGCRLCIKDGEDIDHIFLHCEFARKAWAFSARLLGITFCMPRRVED---WLPDGLNAWNMKKKAKVIVGCAFRATMWLLWKERNARTFED

Query:  KS
        KS
Subjt:  KS

AT4G05095.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT4G04650.1)6.7e-0530.93Show/hide
Query:  WTLS-PSGCRLCIKDGEDIDHIFLHCEFARKAWAFSARLLGITFCMPRRVEDWLPDGLNAWNMKKKAKVIVGCAFRATMWLLWKERNARTFEDKSNS
        W LS PS C LC  + E  +H+F  C  +   W        +T   P    D L   L+  +      +I+   F+A+++ LWKERN R  +  S S
Subjt:  WTLS-PSGCRLCIKDGEDIDHIFLHCEFARKAWAFSARLLGITFCMPRRVEDWLPDGLNAWNMKKKAKVIVGCAFRATMWLLWKERNARTFEDKSNS

AT4G16510.1 YbaK/aminoacyl-tRNA synthetase-associated domain6.4e-6465.84Show/hide
Query:  MAERNSME-ALAKLEHRQTLILNRISKLELAYLP-NLDSAPSP-VPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASVDHLC
        M +   ME A A+LE  Q  IL++IS LE ++LP N  +APSP +P +  + +TV RLS IL++  VNDF FKRV +DYYDW LE+RRDVL A+SVDHLC
Subjt:  MAERNSME-ALAKLEHRQTLILNRISKLELAYLP-NLDSAPSP-VPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASVDHLC

Query:  KSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIGMKTD
        KSIVL             VNTQA SN++DCSD NNSKYYVVVVQYTA+FNAEAVK FLYSLN+GKI KK+FNLRLAPEETS KLTG+EHNGVTCIGMKT+
Subjt:  KSIVLVRSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIGMKTD

Query:  IP
        IP
Subjt:  IP

AT4G16510.1 YbaK/aminoacyl-tRNA synthetase-associated domain2.0e-1774.51Show/hide
Query:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS
        ++  + VILDEAI KL PD+FWLGGGEIDLKLG+RTSEF+ FVKPFIV CS
Subjt:  IREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGACAGAGACTGAAGCAATGGCAGAACGAAACTCCATGGAAGCACTAGCGAAGCTCGAACACCGCCAGACCCTAATTCTCAATCGAATTTCAAAGCTCGAGCTTGC
ATACCTTCCAAATCTCGACTCTGCACCTTCCCCTGTTCCCTTCAATGGCGCAGATGCCGACACTGTGGCTCGACTCTCTGCCATTCTCCGAACCAACGCTGTCAACGACT
TCTCCTTCAAGAGAGTGCCCTCTGATTACTACGATTGGACTCTCGAAGCCCGACGGGACGTTCTTAATGCCGCCTCCGTCGACCATCTCTGCAAGAGCATCGTTCTGGTG
CGTTCTGGAATTGGATTCTCCTTTTGCAATTTCGATGTAAATACTCAAGCCCCGTCCAATGTTGTTGATTGTAGTGATCGCAACAATTCAAAATATTATGTTGTTGTTGT
TCAGTATACTGCTAAATTCAATGCTGAAGCTGTTAAAAGCTTCCTGTATTCACTCAACGATGGCAAGATAGCAAAAAAGAAATTCAATTTGAGACTTGCTCCAGAGGAGA
CATCTGCGAAGCTAACTGGATATGAGCATAATGGAGTGACGTGCATTGGCATGAAAACAGATATTCCAGGCAGAGGTGCCATCTCCACCGAGACTGGACTCCAGCTTCCT
CATCTTTGGAAGGAGCTGAAGCCAGGTCAACTGAGAGGTCTTTTTGATCGTGAAACTAGCAATTGGGTGGCTTTAGTTGACAAGTTGAATGAGGTGCACTTGGGATCTGG
CATTGACAAAATCCTTTGGAGCTTAGAAGGATCTGGCTACTACTCCACCAATTCTTTGTTCAGTAGAAATGTGGGGAAATTGCCTAAAATTAGCTTGACCACTGCTGGTT
TGATTTGGAAGCACAAATGCCCCAAAAGGGTGAAGGTTTTTCTCTTGTCTGTGGCCTACCGTAGCTTAAACACGGATGATTATCTGCAAAGGAAGTTTAAATATTGGACC
TTGTCTCCTTCAGGGTGTAGGCTGTGCATAAAAGACGGGGAGGATATAGATCACATCTTTCTTCATTGTGAGTTTGCTAGAAAGGCTTGGGCTTTCAGTGCAAGGCTGCT
GGGTATCACTTTTTGTATGCCTAGAAGAGTGGAAGACTGGCTTCCTGATGGCTTGAATGCTTGGAACATGAAAAAGAAGGCTAAAGTCATTGTTGGTTGTGCTTTCCGGG
CGACTATGTGGCTTTTGTGGAAGGAAAGGAATGCGAGGACTTTTGAAGATAAGTCTAATTCTTTTGATATTTTTTGTGATAGTGTACAAAATACGGCCTCTTGGCAAGAA
TGCATCAGGGAGGAGCTAAAGGTGATTTTGGACGAAGCAATTGTGAAACTAAATCCTGATTATTTTTGGTTGGGTGGTGGGGAAATCGACCTGAAGTTGGGAATCAGGAC
ATCTGAGTTCGTAAACTTTGTTAAACCCTTCATTGTCAAGTGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGACAGAGACTGAAGCAATGGCAGAACGAAACTCCATGGAAGCACTAGCGAAGCTCGAACACCGCCAGACCCTAATTCTCAATCGAATTTCAAAGCTCGAGCTTGC
ATACCTTCCAAATCTCGACTCTGCACCTTCCCCTGTTCCCTTCAATGGCGCAGATGCCGACACTGTGGCTCGACTCTCTGCCATTCTCCGAACCAACGCTGTCAACGACT
TCTCCTTCAAGAGAGTGCCCTCTGATTACTACGATTGGACTCTCGAAGCCCGACGGGACGTTCTTAATGCCGCCTCCGTCGACCATCTCTGCAAGAGCATCGTTCTGGTG
CGTTCTGGAATTGGATTCTCCTTTTGCAATTTCGATGTAAATACTCAAGCCCCGTCCAATGTTGTTGATTGTAGTGATCGCAACAATTCAAAATATTATGTTGTTGTTGT
TCAGTATACTGCTAAATTCAATGCTGAAGCTGTTAAAAGCTTCCTGTATTCACTCAACGATGGCAAGATAGCAAAAAAGAAATTCAATTTGAGACTTGCTCCAGAGGAGA
CATCTGCGAAGCTAACTGGATATGAGCATAATGGAGTGACGTGCATTGGCATGAAAACAGATATTCCAGGCAGAGGTGCCATCTCCACCGAGACTGGACTCCAGCTTCCT
CATCTTTGGAAGGAGCTGAAGCCAGGTCAACTGAGAGGTCTTTTTGATCGTGAAACTAGCAATTGGGTGGCTTTAGTTGACAAGTTGAATGAGGTGCACTTGGGATCTGG
CATTGACAAAATCCTTTGGAGCTTAGAAGGATCTGGCTACTACTCCACCAATTCTTTGTTCAGTAGAAATGTGGGGAAATTGCCTAAAATTAGCTTGACCACTGCTGGTT
TGATTTGGAAGCACAAATGCCCCAAAAGGGTGAAGGTTTTTCTCTTGTCTGTGGCCTACCGTAGCTTAAACACGGATGATTATCTGCAAAGGAAGTTTAAATATTGGACC
TTGTCTCCTTCAGGGTGTAGGCTGTGCATAAAAGACGGGGAGGATATAGATCACATCTTTCTTCATTGTGAGTTTGCTAGAAAGGCTTGGGCTTTCAGTGCAAGGCTGCT
GGGTATCACTTTTTGTATGCCTAGAAGAGTGGAAGACTGGCTTCCTGATGGCTTGAATGCTTGGAACATGAAAAAGAAGGCTAAAGTCATTGTTGGTTGTGCTTTCCGGG
CGACTATGTGGCTTTTGTGGAAGGAAAGGAATGCGAGGACTTTTGAAGATAAGTCTAATTCTTTTGATATTTTTTGTGATAGTGTACAAAATACGGCCTCTTGGCAAGAA
TGCATCAGGGAGGAGCTAAAGGTGATTTTGGACGAAGCAATTGTGAAACTAAATCCTGATTATTTTTGGTTGGGTGGTGGGGAAATCGACCTGAAGTTGGGAATCAGGAC
ATCTGAGTTCGTAAACTTTGTTAAACCCTTCATTGTCAAGTGCAGTTAG
Protein sequenceShow/hide protein sequence
MGTETEAMAERNSMEALAKLEHRQTLILNRISKLELAYLPNLDSAPSPVPFNGADADTVARLSAILRTNAVNDFSFKRVPSDYYDWTLEARRDVLNAASVDHLCKSIVLV
RSGIGFSFCNFDVNTQAPSNVVDCSDRNNSKYYVVVVQYTAKFNAEAVKSFLYSLNDGKIAKKKFNLRLAPEETSAKLTGYEHNGVTCIGMKTDIPGRGAISTETGLQLP
HLWKELKPGQLRGLFDRETSNWVALVDKLNEVHLGSGIDKILWSLEGSGYYSTNSLFSRNVGKLPKISLTTAGLIWKHKCPKRVKVFLLSVAYRSLNTDDYLQRKFKYWT
LSPSGCRLCIKDGEDIDHIFLHCEFARKAWAFSARLLGITFCMPRRVEDWLPDGLNAWNMKKKAKVIVGCAFRATMWLLWKERNARTFEDKSNSFDIFCDSVQNTASWQE
CIREELKVILDEAIVKLNPDYFWLGGGEIDLKLGIRTSEFVNFVKPFIVKCS