; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020012 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020012
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSM-ATX domain-containing protein
Genome locationtig00153446:895059..908061
RNA-Seq ExpressionSgr020012
SyntenySgr020012
Gene Ontology termsGO:0034063 - stress granule assembly (biological process)
GO:0010494 - cytoplasmic stress granule (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR045117 - Ataxin2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600769.1 Polyadenylate-binding protein-interacting protein 4, partial [Cucurbita argyrosperma subsp. sororia]4.1e-21282.74Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG
        GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKS+A  DNE+M N P SLLPA+ETKTCMESFKEGSQ+NQTSDL+QDQNG
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG

Query:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA
        FAHGS+PT+TGK  DVRQLL DN ENN+ DAQQK+ER NCKKPEGVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N DRVQSSI SEKPC ER  SA
Subjt:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA

Query:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV
        NT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH SESNKS KEFKLNPRAKLFSPS  N + A  A  + A++AYISNNS PVVP AV QPE+
Subjt:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV

Query:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
        +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNS AVMV RFGQLVY+ PVSHDLAQG TVVSPV P
Subjt:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP

Query:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF
        CPLLTTQPAQYPKHQ      GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPM +PG+NAFF TKF
Subjt:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF

XP_022151013.1 uncharacterized protein LOC111019035 isoform X1 [Momordica charantia]7.9e-23288.17Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQN
        GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+SLLP  E KTCMESFKEGSQ+NQ +S+LVQDQN
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQN

Query:  GFAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTS
        GFAHGSVPTITGKH DVRQLLRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSISSEKPCIERPTS
Subjt:  GFAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTS

Query:  ANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEV
        ANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEV
Subjt:  ANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEV

Query:  EFSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
        EFSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
Subjt:  EFSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP

Query:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT
        CPLLTTQPAQYPKHQ +        AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Subjt:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT

XP_022151014.1 uncharacterized protein LOC111019035 isoform X2 [Momordica charantia]5.5e-23388.36Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG
        GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+SLLP  E KTCMESFKEGSQ+NQ S+LVQDQNG
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG

Query:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA
        FAHGSVPTITGKH DVRQLLRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSISSEKPCIERPTSA
Subjt:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA

Query:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVE
        NT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVE
Subjt:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVE

Query:  FSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPC
        FSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPC
Subjt:  FSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPC

Query:  PLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT
        PLLTTQPAQYPKHQ +        AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Subjt:  PLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT

XP_022151016.1 uncharacterized protein LOC111019035 isoform X4 [Momordica charantia]1.2e-22787.97Show/hide
Query:  MTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVP
        MTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+SLLP  E KTCMESFKEGSQ+NQ +S+LVQDQNGFAHGSVP
Subjt:  MTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVP

Query:  TITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSANTSLNAY
        TITGKH DVRQLLRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSISSEKPCIERPTSANT+ NA+
Subjt:  TITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSANTSLNAY

Query:  SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPR
        SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVEFSPFVPR
Subjt:  SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPR

Query:  SSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQP
        SSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQP
Subjt:  SSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQP

Query:  AQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT
        AQYPKHQ +        AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Subjt:  AQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT

XP_022943019.1 polyadenylate-binding protein-interacting protein 4-like [Cucurbita moschata]4.1e-21283.2Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG
        GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLA  DNE+M N P SLLPA+ETKTCMESFKEGSQ+NQTSDLVQDQNG
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG

Query:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA
        FAHGS+PT+TGK  DVRQLL DN ENN+GDAQQK+ER NCKKP+GVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N DRVQSSI SEKPC ER  SA
Subjt:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA

Query:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV
        NT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH SESNKS KEFKLNPRAKLFSPS  N M A  A  + A++AYISNNS PVVP AV QPE+
Subjt:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV

Query:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
        +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNS AVMV RFGQLVY+ PVSHDLAQG TVVSPV P
Subjt:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP

Query:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNTKF
        CPLLTTQPAQYPKHQ      GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQP SFPLNRPM +PG+NAFF TKF
Subjt:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNTKF

TrEMBL top hitse value%identityAlignment
A0A6J1DB05 uncharacterized protein LOC111019035 isoform X13.8e-23288.17Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQN
        GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+SLLP  E KTCMESFKEGSQ+NQ +S+LVQDQN
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQN

Query:  GFAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTS
        GFAHGSVPTITGKH DVRQLLRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSISSEKPCIERPTS
Subjt:  GFAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTS

Query:  ANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEV
        ANT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEV
Subjt:  ANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEV

Query:  EFSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
        EFSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
Subjt:  EFSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP

Query:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT
        CPLLTTQPAQYPKHQ +        AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Subjt:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT

A0A6J1DBR8 uncharacterized protein LOC111019035 isoform X22.6e-23388.36Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG
        GVVLKKARMTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+SLLP  E KTCMESFKEGSQ+NQ S+LVQDQNG
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG

Query:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA
        FAHGSVPTITGKH DVRQLLRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSISSEKPCIERPTSA
Subjt:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA

Query:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVE
        NT+ NA+SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVE
Subjt:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVE

Query:  FSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPC
        FSPFVPRSSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPC
Subjt:  FSPFVPRSSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPC

Query:  PLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT
        PLLTTQPAQYPKHQ +        AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Subjt:  PLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT

A0A6J1DD98 uncharacterized protein LOC111019035 isoform X45.7e-22887.97Show/hide
Query:  MTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVP
        MTKKGKRNVNVDDG VIDTLIVLSGDLVQVVATEVLLPAGSFSKSL GYDNEAMANVP+SLLP  E KTCMESFKEGSQ+NQ +S+LVQDQNGFAHGSVP
Subjt:  MTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQ-TSDLVQDQNGFAHGSVP

Query:  TITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSANTSLNAY
        TITGKH DVRQLLRDN E+NQGDAQQKRERINCKKPE  TDAAINWRQDPDNQLKRE+DDH QEFD+HK VNVDRVQSSISSEKPCIERPTSANT+ NA+
Subjt:  TITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSANTSLNAY

Query:  SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPR
        SV VST SLSS+DSSMDSCHSSITST D+A  H SESNKS+KEFKLNPRAKLFSPSV N+M+A+PA  +VA+VAYISN+SPVVPVAVAQPEVEFSPFVPR
Subjt:  SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPR

Query:  SSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQP
        SSV PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQP
Subjt:  SSV-PAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQP

Query:  AQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT
        AQYPKHQ +        AAA QALQFCVPPPFMA+GHQPL+AVPNHIPILQPSFPLNRPMQ+PGSNAFF+TKFT
Subjt:  AQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKFT

A0A6J1FT17 polyadenylate-binding protein-interacting protein 4-like2.0e-21283.2Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG
        GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLA  DNE+M N P SLLPA+ETKTCMESFKEGSQ+NQTSDLVQDQNG
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG

Query:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA
        FAHGS+PT+TGK  DVRQLL DN ENN+GDAQQK+ER NCKKP+GVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N DRVQSSI SEKPC ER  SA
Subjt:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA

Query:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV
        NT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH SESNKS KEFKLNPRAKLFSPS  N M A  A  + A++AYISNNS PVVP AV QPE+
Subjt:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV

Query:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
        +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGPPNS AVMV RFGQLVY+ PVSHDLAQG TVVSPV P
Subjt:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP

Query:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNTKF
        CPLLTTQPAQYPKHQ      GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQP SFPLNRPM +PG+NAFF TKF
Subjt:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNTKF

A0A6J1IG89 polyadenylate-binding protein-interacting protein 4-like6.3e-21182.99Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG
        GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLA  DNE+M N P SLLPA+ETKTCMESFKEGSQ+NQTSDLVQDQNG
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG

Query:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA
        FAHGS+PT+TGK  DVRQLL DN ENN+GDAQQK+E  NCKKPEGVTDAAINWR+DPDNQLK+EQDDHGQEFD+HK  N DRVQSSI SEKPC ER  SA
Subjt:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSA

Query:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV
        NT+ NAYSV VST S SSVDSSMDSC SSITST D+APSH SESNKS KEFKLNPRAKLFSPS  N M A  A  + A++AYISNNS PVVP AV QPE+
Subjt:  NTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNS-PVVPVAVAQPEV

Query:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP
        +FSPFVPRSSVP AKFVPYGNS+AGF GNVAQFSQPMVGHVGTRTQP+RYVGQYPLQAGPTFGP NS AVMV RFGQLVY+ PVSHDLAQG TVVSPV P
Subjt:  EFSPFVPRSSVP-AKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSP

Query:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNTKF
        CPLLTTQPAQYPKHQ      GTAAA   QALQFCVPPPFMA+GHQPL+AVPNHIPILQP SFPLNRPM +PG+NAFF TKF
Subjt:  CPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP-SFPLNRPMQVPGSNAFFNTKF

SwissProt top hitse value%identityAlignment
P93290 Uncharacterized mitochondrial protein AtMg002405.3e-0537.5Show/hide
Query:  LYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRET
        +YLTI+RPD+TF   +LSQF S    + + A + +L Y+K + GQG+F  A+S  Q+   F ++++++  +T
Subjt:  LYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRET

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-0937.65Show/hide
Query:  MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQI
        M P+ +L       L DP+ YR ++G L YL  +RPDI++   +LSQF+  P + HL A   +LRYL  +P  G+FL   ++  +
Subjt:  MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.1e-0836.14Show/hide
Query:  PNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQI
        P L LH+     L DP+ YR ++G L YL  +RPD+++   +LSQ++  P   H +A   +LRYL  +P  G+FL   ++  +
Subjt:  PNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.6e-1239Show/hide
Query:  MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRET
        MDP++   A       D   YRRLIGRL+YL I+R DI+F   KLSQF   P  +H  A   +L Y+K + GQG+F  + +  Q+   F + +F + ++T
Subjt:  MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRET

AT4G26990.1 unknown protein9.8e-4734.94Show/hide
Query:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG
        G+VLK AR+TKKG    NV  G V+DTL++LS  +VQ++A  V LP+     ++   +NE         LP SE + C          N+++++     G
Subjt:  GVVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNG

Query:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIER----
        F H                        Q  AQ  +  +                               Q  +++++ N+D +QSS SS     ER    
Subjt:  FAHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIER----

Query:  -------PTSANTSLNAYSVCVSTCSLSS----VDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAA-SMVASVAYIS
                  +N   NA +   ST +L S    VD +++ C   + +++    S   ++ K  KEFKLNP AK+FSPS T  +S +P     V ++AYI 
Subjt:  -------PTSANTSLNAYSVCVSTCSLSS----VDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAA-SMVASVAYIS

Query:  NNSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVHPV
        +N+P++PV  A  PEV  +P+VP++  P+KFVPYGN  AG      QF Q M+G    R QP RY  QY  +QA P    P+ Q VMV R GQLVYV  V
Subjt:  NNSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVHPV

Query:  SHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPIL-QPSFPLNRPMQVPGSNAFFNTKF
        S DL QGT  +SP+  CPL T Q  QY KHQ           AA Q L  CV  PF   G QP   +P   P + QP FP N+PM V   N F+ TKF
Subjt:  SHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPIL-QPSFPLNRPMQVPGSNAFFNTKF

AT5G54920.1 unknown protein5.4e-5333.87Show/hide
Query:  VVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGF
        +VLK A++TKKG+   NV+ G +++TL++LS ++VQ+VA  V     S S ++AG   E      +S +  S       S K     N+  +  + +N  
Subjt:  VVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGF

Query:  AHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSE------KPCIE
              T+T   G+    +++    ++   +     +N ++  GV                R   +  +  D+H+  NV+   SS S +      KP IE
Subjt:  AHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSE------KPCIE

Query:  RPTSANTSLNAY--------SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPA--ASMVASVAYISN
        +      S N +        S   S+   ++VD + +     + ST  L P+  ++ +K AKEFKLNP AK FSPS+   +++  A    +VA++ Y+ +
Subjt:  RPTSANTSLNAY--------SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPA--ASMVASVAYISN

Query:  NSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVHPVS
        N+P++PV  A QPE+  SPF+  +S P+KFVPY N   G  G  + F Q MVG    R QP R+  QY  +Q  P    PN Q VMVGR GQL+Y+ P+S
Subjt:  NSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVHPVS

Query:  HDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF
         DL QG    S + P PL T Q  QYPKHQ        +  A  Q +    P PF ANGHQP + +P  IP++Q  FP+NR M +P  N F+ TKF
Subjt:  HDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF

AT5G54920.2 unknown protein3.1e-5333.87Show/hide
Query:  VVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGF
        +VLK A++TKKG+   NV+ G +++TL++LS ++VQ+VA  V     S S ++AG   E      +S +  S       S K     N+  +  + +N  
Subjt:  VVLKKARMTKKGKRNVNVDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGF

Query:  AHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKR---ERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSE------KP
              T+T   G+    +++    ++    Q +     +N ++  GV                R   +  +  D+H+  NV+   SS S +      KP
Subjt:  AHGSVPTITGKHGDVRQLLRDNAENNQGDAQQKR---ERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSE------KP

Query:  CIERPTSANTSLNAY--------SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPA--ASMVASVAY
         IE+      S N +        S   S+   ++VD + +     + ST  L P+  ++ +K AKEFKLNP AK FSPS+   +++  A    +VA++ Y
Subjt:  CIERPTSANTSLNAY--------SVCVSTCSLSSVDSSMDSCHSSITSTTDLAPSHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPA--ASMVASVAY

Query:  ISNNSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVH
        + +N+P++PV  A QPE+  SPF+  +S P+KFVPY N   G  G  + F Q MVG    R QP R+  QY  +Q  P    PN Q VMVGR GQL+Y+ 
Subjt:  ISNNSPVVPVAVA-QPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRYVGQY-PLQAGPTFGPPNSQAVMVGRFGQLVYVH

Query:  PVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF
        P+S DL QG    S + P PL T Q  QYPKHQ        +  A  Q +    P PF ANGHQP + +P  IP++Q  FP+NR M +P  N F+ TKF
Subjt:  PVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQPSFPLNRPMQVPGSNAFFNTKF

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.8e-0637.5Show/hide
Query:  LYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRET
        +YLTI+RPD+TF   +LSQF S    + + A + +L Y+K + GQG+F  A+S  Q+   F ++++++  +T
Subjt:  LYLTISRPDITF--YKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCCAACTTGCAGCTTCATGCTTCTGACAATGATTATTTGCAGGATCCTTCTGTTTACAGGAGATTAATTGGCAGATTATTGTATTTGACCATTTCTCGTCCTGA
TATAACTTTTTACAAGTTGAGCCAATTTGTGTCCAAGCCGTGCAAGTCCCACCTATCTGCTGCCCACCATTTATTGCGATATTTAAAGGCTTCGCCAGGACAAGGTGTTT
TCCTCCCAGCTTCTTCTTCCTTCCAGATCTCGTTTCCCTTCTGCAACACCAACTTTTCGACACTCAGAGAAACACCACCTCCACCGGCGGATCCATCAGCTAACACCCAC
ACAGAAATTGGCATATGGGTTGCAGAAACAGGGAGTTTTCCGAAGATGACACTTCCTCTTCTACGCTTAGCGAGGCTTTGCTCTTTGCCACCATGTGCCTCATTGGCCTC
CCAGTTGAGGTTCACGTTAAAGATGGCTCTGTCTATTGCGGCATCTTTCACACTGCCTGTGTGGAGAATGAATATGAATTCTACTTTTTGTATTCATTCTTATGTGAATT
TGTGCCTTGGTTTCGCCCGTGAGCGAGTGGTTTTGATTTTATCGTTTACGTATGGCGGTGTTGTTCTGAAGAAAGCAAGGATGACAAAAAAGGGTAAAAGGAATGTGAAT
GTGGACGATGGAGTTGTAATCGATACTCTTATTGTTCTTTCCGGTGATCTTGTCCAAGTTGTTGCGACGGAAGTTCTACTTCCGGCTGGTAGTTTTTCCAAAAGTTTGGC
TGGTTATGATAATGAAGCCATGGCCAATGTTCCTATTTCATTGCTTCCAGCTTCAGAGACTAAGACATGTATGGAGTCATTCAAGGAGGGGAGTCAGATGAATCAAACAA
GCGACTTGGTCCAAGATCAGAATGGGTTTGCTCATGGTTCAGTGCCTACGATAACTGGGAAGCATGGTGATGTTAGACAGCTTTTGCGAGATAATGCTGAGAACAACCAG
GGAGATGCACAGCAGAAAAGGGAAAGGATCAATTGCAAAAAGCCTGAAGGTGTCACTGATGCTGCAATCAATTGGAGACAGGACCCAGATAACCAATTAAAAAGGGAGCA
GGATGATCATGGTCAGGAATTTGACATTCATAAAAGAGTCAATGTTGATCGAGTTCAATCCTCTATATCGAGTGAGAAACCATGCATTGAAAGACCCACCTCAGCCAACA
CTTCTCTGAATGCATATTCAGTTTGTGTCTCAACTTGTTCACTTTCGTCCGTAGATAGTAGCATGGACTCGTGTCATAGTTCTATAACATCAACGACCGACTTGGCCCCT
TCTCATGTTTCAGAATCCAATAAAAGTGCCAAGGAATTTAAGCTGAATCCAAGAGCCAAACTCTTCTCTCCATCTGTTACCAATAGCATGTCAGCAACTCCTGCAGCTTC
AATGGTTGCAAGCGTGGCTTACATTTCAAACAACTCACCAGTAGTACCTGTGGCTGTTGCTCAGCCAGAGGTGGAGTTCAGTCCTTTTGTACCTCGTTCATCTGTGCCTG
CCAAGTTTGTCCCTTATGGCAACTCAATAGCTGGATTTGGTGGCAATGTTGCTCAATTTTCCCAACCTATGGTGGGACATGTAGGAACCAGGACGCAGCCAGTTAGATAT
GTTGGTCAGTATCCTCTCCAGGCTGGTCCAACCTTTGGGCCCCCAAACTCACAAGCAGTTATGGTCGGACGTTTTGGGCAACTTGTTTATGTTCACCCAGTCTCGCATGA
CTTGGCTCAAGGTACAACAGTCGTCTCACCGGTATCACCTTGCCCTTTGTTGACAACACAGCCAGCTCAATATCCAAAACATCAAGCGTCCTCCACAAATGCAGGAACTG
CAGCAGCAGCAGCAGTACAAGCATTGCAGTTTTGCGTTCCTCCACCATTTATGGCCAATGGACACCAGCCGCTCTCCGCAGTGCCAAACCACATTCCAATTTTGCAGCCC
TCCTTCCCCCTCAATCGCCCAATGCAAGTCCCAGGATCTAATGCATTCTTCAACACCAAGTTCACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCCAACTTGCAGCTTCATGCTTCTGACAATGATTATTTGCAGGATCCTTCTGTTTACAGGAGATTAATTGGCAGATTATTGTATTTGACCATTTCTCGTCCTGA
TATAACTTTTTACAAGTTGAGCCAATTTGTGTCCAAGCCGTGCAAGTCCCACCTATCTGCTGCCCACCATTTATTGCGATATTTAAAGGCTTCGCCAGGACAAGGTGTTT
TCCTCCCAGCTTCTTCTTCCTTCCAGATCTCGTTTCCCTTCTGCAACACCAACTTTTCGACACTCAGAGAAACACCACCTCCACCGGCGGATCCATCAGCTAACACCCAC
ACAGAAATTGGCATATGGGTTGCAGAAACAGGGAGTTTTCCGAAGATGACACTTCCTCTTCTACGCTTAGCGAGGCTTTGCTCTTTGCCACCATGTGCCTCATTGGCCTC
CCAGTTGAGGTTCACGTTAAAGATGGCTCTGTCTATTGCGGCATCTTTCACACTGCCTGTGTGGAGAATGAATATGAATTCTACTTTTTGTATTCATTCTTATGTGAATT
TGTGCCTTGGTTTCGCCCGTGAGCGAGTGGTTTTGATTTTATCGTTTACGTATGGCGGTGTTGTTCTGAAGAAAGCAAGGATGACAAAAAAGGGTAAAAGGAATGTGAAT
GTGGACGATGGAGTTGTAATCGATACTCTTATTGTTCTTTCCGGTGATCTTGTCCAAGTTGTTGCGACGGAAGTTCTACTTCCGGCTGGTAGTTTTTCCAAAAGTTTGGC
TGGTTATGATAATGAAGCCATGGCCAATGTTCCTATTTCATTGCTTCCAGCTTCAGAGACTAAGACATGTATGGAGTCATTCAAGGAGGGGAGTCAGATGAATCAAACAA
GCGACTTGGTCCAAGATCAGAATGGGTTTGCTCATGGTTCAGTGCCTACGATAACTGGGAAGCATGGTGATGTTAGACAGCTTTTGCGAGATAATGCTGAGAACAACCAG
GGAGATGCACAGCAGAAAAGGGAAAGGATCAATTGCAAAAAGCCTGAAGGTGTCACTGATGCTGCAATCAATTGGAGACAGGACCCAGATAACCAATTAAAAAGGGAGCA
GGATGATCATGGTCAGGAATTTGACATTCATAAAAGAGTCAATGTTGATCGAGTTCAATCCTCTATATCGAGTGAGAAACCATGCATTGAAAGACCCACCTCAGCCAACA
CTTCTCTGAATGCATATTCAGTTTGTGTCTCAACTTGTTCACTTTCGTCCGTAGATAGTAGCATGGACTCGTGTCATAGTTCTATAACATCAACGACCGACTTGGCCCCT
TCTCATGTTTCAGAATCCAATAAAAGTGCCAAGGAATTTAAGCTGAATCCAAGAGCCAAACTCTTCTCTCCATCTGTTACCAATAGCATGTCAGCAACTCCTGCAGCTTC
AATGGTTGCAAGCGTGGCTTACATTTCAAACAACTCACCAGTAGTACCTGTGGCTGTTGCTCAGCCAGAGGTGGAGTTCAGTCCTTTTGTACCTCGTTCATCTGTGCCTG
CCAAGTTTGTCCCTTATGGCAACTCAATAGCTGGATTTGGTGGCAATGTTGCTCAATTTTCCCAACCTATGGTGGGACATGTAGGAACCAGGACGCAGCCAGTTAGATAT
GTTGGTCAGTATCCTCTCCAGGCTGGTCCAACCTTTGGGCCCCCAAACTCACAAGCAGTTATGGTCGGACGTTTTGGGCAACTTGTTTATGTTCACCCAGTCTCGCATGA
CTTGGCTCAAGGTACAACAGTCGTCTCACCGGTATCACCTTGCCCTTTGTTGACAACACAGCCAGCTCAATATCCAAAACATCAAGCGTCCTCCACAAATGCAGGAACTG
CAGCAGCAGCAGCAGTACAAGCATTGCAGTTTTGCGTTCCTCCACCATTTATGGCCAATGGACACCAGCCGCTCTCCGCAGTGCCAAACCACATTCCAATTTTGCAGCCC
TCCTTCCCCCTCAATCGCCCAATGCAAGTCCCAGGATCTAATGCATTCTTCAACACCAAGTTCACCTGA
Protein sequenceShow/hide protein sequence
MDPNLQLHASDNDYLQDPSVYRRLIGRLLYLTISRPDITFYKLSQFVSKPCKSHLSAAHHLLRYLKASPGQGVFLPASSSFQISFPFCNTNFSTLRETPPPPADPSANTH
TEIGIWVAETGSFPKMTLPLLRLARLCSLPPCASLASQLRFTLKMALSIAASFTLPVWRMNMNSTFCIHSYVNLCLGFARERVVLILSFTYGGVVLKKARMTKKGKRNVN
VDDGVVIDTLIVLSGDLVQVVATEVLLPAGSFSKSLAGYDNEAMANVPISLLPASETKTCMESFKEGSQMNQTSDLVQDQNGFAHGSVPTITGKHGDVRQLLRDNAENNQ
GDAQQKRERINCKKPEGVTDAAINWRQDPDNQLKREQDDHGQEFDIHKRVNVDRVQSSISSEKPCIERPTSANTSLNAYSVCVSTCSLSSVDSSMDSCHSSITSTTDLAP
SHVSESNKSAKEFKLNPRAKLFSPSVTNSMSATPAASMVASVAYISNNSPVVPVAVAQPEVEFSPFVPRSSVPAKFVPYGNSIAGFGGNVAQFSQPMVGHVGTRTQPVRY
VGQYPLQAGPTFGPPNSQAVMVGRFGQLVYVHPVSHDLAQGTTVVSPVSPCPLLTTQPAQYPKHQASSTNAGTAAAAAVQALQFCVPPPFMANGHQPLSAVPNHIPILQP
SFPLNRPMQVPGSNAFFNTKFT