; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017420 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017420
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSpoU_methylase domain-containing protein
Genome locationchr5:3488717..3508599
RNA-Seq ExpressionLag0017420
SyntenyLag0017420
Gene Ontology termsGO:0030488 - tRNA methylation (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016423 - tRNA (guanine) methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030164.1 putative methyltransferase TARBP1, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.89Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQQLTSLARKKS GRVGLISLSECIASAASI G +N++EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKEEHHL PILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+ DEFT   PSD+ S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        Q+ELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SNN G+EFLL FLSKTVSSP +HSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWK-WLCLESLLSIPLHALQSGL
        GAEICLA YEALASVLQVLV EFS EALRF+ DESTI++ GVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWK WLCLESLLSIP  A+QSGL
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWK-WLCLESLLSIPLHALQSGL

Query:  NLVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSS
        NLVDNNSFLSEATLLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSS
Subjt:  NLVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSS

Query:  TFSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELT
        TFSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLA SPDPELT
Subjt:  TFSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELT

Query:  EVFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        EVFINTELYARVSVAVLFHKLADLA + GLSN +GS SDAVESGKLFLLELLDSVV
Subjt:  EVFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

XP_022934778.1 uncharacterized protein LOC111441850 isoform X1 [Cucurbita moschata]0.0e+0089.01Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQ+LTSLARKKS GRVGLISLSECIASAASI G +N+ EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKEEHHL PILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+ DEFT   PSD+ S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        Q+ELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SNN G+EFLL FLSKTVSSP +HSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ DESTIL+ GVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEATLLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GS SD+VESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

XP_022934811.1 uncharacterized protein LOC111441850 isoform X2 [Cucurbita moschata]0.0e+0089.01Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQ+LTSLARKKS GRVGLISLSECIASAASI G +N+ EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKEEHHL PILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+ DEFT   PSD+ S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        Q+ELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SNN G+EFLL FLSKTVSSP +HSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ DESTIL+ GVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEATLLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GS SD+VESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

XP_023005246.1 uncharacterized protein LOC111498319 isoform X2 [Cucurbita maxima]0.0e+0088.87Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQQLTSLARKKS GRVGLISLSECIASAASI GF+N+ EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKE H L  ILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+KDEFT  QPSD  S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        QVELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SN+ G+EFLL FLSKTVSS  YHSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ D STIL+ GVEGRPLLDSLVLTFHQHVNGILDAG+LVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEA LLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDA+TEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GSCSDAVESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

XP_023539790.1 uncharacterized protein LOC111800369 [Cucurbita pepo subsp. pepo]0.0e+0089.54Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQQLTSLARKKS GRVGLISLSECIASAAS+ G  N+SEGECFDGSSLSAQ DLI  SLG  MELLDDLRFVVE+SKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKEEHHL PILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+KDEFT  QPSD  S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        QVELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SNN G+EFLL FLSKTVSSP YHSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ D STIL+ GVEGRPLLDSLVLTFHQHVNG+LDAGVLVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEATLLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHL EG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GS SDAVESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

TrEMBL top hitse value%identityAlignment
A0A6J1DPL7 uncharacterized protein LOC111022655 isoform X50.0e+0086.92Show/hide
Query:  SRTRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKAL
        +RT VVFLQQLTSL +KKSFGRVGLISLSECIASAASIVGF ND EGECFD      Q +LIT SLG  MELLDDLRFVV+SSKQHFNPSYR QVCAKAL
Subjt:  SRTRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKAL

Query:  EAAASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKR
        EAAASVLCTSDL LEV+L FISALPREATDYGGCLRGKMQ+WL GCGKK CSGSCCSTETKFMKSLIEFPKRF  HNHSS+ SVTYDDEELEAWEFEAKR
Subjt:  EAAASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKR

Query:  WARVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFS
        WARVVFLAVKEEHHLR ILTFIH+HGVNI KQKSDLE +RVKFLILI+SLVQEL+LVQEK    NYKCETKDE+T  QPSD+ + A PT FI+KF NLFS
Subjt:  WARVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFS

Query:  SLQVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHS
        SL  ELVSFAT SCSIFWSNVK DE  LP SVKGKLGGPSQRRLPS  ATLVLLAVTSMKA+A VLSCCRQ + +GS NFGVEFLL FL KTVSSPAY S
Subjt:  SLQVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHS

Query:  ESGAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSG
        ESG EI LATYEALASVLQVLVSEFSS+ L+FI DE+TI+HLGVEGR  LDSLVLTFHQHVNGILDAGVLVR+RRAVLLKWKWLCLESLLSIP    Q G
Subjt:  ESGAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSG

Query:  LNLVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHS
        L LVDNNSFLSEATL+QIF+DLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVT CNG+N+EMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHS
Subjt:  LNLVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHS

Query:  STFSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPEL
        STFSE+SMHLSEGG GPLKWFIEK LEEGTKSPRTFRLAALHLTGMWLS+PWTIKYYVKELKLLSLYGS+AFDEDFEAELTDY ARTEVSLLAESPDPEL
Subjt:  STFSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPEL

Query:  TEVFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        TEVFINTELYARVSVAVLFHKLADLADM GLSNKYGSCSDAVESGKLFLLELLDSVV
Subjt:  TEVFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

A0A6J1F2V7 uncharacterized protein LOC111441850 isoform X20.0e+0089.01Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQ+LTSLARKKS GRVGLISLSECIASAASI G +N+ EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKEEHHL PILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+ DEFT   PSD+ S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        Q+ELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SNN G+EFLL FLSKTVSSP +HSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ DESTIL+ GVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEATLLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GS SD+VESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

A0A6J1F8L8 uncharacterized protein LOC111441850 isoform X10.0e+0089.01Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQ+LTSLARKKS GRVGLISLSECIASAASI G +N+ EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKEEHHL PILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+ DEFT   PSD+ S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        Q+ELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SNN G+EFLL FLSKTVSSP +HSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ DESTIL+ GVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEATLLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GS SD+VESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

A0A6J1KSL3 uncharacterized protein LOC111498319 isoform X10.0e+0088.87Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQQLTSLARKKS GRVGLISLSECIASAASI GF+N+ EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKE H L  ILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+KDEFT  QPSD  S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        QVELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SN+ G+EFLL FLSKTVSS  YHSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ D STIL+ GVEGRPLLDSLVLTFHQHVNGILDAG+LVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEA LLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDA+TEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GSCSDAVESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

A0A6J1KUF0 uncharacterized protein LOC111498319 isoform X20.0e+0088.87Show/hide
Query:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA
        TRVVFLQQLTSLARKKS GRVGLISLSECIASAASI GF+N+ EGECF+GSSLSAQ DLI +S G  MELLDDLRFVVESSKQHFNPSYR+QVCAKALEA
Subjt:  TRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFDGSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEA

Query:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA
        AASVLCTSDL  E VLHFISALPREATDYGGCLRGKMQNWL GCGKK CSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWE EAKRWA
Subjt:  AASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWA

Query:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL
        RVVFLAVKE H L  ILTFI + GVNICKQKSDLE +RVKFLILIMSLVQEL+LV+EKI+  N+K E+KDEFT  QPSD  S AEPTI IQK  NLFSSL
Subjt:  RVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQPSDNWSCAEPTIFIQKFANLFSSL

Query:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES
        QVELVSFA +SCSIFWS VK DE ILPGSVKGKLGGPSQRRLPSSIAT VLLAVTS+KAVASVLSCCRQ R   SN+ G+EFLL FLSKTVSS  YHSE+
Subjt:  QVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFGVEFLLTFLSKTVSSPAYHSES

Query:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN
        GAEICLA YEALASVLQVLV EFSSEALRF+ D STIL+ GVEGRPLLDSLVLTFHQHVNGILDAG+LVRTRRAVLLKWKWLCLESLLSIP  A+QSGLN
Subjt:  GAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLSIPLHALQSGLN

Query:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
        LVDNNSFLSEA LLQIFSDLVESLENAGECS+LPMLRLVRLTLWLFCKGKSGLLVTSCNG+NAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST
Subjt:  LVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSST

Query:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE
        FSERSMHLSEG PGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDA+TEVSLLA SPDPELTE
Subjt:  FSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTE

Query:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        VFINTELYARVSVAVLFHKLADLA + GLSN +GSCSDAVESGKLFLLELLDSVV
Subjt:  VFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G17610.1 tRNA/rRNA methyltransferase (SpoU) family protein7.4e-18949.42Show/hide
Query:  RTRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSE-GECFDGSSLSAQRDLITN-SLGSTMELLDDLRFVVESSKQHFNPSYRLQ-----
        R RV FL  L SLA+K+SF R G ++L +CI S A +VG   D E G   D  S +AQ     + S      +LD L+FV ESS+QHFN  YR++     
Subjt:  RTRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSE-GECFDGSSLSAQRDLITN-SLGSTMELLDDLRFVVESSKQHFNPSYRLQ-----

Query:  ----------------VCAKALEAAASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNH
                        V  K LE AASV+   ++ L  +L F+SA+PRE TD+ G LR  M  WL GC +K  S S C+  T+ + SL E+ K FTS N 
Subjt:  ----------------VCAKALEAAASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETKFMKSLIEFPKRFTSHNH

Query:  SSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQ
         S     +DDE+LEAW+ + KRWARV FL + +E HL  I+ F+ ++G++  ++K+ L+    KFLI I+S++ EL+ +Q+ I+  +   ++K      +
Subjt:  SSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETKDEFTFLQ

Query:  PSDNWSCAEPTIFIQKFANLFSSLQVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRT-LGS
         +      + +   +KFA +  S+  EL+ FA  SCSIFWS+  ++   LPGSV GKLGGPSQRRL     T VL AV S+K +  + S C Q  + +G 
Subjt:  PSDNWSCAEPTIFIQKFANLFSSLQVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRT-LGS

Query:  NNFGVEFLLTFLSKTVSSPAYHSESGAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAV
            + F   F   T+SS   +SE+ AEI LA +EALASVL   VS  S+ A   + ++ST+L + V+G   L   V  F +++N +L AGVLVR+RRAV
Subjt:  NNFGVEFLLTFLSKTVSSPAYHSESGAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAV

Query:  LLKWKWLCLESLLSIPLHALQSGLNLVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWI
        LL WKWLC+ESLLS+ +H L +     D  SF S+ T+  IF D+VESLENAGE S LPML+ VRL L +   GKS L     +G++ + MW+LV S WI
Subjt:  LLKWKWLCLESLLSIPLHALQSGLNLVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWI

Query:  LHVSCNKRRVAHIAALLSSVLHSSTFSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFE
        LH+SC KRRVA IAALLSSVLHSS FS + MH++E   GPLKWF+EK+LEEG KSPRT RLAALHL+G+WL +P TIKYY+KEL+LL+LYGS+AFDEDFE
Subjt:  LHVSCNKRRVAHIAALLSSVLHSSTFSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFE

Query:  AELTD-YDARTEVSLLAESPDPELTEVFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV
        AEL+D  DARTEVSLLA+SPDPELTE+FINTELYARVSVA LF KLA+LA M   +++   C DA+ +GKLFLLELLD+ V
Subjt:  AELTD-YDARTEVSLLAESPDPELTEVFINTELYARVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAATCATTCTGTGGCAGCCATCTTTTGGTGTCGTGTGAATCCCTCTCCCTCTAGTGGGTCTGCCATTACCTCATTGTGGAAGTCTGGGGTTGGTTCCTTAAGACT
TTTGATATTCCTTTTGCCAAGCCTTAGGGGTTGTAGAGATATGATTGAGAATTCCTTCTCCACCCGCCTTTTTATGAGAAGAGGAGGTTTCTGTGGCAAGTCGGTGTGTA
GTTTTGTGGGGCCTTTGAGAAAGGTTTGGGACCATTTTGGTAGGTTGGTTGGTTTGAGCTGGTGTGTTCCTCGGTCGGTGGAAGAATGGATGAGAGGAGCCTTAACGGGA
GGAAGATTGAAAGACAAAATTAAGGTGGTTTTGCTTGGGCATGGATCCAAAGTAATCGTTTGCCGGGATCTTGGAAAGTCTGGTATTGCTCTCTTGATGTTTCTTTTCAA
AAAAAAAAAAGGTCTCAAAACTCTTCCTCTACCGGAAGCCTGTAATCATCCATACTATGGAGTATGGTCAAGGACGAGAGTGGTGTTTTTGCAACAGCTCACATCTTTGG
CCAGGAAGAAATCATTTGGTCGAGTTGGGTTGATCAGCCTATCTGAATGCATTGCTTCAGCTGCTTCAATAGTTGGATTCAATAACGATAGTGAAGGAGAGTGCTTTGAT
GGTTCTTCGCTATCAGCCCAAAGGGATTTGATAACTAATTCTCTAGGGTCCACAATGGAATTGCTGGATGATCTGAGATTTGTGGTTGAGAGCAGCAAACAACACTTCAA
TCCTAGTTATCGCCTTCAAGTTTGTGCAAAAGCTCTGGAGGCTGCTGCTTCTGTCTTGTGTACATCGGACTTGGGTCTTGAGGTTGTTCTGCATTTTATTTCAGCACTAC
CACGAGAGGCTACTGACTATGGAGGTTGCTTAAGGGGGAAAATGCAAAATTGGCTCTCAGGGTGTGGTAAGAAGCGCTGCAGTGGCAGTTGCTGCAGTACTGAGACAAAG
TTTATGAAGAGTCTTATTGAGTTCCCTAAAAGATTTACGAGTCATAATCATTCATCCGATGCTTCTGTTACTTATGACGACGAAGAATTGGAAGCATGGGAATTTGAGGC
AAAACGATGGGCAAGAGTGGTTTTTCTTGCAGTCAAGGAGGAACATCATTTAAGACCTATACTGACGTTTATTCATGACCATGGTGTAAATATTTGCAAGCAGAAGAGTG
ATTTGGAAAATGTACGTGTGAAGTTTCTAATACTTATCATGAGCTTGGTTCAAGAACTTAAATTAGTTCAGGAGAAAATTGCTCTCAGCAACTACAAATGTGAAACCAAG
GATGAGTTTACCTTCTTGCAGCCAAGTGACAATTGGAGTTGTGCAGAACCAACTATTTTTATCCAAAAATTTGCCAACCTTTTTTCGTCTCTACAGGTAGAGTTGGTTTC
TTTTGCTACCATGTCTTGTTCCATATTCTGGTCCAACGTCAAGTTGGATGAGGCAATATTACCAGGTTCTGTGAAAGGGAAACTTGGAGGCCCCAGTCAACGCCGATTAC
CATCCTCTATTGCTACTTTGGTTTTGCTAGCTGTAACATCAATGAAGGCTGTTGCATCTGTCTTATCATGTTGCAGACAGTTGAGAACCCTTGGTTCAAATAATTTTGGT
GTTGAATTTTTATTGACATTTTTGTCAAAGACTGTTTCATCTCCAGCTTATCACTCAGAGAGTGGAGCAGAAATATGTCTTGCAACATATGAAGCGCTAGCCTCTGTTCT
CCAAGTGCTTGTGTCAGAGTTTTCTTCTGAAGCTCTAAGATTTATACGGGATGAGAGTACAATCCTCCATCTAGGAGTAGAAGGAAGACCATTGTTGGACTCTCTTGTTC
TTACTTTTCATCAGCATGTAAATGGTATACTTGATGCGGGAGTATTAGTTCGAACTAGAAGGGCAGTTCTACTTAAGTGGAAGTGGCTTTGCTTAGAGTCTCTTTTATCA
ATTCCCCTTCACGCTCTTCAAAGTGGACTCAATTTAGTGGATAATAACTCCTTTTTGTCAGAGGCAACTCTTCTACAGATATTTAGTGATCTTGTTGAGAGCCTCGAGAA
TGCTGGAGAATGTTCTGTTTTACCCATGCTGAGATTGGTTAGATTGACCTTGTGGCTATTTTGCAAGGGAAAGTCTGGTCTGCTTGTTACATCGTGTAATGGCATGAATG
CAGAGATGATGTGGCGTTTGGTGCATTCCTCTTGGATATTGCACGTCAGCTGCAACAAGCGAAGGGTTGCACATATTGCTGCACTTCTGTCTTCTGTTCTGCATTCTTCC
ACATTTTCTGAAAGGAGTATGCATTTAAGTGAAGGTGGACCAGGACCTCTGAAATGGTTTATAGAAAAAATTCTTGAAGAAGGCACAAAAAGTCCTCGAACATTTCGTCT
GGCTGCATTGCATTTGACCGGCATGTGGCTCAGTCATCCATGGACCATAAAGTATTATGTTAAAGAGCTGAAACTGCTATCACTATATGGTTCTATTGCATTTGATGAAG
ATTTTGAAGCTGAATTAACTGATTATGATGCACGGACTGAAGTATCATTATTGGCAGAAAGTCCAGACCCTGAGCTCACTGAAGTGTTTATCAATACAGAATTGTATGCA
CGTGTATCAGTTGCTGTTCTGTTTCATAAACTAGCTGATTTGGCTGATATGGCGGGATTGTCTAATAAATATGGGAGTTGCTCCGATGCTGTTGAATCTGGAAAACTGTT
TCTGCTTGAGCTCCTTGATTCTGTGGTGTTAGGTTGGAACATGCTATTCACCAAAGGTGTAACAAGTCTCTCTTCACCTAAGAAGAATCCTTCTTGTTTTGATTCTGATG
GAGAATCAGATATTAGTATGAGCAGTGTGGAGTCAAATTTTCATATTCCAGAATTAGAGTTGGTTGATCAGGTTGAAGACAATTCATTCGTTGAAGCTTTTGAAAATCTG
TTTGAAGATAATGAACAAGGCGTGGTGCTACACTGTGACTTTGCTTGGAAGGCGTGGTGCTACATTGCTAGATTATTCGGCATATCGTTTTGCATTCCTAGCAGGGTTGA
TGACTGGCTTGTTGAAGGCCTTAATGCCTGGAACCTCAGAAAAAAGGCTAAGGTGGTTGCGAGCTGCGCTTTTAGGGCCACTCTCTGGAATTTATGGAAAGAAAGAAACA
ATCGTACTTTCGAGGATAAGTCTGTTAGCTTTGATATTTTTTTTATTGAAGTTGAGGAAAACATTTGTGGCTTCATCCAAATAGAGATTGATGTCAAGGATGAAAATATT
AGAAATTTTACCCTCCATTTTGGTGATATAGCTTCTATTGAACTTTCATATGATTTGATGATGGTTATCGATGATTCTAAGGTCCTTTCGATGTATGATAAGAATCTTAA
TTCAAATGGTCTTGATTTGTTAAAAGGCATTTCTTCGATTGGGCATGCAGAAGTTAATTCCCTAAGTTCTAATGAGGGTTCTATGGCGTCTCCTTCACAATTGAAATCAA
GTTTTATAAAAGCCTACCCACATTATTCTCGAAGAAAGGTCCTCCAGCCGTCGTCTTCTTCCACTGCCGTCATCGTCGCCTTCTTCCACCGCCATCACCATCGCCTTCTT
CCACCGCCATCACCGTCTTCGTCCCTCCTTCGTGCTGTCCCAGTATCTTCGACTCGACACAAAGAAACCCTAGCCTCCTTCCTCTTTCACCCTAATCCTAGCCTTCAGTC
GTCGTCTCCTTTCACCACCGTCGTTGTCGCCGTTCCCCTCGTCATCAGCTCGTGCCATTCTTGTCGTTGCCGTCGCCGTCGTGCCATTCTCGTCGTCGTCCAATTGCAAC
CCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAATCATTCTGTGGCAGCCATCTTTTGGTGTCGTGTGAATCCCTCTCCCTCTAGTGGGTCTGCCATTACCTCATTGTGGAAGTCTGGGGTTGGTTCCTTAAGACT
TTTGATATTCCTTTTGCCAAGCCTTAGGGGTTGTAGAGATATGATTGAGAATTCCTTCTCCACCCGCCTTTTTATGAGAAGAGGAGGTTTCTGTGGCAAGTCGGTGTGTA
GTTTTGTGGGGCCTTTGAGAAAGGTTTGGGACCATTTTGGTAGGTTGGTTGGTTTGAGCTGGTGTGTTCCTCGGTCGGTGGAAGAATGGATGAGAGGAGCCTTAACGGGA
GGAAGATTGAAAGACAAAATTAAGGTGGTTTTGCTTGGGCATGGATCCAAAGTAATCGTTTGCCGGGATCTTGGAAAGTCTGGTATTGCTCTCTTGATGTTTCTTTTCAA
AAAAAAAAAAGGTCTCAAAACTCTTCCTCTACCGGAAGCCTGTAATCATCCATACTATGGAGTATGGTCAAGGACGAGAGTGGTGTTTTTGCAACAGCTCACATCTTTGG
CCAGGAAGAAATCATTTGGTCGAGTTGGGTTGATCAGCCTATCTGAATGCATTGCTTCAGCTGCTTCAATAGTTGGATTCAATAACGATAGTGAAGGAGAGTGCTTTGAT
GGTTCTTCGCTATCAGCCCAAAGGGATTTGATAACTAATTCTCTAGGGTCCACAATGGAATTGCTGGATGATCTGAGATTTGTGGTTGAGAGCAGCAAACAACACTTCAA
TCCTAGTTATCGCCTTCAAGTTTGTGCAAAAGCTCTGGAGGCTGCTGCTTCTGTCTTGTGTACATCGGACTTGGGTCTTGAGGTTGTTCTGCATTTTATTTCAGCACTAC
CACGAGAGGCTACTGACTATGGAGGTTGCTTAAGGGGGAAAATGCAAAATTGGCTCTCAGGGTGTGGTAAGAAGCGCTGCAGTGGCAGTTGCTGCAGTACTGAGACAAAG
TTTATGAAGAGTCTTATTGAGTTCCCTAAAAGATTTACGAGTCATAATCATTCATCCGATGCTTCTGTTACTTATGACGACGAAGAATTGGAAGCATGGGAATTTGAGGC
AAAACGATGGGCAAGAGTGGTTTTTCTTGCAGTCAAGGAGGAACATCATTTAAGACCTATACTGACGTTTATTCATGACCATGGTGTAAATATTTGCAAGCAGAAGAGTG
ATTTGGAAAATGTACGTGTGAAGTTTCTAATACTTATCATGAGCTTGGTTCAAGAACTTAAATTAGTTCAGGAGAAAATTGCTCTCAGCAACTACAAATGTGAAACCAAG
GATGAGTTTACCTTCTTGCAGCCAAGTGACAATTGGAGTTGTGCAGAACCAACTATTTTTATCCAAAAATTTGCCAACCTTTTTTCGTCTCTACAGGTAGAGTTGGTTTC
TTTTGCTACCATGTCTTGTTCCATATTCTGGTCCAACGTCAAGTTGGATGAGGCAATATTACCAGGTTCTGTGAAAGGGAAACTTGGAGGCCCCAGTCAACGCCGATTAC
CATCCTCTATTGCTACTTTGGTTTTGCTAGCTGTAACATCAATGAAGGCTGTTGCATCTGTCTTATCATGTTGCAGACAGTTGAGAACCCTTGGTTCAAATAATTTTGGT
GTTGAATTTTTATTGACATTTTTGTCAAAGACTGTTTCATCTCCAGCTTATCACTCAGAGAGTGGAGCAGAAATATGTCTTGCAACATATGAAGCGCTAGCCTCTGTTCT
CCAAGTGCTTGTGTCAGAGTTTTCTTCTGAAGCTCTAAGATTTATACGGGATGAGAGTACAATCCTCCATCTAGGAGTAGAAGGAAGACCATTGTTGGACTCTCTTGTTC
TTACTTTTCATCAGCATGTAAATGGTATACTTGATGCGGGAGTATTAGTTCGAACTAGAAGGGCAGTTCTACTTAAGTGGAAGTGGCTTTGCTTAGAGTCTCTTTTATCA
ATTCCCCTTCACGCTCTTCAAAGTGGACTCAATTTAGTGGATAATAACTCCTTTTTGTCAGAGGCAACTCTTCTACAGATATTTAGTGATCTTGTTGAGAGCCTCGAGAA
TGCTGGAGAATGTTCTGTTTTACCCATGCTGAGATTGGTTAGATTGACCTTGTGGCTATTTTGCAAGGGAAAGTCTGGTCTGCTTGTTACATCGTGTAATGGCATGAATG
CAGAGATGATGTGGCGTTTGGTGCATTCCTCTTGGATATTGCACGTCAGCTGCAACAAGCGAAGGGTTGCACATATTGCTGCACTTCTGTCTTCTGTTCTGCATTCTTCC
ACATTTTCTGAAAGGAGTATGCATTTAAGTGAAGGTGGACCAGGACCTCTGAAATGGTTTATAGAAAAAATTCTTGAAGAAGGCACAAAAAGTCCTCGAACATTTCGTCT
GGCTGCATTGCATTTGACCGGCATGTGGCTCAGTCATCCATGGACCATAAAGTATTATGTTAAAGAGCTGAAACTGCTATCACTATATGGTTCTATTGCATTTGATGAAG
ATTTTGAAGCTGAATTAACTGATTATGATGCACGGACTGAAGTATCATTATTGGCAGAAAGTCCAGACCCTGAGCTCACTGAAGTGTTTATCAATACAGAATTGTATGCA
CGTGTATCAGTTGCTGTTCTGTTTCATAAACTAGCTGATTTGGCTGATATGGCGGGATTGTCTAATAAATATGGGAGTTGCTCCGATGCTGTTGAATCTGGAAAACTGTT
TCTGCTTGAGCTCCTTGATTCTGTGGTGTTAGGTTGGAACATGCTATTCACCAAAGGTGTAACAAGTCTCTCTTCACCTAAGAAGAATCCTTCTTGTTTTGATTCTGATG
GAGAATCAGATATTAGTATGAGCAGTGTGGAGTCAAATTTTCATATTCCAGAATTAGAGTTGGTTGATCAGGTTGAAGACAATTCATTCGTTGAAGCTTTTGAAAATCTG
TTTGAAGATAATGAACAAGGCGTGGTGCTACACTGTGACTTTGCTTGGAAGGCGTGGTGCTACATTGCTAGATTATTCGGCATATCGTTTTGCATTCCTAGCAGGGTTGA
TGACTGGCTTGTTGAAGGCCTTAATGCCTGGAACCTCAGAAAAAAGGCTAAGGTGGTTGCGAGCTGCGCTTTTAGGGCCACTCTCTGGAATTTATGGAAAGAAAGAAACA
ATCGTACTTTCGAGGATAAGTCTGTTAGCTTTGATATTTTTTTTATTGAAGTTGAGGAAAACATTTGTGGCTTCATCCAAATAGAGATTGATGTCAAGGATGAAAATATT
AGAAATTTTACCCTCCATTTTGGTGATATAGCTTCTATTGAACTTTCATATGATTTGATGATGGTTATCGATGATTCTAAGGTCCTTTCGATGTATGATAAGAATCTTAA
TTCAAATGGTCTTGATTTGTTAAAAGGCATTTCTTCGATTGGGCATGCAGAAGTTAATTCCCTAAGTTCTAATGAGGGTTCTATGGCGTCTCCTTCACAATTGAAATCAA
GTTTTATAAAAGCCTACCCACATTATTCTCGAAGAAAGGTCCTCCAGCCGTCGTCTTCTTCCACTGCCGTCATCGTCGCCTTCTTCCACCGCCATCACCATCGCCTTCTT
CCACCGCCATCACCGTCTTCGTCCCTCCTTCGTGCTGTCCCAGTATCTTCGACTCGACACAAAGAAACCCTAGCCTCCTTCCTCTTTCACCCTAATCCTAGCCTTCAGTC
GTCGTCTCCTTTCACCACCGTCGTTGTCGCCGTTCCCCTCGTCATCAGCTCGTGCCATTCTTGTCGTTGCCGTCGCCGTCGTGCCATTCTCGTCGTCGTCCAATTGCAAC
CCTAA
Protein sequenceShow/hide protein sequence
MRNHSVAAIFWCRVNPSPSSGSAITSLWKSGVGSLRLLIFLLPSLRGCRDMIENSFSTRLFMRRGGFCGKSVCSFVGPLRKVWDHFGRLVGLSWCVPRSVEEWMRGALTG
GRLKDKIKVVLLGHGSKVIVCRDLGKSGIALLMFLFKKKKGLKTLPLPEACNHPYYGVWSRTRVVFLQQLTSLARKKSFGRVGLISLSECIASAASIVGFNNDSEGECFD
GSSLSAQRDLITNSLGSTMELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTSDLGLEVVLHFISALPREATDYGGCLRGKMQNWLSGCGKKRCSGSCCSTETK
FMKSLIEFPKRFTSHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTFIHDHGVNICKQKSDLENVRVKFLILIMSLVQELKLVQEKIALSNYKCETK
DEFTFLQPSDNWSCAEPTIFIQKFANLFSSLQVELVSFATMSCSIFWSNVKLDEAILPGSVKGKLGGPSQRRLPSSIATLVLLAVTSMKAVASVLSCCRQLRTLGSNNFG
VEFLLTFLSKTVSSPAYHSESGAEICLATYEALASVLQVLVSEFSSEALRFIRDESTILHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRTRRAVLLKWKWLCLESLLS
IPLHALQSGLNLVDNNSFLSEATLLQIFSDLVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTSCNGMNAEMMWRLVHSSWILHVSCNKRRVAHIAALLSSVLHSS
TFSERSMHLSEGGPGPLKWFIEKILEEGTKSPRTFRLAALHLTGMWLSHPWTIKYYVKELKLLSLYGSIAFDEDFEAELTDYDARTEVSLLAESPDPELTEVFINTELYA
RVSVAVLFHKLADLADMAGLSNKYGSCSDAVESGKLFLLELLDSVVLGWNMLFTKGVTSLSSPKKNPSCFDSDGESDISMSSVESNFHIPELELVDQVEDNSFVEAFENL
FEDNEQGVVLHCDFAWKAWCYIARLFGISFCIPSRVDDWLVEGLNAWNLRKKAKVVASCAFRATLWNLWKERNNRTFEDKSVSFDIFFIEVEENICGFIQIEIDVKDENI
RNFTLHFGDIASIELSYDLMMVIDDSKVLSMYDKNLNSNGLDLLKGISSIGHAEVNSLSSNEGSMASPSQLKSSFIKAYPHYSRRKVLQPSSSSTAVIVAFFHRHHHRLL
PPPSPSSSLLRAVPVSSTRHKETLASFLFHPNPSLQSSSPFTTVVVAVPLVISSCHSCRCRRRRAILVVVQLQP