; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032043 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032043
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:23162645..23169223
RNA-Seq ExpressionLag0032043
SyntenyLag0032043
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]1.5e-30752.45Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KKA +K + +SD+YTGP+TRSRS+G+ I+      A+A  I K + ES K  + +K NPL+      S +     +PDVMSVMMADV  +  MAEMERK+
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL
        NLLMK VDERD EIA LK Q+Q RE AESSQTP    +DK K VV E+Q  Q S SVASLSVQQLQDMI NSIRA YGGP+Q+S +YSKPYTKRIDNL++
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL

Query:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------
        P GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDLEP++++SWEQ+E+EFLNRFYSTRRT                 
Subjt:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------

Query:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT
                                                GIKPRTFEELATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS +KESMVVNT
Subjt:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT

Query:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA
        T +K S KGKE +  K+ +  E  RLT KERQEK                    LI+L ECKR E+ GKV+DPNYCKYH+V++HPVEKCFVLKELIL+LA
Subjt:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA

Query:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------
        R+KRIELDL+EVAQ NHA  TIM+         S   +E+R  L+QFGT + +VV                                             
Subjt:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------

Query:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP
                + QK                               + P+ FL D + E  E V CH ++  E++ +   S      S DLS  +++DLLSLP
Subjt:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP

Query:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI
        QE K +LI+AL+ S  ++  T +      S CMSI FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGILM++LSNSKLVI
Subjt:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI

Query:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET
        QGFNQG QR IGMIRLELII DLKA  LFHVIDS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAK+Y KN N  E 
Subjt:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET

Query:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT
        +PAE PL K  D   L+  A    E  E   T N  KGE  TS  +   +KDE  +N+P+L+YVPLSRRKKGESPF E  + +KVGD+EI+KE+FTTPLT
Subjt:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT

Query:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN
        KI +QE                                                       KKLL+EG+S+P +RKGLGYKSPEP+RIT++GK K  D N
Subjt:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN

Query:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF
        HIT+EE D++  KE  +QR SVF RIRP VAR +VF+RLS  E E +  Q   S  R SVFRRL+    +EEST   L  TRP AF+RL +   K   A 
Subjt:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF

Query:  LAP
         AP
Subjt:  LAP

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]1.7e-30652.29Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KKA +K + +SD+YTGP+TRSRS+G+ I+      A+A  I K + ES K  + +K NPL+      S +     +PDVMSVMMADV  +  MAEMERK+
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL
        NLLMK VDERD EIA LK Q+Q RE AESSQTP    +DK K VV E+Q  Q S SVASLSVQQLQDMI +SIRA YGGP+Q+S +YSKPYTKRIDNL++
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL

Query:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------
        P GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDLEP++++SWEQ+E+EFLNRFYSTRRT                 
Subjt:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------

Query:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT
                                                GIKPRTFEELATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS  KESMVVNT
Subjt:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT

Query:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA
        T +K S KGKE +  K+ +  E  RLT KERQEK                    LI+L ECKR E+ GKV+DPNYCKYH+V++HPVEKCFVLKELIL+LA
Subjt:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA

Query:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------
        R+KRIELDL+EVAQ NHA  TIM+         S   +E+R  L+QFGT + +VV                                             
Subjt:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------

Query:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP
                + QK                               + P+ FL D + E  E V CH ++  E++ +   S      S DLS  +++DLLSLP
Subjt:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP

Query:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI
        QE K +LI+AL+ S  ++  T +      S CMSI FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGILM++LSNSKLVI
Subjt:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI

Query:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET
        QGFNQG QR IGMIRLELII DLKA  LFHVIDS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAK+Y KN N  E 
Subjt:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET

Query:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT
        +PAE PL K  D   L+  A    E  E   T N  KGE  TS  +   +KDE  +N+P+L+YVPLSRRKKGESPF E  + +KVGD+EI+KE+FTTPLT
Subjt:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT

Query:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN
        KI +QE                                                       KKLL+EG+S+P +RKGLGYKSPEP+RIT++GK K  D N
Subjt:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN

Query:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF
        HIT+EE D++  KE  +QR SVF RIRP VAR +VF+RLS  E E +  Q   +  R SVFRRL+    +EEST   L  TRP AF+RL +   K   A 
Subjt:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF

Query:  LAP
         AP
Subjt:  LAP

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]1.3e-30652.29Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KKA +K + +SD+YTGP+TRSRS+G+ I+      A+A  I K + ES K  + +K NPL+      S +     +PDVMSVMMADV  +  MAEMERK+
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL
        NLLMK VDERD EIA LK Q+Q RE AESSQTP    +DK K VV E+Q  Q S SVASLSVQQLQDMI +SIRA YGGP+Q+S +YSKPYTKRIDNL++
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL

Query:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------
        P GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDLEP++++SWEQ+E+EFLNRFYSTRRT                 
Subjt:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------

Query:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT
                                                GIKPRTFEELATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS +KESMVVNT
Subjt:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT

Query:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA
        T +K S KGKE +  K+ +  E  RLT KERQEK                    LI+L ECKR E+ GKV+DPNYCKYH+V++HPVEKCFVLKELIL+LA
Subjt:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA

Query:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------
        R+KRIELDL+EVAQ NHA  TIM+         S   +E+R  L+QFGT + +VV                                             
Subjt:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------

Query:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP
                + QK                               + P+ FL D + E  E V CH ++  E++ +   S      S DLS  +++DLLSLP
Subjt:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP

Query:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI
        QE K +LI+AL+ S  ++  T +      S CMSI FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGILM++LSNSKLVI
Subjt:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI

Query:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET
        QGFNQG QR IGMIRLELII DLKA  LFHVIDS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAK+Y KN N  E 
Subjt:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET

Query:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT
        +PAE PL K  D   L+  A    E  E   T N  KGE  TS  +   +KDE  +N+P+L+YVPLSRRKKGESPF E  + +KVGD+EI+KE+FTTPLT
Subjt:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT

Query:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN
        KI +QE                                                       KKLL+EG+S+P +RKGLGYKSPEP+RIT++GK K  D N
Subjt:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN

Query:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF
        HIT+EE D++  KE  +QR SVF RIRP VAR +VF+RLS  E E +  Q   +  R SVFRRL+    +EEST   L  TRP AF+RL +   K   A 
Subjt:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF

Query:  LAP
         AP
Subjt:  LAP

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]2.9e-30652.29Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KKA +K + +SD+YTGP+TRSRS+G+ I+      A+A  I K + ES K  + +K NPL+      S +     +PDVMSVMMADV  +  MAEMERK+
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL
        NLLMK VDERD EIA LK Q+Q RE AESSQTP    +DK K VV E+Q  Q S SVASLSVQQLQDMI +SIRA YGGP+Q+S +YSKPYTKRIDNL++
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL

Query:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------
        P GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDLEP++++SWEQ+E+EFLNRFYSTRRT                 
Subjt:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------

Query:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT
                                                GIKPRTFEELATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS  KESMVVNT
Subjt:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT

Query:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA
        T +K S KGKE +  K+ +  E  RLT KERQEK                    LI+L ECKR E+ GKV+DPNYCKYH+V++HPVEKCFVLKELIL+LA
Subjt:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA

Query:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------
        R+KRIELDL+EVAQ NHA  TIM+         S   +E+R  L+QFGT + +VV                                             
Subjt:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVV---------------------------------------------

Query:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP
                + QK                               + P+ FL D + E  E V CH ++  E++ +   S      S DLS  +++DLLSLP
Subjt:  --------RCQK-------------------------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLP

Query:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI
        QE K +LI+AL+ S  ++  T +      S CMSI FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+GSAVNIMPKSTM QLGILM++LSNSKLVI
Subjt:  QEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVI

Query:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET
        QGFNQG QR IGMIRLELII DLKA  LFHVIDS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVEAD+NPFSEAESHFADAK+Y KN N  E 
Subjt:  QGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGET

Query:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT
        +PAE PL K  D   L+  A    E  E   T N  KGE  TS  +   +KDE  +N+P+L+YVPLSRRKKGESPF E  + +KVGD+EI+KE+FTTPLT
Subjt:  IPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLT

Query:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN
        KI +QE                                                       KKLL+EG+S+P +RKGLGYKSPEP+RIT++GK K  D N
Subjt:  KITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTN

Query:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF
        HIT+EE D++  KE  +QR SVF RIRP VAR +VF+RLS  E E +  Q   +  R SVFRRL+    +EEST   L  TRP AF+RL +   K   A 
Subjt:  HITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAF

Query:  LAP
         AP
Subjt:  LAP

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]4.5e-28354.04Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KKA +K + +SD+YTGP+TRSRS+G+ I+      A+A  I K + ES K  + +K NPL+      S +     +PDVMSVMMADV  +  MAEMERK+
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL
        NLLMK VDERD EIA LK Q+Q RE AESSQTP    +DK K VV E+Q  Q S SVASLSVQQLQDMI NSIRA YGGP+Q+S +YSKPYTKRIDNL++
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL

Query:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------
        P GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDLEP++++SWEQ+E+EFLNRFYSTRRT                 
Subjt:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------

Query:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT
                                                GIKPRTFEELATRAHDMELSIASRG KD +VP+++K+KKE+KG EK+ KS +KESMVVNT
Subjt:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT

Query:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA
        T +K S KGKE +  K+ +  E  RLT KERQEK                    LI+L ECKR E+ GKV+DPNYCKYH+V++HPVEKCFVLKELIL+LA
Subjt:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA

Query:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVVRCQKYF--------PRRFLDDSEG----ETLETVTCHIVDVVEDD
        R+KRIELDL+EVAQ NHA  TIM+         S   +E+R  L+QFGT + +VV+  +           R   +D EG    E  E V CH ++  E++
Subjt:  RKKRIELDLDEVAQANHA--TIMANVGDQIPPISSCDYEER--LIQFGTLDHVVVRCQKYF--------PRRFLDDSEG----ETLETVTCHIVDVVEDD

Query:  DVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDG
         +   S      S DLS  +++DLLSLPQE K +LI+AL+ S  ++  T +      S CMSI FSDEDLLLGSK HNRPL+VSGY+REQRV RILID+G
Subjt:  DVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIESDGTNIPT-SEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDG

Query:  SAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVE
        SAVNIMPKSTM QLGILM++LSNSKLVIQGFNQG QR IGMIRLELII DLKA  LFHVIDS+TTYKLLLGRPWIHGNGV+TSTLHQCFKFYQDG+KKVE
Subjt:  SAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVE

Query:  ADTNPFSEAESHFADAKYYL-KNVNTGETIPAEIPLIKKNDKVDLE--------PQAN-ARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQ
        AD+NPFSEAESHFADA + L  +V   E       + K+ +  D+E        P    A++EV  D+  +NL +  T      PK  K           
Subjt:  ADTNPFSEAESHFADAKYYL-KNVNTGETIPAEIPLIKKNDKVDLE--------PQAN-ARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQ

Query:  YVPLSRRKKGESPFTEYSE--SIKVGDVEILKENFTTPLTKITRQEKKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTNHITVEEVDDSKEKES
                K    FT ++E  S+++ D             +++  +KKLL+EG+S+P +RKGLGYKSPEP+RIT++GK K  D NHIT+EE D++  KE 
Subjt:  YVPLSRRKKGESPFTEYSE--SIKVGDVEILKENFTTPLTKITRQEKKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTNHITVEEVDDSKEKES

Query:  VDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAFLAP
         +QR SVF RIRP VAR +VF+RLS  E E +  Q   +  R SVFRRL+    +EEST   L  TRP AF+RL +   K   A  AP
Subjt:  VDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAFLAP

TrEMBL top hitse value%identityAlignment
A0A5A7SMM7 Ty3-gypsy retrotransposon protein2.2e-21949.89Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KK  +K + +SD+YTGP               TP  V       + ES K K+                + K   + DVMS+MMAD+  +  + EMERK+
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL
        N LMK V+ERD EI  L+ Q+Q RE  ES+QT      DK+K VV E+Q  Q S  VASLSVQQLQDMI NSI A YGG  Q+S +YSKPYTKRIDNL++
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL

Query:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------
        P GYQPP F QFDGK N KQHIAHFVETCEN G+RGD LVKQ VR+LK N F+WYT+L+P+ +DSWEQ+E++F NRFYSTR T                 
Subjt:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------

Query:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT
                                                GIKPRTFEELATR   MELSI S G KD  VP++RK+KKE KG +KV KS  KESMVVN 
Subjt:  ----------------------------------------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNT

Query:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA
        T +K S K KE K  K+ +  E   +T KERQEK                    LI+LLECKR E+  K++D NYCKYH V+ HPVEKCFVLKELIL+LA
Subjt:  TLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLA

Query:  RKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVVRCQKYFPRRFLDDSEGETLETVTCHIVDVVEDDDV---LASSSGVVAGS
        R+K+IELDL+EVAQ NHA       D+ P +                                         VTCH ++  E++ +   L    GV   S
Subjt:  RKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVVRCQKYFPRRFLDDSEGETLETVTCHIVDVVEDDDV---LASSSGVVAGS

Query:  GDLSSLSIKDLLSLPQEAKNVLINALIESDGTNIPTSEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQL
         DLS  ++ DLLSLPQE K +LIN L+ +  ++   +        CMSI F DEDLLLGSK HNRPL+VSG +REQRV RILID+GSAVNIMPKSTM+ L
Subjt:  GDLSSLSIKDLLSLPQEAKNVLINALIESDGTNIPTSEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQL

Query:  GILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFA
        GIL+++LSNSKLVIQGFNQG QR IGM+RLELII DLKA  LFHVIDS+TTYKLLLGR WIH NGV+TSTLHQCFKFYQD +K VE D+NPFSE +SHFA
Subjt:  GILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFA

Query:  DAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVG
        DAK+YLKN N+ E +PA+IP                               GE S S  +   + DEK SN  IL YVPLSR KKGESPF ++ + +KVG
Subjt:  DAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVG

Query:  DVEILKENFTTPLTKITRQEKKLLKEGYSLPTTR
        ++++LKE FTT +TKIT+QE  +     SLP  R
Subjt:  DVEILKENFTTPLTKITRQEKKLLKEGYSLPTTR

A0A5A7TJZ7 Retrotransposon gag protein3.0e-22444.41Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KK  +  + +SD+YTGP+TRSRS+G+   +D     VA  I K + ES K  + +K N L+++    S + K   +PDV                     
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQSQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKLP
              V+ERD EI  LK Q++ RE AESS+T          VV   D+ +++A+                                           +P
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQSQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKLP

Query:  TGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRR-------------------
        T     K      KG PK     F+ETCENAG+RGD LV++FVR+LKGN F+WYT+LEP+ +DSWE++E+EFLNRFYSTR                    
Subjt:  TGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRR-------------------

Query:  ----TGIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKGKENKTNKRVETGESCRLTFKER--QE
             GIKP TFEELATRAHDMELSIASRG KD  +P++RK+KKE+K  EKV KS  KESMV                  K     +S      E+  ++
Subjt:  ----TGIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKGKENKTNKRVETGESCRLTFKER--QE

Query:  KLIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVVRC
        +LI+L ECKR E+ GKV+DPNYCKYH+V++HPVEKCFVLKELIL+LAR+K++EL+L+EVAQ NHA +   + +   P    +  E L+QFGT + VVV+ 
Subjt:  KLIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVVRC

Query:  -QKYFP-------RRFLDDSEG----------------------------------------------ETLETVTCHIVDVVEDDDVLASSSGVVAGSGD
         Q+  P       R   +D EG                                              E L  V CH ++  E++ +   S      S D
Subjt:  -QKYFP-------RRFLDDSEG----------------------------------------------ETLETVTCHIVDVVEDDDVLASSSGVVAGSGD

Query:  LSSLSIKDLLSLPQEAKNVLINALIES--DGTNIPTSEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQL
        LS  ++ DLLSLPQ+ K +LINAL+ S    +++PT+   +    CM I FS+ DLLLGSK HNRPL+VSGY+REQRV RILID+GSA+NIMPKSTM+QL
Subjt:  LSSLSIKDLLSLPQEAKNVLINALIES--DGTNIPTSEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQL

Query:  GILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFA
        GIL+++LSNSKLVIQGFNQG +R IGMIRLELII DLKA  LFHVID +TTYKLLL RPWIHGNGV+TS LHQCFKFYQDGIKKVEAD NPFSEAESHFA
Subjt:  GILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFA

Query:  DAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVG
        DAK+YLKN N+ E +  E+PL                              G+ STS  +   + DEK SN PIL+YVPLSR KKGESPF +  + +KVG
Subjt:  DAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVG

Query:  DVEILKENFTTPLTKITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPV
        D+E+LKE+FTTP TKIT+QE                                                       KKLL+EG+++P +RKGLGYKSPEP+
Subjt:  DVEILKENFTTPLTKITRQE-------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPV

Query:  RITRRGKAKEADTNHITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAF
        RITR+GK K  D NHITV+EVD  KEKE   QRTS F RI P VAR  VF+RLS  E E K  Q T++  R S F+RL+M   +E+        TRP AF
Subjt:  RITRRGKAKEADTNHITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAF

Query:  QRLNMPIAKEENAFLAPDVTLPSVFQRL
        +RL+M  AK++N         P +F RL
Subjt:  QRLNMPIAKEENAFLAPDVTLPSVFQRL

A0A5A7TZU9 Ribonuclease H2.2e-26745.77Show/hide
Query:  SDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTL-NPDVMSVMMADVD-QDERMAEMERKLNLLMKAVD
        SD    P TRSRS+ ++  ED  P  VA  I + I +  K  + +K NP  +     S++  + +  P++MSVM+ DVD  ++RMAE+E+K+N+LMKAV+
Subjt:  SDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTL-NPDVMSVMMADVD-QDERMAEMERKLNLLMKAVD

Query:  ERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQSQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKLPTGYQPPKF
        ERD EIA LKN +++R+ AESS T      +K K ++ E Q Q+S S+ASLSVQQLQ+MI NSI+  YGGP Q+  LYSKPYTKRIDN+++P GYQPPKF
Subjt:  ERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQSQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKLPTGYQPPKF

Query:  QQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRR---------------------------
        QQFDGKGNPKQH+AHF+ETCE AGTRGD LVKQFVRTLKGN FDWYTDLEP+++DSWEQ+ER+FLNRFYSTRR                           
Subjt:  QQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRR---------------------------

Query:  ------------------------------TGIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKG
                                       GIKPRTFEELATRAHDMELSIA+RG  D +VP++RKEKKEVK T+K  K  TKE+MVV+TT +KL +  
Subjt:  ------------------------------TGIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKG

Query:  KENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDL
        KE K  KR + GE  R T KERQEK                    LI+L ECKR  E+G+VNDPNYCKYH+V++HPVEKCFVLKELILKLA  K+IEL+L
Subjt:  KENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDL

Query:  DEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVV----------------------------------------------------------
        D+VAQ NHA ++     ++  I S      LIQFG+L+ VV+                                                          
Subjt:  DEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVV----------------------------------------------------------

Query:  -----RCQKYFPRRFL---DDSEGET-----------------LETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIES
             R  +  PR+FL   ++SEG +                 +E V+CH     E+D   +++        DL  L I DLL+L +E K+ +I  L   
Subjt:  -----RCQKYFPRRFL---DDSEGET-----------------LETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIES

Query:  DGTNIPTSEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIR
        D + I TS A T  S CMSI FSDEDLLLGSK HNRPL+VSG++REQ++++ILID+GSAVNI+PKSTM QLGI +E+LSNSKLVIQGFNQG QR IG +R
Subjt:  DGTNIPTSEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIR

Query:  LELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGETIPAEIPLIKKNDKVD
        LE++I DL+A T+FHVIDS+TTYK+LLGRPWIH NG++TSTLHQCFKFY+ GIKKV+AD+ PF++AESHFADAK+Y K+ +  E I  E+P+ K   K +
Subjt:  LELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGETIPAEIPLIKKNDKVD

Query:  LEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVK----DEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLTKITRQE-----
         E   + +    + +++    +  T T L  P+  K     ++ SN P+L+Y+PLSRRKKGESPFTE S+++ V + EILKENFT PLTKI + E     
Subjt:  LEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVK----DEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLTKITRQE-----

Query:  ------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTNHITVEEV
                                                              KKL K+GYS+P +R G+GY+S EPVRIT +GKAK A+T HITVEE 
Subjt:  ------------------------------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTNHITVEEV

Query:  DDSKEKESV-DQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAFLAPDVT
         DS+E + V  QR+SVF RI                           S+ RPSVF+R+S  + ++ +  ST + TR  AFQRLN   AK+  +      T
Subjt:  DDSKEKESV-DQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRPLAFQRLNMPIAKEENAFLAPDVT

Query:  LPSVFQRLSVTTRRDKKEQPASLMSHRLLM
          S F+RLSV+  RD+K+   S+ +   L+
Subjt:  LPSVFQRLSVTTRRDKKEQPASLMSHRLLM

A0A5D3BY54 Ty3-gypsy retrotransposon protein1.2e-22547.47Show/hide
Query:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL
        KKA +  + +S++YTGP+TRSRS+G+   +D     VA  I K + ES K  + +K NPL+++   +S + K   +PDVMSVMMAD+  +  MAEMERK+
Subjt:  KKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTSKQPKDTLNPDVMSVMMADVDQDERMAEMERKL

Query:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL
        N LMKAV+ERD EI  L+ Q++ RE AESSQTP     DK K VV E+Q  Q S SVASLSVQQLQDMI NSIRA YGGP Q++ +YSK YTKRIDNL++
Subjt:  NLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQ-SQHSASVASLSVQQLQDMIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKL

Query:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------
        P GYQPPKFQQF+GKGNPKQHIAHF ETCENAG+RGD LV+QFV++LKGN F+WYTDLEP+ +DSWEQ+E+EFLNRFYSTRRT                 
Subjt:  PTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRT-----------------

Query:  ---------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKGKENKTNKRVETGESCRLTFKER
                  IKPRTFEELATRAHDMELSIASRG KD  VP +RK+KKE K  EKV KS  KESMVVNTT +K S K KE +  K+ + G    +  +  
Subjt:  ---------GIKPRTFEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKGKENKTNKRVETGESCRLTFKER

Query:  QEKLIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVV
        +++LI+L ECKRSE+ G V+DPNYCKYH+V++HPVEK FVLKELIL+LAR+K+I+LDL+E               ++ P  S + +ERLI+       VV
Subjt:  QEKLIELLECKRSEELGKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVV

Query:  RCQK-------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIESDGTNIPTSEART
          +K             Y   R  + ++    +  T  +  V E D     +  VV  +    +  + D      + +N  + A   S    I T E+  
Subjt:  RCQK-------------YFPRRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIESDGTNIPTSEART

Query:  CASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADT
            CMSI FSDEDLLLGSK HNRPL+VSGY++EQRV RIL+D+GSAVNIMPKSTM+QLGILME+LSNSKLVIQGFNQG QR I                
Subjt:  CASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADT

Query:  LFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVV
                                                                   DAK+YLKN  + E +  E+PL+ K D + L+  A+  +E  
Subjt:  LFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVV

Query:  EDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLTKITRQE---------------------
        +   T +  K E STS  +   V DEK SN PIL+YVPLSRRKKGESPF E+ + +KVGD+E+LKE+FTTPLTKIT+QE                     
Subjt:  EDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLTKITRQE---------------------

Query:  ----------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTNHITVEEVDDSKEKESVDQRTSVFRRIR
                                          KKLL+EG+++P +RKGLGYK PEP+RITR+GK K  D+NHITV+EVD  +EKE   QRTS F R+ 
Subjt:  ----------------------------------KKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTNHITVEEVDDSKEKESVDQRTSVFRRIR

Query:  PPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEEST
        P VAR  VF+RLS  E E K  Q T+S  R S F+RL+M    E+ +
Subjt:  PPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEEST

A0A5D3C8N8 Ribonuclease H3.8e-24352.62Show/hide
Query:  MIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKLPTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWE
        MI+NSIRA YGGP Q++ +YSKPYTKRIDNL++P GYQP KFQQF GKGNPKQHIAHFVETCENAG+RGD LV QF+R+LKGN F+WY DLEP+ +DSWE
Subjt:  MIMNSIRAHYGGPTQSSLLYSKPYTKRIDNLKLPTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWE

Query:  QMEREFLNRFYSTRRT---------------------------------------------------------GIKPRTFEELATRAHDMELSIASRGGK
        Q+E++ LN FYS +RT                                                         GIKP TFEELATRAHDMELS ASRG K
Subjt:  QMEREFLNRFYSTRRT---------------------------------------------------------GIKPRTFEELATRAHDMELSIASRGGK

Query:  DSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEEL
        +  V ++RK+KKE K  EKV KS  KESM+VNTT +K   K KE +  K+ +  E   LT KERQEK                    LI+L +CKR E+ 
Subjt:  DSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKGKENKTNKRVETGESCRLTFKERQEK--------------------LIELLECKRSEEL

Query:  GKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVVRC-QKYFP--------
        GKV+DPNY KYH+V++HP+EKCFVLKELIL+LAR+K+IELDL+EV Q NHA  +  + + + P    +  + L+QF T + VVVR  Q+  P        
Subjt:  GKVNDPNYCKYHKVVTHPVEKCFVLKELILKLARKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVVRC-QKYFP--------

Query:  -----------------RRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIES--DGTNIPTSEARTC
                         R+FL D + E    V CH ++  E++ +   S      S DLS  ++ DLLSLPQE K +LINAL+ S    ++ PT+   + 
Subjt:  -----------------RRFLDDSEGETLETVTCHIVDVVEDDDVLASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIES--DGTNIPTSEARTC

Query:  ASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTL
           CMSI FSDEDLLLGSK HNRPL+V GY+REQRV RIL+++G AVNIMPKSTM+QLGILME+L NSKLVIQGFNQG QR IGMIRLELII DLKA  L
Subjt:  ASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQLGILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTL

Query:  FHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVVE
        FHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFYQDG KKVE D+NPFSEAESHFADAK+YLKN ++ E +  E+ L+ + D   L+  A+ +    +
Subjt:  FHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNVNTGETIPAEIPLIKKNDKVDLEPQANARKEVVE

Query:  DVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLTKITRQEKKLLKEGYSLPTTRKGLGYKSP
           T +  K E STS  +   + +E  SN PIL+YVPLSR KKGESPF E  + +KVGD+E+LKE+FT  LTKI                T+KGLGYKSP
Subjt:  DVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLTKITRQEKKLLKEGYSLPTTRKGLGYKSP

Query:  EPVRITRRGKAKEADTNHITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRP
        EP+RITR+GK K  D+NHITV+EVD  + KE   QRTS F RI P VAR  VF+RLS  E + K  Q T++  R S F+RL++   EE+        T+P
Subjt:  EPVRITRRGKAKEADTNHITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGEEESTFSTLNDTRP

Query:  LAFQRLNMPIAKEENA
         AF+RL+  I K++NA
Subjt:  LAFQRLNMPIAKEENA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.1e-0640.28Show/hide
Query:  TKLVG-NQSTFIKGRQITDAILLANKAVDYWRCSK--KKGFLIKLDIEKAFDKITWDFIVTILHQKEYANKW
        T L+G  Q++FI GR  TD I+   +AV   R  K  K   L+KLD+EKA+D+I WD++   L    +   W
Subjt:  TKLVG-NQSTFIKGRQITDAILLANKAVDYWRCSK--KKGFLIKLDIEKAFDKITWDFIVTILHQKEYANKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAAAAAGTCTCAACCACAACAAGACTCCTGGACCAGATGGATTCACCAACGAATTCTTCAAAAATTTATGGAACATTCTCAAGCTTGCTATCATGGAAGTGTTCC
ACAACTTTTCAAGAAATGGGATAGTGAAACAAAATTAGTAGGGAATCAGTCGACCTTTATAAAAGGGAGACAAATCACCGATGCTATTCTTCTTGCCAACAAAGCAGTGG
ATTATTGGAGATGCTCAAAGAAAAAGGGCTTTCTGATAAAACTTGATATCGAGAAAGCTTTTGATAAAATCACTTGGGACTTTATTGTTACTATTTTGCACCAGAAAGAG
TATGCTAACAAATGGACAAATTGGATTATGGCTTGCATTTCTTCAGCAAATTACTTTGTCATCGTTAATGGAAAACCAAAAGGGAAGATTTCAGCAACTCGCGATCCAAC
CAACGTGATAGGACCAAAGAAAGCTCCCACGAAGGTTACCACATCAAGCGACTCTTACACTGGTCCTGTCACGCGTAGTCGCTCCCAAGGAGTTGAGATCAGGGAGGATC
ATACTCCTCTTGCTGTTGCAAGCAGGATCTCAAAGTTGATTGAAGAATCCTCTAAGGATAAGGTTGCAGTCAAACACAACCCACTGTTCGAATCTGTCGTTCCAACATCT
AAGCAGCCAAAGGATACACTAAATCCTGATGTGATGTCCGTCATGATGGCTGATGTAGACCAGGATGAAAGAATGGCAGAGATGGAAAGGAAACTCAATCTCTTAATGAA
GGCAGTTGATGAGAGAGATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAGAACCGAGAAGTGGCTGAGTCTAGCCAGACCCCTGCTGCAGGGAAGAATGATAAACAGA
AGGTTGTCGTGCATGAGGATCAATCGCAACACTCCGCGTCAGTTGCCTCTTTGTCTGTCCAACAACTGCAAGACATGATCATGAATTCCATCAGGGCTCATTATGGTGGA
CCAACTCAAAGTTCCCTCTTGTATTCTAAACCTTATACAAAGAGGATTGACAATCTCAAACTACCCACTGGGTATCAGCCTCCCAAGTTCCAACAGTTTGATGGAAAAGG
CAACCCAAAGCAGCACATCGCTCACTTCGTTGAAACCTGTGAGAACGCTGGTACTCGGGGAGATTTTCTTGTTAAGCAATTTGTCCGAACACTGAAAGGAAACACGTTCG
ATTGGTACACTGACCTGGAGCCTGATACAATGGACAGTTGGGAACAAATGGAGAGAGAGTTTCTAAATCGCTTCTACAGTACGAGGCGAACAGGGATAAAACCCCGGACC
TTTGAAGAATTGGCGACTCGCGCTCATGATATGGAGTTGAGCATCGCTAGTAGAGGGGGCAAAGATTCCGTAGTCCCAGACATGAGGAAAGAAAAGAAGGAAGTCAAGGG
CACTGAGAAAGTTGCAAAGAGTGCCACGAAAGAATCCATGGTCGTCAACACGACCCTTGTGAAGCTTTCCACCAAAGGAAAGGAAAACAAAACAAATAAGCGAGTGGAGA
CAGGCGAGAGTTGTCGCCTAACTTTCAAAGAAAGACAGGAAAAGCTAATTGAGCTCCTAGAGTGCAAACGATCAGAAGAGTTAGGGAAAGTCAATGATCCTAACTATTGC
AAGTATCACAAAGTCGTTACTCACCCGGTAGAGAAGTGTTTTGTATTAAAGGAGCTAATCCTTAAGCTTGCTCGCAAAAAGAGAATTGAGTTGGACCTTGATGAGGTGGC
TCAAGCAAATCACGCAACAATAATGGCGAATGTAGGGGATCAAATTCCCCCAATCTCTTCATGTGATTATGAAGAGAGGCTGATTCAATTCGGGACCCTTGATCACGTAG
TGGTTCGATGCCAAAAATACTTCCCAAGGCGCTTTCTCGATGATAGTGAAGGGGAGACACTTGAAACTGTCACGTGTCACATTGTGGACGTGGTGGAGGATGATGACGTC
CTCGCTAGCTCATCGGGAGTGGTGGCAGGTTCAGGAGACTTATCCTCCTTAAGCATAAAGGACTTACTGTCACTTCCTCAGGAAGCTAAAAACGTCCTTATTAACGCTTT
GATAGAGTCTGATGGTACAAACATCCCAACCTCCGAGGCACGTACATGTGCTTCGTGTTGCATGTCTATCGGATTTTCTGATGAAGATTTGCTGCTAGGGTCAAAGCCTC
ACAATAGACCTTTATTCGTCTCAGGATACATTCGAGAACAGAGGGTTAGTCGGATCCTCATCGATGATGGATCAGCTGTCAATATAATGCCTAAGTCAACCATGAAGCAG
TTAGGCATCTTGATGGAGCAACTATCAAACAGTAAATTGGTGATCCAAGGTTTCAACCAGGGCGGACAACGGGAAATTGGCATGATACGCCTTGAACTCATCATTGAAGA
CTTAAAAGCTGATACTTTGTTCCATGTAATAGATTCTAAGACCACTTATAAGCTGTTGCTAGGTCGTCCTTGGATCCATGGTAATGGAGTCATAACCTCTACATTGCACC
AATGCTTCAAATTCTATCAAGATGGCATCAAGAAAGTTGAGGCTGACACTAACCCATTTTCAGAAGCTGAGTCTCATTTTGCTGATGCAAAGTATTATTTGAAGAATGTA
AACACAGGAGAAACCATACCTGCAGAAATCCCTTTGATAAAAAAGAATGATAAAGTCGATTTAGAGCCACAAGCAAATGCAAGAAAGGAAGTTGTTGAAGATGTGAGTAC
TTCCAACCTGAAAAAGGGTGAAACGTCTACAAGTTTAGTGAGACCTAAGGGTGTAAAGGATGAGAAATGTTCAAATTCACCTATCCTACAATACGTCCCCTTGTCTCGAC
GTAAAAAGGGTGAGTCACCTTTCACTGAATATTCAGAAAGCATAAAGGTCGGTGACGTCGAAATTTTGAAGGAAAATTTCACTACGCCTCTTACAAAGATCACAAGACAA
GAGAAGAAGCTTCTAAAAGAGGGTTATAGTTTGCCAACAACGAGAAAAGGGCTTGGATATAAGTCGCCAGAACCGGTTCGCATAACAAGAAGAGGGAAGGCGAAAGAGGC
AGACACAAATCATATAACAGTAGAGGAGGTTGATGACTCAAAAGAAAAAGAGAGTGTCGACCAACGAACTTCTGTTTTTAGGCGCATCAGGCCACCAGTTGCTCGTGTTT
TAGTCTTTCAGAGATTAAGTGGCAATGAAACGGAGGAGAAAAGCACACAACCTACCAATAGCTCCACTCGACCTTCAGTTTTTCGAAGGTTAAGTATGCCCGTTGGGGAG
GAAGAGAGTACATTTTCAACTTTGAATGACACTCGACCTTTAGCTTTTCAAAGGTTAAATATGCCCATTGCGAAAGAAGAAAATGCATTTTTAGCTCCGGATGTGACTCT
GCCATCGGTTTTTCAGAGGTTAAGTGTTACCACAAGAAGAGACAAAAAAGAACAGCCTGCTTCATTGATGTCTCATCGACTTCTGATGTGGAATTTTCCTTACCTCAAGC
GCACGGGTCAAGTAATAATAAAGTGTTCGTATAAACGAGTGTCGTCCTCTGGATTGGATTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAAAAAGTCTCAACCACAACAAGACTCCTGGACCAGATGGATTCACCAACGAATTCTTCAAAAATTTATGGAACATTCTCAAGCTTGCTATCATGGAAGTGTTCC
ACAACTTTTCAAGAAATGGGATAGTGAAACAAAATTAGTAGGGAATCAGTCGACCTTTATAAAAGGGAGACAAATCACCGATGCTATTCTTCTTGCCAACAAAGCAGTGG
ATTATTGGAGATGCTCAAAGAAAAAGGGCTTTCTGATAAAACTTGATATCGAGAAAGCTTTTGATAAAATCACTTGGGACTTTATTGTTACTATTTTGCACCAGAAAGAG
TATGCTAACAAATGGACAAATTGGATTATGGCTTGCATTTCTTCAGCAAATTACTTTGTCATCGTTAATGGAAAACCAAAAGGGAAGATTTCAGCAACTCGCGATCCAAC
CAACGTGATAGGACCAAAGAAAGCTCCCACGAAGGTTACCACATCAAGCGACTCTTACACTGGTCCTGTCACGCGTAGTCGCTCCCAAGGAGTTGAGATCAGGGAGGATC
ATACTCCTCTTGCTGTTGCAAGCAGGATCTCAAAGTTGATTGAAGAATCCTCTAAGGATAAGGTTGCAGTCAAACACAACCCACTGTTCGAATCTGTCGTTCCAACATCT
AAGCAGCCAAAGGATACACTAAATCCTGATGTGATGTCCGTCATGATGGCTGATGTAGACCAGGATGAAAGAATGGCAGAGATGGAAAGGAAACTCAATCTCTTAATGAA
GGCAGTTGATGAGAGAGATCTTGAGATTGCCTATTTGAAGAACCAGCTGCAGAACCGAGAAGTGGCTGAGTCTAGCCAGACCCCTGCTGCAGGGAAGAATGATAAACAGA
AGGTTGTCGTGCATGAGGATCAATCGCAACACTCCGCGTCAGTTGCCTCTTTGTCTGTCCAACAACTGCAAGACATGATCATGAATTCCATCAGGGCTCATTATGGTGGA
CCAACTCAAAGTTCCCTCTTGTATTCTAAACCTTATACAAAGAGGATTGACAATCTCAAACTACCCACTGGGTATCAGCCTCCCAAGTTCCAACAGTTTGATGGAAAAGG
CAACCCAAAGCAGCACATCGCTCACTTCGTTGAAACCTGTGAGAACGCTGGTACTCGGGGAGATTTTCTTGTTAAGCAATTTGTCCGAACACTGAAAGGAAACACGTTCG
ATTGGTACACTGACCTGGAGCCTGATACAATGGACAGTTGGGAACAAATGGAGAGAGAGTTTCTAAATCGCTTCTACAGTACGAGGCGAACAGGGATAAAACCCCGGACC
TTTGAAGAATTGGCGACTCGCGCTCATGATATGGAGTTGAGCATCGCTAGTAGAGGGGGCAAAGATTCCGTAGTCCCAGACATGAGGAAAGAAAAGAAGGAAGTCAAGGG
CACTGAGAAAGTTGCAAAGAGTGCCACGAAAGAATCCATGGTCGTCAACACGACCCTTGTGAAGCTTTCCACCAAAGGAAAGGAAAACAAAACAAATAAGCGAGTGGAGA
CAGGCGAGAGTTGTCGCCTAACTTTCAAAGAAAGACAGGAAAAGCTAATTGAGCTCCTAGAGTGCAAACGATCAGAAGAGTTAGGGAAAGTCAATGATCCTAACTATTGC
AAGTATCACAAAGTCGTTACTCACCCGGTAGAGAAGTGTTTTGTATTAAAGGAGCTAATCCTTAAGCTTGCTCGCAAAAAGAGAATTGAGTTGGACCTTGATGAGGTGGC
TCAAGCAAATCACGCAACAATAATGGCGAATGTAGGGGATCAAATTCCCCCAATCTCTTCATGTGATTATGAAGAGAGGCTGATTCAATTCGGGACCCTTGATCACGTAG
TGGTTCGATGCCAAAAATACTTCCCAAGGCGCTTTCTCGATGATAGTGAAGGGGAGACACTTGAAACTGTCACGTGTCACATTGTGGACGTGGTGGAGGATGATGACGTC
CTCGCTAGCTCATCGGGAGTGGTGGCAGGTTCAGGAGACTTATCCTCCTTAAGCATAAAGGACTTACTGTCACTTCCTCAGGAAGCTAAAAACGTCCTTATTAACGCTTT
GATAGAGTCTGATGGTACAAACATCCCAACCTCCGAGGCACGTACATGTGCTTCGTGTTGCATGTCTATCGGATTTTCTGATGAAGATTTGCTGCTAGGGTCAAAGCCTC
ACAATAGACCTTTATTCGTCTCAGGATACATTCGAGAACAGAGGGTTAGTCGGATCCTCATCGATGATGGATCAGCTGTCAATATAATGCCTAAGTCAACCATGAAGCAG
TTAGGCATCTTGATGGAGCAACTATCAAACAGTAAATTGGTGATCCAAGGTTTCAACCAGGGCGGACAACGGGAAATTGGCATGATACGCCTTGAACTCATCATTGAAGA
CTTAAAAGCTGATACTTTGTTCCATGTAATAGATTCTAAGACCACTTATAAGCTGTTGCTAGGTCGTCCTTGGATCCATGGTAATGGAGTCATAACCTCTACATTGCACC
AATGCTTCAAATTCTATCAAGATGGCATCAAGAAAGTTGAGGCTGACACTAACCCATTTTCAGAAGCTGAGTCTCATTTTGCTGATGCAAAGTATTATTTGAAGAATGTA
AACACAGGAGAAACCATACCTGCAGAAATCCCTTTGATAAAAAAGAATGATAAAGTCGATTTAGAGCCACAAGCAAATGCAAGAAAGGAAGTTGTTGAAGATGTGAGTAC
TTCCAACCTGAAAAAGGGTGAAACGTCTACAAGTTTAGTGAGACCTAAGGGTGTAAAGGATGAGAAATGTTCAAATTCACCTATCCTACAATACGTCCCCTTGTCTCGAC
GTAAAAAGGGTGAGTCACCTTTCACTGAATATTCAGAAAGCATAAAGGTCGGTGACGTCGAAATTTTGAAGGAAAATTTCACTACGCCTCTTACAAAGATCACAAGACAA
GAGAAGAAGCTTCTAAAAGAGGGTTATAGTTTGCCAACAACGAGAAAAGGGCTTGGATATAAGTCGCCAGAACCGGTTCGCATAACAAGAAGAGGGAAGGCGAAAGAGGC
AGACACAAATCATATAACAGTAGAGGAGGTTGATGACTCAAAAGAAAAAGAGAGTGTCGACCAACGAACTTCTGTTTTTAGGCGCATCAGGCCACCAGTTGCTCGTGTTT
TAGTCTTTCAGAGATTAAGTGGCAATGAAACGGAGGAGAAAAGCACACAACCTACCAATAGCTCCACTCGACCTTCAGTTTTTCGAAGGTTAAGTATGCCCGTTGGGGAG
GAAGAGAGTACATTTTCAACTTTGAATGACACTCGACCTTTAGCTTTTCAAAGGTTAAATATGCCCATTGCGAAAGAAGAAAATGCATTTTTAGCTCCGGATGTGACTCT
GCCATCGGTTTTTCAGAGGTTAAGTGTTACCACAAGAAGAGACAAAAAAGAACAGCCTGCTTCATTGATGTCTCATCGACTTCTGATGTGGAATTTTCCTTACCTCAAGC
GCACGGGTCAAGTAATAATAAAGTGTTCGTATAAACGAGTGTCGTCCTCTGGATTGGATTTTTAA
Protein sequenceShow/hide protein sequence
MSKKSQPQQDSWTRWIHQRILQKFMEHSQACYHGSVPQLFKKWDSETKLVGNQSTFIKGRQITDAILLANKAVDYWRCSKKKGFLIKLDIEKAFDKITWDFIVTILHQKE
YANKWTNWIMACISSANYFVIVNGKPKGKISATRDPTNVIGPKKAPTKVTTSSDSYTGPVTRSRSQGVEIREDHTPLAVASRISKLIEESSKDKVAVKHNPLFESVVPTS
KQPKDTLNPDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQNREVAESSQTPAAGKNDKQKVVVHEDQSQHSASVASLSVQQLQDMIMNSIRAHYGG
PTQSSLLYSKPYTKRIDNLKLPTGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDFLVKQFVRTLKGNTFDWYTDLEPDTMDSWEQMEREFLNRFYSTRRTGIKPRT
FEELATRAHDMELSIASRGGKDSVVPDMRKEKKEVKGTEKVAKSATKESMVVNTTLVKLSTKGKENKTNKRVETGESCRLTFKERQEKLIELLECKRSEELGKVNDPNYC
KYHKVVTHPVEKCFVLKELILKLARKKRIELDLDEVAQANHATIMANVGDQIPPISSCDYEERLIQFGTLDHVVVRCQKYFPRRFLDDSEGETLETVTCHIVDVVEDDDV
LASSSGVVAGSGDLSSLSIKDLLSLPQEAKNVLINALIESDGTNIPTSEARTCASCCMSIGFSDEDLLLGSKPHNRPLFVSGYIREQRVSRILIDDGSAVNIMPKSTMKQ
LGILMEQLSNSKLVIQGFNQGGQREIGMIRLELIIEDLKADTLFHVIDSKTTYKLLLGRPWIHGNGVITSTLHQCFKFYQDGIKKVEADTNPFSEAESHFADAKYYLKNV
NTGETIPAEIPLIKKNDKVDLEPQANARKEVVEDVSTSNLKKGETSTSLVRPKGVKDEKCSNSPILQYVPLSRRKKGESPFTEYSESIKVGDVEILKENFTTPLTKITRQ
EKKLLKEGYSLPTTRKGLGYKSPEPVRITRRGKAKEADTNHITVEEVDDSKEKESVDQRTSVFRRIRPPVARVLVFQRLSGNETEEKSTQPTNSSTRPSVFRRLSMPVGE
EESTFSTLNDTRPLAFQRLNMPIAKEENAFLAPDVTLPSVFQRLSVTTRRDKKEQPASLMSHRLLMWNFPYLKRTGQVIIKCSYKRVSSSGLDF