; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031654 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031654
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:11438358..11444712
RNA-Seq ExpressionLag0031654
SyntenyLag0031654
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]4.9e-23250.62Show/hide
Query:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK
        +++++ +++ +SD+YTGPIT SRS+GI I+      A+A  I K + ES K  + +K+N L+  +   S +     + DVMSVMMADV  +  MAEMERK
Subjt:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK

Query:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        +NLLMK VDERD EIA LK Q+Q RET ESSQTP V  +DKGK V++E+QP Q STSVASLSVQQLQDMI NSIRAQYGGP+Q+S +YSKPYTKR DNLR
Subjt:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        + +GYQPPKFQQFDGKGNPKQ +AHFVET ENAG+  D LV+QF R+LKGN F+WYT+ EPE+ +SWEQ+E+EFLN FYS RRTVSM ELTNTKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        VIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPRTFEEL T AHDMELSIASRG+KD +VP++ K+KKE+KG EK+ KS +KESMVVN
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
        TTPLKF SKGKE++ +K+ +  ERR LTLKER+E +Y F DSD+ DMLEQLLEKQLI+LP+CKR E+ GKV+DPNYCKYH+V+S+ VEKCFVLKELIL+L
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF
        AREK+IELDL+                              +FGT + +VVQF +E + +  Q +   IE+++EG+                   S +++
Subjt:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF

Query:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH
          G+         K H   L  +  +   +  R++    +  + +PKS +          +   +++ +   + PP        +  L R  +  L  L 
Subjt:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH

Query:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-
                      P+   TI+   L +       T T++ +       +  F    LL   +  +   R + VS +         R  R  DR+L++  
Subjt:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-

Query:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE
                   W    ++  +  S   +QG     QR IGMIRLELIIGDLKA  LFHVIDS+TTYKLLLGRPWIH NGV+TSTL+QCFKFYQD +KKVE
Subjt:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE

Query:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK
        A++NPFS+AESHF DAKFY K++N  E++PAE PL K  D   L+  + A     E  +  +  KGE  TS  K
Subjt:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]2.4e-23150.51Show/hide
Query:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK
        +++++ +++ +SD+YTGPIT SRS+GI I+      A+A  I K + ES K  + +K+N L+  +   S +     + DVMSVMMADV  +  MAEMERK
Subjt:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK

Query:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        +NLLMK VDERD EIA LK Q+Q RET ESSQTP V  +DKGK V++E+QP Q STSVASLSVQQLQDMI +SIRAQYGGP+Q+S +YSKPYTKR DNLR
Subjt:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        + +GYQPPKFQQFDGKGNPKQ +AHFVET ENAG+  D LV+QF R+LKGN F+WYT+ EPE+ +SWEQ+E+EFLN FYS RRTVSM ELTNTKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        VIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPRTFEEL T AHDMELSIASRG+KD +VP++ K+KKE+KG EK+ KS  KESMVVN
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
        TTPLKF SKGKE++ +K+ +  ERR LTLKER+E +Y F DSD+ DMLEQLLEKQLI+LP+CKR E+ GKV+DPNYCKYH+V+S+ VEKCFVLKELIL+L
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF
        AREK+IELDL+                              +FGT + +VVQF +E + +  Q +   IE+++EG+                   S +++
Subjt:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF

Query:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH
          G+         K H   L  +  +   +  R++    +  + +PKS +          +   +++ +   + PP        +  L R  +  L  L 
Subjt:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH

Query:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-
                      P+   TI+   L +       T T++ +       +  F    LL   +  +   R + VS +         R  R  DR+L++  
Subjt:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-

Query:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE
                   W    ++  +  S   +QG     QR IGMIRLELIIGDLKA  LFHVIDS+TTYKLLLGRPWIH NGV+TSTL+QCFKFYQD +KKVE
Subjt:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE

Query:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK
        A++NPFS+AESHF DAKFY K++N  E++PAE PL K  D   L+  + A     E  +  +  KGE  TS  K
Subjt:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]1.9e-23150.51Show/hide
Query:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK
        +++++ +++ +SD+YTGPIT SRS+GI I+      A+A  I K + ES K  + +K+N L+  +   S +     + DVMSVMMADV  +  MAEMERK
Subjt:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK

Query:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        +NLLMK VDERD EIA LK Q+Q RET ESSQTP V  +DKGK V++E+QP Q STSVASLSVQQLQDMI +SIRAQYGGP+Q+S +YSKPYTKR DNLR
Subjt:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        + +GYQPPKFQQFDGKGNPKQ +AHFVET ENAG+  D LV+QF R+LKGN F+WYT+ EPE+ +SWEQ+E+EFLN FYS RRTVSM ELTNTKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        VIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPRTFEEL T AHDMELSIASRG+KD +VP++ K+KKE+KG EK+ KS +KESMVVN
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
        TTPLKF SKGKE++ +K+ +  ERR LTLKER+E +Y F DSD+ DMLEQLLEKQLI+LP+CKR E+ GKV+DPNYCKYH+V+S+ VEKCFVLKELIL+L
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF
        AREK+IELDL+                              +FGT + +VVQF +E + +  Q +   IE+++EG+                   S +++
Subjt:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF

Query:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH
          G+         K H   L  +  +   +  R++    +  + +PKS +          +   +++ +   + PP        +  L R  +  L  L 
Subjt:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH

Query:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-
                      P+   TI+   L +       T T++ +       +  F    LL   +  +   R + VS +         R  R  DR+L++  
Subjt:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-

Query:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE
                   W    ++  +  S   +QG     QR IGMIRLELIIGDLKA  LFHVIDS+TTYKLLLGRPWIH NGV+TSTL+QCFKFYQD +KKVE
Subjt:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE

Query:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK
        A++NPFS+AESHF DAKFY K++N  E++PAE PL K  D   L+  + A     E  +  +  KGE  TS  K
Subjt:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]4.2e-23150.51Show/hide
Query:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK
        +++++ +++ +SD+YTGPIT SRS+GI I+      A+A  I K + ES K  + +K+N L+  +   S +     + DVMSVMMADV  +  MAEMERK
Subjt:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK

Query:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        +NLLMK VDERD EIA LK Q+Q RET ESSQTP V  +DKGK V++E+QP Q STSVASLSVQQLQDMI +SIRAQYGGP+Q+S +YSKPYTKR DNLR
Subjt:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        + +GYQPPKFQQFDGKGNPKQ +AHFVET ENAG+  D LV+QF R+LKGN F+WYT+ EPE+ +SWEQ+E+EFLN FYS RRTVSM ELTNTKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        VIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPRTFEEL T AHDMELSIASRG+KD +VP++ K+KKE+KG EK+ KS  KESMVVN
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
        TTPLKF SKGKE++ +K+ +  ERR LTLKER+E +Y F DSD+ DMLEQLLEKQLI+LP+CKR E+ GKV+DPNYCKYH+V+S+ VEKCFVLKELIL+L
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF
        AREK+IELDL+                              +FGT + +VVQF +E + +  Q +   IE+++EG+                   S +++
Subjt:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEGF-------------------SDEDF

Query:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH
          G+         K H   L  +  +   +  R++    +  + +PKS +          +   +++ +   + PP        +  L R  +  L  L 
Subjt:  LLGS---------KPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLH

Query:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-
                      P+   TI+   L +       T T++ +       +  F    LL   +  +   R + VS +         R  R  DR+L++  
Subjt:  IADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNE-

Query:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE
                   W    ++  +  S   +QG     QR IGMIRLELIIGDLKA  LFHVIDS+TTYKLLLGRPWIH NGV+TSTL+QCFKFYQD +KKVE
Subjt:  ----------PWIFLRIVFRILRS---LQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVE

Query:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK
        A++NPFS+AESHF DAKFY K++N  E++PAE PL K  D   L+  + A     E  +  +  KGE  TS  K
Subjt:  ANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVK

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]6.2e-23550.73Show/hide
Query:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK
        +++++ +++ +SD+YTGPIT SRS+GI I+      A+A  I K + ES K  + +K+N L+  +   S +     + DVMSVMMADV  +  MAEMERK
Subjt:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK

Query:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        +NLLMK VDERD EIA LK Q+Q RET ESSQTP V  +DKGK V++E+QP Q STSVASLSVQQLQDMI NSIRAQYGGP+Q+S +YSKPYTKR DNLR
Subjt:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        + +GYQPPKFQQFDGKGNPKQ +AHFVET ENAG+  D LV+QF R+LKGN F+WYT+ EPE+ +SWEQ+E+EFLN FYS RRTVSM ELTNTKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        VIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPRTFEEL T AHDMELSIASRG+KD +VP++ K+KKE+KG EK+ KS +KESMVVN
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
        TTPLKF SKGKE++ +K+ +  ERR LTLKER+E +Y F DSD+ DMLEQLLEKQLI+LP+CKR E+ GKV+DPNYCKYH+V+S+ VEKCFVLKELIL+L
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEG-------------------------
        AREK+IELDL+                              +FGT + +VVQF +E + +  Q +   IE+++EG                         
Subjt:  AREKKIELDLD------------------------------KFGTLDSVVVQFQKEATMKGSQEKNDSIEDENEG-------------------------

Query:  ------------------------------------------------------------FSDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVN
                                                                    FSDED LLGSK HNRPL++SGY+REQ+V RILID+GSAVN
Subjt:  ------------------------------------------------------------FSDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVN

Query:  IMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFT
        IMPKSTM QLGI M            LS+ K                                                                     
Subjt:  IMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFT

Query:  VAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNEPWIFLRIVFRILRSLQGQRVIGMIRLELIIGDLKADTLFHVIDS
                                    +V+  F                                       QR IGMIRLELIIGDLKA  LFHVIDS
Subjt:  VAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNEPWIFLRIVFRILRSLQGQRVIGMIRLELIIGDLKADTLFHVIDS

Query:  KTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKF
        +TTYKLLLGRPWIH NGV+TSTL+QCFKFYQD +KKVEA++NPFS+AESHF DA F
Subjt:  KTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKF

TrEMBL top hitse value%identityAlignment
A0A5A7SMM7 Ty3-gypsy retrotransposon protein2.1e-19647.43Show/hide
Query:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK
        +++++ +++ +SD+YTGP               TP  V       + ES            FKS I + K+     +LDVMS+MMAD+  +  + EMERK
Subjt:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK

Query:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        +N LMK V+ERD EI  L+ Q+Q RET ES+QT  V   DK K V++E+QP Q S  VASLSVQQLQDMI NSI AQYGG  Q+S +YSKPYTKR DNLR
Subjt:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        + +GYQPP F QFDGK N KQ IAHFVET EN G+  D LVKQ  R+LK N F+WYTN +PE  DSWEQ+E++F N FYS R TVSM ELTNTKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        V +YIN+WRALSLDC+DRL E+S+VEM  QGMHWGLLYILQGIKPRTFEEL T    MELSI S G+KD  VP++ K+KKE KG +KV KS  KESMVVN
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
         TPLKF SK KE K  K+ +  ER  +TLKER+E +Y F DSD+ DMLEQLLEKQLI+L +CKR E+  K++D NYCKYH V+ + VEKCFVLKELIL+L
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLDKFGTL---------------------DSVVVQFQKE-------------------------------ATMKGSQEKNDSIEDE----NE
        AREKKIELDL++                         +S+  +  KE                               +T   S     + E      + 
Subjt:  AREKKIELDLDKFGTL---------------------DSVVVQFQKE-------------------------------ATMKGSQEKNDSIEDE----NE

Query:  GFSDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIA
         F DED LLGSK HNRPL++SG +REQ+V RILID+GSAVNIMPKSTM+ LGI          L   LS+ K                            
Subjt:  GFSDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIA

Query:  DYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNEPWI
                                                                             +V+  F                         
Subjt:  DYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNEPWI

Query:  FLRIVFRILRSLQGQRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMK
                      QRVIGM+RLELIIGDLKA  LFHVIDS+TTYKLLLGR WIH NGV+TSTL+QCFKFYQDD+K VE ++NPFS+ +SHF DAKFY+K
Subjt:  FLRIVFRILRSLQGQRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMK

Query:  SDNTGEIIPAEIP
        + N+ E +PA+IP
Subjt:  SDNTGEIIPAEIP

A0A5A7TZU9 Ribonuclease H3.4e-21046.63Show/hide
Query:  SSRENTTSSDSYTG--PITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTL-NLDVMSVMMADVD-QDERMAEMER
        +S+ NT+ + S  G  P T SRS+ I+  ED  PF VA  I + I +  K  + +K+N         S++ N+ +   ++MSVM+ DVD  ++RMAE+E+
Subjt:  SSRENTTSSDSYTG--PITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTL-NLDVMSVMMADVD-QDERMAEMER

Query:  KLNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQPQHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        K+N+LMKAV+ERD EIA LKN +++R+  ESS T  +   +KGK +M+E QPQ+STS+ASLSVQQLQ+MI NSI+ QYGGP Q+  LYSKPYTKR DN+R
Subjt:  KLNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQPQHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        +  GYQPPKFQQFDGKGNPKQ +AHF+ET E AGT  DLLVKQF RTLKGN FDWYT+ EPE+ DSWEQ+ER+FLN FYS RR VSM ELT TKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        VIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPRTFEEL T AHDMELSIA+RG+ D +VP++ KEKKEVK T+K  K  TKE+MVV+
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
        TTPLK  S  KE K +K+ + GE+R  TLKER+E +Y F DSD+PDML+QLLEKQLI+LP+CKR  E+G+VNDPNYCKYH+V+S+ VEKCFVLKELILKL
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLD-------------------------KFGTLDSVVVQFQKEATMK------GSQEKNDSIEDENEGFSDEDFLLGSKPHNRPLFISGYIR
        A +KKIEL+LD                         +FG+L+ VV+    E          G +E+   +++  EG++    L+  +   +  F     +
Subjt:  AREKKIELDLD-------------------------KFGTLDSVVVQFQKEATMK------GSQEKNDSIEDENEGFSDEDFLLGSKPHNRPLFISGYIR

Query:  EQKVSRILIDDGSAVNIMPKSTMKQL--GISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIADYRRP--------HGVARRRPER
        E    R     G +     +   ++    I     +        L    P   P  +        +     P   + +  +P        + +     E 
Subjt:  EQKVSRILIDDGSAVNIMPKSTMKQL--GISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIADYRRP--------HGVARRRPER

Query:  RATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARV-----YRTRDRVLLNEPWIFLRIVFRILR
        + TI+  L           +T    T      + +F    LL   +  +   R + VS F   + L +  +          +  +N+  I +  +     
Subjt:  RATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARV-----YRTRDRVLLNEPWIFLRIVFRILR

Query:  SLQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMKSDNTG
         +QG     QR IG +RLE++IGDL+A T+FHVIDS+TTYK+LLGRPWIH+NG++TSTL+QCFKFY+  IKKV+A++ PF+KAESHF DAKFY KS++  
Subjt:  SLQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMKSDNTG

Query:  EIIPAEIPLIK---KNDKLHLEPSTDARKGAVEDVKASDL--------KKGETSTSLVKPVTPPP
        EII  E+P+ K   KN++  +     ++  A+   +  +L         + E   +L K V+ PP
Subjt:  EIIPAEIPLIK---KNDKLHLEPSTDARKGAVEDVKASDL--------KKGETSTSLVKPVTPPP

A0A5A7VE63 Uncharacterized protein2.9e-18549.25Show/hide
Query:  MREDQPQHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLRLSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFR
        M+E QPQ+STS+ASLSVQQLQ+MI NSI+ QYGGP Q+  LYSKPYTKR DN+R+  GYQ PKFQQFDGKGNPKQ +AHF+ET E AGT  DLLVKQF +
Subjt:  MREDQPQHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLRLSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFR

Query:  TLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEPVIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPR
        TLKGN FDWYT+ EPE+ DSWEQ+ER+FLN FYS RR VSM ELT TKQRKGEPVIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPR
Subjt:  TLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEPVIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPR

Query:  TFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVNTTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPD
        TFEEL T AHDMELSIA+RG+ D +VP++ KEKKEVK T+KV K ATKE+MVV+TTPLK  S  KE K +K  + GE+R  TLKER+E +Y F DSD+PD
Subjt:  TFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVNTTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPD

Query:  MLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKLAREKKIELDLDKFGTLDSVVVQFQKEATMK--------GSQEK---
        ML+QLLEKQLI+LP+CKR  E+G+VNDPNYCKYH+V+S+ VEKCFVLKELILKLA +KKIEL++D     +   V  Q ++ +         GS E    
Subjt:  MLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKLAREKKIELDLDKFGTLDSVVVQFQKEATMK--------GSQEK---

Query:  NDSIED-ENEGF-----SDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPA----------
          S+ED +N  F      +E+  + +      L      R+Q  S+             KS  +    ++ + + +      LS  + P           
Subjt:  NDSIED-ENEGF-----SDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPA----------

Query:  PPPPLSCAAVLHRSLIARLPLLHIADYRRP--------HGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLV---RLLKSVRRFS
        P   +SC         A  P   + +  +P        + +     E + TI+  L         ++  V+  T+A  ++  +        LL+S     
Subjt:  PPPPLSCAAVLHRSLIARLPLLHIADYRRP--------HGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLV---RLLKSVRRFS

Query:  DLQRDVVVSVFRSVECLVK--ARVYRTRDRVLLNEPWIFLRIVFRILRSLQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDN
         L     +   +  + L+   + VY    +  +N+  I +  +      +QG     QR IG +RLE++IGDL+A T+FHVIDSKTTYK+LLGRPWIH+N
Subjt:  DLQRDVVVSVFRSVECLVK--ARVYRTRDRVLLNEPWIFLRIVFRILRSLQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDN

Query:  GVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVKPV
        G++TSTL+QCFKFY+  IKKV+A++ PF+KAESHF DAKFY KS++  EII  E+P+ K   K   E  T  +      + +    +  T T L  PV
Subjt:  GVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMKSDNTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVKPV

A0A5D3BIH8 Uncharacterized protein1.1e-18944.04Show/hide
Query:  SSRENTTSSDSYTG--PITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTL-NLDVMSVMMADVD-QDERMAEMER
        +S+ NT+ + S  G  P T SRS+ I+  ED  PF VA  I K I +  K  + +K+N         S++ N+ +   ++MSVM+ DVD  ++RM  +E+
Subjt:  SSRENTTSSDSYTG--PITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTL-NLDVMSVMMADVD-QDERMAEMER

Query:  KLNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQPQHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        K+N+ MKAV+ERD EIA LKN +++R+  ESS T  +   +KGK +M+E QPQ+STS+ASLSVQQLQ+MI NSI+ QYGGP Q+                
Subjt:  KLNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQPQHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
               P F     +GNPKQ +AHF+ET E AGT  DLLVKQF RTLKGN FDWYT+ EPE+ DSWEQ+ER+FLN FYS RR VSM ELT TKQRKGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        VIDYINRWRALSLDCKDRL E+S+VEMC QGMHWGLLYILQGIKPRTFEEL T AHDMELSIA+RG+ D +VP++ KEKKEVK T+K  K  TKE+MVV+
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
        TTPLK  S  KE K +K+ + GE+R  TLKER+E +Y F DSD+PDML+QLLEKQLI+LP+CKR  E+G+VNDPNYCKYH+V+S+ VEKCFVLKELILKL
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLD-------------------------KFGTLDSVVVQFQKEATMK------GSQEKNDSIEDENEGFSDEDFLLGSKPHNRPLFISGYIR
        A +KKIEL+LD                         +FG+L+ VV+    E          G +E+   +++  EG++    L+  +   +  F     +
Subjt:  AREKKIELDLD-------------------------KFGTLDSVVVQFQKEATMK------GSQEKNDSIEDENEGFSDEDFLLGSKPHNRPLFISGYIR

Query:  EQKVSRILIDDGSAVNIMPKSTMKQL--GISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIADYRRP--------HGVARRRPER
        E    R     G +     +   ++    I     +        L    P   P  +        +     P   + +  +P        + +     E 
Subjt:  EQKVSRILIDDGSAVNIMPKSTMKQL--GISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIADYRRP--------HGVARRRPER

Query:  RATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARV-----YRTRDRVLLNEPWIFLRIVFRILR
        + TI+  L           +T    T      + +F    LL   +  +   R + VS F   + L +  +          +  +N+  I +  +     
Subjt:  RATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARV-----YRTRDRVLLNEPWIFLRIVFRILR

Query:  SLQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMKSDNTG
         +QG     QR IG +RLE++IGDL+A T+FHVIDS+TTYK+LLGRPWIH+NG++TSTL+QCFKFY+  IKKV+A++ PF+KAESHF DAKFY KS++  
Subjt:  SLQG-----QRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMKSDNTG

Query:  EIIPAEIPLIK---KNDKLHLEPSTDARKGAVEDVKASDL--------KKGETSTSLVKPVTPPP
        EII  E+P+ K   KN++  +     ++  A+   +  +L         + E   +L K V+ PP
Subjt:  EIIPAEIPLIK---KNDKLHLEPSTDARKGAVEDVKASDL--------KKGETSTSLVKPVTPPP

A0A5D3BPP3 Ty3-gypsy retrotransposon protein2.6e-19446.88Show/hide
Query:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK
        +++++ +++ +SD+YTGP               TP  V       + ES            FKS I + K+ +     DVMS+MMAD+  +  + E+E+K
Subjt:  TEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDTLNLDVMSVMMADVDQDERMAEMERK

Query:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR
        +N LMK V+ERD EI  L+ Q+Q RET ES+QT  V   DK K V++E+QP Q S  VASLSVQQLQDMI NSI AQYGG  Q+S +YSKPYTKR DNLR
Subjt:  LNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQP-QHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSLLYSKPYTKRFDNLR

Query:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP
        + +GYQPP F QFDGK N KQ IAHFVET EN G+  D LVKQ  R+LK N F+WYTN +PE  DSWEQ+E++F N FYS R TVSM ELTNTKQ+KGEP
Subjt:  LSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQRKGEP

Query:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN
        V DYIN+WRALSLDC+DRL E+S+VEM  QGMHWGLLYILQGIKPRTFEEL T    MELSI S G+KD  VP++ K+KKE KG +KV KS  KESMVVN
Subjt:  VIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVN

Query:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL
         TPLKF SK KE K  K+ +  ER  +TLKER+E +Y F DSD+ DMLEQLLEKQLI+LP+CKR E+  K++D NYCKYH V+ + VEKCFVLKELIL+L
Subjt:  TTPLKFSSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKL

Query:  AREKKIELDLDKFGTL---------------------DSVVVQFQKEATMKGSQEK------------------NDSIEDENEG----------------
        AREKKIELDL++                         +S+  +  KE  +     +                  ND +   +                  
Subjt:  AREKKIELDLDKFGTL---------------------DSVVVQFQKEATMKGSQEK------------------NDSIEDENEG----------------

Query:  -FSDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIA
         F DED LLGSK HNRPL++ G +REQ+V +ILI++GSAVNIMPKSTM+ LGI          L   LS+ K                            
Subjt:  -FSDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSCAAVLHRSLIARLPLLHIA

Query:  DYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNEPWI
                                                                             +V+  F                         
Subjt:  DYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDRVLLNEPWI

Query:  FLRIVFRILRSLQGQRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMK
                      QRVIGM+RLELIIGDLKA  LFHVIDS+TTYKLLLGRPWIH NGV+TSTL+QCFKFYQDD+K VE ++NPFS+ +SHF DAKFY+K
Subjt:  FLRIVFRILRSLQGQRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMK

Query:  SDNTGEIIPAEIP
        + N+ E +PA+IP
Subjt:  SDNTGEIIPAEIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTGAAGACTCCTTCAAGATTGGAAGACTTCAAGCTCCAAGAATCCATTGAAGCTTCAGACTCGAGATCAAGTGTTCCCGCCCTCGAATCCAACCAACGTGATGGCACCGA
AGAAAGCTCCCGCGAAAATACAACATCGAGCGACTCTTATACGGGTCCTATCACTCATAGTCGTTCCCAAGGAATAGAGATCAAGGAGGATCACACTCCTTTCGCTGTTG
CAAGTAGGATCTCAAAGTTGATTGAAGAATCCTCTAAGGACAAGGTTGCAGTCAAAGACAACCTGTTGTTCAAATCTTTCATTCCAACATCTAAGCAGCCAAATGATACA
CTAAATCTTGATGTGATGTCCGTCATGATGGCTGATGTAGACCAGGATGAAAGAATGGCAGAGATGGAAAGAAAACTCAATCTCTTAATGAAGGCAGTTGATGAAAGAGA
TCTTGAGATTGCCTATTTGAAAAACCAGCTGCAAAACCGAGAAACGCCTGAGTCTAGCCAGACCCCTGCTGTAGGAAAGAATGATAAGGGGAAGACTGTCATGCGTGAGG
ACCAACCACAACACTCCACGTCAGTTGCCTCTTTGTCCGTCCAACAACTACAAGACATGATCATGAATTCAATCAGAGCTCAATATGGTGGACCCACTCAAAGTTCCCTC
TTGTATTCCAAACCTTATACAAAGAGGTTTGACAATCTCAGACTGTCCATTGGGTATCAGCCTCCCAAGTTCCAACAGTTTGACGGAAAGGGCAACCCAAAACAGCAGAT
CGCCCACTTCGTTGAAACCTACGAGAACGCTGGTACTTGGAGAGATTTGCTTGTTAAGCAATTTTTCCGAACACTGAAAGGAAACACGTTCGATTGGTACACTAACTCGG
AGCCTGAGACAAATGACAGTTGGGAGCAGATTGAGAGAGAGTTTCTGAATCATTTCTACAGTATGAGGCGAACGGTTAGTATGACGGAACTCACAAATACAAAGCAACGA
AAGGGTGAGCCAGTCATTGACTACATCAATCGTTGGAGAGCATTGAGTCTTGACTGCAAAGATAGACTCTTTGAAGTGTCCTCTGTTGAGATGTGCATCCAAGGCATGCA
TTGGGGACTTCTTTACATTCTGCAGGGGATAAAACCTCGAACCTTTGAAGAATTAGTAACTCACGCTCATGACATGGAGTTGAGCATCGCTAGTAGAGGGAGCAAAGATC
CCGTAGTCCCAGACATGGGGAAAGAAAAGAAGGAAGTCAAGGGCACTGAGAAAGTTACGAAGAGTGCCACGAAAGAATCCATGGTCGTCAACACGACCCCTTTGAAATTT
TCCTCCAAAGGAAAAGAAAGTAAAACCGATAAGCAAATGGAGACAGGCGAAAGACGTCACCTAACTTTAAAAGAAAGACGGGAAAATATGTATCGGTTTTCAGACTCCGA
TGTCCCAGATATGTTGGAACAACTGTTAGAGAAGCAATTAATTGAACTCCCCGATTGCAAACGACTTGAAGAGTTAGGGAAAGTCAATGATCCTAATTACTGCAAATATC
ACCAAGTTGTTAGCTACACAGTGGAGAAATGTTTTGTATTGAAGGAGCTAATCCTTAAGTTGGCTCGTGAGAAGAAAATTGAGTTGGACCTTGATAAGTTTGGGACCCTT
GATTCCGTAGTGGTTCAATTCCAAAAAGAAGCCACAATGAAGGGATCCCAAGAAAAAAATGATTCCATCGAAGATGAGAACGAAGGATTTTCTGATGAAGATTTTCTGTT
AGGGTCGAAGCCTCACAATAGACCTTTATTCATCTCGGGATACATCCGAGAACAGAAGGTTAGTCGGATCCTCATTGATGATGGATCAGCTGTCAACATAATGCCTAAGT
CAACCATGAAGCAGTTAGGCATCTCGATGGGAAGAACTATCAAACTAACATCTCTCGACTGCTCTCTCTCACATCTGAAACCACCTGCGCCGCCGCCTCCTTTGTCTTGT
GCCGCCGTTCTCCATCGCAGTCTCATCGCACGCCTCCCTCTGTTGCACATCGCCGATTACCGCCGCCCTCACGGAGTTGCCCGCCGTCGCCCAGAACGTCGCGCCACAAT
CGTTTTTTTTCTCCTTCGGTCAGTCTCTCTCCCTTTCGGTCTCACTCTCACAGTCTCCCTCTTTACTGTAGCTCCACGAGCAGCCACTGCCGCCTTCGGGCTCGTTCGCC
TTCTTAAATCTGTTCGTCGCTTTTCAGATCTACAGCGTGATGTGGTGGTTTCAGTGTTCAGAAGTGTTGAATGTCTGGTAAAGGCAAGAGTGTACAGAACGCGTGACAGA
GTGCTGCTGAACGAGCCATGGATTTTCCTTAGGATAGTCTTCCGCATTTTACGGTCGTTACAGGGACAACGAGTGATTGGTATGATACGCCTCGAGCTCATCATTGGAGA
TTTGAAGGCCGATACTTTGTTTCATGTGATAGATTCCAAGACCACTTATAAGCTGCTATTAGGTCGTCCTTGGATCCATGACAATGGAGTGATAACTTCTACATTGTACC
AGTGCTTTAAATTTTATCAAGATGACATTAAGAAAGTTGAAGCTAACACTAACCCATTCTCAAAAGCTGAATCCCATTTCGTTGATGCAAAATTTTATATGAAGAGCGAC
AACACAGGGGAAATTATACCTGCAGAAATCCCCTTGATAAAAAAGAATGATAAGCTCCATTTAGAGCCATCAACTGATGCAAGAAAGGGAGCTGTTGAAGATGTGAAGGC
ATCTGACCTGAAAAAGGGTGAAACATCTACAAGCCTTGTGAAACCTGTCACACCCCCTCCCGGACCACCTGCTAGCTTAGCCCGAAAGACGGCGTGA
mRNA sequenceShow/hide mRNA sequence
TTGAAGACTCCTTCAAGATTGGAAGACTTCAAGCTCCAAGAATCCATTGAAGCTTCAGACTCGAGATCAAGTGTTCCCGCCCTCGAATCCAACCAACGTGATGGCACCGA
AGAAAGCTCCCGCGAAAATACAACATCGAGCGACTCTTATACGGGTCCTATCACTCATAGTCGTTCCCAAGGAATAGAGATCAAGGAGGATCACACTCCTTTCGCTGTTG
CAAGTAGGATCTCAAAGTTGATTGAAGAATCCTCTAAGGACAAGGTTGCAGTCAAAGACAACCTGTTGTTCAAATCTTTCATTCCAACATCTAAGCAGCCAAATGATACA
CTAAATCTTGATGTGATGTCCGTCATGATGGCTGATGTAGACCAGGATGAAAGAATGGCAGAGATGGAAAGAAAACTCAATCTCTTAATGAAGGCAGTTGATGAAAGAGA
TCTTGAGATTGCCTATTTGAAAAACCAGCTGCAAAACCGAGAAACGCCTGAGTCTAGCCAGACCCCTGCTGTAGGAAAGAATGATAAGGGGAAGACTGTCATGCGTGAGG
ACCAACCACAACACTCCACGTCAGTTGCCTCTTTGTCCGTCCAACAACTACAAGACATGATCATGAATTCAATCAGAGCTCAATATGGTGGACCCACTCAAAGTTCCCTC
TTGTATTCCAAACCTTATACAAAGAGGTTTGACAATCTCAGACTGTCCATTGGGTATCAGCCTCCCAAGTTCCAACAGTTTGACGGAAAGGGCAACCCAAAACAGCAGAT
CGCCCACTTCGTTGAAACCTACGAGAACGCTGGTACTTGGAGAGATTTGCTTGTTAAGCAATTTTTCCGAACACTGAAAGGAAACACGTTCGATTGGTACACTAACTCGG
AGCCTGAGACAAATGACAGTTGGGAGCAGATTGAGAGAGAGTTTCTGAATCATTTCTACAGTATGAGGCGAACGGTTAGTATGACGGAACTCACAAATACAAAGCAACGA
AAGGGTGAGCCAGTCATTGACTACATCAATCGTTGGAGAGCATTGAGTCTTGACTGCAAAGATAGACTCTTTGAAGTGTCCTCTGTTGAGATGTGCATCCAAGGCATGCA
TTGGGGACTTCTTTACATTCTGCAGGGGATAAAACCTCGAACCTTTGAAGAATTAGTAACTCACGCTCATGACATGGAGTTGAGCATCGCTAGTAGAGGGAGCAAAGATC
CCGTAGTCCCAGACATGGGGAAAGAAAAGAAGGAAGTCAAGGGCACTGAGAAAGTTACGAAGAGTGCCACGAAAGAATCCATGGTCGTCAACACGACCCCTTTGAAATTT
TCCTCCAAAGGAAAAGAAAGTAAAACCGATAAGCAAATGGAGACAGGCGAAAGACGTCACCTAACTTTAAAAGAAAGACGGGAAAATATGTATCGGTTTTCAGACTCCGA
TGTCCCAGATATGTTGGAACAACTGTTAGAGAAGCAATTAATTGAACTCCCCGATTGCAAACGACTTGAAGAGTTAGGGAAAGTCAATGATCCTAATTACTGCAAATATC
ACCAAGTTGTTAGCTACACAGTGGAGAAATGTTTTGTATTGAAGGAGCTAATCCTTAAGTTGGCTCGTGAGAAGAAAATTGAGTTGGACCTTGATAAGTTTGGGACCCTT
GATTCCGTAGTGGTTCAATTCCAAAAAGAAGCCACAATGAAGGGATCCCAAGAAAAAAATGATTCCATCGAAGATGAGAACGAAGGATTTTCTGATGAAGATTTTCTGTT
AGGGTCGAAGCCTCACAATAGACCTTTATTCATCTCGGGATACATCCGAGAACAGAAGGTTAGTCGGATCCTCATTGATGATGGATCAGCTGTCAACATAATGCCTAAGT
CAACCATGAAGCAGTTAGGCATCTCGATGGGAAGAACTATCAAACTAACATCTCTCGACTGCTCTCTCTCACATCTGAAACCACCTGCGCCGCCGCCTCCTTTGTCTTGT
GCCGCCGTTCTCCATCGCAGTCTCATCGCACGCCTCCCTCTGTTGCACATCGCCGATTACCGCCGCCCTCACGGAGTTGCCCGCCGTCGCCCAGAACGTCGCGCCACAAT
CGTTTTTTTTCTCCTTCGGTCAGTCTCTCTCCCTTTCGGTCTCACTCTCACAGTCTCCCTCTTTACTGTAGCTCCACGAGCAGCCACTGCCGCCTTCGGGCTCGTTCGCC
TTCTTAAATCTGTTCGTCGCTTTTCAGATCTACAGCGTGATGTGGTGGTTTCAGTGTTCAGAAGTGTTGAATGTCTGGTAAAGGCAAGAGTGTACAGAACGCGTGACAGA
GTGCTGCTGAACGAGCCATGGATTTTCCTTAGGATAGTCTTCCGCATTTTACGGTCGTTACAGGGACAACGAGTGATTGGTATGATACGCCTCGAGCTCATCATTGGAGA
TTTGAAGGCCGATACTTTGTTTCATGTGATAGATTCCAAGACCACTTATAAGCTGCTATTAGGTCGTCCTTGGATCCATGACAATGGAGTGATAACTTCTACATTGTACC
AGTGCTTTAAATTTTATCAAGATGACATTAAGAAAGTTGAAGCTAACACTAACCCATTCTCAAAAGCTGAATCCCATTTCGTTGATGCAAAATTTTATATGAAGAGCGAC
AACACAGGGGAAATTATACCTGCAGAAATCCCCTTGATAAAAAAGAATGATAAGCTCCATTTAGAGCCATCAACTGATGCAAGAAAGGGAGCTGTTGAAGATGTGAAGGC
ATCTGACCTGAAAAAGGGTGAAACATCTACAAGCCTTGTGAAACCTGTCACACCCCCTCCCGGACCACCTGCTAGCTTAGCCCGAAAGACGGCGTGA
Protein sequenceShow/hide protein sequence
MKTPSRLEDFKLQESIEASDSRSSVPALESNQRDGTEESSRENTTSSDSYTGPITHSRSQGIEIKEDHTPFAVASRISKLIEESSKDKVAVKDNLLFKSFIPTSKQPNDT
LNLDVMSVMMADVDQDERMAEMERKLNLLMKAVDERDLEIAYLKNQLQNRETPESSQTPAVGKNDKGKTVMREDQPQHSTSVASLSVQQLQDMIMNSIRAQYGGPTQSSL
LYSKPYTKRFDNLRLSIGYQPPKFQQFDGKGNPKQQIAHFVETYENAGTWRDLLVKQFFRTLKGNTFDWYTNSEPETNDSWEQIEREFLNHFYSMRRTVSMTELTNTKQR
KGEPVIDYINRWRALSLDCKDRLFEVSSVEMCIQGMHWGLLYILQGIKPRTFEELVTHAHDMELSIASRGSKDPVVPDMGKEKKEVKGTEKVTKSATKESMVVNTTPLKF
SSKGKESKTDKQMETGERRHLTLKERRENMYRFSDSDVPDMLEQLLEKQLIELPDCKRLEELGKVNDPNYCKYHQVVSYTVEKCFVLKELILKLAREKKIELDLDKFGTL
DSVVVQFQKEATMKGSQEKNDSIEDENEGFSDEDFLLGSKPHNRPLFISGYIREQKVSRILIDDGSAVNIMPKSTMKQLGISMGRTIKLTSLDCSLSHLKPPAPPPPLSC
AAVLHRSLIARLPLLHIADYRRPHGVARRRPERRATIVFFLLRSVSLPFGLTLTVSLFTVAPRAATAAFGLVRLLKSVRRFSDLQRDVVVSVFRSVECLVKARVYRTRDR
VLLNEPWIFLRIVFRILRSLQGQRVIGMIRLELIIGDLKADTLFHVIDSKTTYKLLLGRPWIHDNGVITSTLYQCFKFYQDDIKKVEANTNPFSKAESHFVDAKFYMKSD
NTGEIIPAEIPLIKKNDKLHLEPSTDARKGAVEDVKASDLKKGETSTSLVKPVTPPPGPPASLARKTA