; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G11005 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G11005
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationClcChr06:19411668..19417102
RNA-Seq ExpressionClc06G11005
SyntenyClc06G11005
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050511.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-17444.44Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EML+SGIIR + SPYSS VLLV+K+                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ +EEH+QHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDFS  FEIE+DASGFG GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + VYEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LKYLLEQ+V+QPQ+Q+   KLL                   LSR+    +
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P L+DV ++KEE   DP L EII+ +      + +++L + +LK+KGRLVVS KS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV +Y  E                LL+PL IP AIW D+SMDFIEGLPKS+G+DVI VVV                    VA+ F+ EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   SVE  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GD+V+LKLRP       K R   LS +                       A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              + +Q   P  ++  EW+ +P+EVYGYRKNPST  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

KAA0058186.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-17444.56Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EMLSSGIIR + SPYSS VLLV+KK                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ ++EHVQHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDF+  FEIE+DASG G GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + +YEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LK+LLEQ+V+QPQ+Q+   KLL                   LSR+   V 
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVV-SKSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P ++DV +IK+E   DP L E I +L  +   + +++LH+ +LK+KGRLV+ SKS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVV-SKSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        D+ +Y  E                LL+PL IPYAIW D+SMDFIEGLPKS+G+DVIFVVV                    VA+ FV EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   S+E  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GDLV+LKLRP       K R   LS +                     T+A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--KVMQLEL------------PMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP   V QL+             P  +   EW+  P+EVY YRKNP+T  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--KVMQLEL------------PMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

TYK06572.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-17444.44Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EML+SGIIR + SPYSS VLLV+K+                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ +EEH QHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDFS  FEIE+DASGFG GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + VYEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LKYLLEQ+V+QPQ+Q+   KLL                   LSR+    +
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P L+DV ++KEE   DP L EII+ +      + +++L + +LK+KGRLVVS KS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV +Y  E                LL+PL IP AIW D+SMDFIEGLPKS+G+DVI VVV                    VA+ F+ EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   SVE  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GD+V+LKLRP       K R   LS +                       A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              + +Q   P  ++  EW+ +P+EVYGYRKNPST  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.7e-17444.33Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EMLSSGIIR + SPYSS VLLV+KK                                           D + G HQI+MHP D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EF+VMPFGLTNAPSTFQALMN+VFKP+LR F+LV   DIL YSK +EEH QHL         +ELYAN  KC F + RI YLGH IS K +E DPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A E+LK AM+ L VL +PDF+  FEIE+DASGFG GAVL+Q + 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS  LST DR + VYERELM VV AVQ+WRPYLLG+KF V+TDQ++LK+LLEQ+VIQPQ+Q+   KLL                   LSR+     
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVSK-SSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  LT P LLDV VI++EV  DP L EI+  +      + +++ H+ +LK+KGRLV+SK S+L+P+I+HTYHDSVFGGHSGFLR YKR+ GELYW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVSK-SSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV+KY  E                LLLPL IP AIW D+SMDFIEGLPKS G++VI VVV                    VA+VFV EVVRLHG+PKSIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL----------------------HSGTRGCIG---------------QNIGDQ
        SDRDK+F S FW E+F+LA                   +   SVE  L                      ++   G IG                  G+ 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL----------------------HSGTRGCIG---------------QNIGDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQLR RDV+LG LKEHLR  QERMKKFAD KRR+VEF+ GD V+LKLRP       K R   LS +                     ++A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              K +Q   P  ++  EW+ +P+E+Y YRKN +T  W  L+ WKGL                F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

TYK24654.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-17444.44Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EML+SGIIR + SPYSS VLLV+K+                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ +EEH QHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDFS  FEIE+DASGFG GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + VYEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LKYLLEQ+V+QPQ+Q+   KLL                   LSR+    +
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P L+DV ++KEE   DP L EII+ +      + +++L + +LK+KGRLVVS KS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV +Y  E                LL+PL IP AIW D+SMDFIEGLPKS+G+DVI VVV                    VA+ F+ EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   SVE  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GD+V+LKLRP       K R   LS +                       A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              + +Q   P  ++  EW+ +P+EVYGYRKNPST  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

TrEMBL top hitse value%identityAlignment
A0A5A7UAE4 Ty3/gypsy retrotransposon protein6.1e-17544.44Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EML+SGIIR + SPYSS VLLV+K+                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ +EEH+QHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDFS  FEIE+DASGFG GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + VYEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LKYLLEQ+V+QPQ+Q+   KLL                   LSR+    +
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P L+DV ++KEE   DP L EII+ +      + +++L + +LK+KGRLVVS KS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV +Y  E                LL+PL IP AIW D+SMDFIEGLPKS+G+DVI VVV                    VA+ F+ EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   SVE  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GD+V+LKLRP       K R   LS +                       A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              + +Q   P  ++  EW+ +P+EVYGYRKNPST  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

A0A5A7UVI2 Ty3/gypsy retrotransposon protein6.1e-17544.56Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EMLSSGIIR + SPYSS VLLV+KK                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ ++EHVQHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDF+  FEIE+DASG G GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + +YEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LK+LLEQ+V+QPQ+Q+   KLL                   LSR+   V 
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVV-SKSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P ++DV +IK+E   DP L E I +L  +   + +++LH+ +LK+KGRLV+ SKS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVV-SKSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        D+ +Y  E                LL+PL IPYAIW D+SMDFIEGLPKS+G+DVIFVVV                    VA+ FV EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   S+E  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GDLV+LKLRP       K R   LS +                     T+A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--KVMQLEL------------PMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP   V QL+             P  +   EW+  P+EVY YRKNP+T  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--KVMQLEL------------PMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

A0A5D3C5N7 Ty3/gypsy retrotransposon protein8.0e-17544.44Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EML+SGIIR + SPYSS VLLV+K+                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ +EEH QHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDFS  FEIE+DASGFG GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + VYEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LKYLLEQ+V+QPQ+Q+   KLL                   LSR+    +
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P L+DV ++KEE   DP L EII+ +      + +++L + +LK+KGRLVVS KS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV +Y  E                LL+PL IP AIW D+SMDFIEGLPKS+G+DVI VVV                    VA+ F+ EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   SVE  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GD+V+LKLRP       K R   LS +                       A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              + +Q   P  ++  EW+ +P+EVYGYRKNPST  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

A0A5D3DI73 Ty3/gypsy retrotransposon protein1.8e-17444.33Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EMLSSGIIR + SPYSS VLLV+KK                                           D + G HQI+MHP D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EF+VMPFGLTNAPSTFQALMN+VFKP+LR F+LV   DIL YSK +EEH QHL         +ELYAN  KC F + RI YLGH IS K +E DPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A E+LK AM+ L VL +PDF+  FEIE+DASGFG GAVL+Q + 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS  LST DR + VYERELM VV AVQ+WRPYLLG+KF V+TDQ++LK+LLEQ+VIQPQ+Q+   KLL                   LSR+     
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVSK-SSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  LT P LLDV VI++EV  DP L EI+  +      + +++ H+ +LK+KGRLV+SK S+L+P+I+HTYHDSVFGGHSGFLR YKR+ GELYW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVSK-SSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV+KY  E                LLLPL IP AIW D+SMDFIEGLPKS G++VI VVV                    VA+VFV EVVRLHG+PKSIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL----------------------HSGTRGCIG---------------QNIGDQ
        SDRDK+F S FW E+F+LA                   +   SVE  L                      ++   G IG                  G+ 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL----------------------HSGTRGCIG---------------QNIGDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQLR RDV+LG LKEHLR  QERMKKFAD KRR+VEF+ GD V+LKLRP       K R   LS +                     ++A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              K +Q   P  ++  EW+ +P+E+Y YRKN +T  W  L+ WKGL                F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

A0A5D3DM31 Ty3/gypsy retrotransposon protein8.0e-17544.44Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG
        ME+LV EML+SGIIR + SPYSS VLLV+K+                                           D + G HQI+M P D+EKTAFRTHEG
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK-------------------------------------------DREDGDHQIKMHPADVEKTAFRTHEG

Query:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-
        H EFLVMPFGLTNAPSTFQALMN+VFKPYLR F+LV   DIL YS+ +EEH QHL          ELY N  KC F + RI YLGH IS + +EADPEK 
Subjt:  HNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHL---------NELYANRTKCQFMRGRIEYLGHLISGKWVEADPEK-

Query:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH
                                                                A  +LK AM+ L VL +PDFS  FEIE+DASGFG GAVL Q R 
Subjt:  -------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRH

Query:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE
        P+AYFS TLS  DR + VYEREL+ VV+AVQ+WRPYLLG+KF V+TDQ++LKYLLEQ+V+QPQ+Q+   KLL                   LSR+    +
Subjt:  PIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLL----------------MRYLSRMPEGVE

Query:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ
        L  +T P L+DV ++KEE   DP L EII+ +      + +++L + +LK+KGRLVVS KS+LLP+ILHTYHDSVFGGHSGFLR YKRL GE+YW+GMK+
Subjt:  LANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGRLVVS-KSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQ

Query:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV
        DV +Y  E                LL+PL IP AIW D+SMDFIEGLPKS+G+DVI VVV                    VA+ F+ EVVRLHGYP+SIV
Subjt:  DVKKYVGE----------------LLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIV

Query:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ
        SDRDKVF S FW+ELFRLA                   +   SVE  L                          HS       Q +           GD 
Subjt:  SDRDKVFSSQFWQELFRLA------------------LICVVSVENVL--------------------------HSGTRGCIGQNI-----------GDQ

Query:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI
         T NS LDQQL+ RD+ LG LKEHL+  QERMKK AD KRREVEF+ GD+V+LKLRP       K R   LS +                       A+I
Subjt:  ATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRP-------KGRKTGLSTEI--------------------TSASI

Query:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY
        HP              + +Q   P  ++  EW+ +P+EVYGYRKNPST  W  LI WKGL          T     F  F   DKV+LE ES+ RPPI++
Subjt:  HP--------------KVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGL--------LLTIFSNNFLLFTSRDKVNLEGESNVRPPIVY

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.8e-3528.77Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK------------------------------------------------DREDGDHQIKMHPADVEKTAF
        +E  + +ML+ GIIRT+ SPY+S + +V KK                                                D   G HQI+M P  V KTAF
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK------------------------------------------------DREDGDHQIKMHPADVEKTAF

Query:  RTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLN---------ELYANRTKCQFMRGRIEYLGHLISGKWVEA
         T  GH E+L MPFGL NAP+TFQ  MN + +P L    LV   DI+ +S  L+EH+Q L           L     KC+F++    +LGH+++   ++ 
Subjt:  RTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLN---------ELYANRTKCQFMRGRIEYLGHLISGKWVEA

Query:  DPEK----------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGA
        +PEK                                                           A ++LK  + +  +L +PDF+++F + TDAS    GA
Subjt:  DPEK----------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGA

Query:  VLMQKRHPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQK
        VL Q  HP++Y S TL+  +      E+EL+ +V A + +R YLLG+ F + +D + L +L   K
Subjt:  VLMQKRHPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYLLEQK

P0CT34 Transposon Tf2-1 polyprotein5.9e-3423.07Show/hide
Query:  KKDREDGDHQIKMHPADVEKTAFRTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNE---------LYANRT
        K D +   H I++   D  K AFR   G  E+LVMP+G++ AP+ FQ  +N +      + ++    DIL +SK   EHV+H+ +         L  N+ 
Subjt:  KKDREDGDHQIKMHPADVEKTAFRTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNE---------LYANRT

Query:  KCQFMRGRIEYLGHLISGK---------------------------------------------------------WVEADPEKEASERLKIAMVKLSVL
        KC+F + +++++G+ IS K                                                         W     + +A E +K  +V   VL
Subjt:  KCQFMRGRIEYLGHLISGK---------------------------------------------------------WVEADPEKEASERLKIAMVKLSVL

Query:  ILPDFSEQFEIETDASGFGFGAVLMQKR-----HPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLG--KKFVVRTDQKALKYLLEQKVIQPQH
           DFS++  +ETDAS    GAVL QK      +P+ Y+S  +S +     V ++E++ ++ +++ WR YL    + F + TD + L   +  +  +P++
Subjt:  ILPDFSEQFEIETDASGFGFGAVLMQKR-----HPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLG--KKFVVRTDQKALKYLLEQKVIQPQH

Query:  QRVE--NKLLMRYLSRMPEGVELANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKY------------------------KGR
        +R+      L  +   +      AN     L  +V   E +   PK  E     +V++ ++++   ++V+ +Y                         G 
Subjt:  QRVE--NKLLMRYLSRMPEGVELANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKY------------------------KGR

Query:  LVVSKSSLL--------PSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQDVKKYVGEL----------------LLPLSIPYAIWEDVSMDFIEGL
        L+ SK  +L         +I+  YH+     H G   +   ++    W+G+++ +++YV                   L P+      WE +SMDFI  L
Subjt:  LVVSKSSLL--------PSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQDVKKYVGEL----------------LLPLSIPYAIWEDVSMDFIEGL

Query:  PKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIVSDRDKVFSSQFWQEL---------FRL-------------------A
        P+S GY+ +FVVV                     A +F   V+   G PK I++D D +F+SQ W++          F L                    
Subjt:  PKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIVSDRDKVFSSQFWQEL---------FRL-------------------A

Query:  LICVVSV----------------ENVLHSGTRGCIGQNIGDQATSNSNL---------DQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREV-EFEVG
        L CV S                  N +HS T+    + +   + + S L         D+  +    +   +KEHL     +MKK+ D K +E+ EF+ G
Subjt:  LICVVSV----------------ENVLHSGTRGCIGQNIGDQATSNSNL---------DQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREV-EFEVG

Query:  DLVYLKLRPKG
        DLV +K    G
Subjt:  DLVYLKLRPKG

P0CT35 Transposon Tf2-2 polyprotein5.9e-3423.07Show/hide
Query:  KKDREDGDHQIKMHPADVEKTAFRTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNE---------LYANRT
        K D +   H I++   D  K AFR   G  E+LVMP+G++ AP+ FQ  +N +      + ++    DIL +SK   EHV+H+ +         L  N+ 
Subjt:  KKDREDGDHQIKMHPADVEKTAFRTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNE---------LYANRT

Query:  KCQFMRGRIEYLGHLISGK---------------------------------------------------------WVEADPEKEASERLKIAMVKLSVL
        KC+F + +++++G+ IS K                                                         W     + +A E +K  +V   VL
Subjt:  KCQFMRGRIEYLGHLISGK---------------------------------------------------------WVEADPEKEASERLKIAMVKLSVL

Query:  ILPDFSEQFEIETDASGFGFGAVLMQKR-----HPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLG--KKFVVRTDQKALKYLLEQKVIQPQH
           DFS++  +ETDAS    GAVL QK      +P+ Y+S  +S +     V ++E++ ++ +++ WR YL    + F + TD + L   +  +  +P++
Subjt:  ILPDFSEQFEIETDASGFGFGAVLMQKR-----HPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLG--KKFVVRTDQKALKYLLEQKVIQPQH

Query:  QRVE--NKLLMRYLSRMPEGVELANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKY------------------------KGR
        +R+      L  +   +      AN     L  +V   E +   PK  E     +V++ ++++   ++V+ +Y                         G 
Subjt:  QRVE--NKLLMRYLSRMPEGVELANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKY------------------------KGR

Query:  LVVSKSSLL--------PSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQDVKKYVGEL----------------LLPLSIPYAIWEDVSMDFIEGL
        L+ SK  +L         +I+  YH+     H G   +   ++    W+G+++ +++YV                   L P+      WE +SMDFI  L
Subjt:  LVVSKSSLL--------PSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQDVKKYVGEL----------------LLPLSIPYAIWEDVSMDFIEGL

Query:  PKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIVSDRDKVFSSQFWQEL---------FRL-------------------A
        P+S GY+ +FVVV                     A +F   V+   G PK I++D D +F+SQ W++          F L                    
Subjt:  PKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIVSDRDKVFSSQFWQEL---------FRL-------------------A

Query:  LICVVSV----------------ENVLHSGTRGCIGQNIGDQATSNSNL---------DQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREV-EFEVG
        L CV S                  N +HS T+    + +   + + S L         D+  +    +   +KEHL     +MKK+ D K +E+ EF+ G
Subjt:  LICVVSV----------------ENVLHSGTRGCIGQNIGDQATSNSNL---------DQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREV-EFEVG

Query:  DLVYLKLRPKG
        DLV +K    G
Subjt:  DLVYLKLRPKG

P0CT41 Transposon Tf2-12 polyprotein5.9e-3423.07Show/hide
Query:  KKDREDGDHQIKMHPADVEKTAFRTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNE---------LYANRT
        K D +   H I++   D  K AFR   G  E+LVMP+G++ AP+ FQ  +N +      + ++    DIL +SK   EHV+H+ +         L  N+ 
Subjt:  KKDREDGDHQIKMHPADVEKTAFRTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNE---------LYANRT

Query:  KCQFMRGRIEYLGHLISGK---------------------------------------------------------WVEADPEKEASERLKIAMVKLSVL
        KC+F + +++++G+ IS K                                                         W     + +A E +K  +V   VL
Subjt:  KCQFMRGRIEYLGHLISGK---------------------------------------------------------WVEADPEKEASERLKIAMVKLSVL

Query:  ILPDFSEQFEIETDASGFGFGAVLMQKR-----HPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLG--KKFVVRTDQKALKYLLEQKVIQPQH
           DFS++  +ETDAS    GAVL QK      +P+ Y+S  +S +     V ++E++ ++ +++ WR YL    + F + TD + L   +  +  +P++
Subjt:  ILPDFSEQFEIETDASGFGFGAVLMQKR-----HPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLG--KKFVVRTDQKALKYLLEQKVIQPQH

Query:  QRVE--NKLLMRYLSRMPEGVELANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKY------------------------KGR
        +R+      L  +   +      AN     L  +V   E +   PK  E     +V++ ++++   ++V+ +Y                         G 
Subjt:  QRVE--NKLLMRYLSRMPEGVELANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKY------------------------KGR

Query:  LVVSKSSLL--------PSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQDVKKYVGEL----------------LLPLSIPYAIWEDVSMDFIEGL
        L+ SK  +L         +I+  YH+     H G   +   ++    W+G+++ +++YV                   L P+      WE +SMDFI  L
Subjt:  LVVSKSSLL--------PSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQDVKKYVGEL----------------LLPLSIPYAIWEDVSMDFIEGL

Query:  PKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIVSDRDKVFSSQFWQEL---------FRL-------------------A
        P+S GY+ +FVVV                     A +F   V+   G PK I++D D +F+SQ W++          F L                    
Subjt:  PKSQGYDVIFVVV--------------------VADVFVNEVVRLHGYPKSIVSDRDKVFSSQFWQEL---------FRL-------------------A

Query:  LICVVSV----------------ENVLHSGTRGCIGQNIGDQATSNSNL---------DQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREV-EFEVG
        L CV S                  N +HS T+    + +   + + S L         D+  +    +   +KEHL     +MKK+ D K +E+ EF+ G
Subjt:  LICVVSV----------------ENVLHSGTRGCIGQNIGDQATSNSNL---------DQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREV-EFEVG

Query:  DLVYLKLRPKG
        DLV +K    G
Subjt:  DLVYLKLRPKG

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-3427.98Show/hide
Query:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK------------------------------------------------DREDGDHQIKMHPADVEKTAF
        +E  V EML+ G+IR + SPY+S   +V KK                                                D   G HQI+M    + KTAF
Subjt:  MEKLVSEMLSSGIIRTNASPYSSLVLLVKKK------------------------------------------------DREDGDHQIKMHPADVEKTAF

Query:  RTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNELYANRT---------KCQFMRGRIEYLGHLISGKWVEA
         T  GH E+L MPFGL NAP+TFQ  MN + +P L    LV   DI+ +S  L EH+  +  ++             KC+F++    +LGH+++   ++ 
Subjt:  RTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQHLNELYANRT---------KCQFMRGRIEYLGHLISGKWVEA

Query:  DPEK----------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGA
        +P K                                                          EA E+LK  +++  +L LPDF ++F + TDAS    GA
Subjt:  DPEK----------------------------------------------------------EASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGA

Query:  VLMQKRHPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYL
        VL Q  HPI++ S TL+  +      E+EL+ +V A + +R YLLG++F++ +D + L++L
Subjt:  VLMQKRHPIAYFSHTLSTSDRGKLVYERELMVVVMAVQQWRPYLLGKKFVVRTDQKALKYL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGTTGGTATCTGAAATGTTGTCGTCAGGGATTATAAGAACTAACGCGAGTCCTTACTCTAGTCTAGTGCTATTGGTGAAGAAAAAGGACAGAGAGGATGGAGA
TCATCAAATCAAGATGCATCCAGCTGATGTGGAGAAGACTGCTTTTCGTACCCATGAAGGTCATAACGAGTTCTTAGTGATGCCTTTTGGGTTAACAAATGCACCTTCAA
CTTTTCAGGCATTGATGAACAAGGTGTTTAAACCATATTTGAGGAATTTTCTATTGGTACTTTCTTACGATATCTTAGACTACAGCAAGGACTTGGAGGAGCATGTGCAA
CATTTGAATGAGTTGTATGCTAATCGTACAAAGTGTCAGTTTATGCGTGGGAGAATTGAATATTTGGGACATTTGATCTCTGGAAAATGGGTGGAGGCTGATCCAGAGAA
GGAAGCTTCTGAAAGATTGAAAATTGCCATGGTGAAGTTATCGGTATTGATTCTACCAGATTTCAGCGAGCAGTTTGAGATTGAAACTGATGCTTCGGGTTTTGGTTTTG
GTGCCGTTTTGATGCAGAAGAGACATCCAATTGCTTATTTTAGTCATACGTTGTCAACTAGTGATCGGGGCAAGCTAGTGTATGAAAGGGAATTGATGGTTGTGGTCATG
GCGGTGCAACAATGGCGACCATATTTGTTGGGGAAAAAGTTTGTGGTGAGGACAGATCAAAAAGCTCTCAAGTATTTGTTGGAGCAAAAGGTGATTCAACCACAACATCA
AAGGGTGGAGAATAAGCTGCTGATGCGTTATCTATCTAGAATGCCTGAAGGTGTAGAACTGGCTAATCTGACCGTGCCAACGTTGTTAGATGTGGTTGTAATCAAGGAAG
AAGTATGCTGGGATCCTAAGTTGATGGAAATAATTCAAAAATTGTGGGTTGATGAAGATAATGTCTCTAATTTCTCCTTACATCGGGTAATGCTGAAATACAAAGGTCGG
CTGGTCGTTTCAAAGTCATCTTTGTTGCCGTCAATACTTCATACTTATCATGATTCGGTGTTTGGTGGGCATTCGGGGTTCTTACGTATGTATAAAAGATTGATTGGAGA
GTTATATTGGGAGGGTATGAAGCAAGACGTGAAAAAATATGTGGGAGAGTTGTTATTACCGTTGAGTATACCATATGCTATTTGGGAAGATGTATCCATGGATTTTATCG
AGGGATTACCGAAGTCTCAAGGGTATGATGTTATTTTTGTGGTGGTTGTTGCTGATGTATTTGTAAATGAGGTTGTACGTCTTCATGGCTATCCTAAGTCAATTGTTTCT
GACAGAGATAAGGTGTTTTCGAGTCAATTTTGGCAGGAATTGTTTCGTTTGGCTCTTATTTGCGTTGTTTCTGTGGAGAATGTCCTACATAGTGGTACTCGTGGTTGCAT
TGGGCAGAATATTGGTGATCAAGCTACTTCCAACTCCAATCTGGACCAGCAGTTGCGTGCAAGAGATGTAATGTTGGGCGTTTTGAAAGAGCATTTGAGAGACGTTCAAG
AGAGGATGAAGAAGTTTGCGGATAGGAAGAGGAGGGAGGTTGAATTTGAGGTAGGTGATCTAGTATACCTGAAACTTCGTCCCAAGGGTCGAAAAACTGGCTTATCGACT
GAAATTACCAGTGCTTCTATTCACCCGAAGGTCATGCAATTAGAGTTGCCTATGTTTTCAAAGTTGTTTGAATGGGTGATTGAGCCTCAGGAGGTGTATGGCTATCGCAA
GAATCCATCAACAGGTGGTTGGGTGGTGCTGATTGGGTGGAAAGGGCTATTATTGACGATTTTCAGCAACAATTTCCTGCTTTTCACCTCGAGGGACAAGGTGAATTTGG
AGGGGGAAAGTAATGTTAGGCCTCCAATTGTGTATCAACTCTCCTCCCCCCTCTTCTCCCTCCTCCACCTCCCCTCCGGTGATTCTCCTCCTTACCGACAGCTTCCGTAC
TTGCCAACCAAGACTCCGTTATTAGATAGTCGCCTTGTCAAGCATTCGTCATGGAAGGAAGTGCATAACGACTCGGGAGCAATGATATTATCAAAATTCAAGAGTAAGAT
GGGTTGGAATCTTAGATGTGTAGTTTGGCCCGTTACCGGCGGTCGTTTCATTATTCATGTACCAGGAGGGACAGCGCAACAAGGATGGACTGAATTTTTGGGGATGATTA
ATAGCTTCATTATTCGAATTGGGTGTTATACAGACTTTTCAGCAAAAAAGGAGTTGCTTCAGCCGAGCTTTGTAGACGTTTTAACAAAAGGAGAGAGCCATCATAATCAT
TCTGCTCAGGATTTCACTCTAGGGCAACCAAACTCCTCTGCTTTTCACAATAAACAGAGCACATATCTTGGATGGATCAAAAAGGAAAAAGAAGTGTTTAAGGAAGATTT
CGAAAATTTGTGGGTGGTTTCAAGGTTGTTTGCTTTCAATGAATGGAAAGATATTGTCAGTTTTTTGGAGGGTCTCTATCAAACTAAAATCAGCATTAACCCTCTGTTTG
CAGATAAGGCCTTGATTAAAATATCAAATTTAGAAAAGATCATCGTAACACCTGGAAGATGGTTTAATTATCGTAAGTTCCATCTTTTATTTGAAAAATGGAATTCAATT
CAACATAGTAGGCCTACTTGCATAAAAGGTTATGGAGGTTGGTTGATAAATAGAAACTTGCCGCTAGAATACTGGTGCAGAGCAACCTTTGAAACTATCGGATCCCATTT
CGGAGTCATGGAAGGTCCGCCTCACGCTCAAAAGGATTTATTTTTTAAAGATTTCATTAATCCAATCGACCAAAACCGGTTGAGTAAAGTTGCAGAGGATGAAGAGGCAT
GTAGTTTGATGCTGCCGGAAAGCTCGAAAAAGGTTGACTATCCCGAGCCAGTTCCAAAGTTTCGAGATGAAATGATTAGTTTCGAAGAAATTGAAGAAGTCGAAAAGTCA
AGCAAAACAGTTCAAATTCCAGCGTCCACTGATCTCCTCTTAATGGAAAATGAGGAAATTAATGCTTCTCGGGTACTTTCATCCCAAGTTTCAACAAGTTTTTCGGGGGA
GAGCATGCCAACTGATTTCGAAGAGGCCATTGACTTTCCAACCGAAGGATTCGAGGAGGCTGCTGGCATTCCAACCACGGGAATCAAGGCTAGCATTGAAAGGAATTATG
ATCTCACGAATCAACCACCTGATTTATTAGGCAAATATCCTTCTCACATCAATCCCAATTTCTGCCTCACCCGATTAGGTCTGAAAACCCCACTTCTCCTCCTTTTCTCT
TCAAGAAATCCACTCTTACCTTGCCCTAATTCAAATTCTACATTTTTAAGAAGTACTCGTTGCATCTCACATGGTTCTAATCTAAGAATTGAAAAAGATTCCAACATTGG
TTCAATAGCAAGTGTTAGCAGTGAGGATTTAGATGATTTTGAAACATTAGATCATCACGAGGTTTCGGATCAACTTAAAGTTACTGGAGAAGATATCAATGCCTTGTTCT
CTCAGAATGAACCTCCTAACGAATTAGAATTCAGCAACCATTCTTTCATTTCCCCTAATCGTCCGAAATTGCCGGCACATTTGGCATCCTTCGTTGAAGATTGTGGTCAT
CTTTGGATAGACCATGAAGCCTCTCGTAATTTAGAAAAGTTGAGTTTCATTGAAGCTGGGATTTATTTTCGCCTTTCCTTATCAAATTACTGCTTGGATTCATGGTGTTT
CGGTGATCATTTGGAGGTCTTATCGACCAAAGAAGGGGTTACAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGTTGGTATCTGAAATGTTGTCGTCAGGGATTATAAGAACTAACGCGAGTCCTTACTCTAGTCTAGTGCTATTGGTGAAGAAAAAGGACAGAGAGGATGGAGA
TCATCAAATCAAGATGCATCCAGCTGATGTGGAGAAGACTGCTTTTCGTACCCATGAAGGTCATAACGAGTTCTTAGTGATGCCTTTTGGGTTAACAAATGCACCTTCAA
CTTTTCAGGCATTGATGAACAAGGTGTTTAAACCATATTTGAGGAATTTTCTATTGGTACTTTCTTACGATATCTTAGACTACAGCAAGGACTTGGAGGAGCATGTGCAA
CATTTGAATGAGTTGTATGCTAATCGTACAAAGTGTCAGTTTATGCGTGGGAGAATTGAATATTTGGGACATTTGATCTCTGGAAAATGGGTGGAGGCTGATCCAGAGAA
GGAAGCTTCTGAAAGATTGAAAATTGCCATGGTGAAGTTATCGGTATTGATTCTACCAGATTTCAGCGAGCAGTTTGAGATTGAAACTGATGCTTCGGGTTTTGGTTTTG
GTGCCGTTTTGATGCAGAAGAGACATCCAATTGCTTATTTTAGTCATACGTTGTCAACTAGTGATCGGGGCAAGCTAGTGTATGAAAGGGAATTGATGGTTGTGGTCATG
GCGGTGCAACAATGGCGACCATATTTGTTGGGGAAAAAGTTTGTGGTGAGGACAGATCAAAAAGCTCTCAAGTATTTGTTGGAGCAAAAGGTGATTCAACCACAACATCA
AAGGGTGGAGAATAAGCTGCTGATGCGTTATCTATCTAGAATGCCTGAAGGTGTAGAACTGGCTAATCTGACCGTGCCAACGTTGTTAGATGTGGTTGTAATCAAGGAAG
AAGTATGCTGGGATCCTAAGTTGATGGAAATAATTCAAAAATTGTGGGTTGATGAAGATAATGTCTCTAATTTCTCCTTACATCGGGTAATGCTGAAATACAAAGGTCGG
CTGGTCGTTTCAAAGTCATCTTTGTTGCCGTCAATACTTCATACTTATCATGATTCGGTGTTTGGTGGGCATTCGGGGTTCTTACGTATGTATAAAAGATTGATTGGAGA
GTTATATTGGGAGGGTATGAAGCAAGACGTGAAAAAATATGTGGGAGAGTTGTTATTACCGTTGAGTATACCATATGCTATTTGGGAAGATGTATCCATGGATTTTATCG
AGGGATTACCGAAGTCTCAAGGGTATGATGTTATTTTTGTGGTGGTTGTTGCTGATGTATTTGTAAATGAGGTTGTACGTCTTCATGGCTATCCTAAGTCAATTGTTTCT
GACAGAGATAAGGTGTTTTCGAGTCAATTTTGGCAGGAATTGTTTCGTTTGGCTCTTATTTGCGTTGTTTCTGTGGAGAATGTCCTACATAGTGGTACTCGTGGTTGCAT
TGGGCAGAATATTGGTGATCAAGCTACTTCCAACTCCAATCTGGACCAGCAGTTGCGTGCAAGAGATGTAATGTTGGGCGTTTTGAAAGAGCATTTGAGAGACGTTCAAG
AGAGGATGAAGAAGTTTGCGGATAGGAAGAGGAGGGAGGTTGAATTTGAGGTAGGTGATCTAGTATACCTGAAACTTCGTCCCAAGGGTCGAAAAACTGGCTTATCGACT
GAAATTACCAGTGCTTCTATTCACCCGAAGGTCATGCAATTAGAGTTGCCTATGTTTTCAAAGTTGTTTGAATGGGTGATTGAGCCTCAGGAGGTGTATGGCTATCGCAA
GAATCCATCAACAGGTGGTTGGGTGGTGCTGATTGGGTGGAAAGGGCTATTATTGACGATTTTCAGCAACAATTTCCTGCTTTTCACCTCGAGGGACAAGGTGAATTTGG
AGGGGGAAAGTAATGTTAGGCCTCCAATTGTGTATCAACTCTCCTCCCCCCTCTTCTCCCTCCTCCACCTCCCCTCCGGTGATTCTCCTCCTTACCGACAGCTTCCGTAC
TTGCCAACCAAGACTCCGTTATTAGATAGTCGCCTTGTCAAGCATTCGTCATGGAAGGAAGTGCATAACGACTCGGGAGCAATGATATTATCAAAATTCAAGAGTAAGAT
GGGTTGGAATCTTAGATGTGTAGTTTGGCCCGTTACCGGCGGTCGTTTCATTATTCATGTACCAGGAGGGACAGCGCAACAAGGATGGACTGAATTTTTGGGGATGATTA
ATAGCTTCATTATTCGAATTGGGTGTTATACAGACTTTTCAGCAAAAAAGGAGTTGCTTCAGCCGAGCTTTGTAGACGTTTTAACAAAAGGAGAGAGCCATCATAATCAT
TCTGCTCAGGATTTCACTCTAGGGCAACCAAACTCCTCTGCTTTTCACAATAAACAGAGCACATATCTTGGATGGATCAAAAAGGAAAAAGAAGTGTTTAAGGAAGATTT
CGAAAATTTGTGGGTGGTTTCAAGGTTGTTTGCTTTCAATGAATGGAAAGATATTGTCAGTTTTTTGGAGGGTCTCTATCAAACTAAAATCAGCATTAACCCTCTGTTTG
CAGATAAGGCCTTGATTAAAATATCAAATTTAGAAAAGATCATCGTAACACCTGGAAGATGGTTTAATTATCGTAAGTTCCATCTTTTATTTGAAAAATGGAATTCAATT
CAACATAGTAGGCCTACTTGCATAAAAGGTTATGGAGGTTGGTTGATAAATAGAAACTTGCCGCTAGAATACTGGTGCAGAGCAACCTTTGAAACTATCGGATCCCATTT
CGGAGTCATGGAAGGTCCGCCTCACGCTCAAAAGGATTTATTTTTTAAAGATTTCATTAATCCAATCGACCAAAACCGGTTGAGTAAAGTTGCAGAGGATGAAGAGGCAT
GTAGTTTGATGCTGCCGGAAAGCTCGAAAAAGGTTGACTATCCCGAGCCAGTTCCAAAGTTTCGAGATGAAATGATTAGTTTCGAAGAAATTGAAGAAGTCGAAAAGTCA
AGCAAAACAGTTCAAATTCCAGCGTCCACTGATCTCCTCTTAATGGAAAATGAGGAAATTAATGCTTCTCGGGTACTTTCATCCCAAGTTTCAACAAGTTTTTCGGGGGA
GAGCATGCCAACTGATTTCGAAGAGGCCATTGACTTTCCAACCGAAGGATTCGAGGAGGCTGCTGGCATTCCAACCACGGGAATCAAGGCTAGCATTGAAAGGAATTATG
ATCTCACGAATCAACCACCTGATTTATTAGGCAAATATCCTTCTCACATCAATCCCAATTTCTGCCTCACCCGATTAGGTCTGAAAACCCCACTTCTCCTCCTTTTCTCT
TCAAGAAATCCACTCTTACCTTGCCCTAATTCAAATTCTACATTTTTAAGAAGTACTCGTTGCATCTCACATGGTTCTAATCTAAGAATTGAAAAAGATTCCAACATTGG
TTCAATAGCAAGTGTTAGCAGTGAGGATTTAGATGATTTTGAAACATTAGATCATCACGAGGTTTCGGATCAACTTAAAGTTACTGGAGAAGATATCAATGCCTTGTTCT
CTCAGAATGAACCTCCTAACGAATTAGAATTCAGCAACCATTCTTTCATTTCCCCTAATCGTCCGAAATTGCCGGCACATTTGGCATCCTTCGTTGAAGATTGTGGTCAT
CTTTGGATAGACCATGAAGCCTCTCGTAATTTAGAAAAGTTGAGTTTCATTGAAGCTGGGATTTATTTTCGCCTTTCCTTATCAAATTACTGCTTGGATTCATGGTGTTT
CGGTGATCATTTGGAGGTCTTATCGACCAAAGAAGGGGTTACAACGGCTTGA
Protein sequenceShow/hide protein sequence
MEKLVSEMLSSGIIRTNASPYSSLVLLVKKKDREDGDHQIKMHPADVEKTAFRTHEGHNEFLVMPFGLTNAPSTFQALMNKVFKPYLRNFLLVLSYDILDYSKDLEEHVQ
HLNELYANRTKCQFMRGRIEYLGHLISGKWVEADPEKEASERLKIAMVKLSVLILPDFSEQFEIETDASGFGFGAVLMQKRHPIAYFSHTLSTSDRGKLVYERELMVVVM
AVQQWRPYLLGKKFVVRTDQKALKYLLEQKVIQPQHQRVENKLLMRYLSRMPEGVELANLTVPTLLDVVVIKEEVCWDPKLMEIIQKLWVDEDNVSNFSLHRVMLKYKGR
LVVSKSSLLPSILHTYHDSVFGGHSGFLRMYKRLIGELYWEGMKQDVKKYVGELLLPLSIPYAIWEDVSMDFIEGLPKSQGYDVIFVVVVADVFVNEVVRLHGYPKSIVS
DRDKVFSSQFWQELFRLALICVVSVENVLHSGTRGCIGQNIGDQATSNSNLDQQLRARDVMLGVLKEHLRDVQERMKKFADRKRREVEFEVGDLVYLKLRPKGRKTGLST
EITSASIHPKVMQLELPMFSKLFEWVIEPQEVYGYRKNPSTGGWVVLIGWKGLLLTIFSNNFLLFTSRDKVNLEGESNVRPPIVYQLSSPLFSLLHLPSGDSPPYRQLPY
LPTKTPLLDSRLVKHSSWKEVHNDSGAMILSKFKSKMGWNLRCVVWPVTGGRFIIHVPGGTAQQGWTEFLGMINSFIIRIGCYTDFSAKKELLQPSFVDVLTKGESHHNH
SAQDFTLGQPNSSAFHNKQSTYLGWIKKEKEVFKEDFENLWVVSRLFAFNEWKDIVSFLEGLYQTKISINPLFADKALIKISNLEKIIVTPGRWFNYRKFHLLFEKWNSI
QHSRPTCIKGYGGWLINRNLPLEYWCRATFETIGSHFGVMEGPPHAQKDLFFKDFINPIDQNRLSKVAEDEEACSLMLPESSKKVDYPEPVPKFRDEMISFEEIEEVEKS
SKTVQIPASTDLLLMENEEINASRVLSSQVSTSFSGESMPTDFEEAIDFPTEGFEEAAGIPTTGIKASIERNYDLTNQPPDLLGKYPSHINPNFCLTRLGLKTPLLLLFS
SRNPLLPCPNSNSTFLRSTRCISHGSNLRIEKDSNIGSIASVSSEDLDDFETLDHHEVSDQLKVTGEDINALFSQNEPPNELEFSNHSFISPNRPKLPAHLASFVEDCGH
LWIDHEASRNLEKLSFIEAGIYFRLSLSNYCLDSWCFGDHLEVLSTKEGVTTA