; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006769 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006769
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr6:45678367..45680574
RNA-Seq ExpressionLag0006769
SyntenyLag0006769
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.1e-9946.17Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT       + TE+ GK  +   NG +L I   G++ L +     NL  +L VPNI KNL+SVSKLA DN + VEF EN C VKDK++GK +L+
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------
        G+  +GLY  +G K                        SAFV   SV  +     WHRRLGHP+ KV +   + C +    ++   FCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------

Query:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                               PI    GFKYY+ FVDDFSRF WIYPLKQKSE + A F+ F  + +NQFN  IK +Q D GGEY  + KL  E G+Q
Subjt:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPSQV   +SP  +M  K  D+  L+TFGC CY CL+PY  HK Q+H
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C +LG S  HKG+ CL+S GR+++SRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]3.3e-9242.92Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GA+NHVT   +      E+ GK  +   NG +LKI   G+  L+      NL  VL VP I KNL+SVSKL  DN + VEF  N C VKDK++G+ LL+
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEAL---------
        G   +GL                       Y ++N E   ++   SV  +     WHR+LGHP+ KV +   + C +    +++  FCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEAL---------

Query:  ----------PISLV-------------KGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                  P++L+              GFKYY+ F+DDFSRF WI+PLKQKS+ + A F+ F  + +NQFN  IK +Q D GGEY  + K+  E G+Q
Subjt:  ----------PISLV-------------KGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRH+ E  LTLLAQ  MPLRYWW+AF  A  LIN LPS V   +SP  +M+ +  D++AL+ FGC CY CL+PY  HK QFH
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C ++G S  HKG+ C++S GR++VSRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]4.6e-9446.19Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT D N +    E +GK  ++  NG+ LKI   G++ L       NL+ +L VP I KNL+S+SKL  DN ++VEFH+  C VKDK++G+ LL 
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTA-----IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVN---EKSKFCE---
        G   +GLY   G  T+      +  S   T   K+   N+  L+   V    NI  S          P E  FE F + C      N   + S  C    
Subjt:  GVFSEGLYWFNGAKTTA-----IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVN---EKSKFCE---

Query:  ----------ALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQ
                    PIS V GFKYY+LF+DD+SRF WIYPLKQKS+   A F+ F  +V+NQFN  IK LQ D GGE+  + K+  + G+Q R + PYTS Q
Subjt:  ----------ALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQ

Query:  NGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSP
        NGRAERKHRH+VE+ LTLLAQ  MPL YWW+AF  A  LIN LP+QVI  KSP + ++ K+ D++A++TFGC CY CL+PY  HK QFHT KC +LG S 
Subjt:  NGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSP

Query:  IHKGHICLSSSGRVYVSRHV
         HKG+ CL+S+GR+++SRHV
Subjt:  IHKGHICLSSSGRVYVSRHV

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]4.6e-9443.62Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT   +   N +E+ GK  +   NG +L+I   G++ L +     NL  +L VP I KNL+SVSKLA DN + VEF EN C VKDK++GK +LR
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------
        G+  +GL                       Y ++  + SA+     V+I  S   WHR+LGHP+ KV +   + C +    +++  FCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------

Query:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                               PI    GFKYY+ F+DDF+RF WIYPLKQKS+  A  F+ F  MV+NQF+  IK +Q D GGEY  + K   E G+Q
Subjt:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPS V + KSP  +++ +  D+++L+ FGC CY  L+PY  HK QFH
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C +LG S  HKG+ C++S GR+++SRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

PNY02796.1 copia protein (gag-int-pol protein), partial [Trifolium pratense]2.1e-9444.55Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT   +   + T + GK  +   NG +LKI   G+  L       NL  VL VP I KNL+SVSKL  DN + VEF  + C VKDK++GK LL+
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------
        G   EGLY  +       ++SS +  D   Y           V  S         WHR+LGHP+ KV +   + C +    +++ KFCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------

Query:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                               PI    GFKYY+ F+DDFSRF WIYPLKQKSE + A F  F T+V+NQFN  IK +Q D GGEY  + KL  E G+Q
Subjt:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRH+ E  LT+LAQ  MPL YWW+AF  +  LIN LPS +     P  ++Y K  D+S L+ FGC CY CL+PY  HK QFH
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C +LG S  HKG+ C++S GR++VSRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-9446.19Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT D N +    E +GK  ++  NG+ LKI   G++ L       NL+ +L VP I KNL+S+SKL  DN ++VEFH+  C VKDK++G+ LL 
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTA-----IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVN---EKSKFCE---
        G   +GLY   G  T+      +  S   T   K+   N+  L+   V    NI  S          P E  FE F + C      N   + S  C    
Subjt:  GVFSEGLYWFNGAKTTA-----IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVN---EKSKFCE---

Query:  ----------ALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQ
                    PIS V GFKYY+LF+DD+SRF WIYPLKQKS+   A F+ F  +V+NQFN  IK LQ D GGE+  + K+  + G+Q R + PYTS Q
Subjt:  ----------ALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQ

Query:  NGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSP
        NGRAERKHRH+VE+ LTLLAQ  MPL YWW+AF  A  LIN LP+QVI  KSP + ++ K+ D++A++TFGC CY CL+PY  HK QFHT KC +LG S 
Subjt:  NGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSP

Query:  IHKGHICLSSSGRVYVSRHV
         HKG+ CL+S+GR+++SRHV
Subjt:  IHKGHICLSSSGRVYVSRHV

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)2.2e-9443.62Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT   +   N +E+ GK  +   NG +L+I   G++ L +     NL  +L VP I KNL+SVSKLA DN + VEF EN C VKDK++GK +LR
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------
        G+  +GL                       Y ++  + SA+     V+I  S   WHR+LGHP+ KV +   + C +    +++  FCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------

Query:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                               PI    GFKYY+ F+DDF+RF WIYPLKQKS+  A  F+ F  MV+NQF+  IK +Q D GGEY  + K   E G+Q
Subjt:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPS V + KSP  +++ +  D+++L+ FGC CY  L+PY  HK QFH
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C +LG S  HKG+ C++S GR+++SRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

A0A2K3NIC3 Copia protein (Gag-int-pol protein) (Fragment)1.0e-9444.55Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT   +   + T + GK  +   NG +LKI   G+  L       NL  VL VP I KNL+SVSKL  DN + VEF  + C VKDK++GK LL+
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------
        G   EGLY  +       ++SS +  D   Y           V  S         WHR+LGHP+ KV +   + C +    +++ KFCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------

Query:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                               PI    GFKYY+ F+DDFSRF WIYPLKQKSE + A F  F T+V+NQFN  IK +Q D GGEY  + KL  E G+Q
Subjt:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRH+ E  LT+LAQ  MPL YWW+AF  +  LIN LPS +     P  ++Y K  D+S L+ FGC CY CL+PY  HK QFH
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C +LG S  HKG+ C++S GR++VSRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.0e-9946.17Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GASNHVT       + TE+ GK  +   NG +L I   G++ L +     NL  +L VPNI KNL+SVSKLA DN + VEF EN C VKDK++GK +L+
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------
        G+  +GLY  +G K                        SAFV   SV  +     WHRRLGHP+ KV +   + C +    ++   FCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----------

Query:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                               PI    GFKYY+ FVDDFSRF WIYPLKQKSE + A F+ F  + +NQFN  IK +Q D GGEY  + KL  E G+Q
Subjt:  ----------------------LPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPSQV   +SP  +M  K  D+  L+TFGC CY CL+PY  HK Q+H
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C +LG S  HKG+ CL+S GR+++SRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.6e-9242.92Show/hide
Query:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR
        +GA+NHVT   +      E+ GK  +   NG +LKI   G+  L+      NL  VL VP I KNL+SVSKL  DN + VEF  N C VKDK++G+ LL+
Subjt:  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLR

Query:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEAL---------
        G   +GL                       Y ++N E   ++   SV  +     WHR+LGHP+ KV +   + C +    +++  FCEA          
Subjt:  GVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEAL---------

Query:  ----------PISLV-------------KGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ
                  P++L+              GFKYY+ F+DDFSRF WI+PLKQKS+ + A F+ F  + +NQFN  IK +Q D GGEY  + K+  E G+Q
Subjt:  ----------PISLV-------------KGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ

Query:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH
         R++ PYTSQQNGRAERKHRH+ E  LTLLAQ  MPLRYWW+AF  A  LIN LPS V   +SP  +M+ +  D++AL+ FGC CY CL+PY  HK QFH
Subjt:  TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFH

Query:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV
        T +C ++G S  HKG+ C++S GR++VSRHV
Subjt:  TEKCAYLGPSPIHKGHICLSSSGRVYVSRHV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-1924.02Show/hide
Query:  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKD------
        +L +GAS+H+  D +   +  E     +++ +   +   A          + +  LE VL       NL+SV +L ++ G+ +EF ++   +        
Subjt:  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKD------

Query:  KVSGKELLRGVFSEGLYWFNGAKTTAIDI---SSSTTNDNKIYSI------------NNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCY
        K SG      V +   Y  N        +        +D K+  I            NN ELS  +    +N            G  +   F+    + +
Subjt:  KVSGKELLRGVFSEGLYWFNGAKTTAIDI---SSSTTNDNKIYSI------------NNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCY

Query:  LSYRV-NEKSKFCEALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYA--RILKLCNECGVQTRLTR
        +   +    S  C  +    +    Y+++FVD F+ +   Y +K KS+   ++F  F    +  FN  +  L  DNG EY    + + C + G+   LT 
Subjt:  LSYRV-NEKSKFCEALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYA--RILKLCNECGVQTRLTR

Query:  PYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVI--NGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEK
        P+T Q NG +ER  R I E   T+++   +   +W +A L A+ LIN +PS+ +  + K+P E+ + K      LR FG T Y  ++  Q  KF   + K
Subjt:  PYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVI--NGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEK

Query:  CAYLGPSP
          ++G  P
Subjt:  CAYLGPSP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.9e-3027.2Show/hide
Query:  VSASNGSQLKIAFVGNACLSAG-NMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDISSST
        V   N S  KIA +G+ C+         L+ V  VP++  NL+S   L RD       ++ + + K  +    + +GV    LY  N             
Subjt:  VSASNGSQLKIAFVGNACLSAG-NMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDISSST

Query:  TNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSY------------------RVNEK--------------SKFCEALP
                I   EL+A          +S+ +WH+R+GH SEK  +  A++  +SY                  RV+ +              S  C  + 
Subjt:  TNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSY------------------RVNEK--------------SKFCEALP

Query:  ISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYA--RILKLCNECGVQTRLTRPYTSQQNGRAERKHRHI
        I  + G KY++ F+DD SR +W+Y LK K +    +F  F  +V+ +    +K L+ DNGGEY      + C+  G++   T P T Q NG AER +R I
Subjt:  ISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYA--RILKLCNECGVQTRLTRPYTSQQNGRAERKHRHI

Query:  VETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLG
        VE   ++L    +P  +W +A   A  LIN  PS  +  + P  +   K + +S L+ FGC  ++ +   Q  K    +  C ++G
Subjt:  VETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLG

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.7e-1423.94Show/hide
Query:  LILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGK
        L++ +GAS  +    + L + T       V A     + I  +GN   +  N      K L  PNI  +L+S+S+LA  N +   F  N     ++  G 
Subjt:  LILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGK

Query:  ELLRGVFSEGLYWFNGAKTTAIDISSSTTND-NKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----L
         L   V     YW +        IS  T N+ NK  S+N                    + HR LGH + +  +   ++  ++Y      ++  A     
Subjt:  ELLRGVFSEGLYWFNGAKTTAIDISSSTTND-NKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEA----L

Query:  PISL---------VKGFK------------------------------YYILFVDDFSRFVWIYPL-KQKSEALAALFLHFTTMVKNQFNSSIKALQPDN
        P  L         VKG +                              Y+I F D+ +RF W+YPL  ++ E++  +F      +KNQFN+ +  +Q D 
Subjt:  PISL---------VKGFK------------------------------YYILFVDDFSRFVWIYPL-KQKSEALAALFLHFTTMVKNQFNSSIKALQPDN

Query:  GGEYAR--ILKLCNECGVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTF
        G EY    + K     G+    T    S+ +G AER +R ++    TLL  + +P   W+ A   ++++ N L S   N KS  +      +D + +  F
Subjt:  GGEYAR--ILKLCNECGVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTF

Query:  G
        G
Subjt:  G

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-6834.63Show/hide
Query:  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKE
        +L +GA++H+T+D+NNL+    Y G + V  ++GS + I+  G+  LS  +   NL  +L VPNI KNL+SV +L   NGV VEF      VKD  +G  
Subjt:  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKE

Query:  LLRGVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCE--------
        LL+G   + LY +  A +  + + +S ++                 +HS         WH RLGHP+  +  S      LS  +N   KF          
Subjt:  LLRGVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCE--------

Query:  -------------------------ALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNE
                                 + PI     ++YY++FVD F+R+ W+YPLKQKS+ +   F+ F  +++N+F + I     DNGGE+  + +  ++
Subjt:  -------------------------ALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNE

Query:  CGVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHK
         G+    + P+T + NG +ERKHRHIVET LTLL+  S+P  YW  AF VA  LIN LP+ ++  +SP + ++G S ++  LR FGC CY  LRPY  HK
Subjt:  CGVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHK

Query:  FQFHTEKCAYLGPSPIHKGHICLS-SSGRVYVSRHV
            + +C +LG S     ++CL   + R+Y+SRHV
Subjt:  FQFHTEKCAYLGPSPIHKGHICLS-SSGRVYVSRHV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-6735.4Show/hide
Query:  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKE
        +L +GA++H+T+D+NNL+    Y G + V  ++GS + I   G+A L   +   +L KVL VPNI KNL+SV +L   N V VEF      VKD  +G  
Subjt:  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKE

Query:  LLRGVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQR-----------------CYL--S
        LL+G   + LY +  A + A+ + +S  +                 +HS         WH RLGHPS  +  S                     C++  S
Subjt:  LLRGVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQR-----------------CYL--S

Query:  YRVN------EKSKFCEAL-------PISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNEC
        ++V         SK  E +       PI  +  ++YY++FVD F+R+ W+YPLKQKS+ +   F+ F ++V+N+F + I  L  DNGGE+  +    ++ 
Subjt:  YRVN------EKSKFCEAL-------PISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNEC

Query:  GVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKF
        G+    + P+T + NG +ERKHRHIVE  LTLL+  S+P  YW  AF VA  LIN LP+ ++  +SP + ++G+  ++  L+ FGC CY  LRPY  HK 
Subjt:  GVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKF

Query:  QFHTEKCAYLGPSPIHKGHICLS-SSGRVYVSRHV
        +  +++CA++G S     ++CL   +GR+Y SRHV
Subjt:  QFHTEKCAYLGPSPIHKGHICLS-SSGRVYVSRHV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCAAGGGAGACAACGAGGATTTGGAGGAAGGCCTTTATGGAAAATCAGAATGCAAATCCGTTTGTTGCTTCCCCTGAAACTGTTGTTGATCCTAATTGCCGGAGC
TTCCAATCATGTGACTACTGACTACAACAACTTGGCTAATCCAACCGAATATGAAGGTAAAGAACAAGTATCTGCTAGTAATGGTAGTCAACTTAAAATAGCTTTTGTTG
GTAATGCTTGTCTATCGGCCGGAAATATGAAGTTTAATTTGGAAAAAGTTTTGTGTGTTCCAAATATAGTTAAAAATCTTGTTAGTGTATCCAAGTTGGCTAGAGATAAT
GGTGTTTTTGTGGAATTTCATGAAAATTTTTGTGTTGTTAAGGACAAGGTTTCGGGCAAGGAGTTGCTGAGAGGGGTGTTCAGTGAAGGGCTCTATTGGTTTAATGGTGC
AAAGACAACTGCAATAGATATATCTAGTTCAACTACTAACGACAACAAAATCTATAGTATAAATAATGCTGAGCTTTCTGCTTTTGTTGTGTCTCATTCAGTAAATATTA
CTTTGTCTATAGTAATATGGCATAGGCGTCTTGGCCATCCATCAGAAAAAGTTTTTGAATCATTTGCTCAACGATGTTATCTCTCATATAGAGTTAATGAAAAGTCCAAG
TTTTGTGAGGCCTTACCCATAAGTTTAGTTAAAGGGTTTAAATATTACATTCTGTTTGTTGACGATTTTAGCCGTTTTGTATGGATTTATCCACTGAAACAGAAAAGTGA
AGCATTAGCAGCATTATTTCTACACTTTACAACAATGGTGAAAAATCAGTTTAATAGTAGTATAAAGGCTTTACAACCAGATAATGGCGGGGAATATGCTAGAATACTTA
AACTGTGTAATGAATGTGGTGTTCAAACAAGACTCACCCGTCCCTACACCTCTCAGCAAAACGGTAGAGCAGAAAGAAAGCATAGGCACATTGTGGAGACCGATTTAACA
CTTCTTGCCCAAACATCAATGCCTCTAAGGTACTGGTGGGATGCGTTTTTAGTTGCCTCTATGCTAATAAATGGTCTTCCTTCACAGGTCATTAATGGAAAATCTCCAAT
GGAGATAATGTATGGCAAGAGCATTGATTTTAGTGCACTCAGGACATTCGGTTGTACGTGTTATTCCTGTCTCCGTCCATATCAAACACACAAATTTCAGTTTCATACTG
AAAAGTGTGCTTACTTGGGGCCAAGTCCGATTCATAAAGGTCATATATGCCTCAGTTCCAGCGGTCGGGTGTATGTTTCGCGCCATGTCTCTTCTGTTATTTTCTTAAGT
TGGCTTCCTATCCGGGGGATATTCCAACCAAGTCCAAGTATTCTAGCTCCCGAGCCCCATAATTCTCTATTTGTAAAATGCATGGTGTTTTTACTAGAAATACCAAATGG
GTGCTTTGTGATGGCAGTAGACTCCGATTTTGGCATGACAGTTGGATTGGTAATGGCCCGCTTAAAGAGGCACATCCGTGCATGTTTCTTCGCTATCACCATCAACAAGG
ACCTCCTAGTCTCCGAAGATCTCTCGGAGCTTCCCGCCCACGTCCCCTTCCTTGGGGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCAAGGGAGACAACGAGGATTTGGAGGAAGGCCTTTATGGAAAATCAGAATGCAAATCCGTTTGTTGCTTCCCCTGAAACTGTTGTTGATCCTAATTGCCGGAGC
TTCCAATCATGTGACTACTGACTACAACAACTTGGCTAATCCAACCGAATATGAAGGTAAAGAACAAGTATCTGCTAGTAATGGTAGTCAACTTAAAATAGCTTTTGTTG
GTAATGCTTGTCTATCGGCCGGAAATATGAAGTTTAATTTGGAAAAAGTTTTGTGTGTTCCAAATATAGTTAAAAATCTTGTTAGTGTATCCAAGTTGGCTAGAGATAAT
GGTGTTTTTGTGGAATTTCATGAAAATTTTTGTGTTGTTAAGGACAAGGTTTCGGGCAAGGAGTTGCTGAGAGGGGTGTTCAGTGAAGGGCTCTATTGGTTTAATGGTGC
AAAGACAACTGCAATAGATATATCTAGTTCAACTACTAACGACAACAAAATCTATAGTATAAATAATGCTGAGCTTTCTGCTTTTGTTGTGTCTCATTCAGTAAATATTA
CTTTGTCTATAGTAATATGGCATAGGCGTCTTGGCCATCCATCAGAAAAAGTTTTTGAATCATTTGCTCAACGATGTTATCTCTCATATAGAGTTAATGAAAAGTCCAAG
TTTTGTGAGGCCTTACCCATAAGTTTAGTTAAAGGGTTTAAATATTACATTCTGTTTGTTGACGATTTTAGCCGTTTTGTATGGATTTATCCACTGAAACAGAAAAGTGA
AGCATTAGCAGCATTATTTCTACACTTTACAACAATGGTGAAAAATCAGTTTAATAGTAGTATAAAGGCTTTACAACCAGATAATGGCGGGGAATATGCTAGAATACTTA
AACTGTGTAATGAATGTGGTGTTCAAACAAGACTCACCCGTCCCTACACCTCTCAGCAAAACGGTAGAGCAGAAAGAAAGCATAGGCACATTGTGGAGACCGATTTAACA
CTTCTTGCCCAAACATCAATGCCTCTAAGGTACTGGTGGGATGCGTTTTTAGTTGCCTCTATGCTAATAAATGGTCTTCCTTCACAGGTCATTAATGGAAAATCTCCAAT
GGAGATAATGTATGGCAAGAGCATTGATTTTAGTGCACTCAGGACATTCGGTTGTACGTGTTATTCCTGTCTCCGTCCATATCAAACACACAAATTTCAGTTTCATACTG
AAAAGTGTGCTTACTTGGGGCCAAGTCCGATTCATAAAGGTCATATATGCCTCAGTTCCAGCGGTCGGGTGTATGTTTCGCGCCATGTCTCTTCTGTTATTTTCTTAAGT
TGGCTTCCTATCCGGGGGATATTCCAACCAAGTCCAAGTATTCTAGCTCCCGAGCCCCATAATTCTCTATTTGTAAAATGCATGGTGTTTTTACTAGAAATACCAAATGG
GTGCTTTGTGATGGCAGTAGACTCCGATTTTGGCATGACAGTTGGATTGGTAATGGCCCGCTTAAAGAGGCACATCCGTGCATGTTTCTTCGCTATCACCATCAACAAGG
ACCTCCTAGTCTCCGAAGATCTCTCGGAGCTTCCCGCCCACGTCCCCTTCCTTGGGGCCTGA
Protein sequenceShow/hide protein sequence
MIQGRQRGFGGRPLWKIRMQIRLLLPLKLLLILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDN
GVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSK
FCEALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQNGRAERKHRHIVETDLT
LLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLSSSGRVYVSRHVSSVIFLS
WLPIRGIFQPSPSILAPEPHNSLFVKCMVFLLEIPNGCFVMAVDSDFGMTVGLVMARLKRHIRACFFAITINKDLLVSEDLSELPAHVPFLGA