; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G32150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G32150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:26926908..26932030
RNA-Seq ExpressionCSPI01G32150
SyntenyCSPI01G32150
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.1e-22659.63Show/hide
Query:  SQEAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKF
        ++ AF +L++AMMTLPVLA+PDFN PFE+E+D+ G+GVGAVL+Q+K+P+A++S  L  RDRA+PVYERELMAVV AVQRWRPYLLGR F VKTDQ SLKF
Subjt:  SQEAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKF

Query:  LLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGMLRYKG
        LLEQRVIQPQYQ+WIAKLLGYSFEV+YKPG ENKA D LSR+ PT HLNQLT P L+D++VI++EV KD  L+EI+S I+++  E+ +YT  QG+L++KG
Subjt:  LLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGMLRYKG

Query:  RLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS----------------
        RLV++K S+LIPTIMH YHDSV GG+SGFLRTYKR+ GEL+       + +Y       ++NK+ AL+PAGLL+P+EIP  + S                
Subjt:  RLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS----------------

Query:  --------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCG
                             P+ AK VAE+F+KE+VRLHGFP+SIVSD DKIF+S+FW E+ +LAGTKLNRS++YHPQ DGQT+V+N+SVE YL CFCG
Subjt:  --------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCG

Query:  ERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRR------------
        E+P++W +W+ WAEYWYNTT+  S+G++PFQAVYGR PP LI YG+ ETPNSTLD+QL++RDV LGALK+HLR+AQ++MK +AD+KRR            
Subjt:  ERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRR------------

Query:  --HTHRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNE-KEVWEV
            +RQ S+RK RNEKLSPKYFGPY+IL+RIG +AYKLELP++A IHP+FH+SQLKKA G     + L PF+ E HEW+  P+E+Y Y+KN+  + WE 
Subjt:  --HTHRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNE-KEVWEV

Query:  LMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
        L+ WKGL  HE TWENY D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  LMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

KAA0050511.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-22759.42Show/hide
Query:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP
        WG +E  AF +L++AMMTLPVL +PDF+ PFE+E+D+ G+GVGAVL Q ++P+A++S TL +RDR++PVYEREL+AVVLAVQRWRPYLLGR F VKTDQ 
Subjt:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP

Query:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML
        SLK+LLEQRV+QPQYQKW+AKLLGYSFEVVY+PG ENKA D LSR+ PT  LNQ+T P LID+++++EE  +D  L+EII  I+++  E+ +YTLQQG+L
Subjt:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML

Query:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------
        ++KGRLV++  S+L+PTI+H YHDSV GG+SGFLRTYKRLTGE++       + RY       +RNK+ ALTPAGLL+P+EIP  + S            
Subjt:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------

Query:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC
                                 PF AK+VAE F+KE+VRLHG+P+SIVSD DK+FLS+FW+EL RLAGTKLNRS++YHPQ DGQT+V+N+SVE YL 
Subjt:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC

Query:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------
        CFCGE+P+EW +W+ WAEYWYNTT+  S+G++PFQAVYGR PP LIYYGD ETPNSTLD+QLK+RD+ LGALK+HL++AQ++MK  AD KRR        
Subjt:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------

Query:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE
                +RQ S+RK RNEKLSPKYFGPYR+L+RIG +AY+LELP  A IHP+FH+SQLKKA G     + L P++ ENHEW+  P+EVYGY+KN    
Subjt:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE

Query:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
         WE L+SWKGL  HE TWE+  D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

KAA0066077.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]6.7e-25368.12Show/hide
Query:  FVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKFLLEQ
        F RL++AMMTLPVLALPDF+ PFE++ D+ GY VG VLMQNKRPIAF+SHTL +RDRAKPVYERELMAVVLAVQRWRPYLLGR F+VK DQ SLKFLLEQ
Subjt:  FVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKFLLEQ

Query:  RVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKI-QKEEVTNYTLQQGMLRYKGRLVIA
        RVIQPQY KWIAKLLGYSFEV+YKPG ENKA D LSRVP  V LNQLT P LIDLK+IREEV +D++LK+II +I ++EEV  YT+Q GML+YKGR+VIA
Subjt:  RVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKI-QKEEVTNYTLQQGMLRYKGRLVIA

Query:  KNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSSI--------------------
        K+S+LIPTI+H YHDSV  G+SGFLRTYKRLTGELF       + +Y       +RNK+L+L+PAGLL P+EIPSR+                       
Subjt:  KNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSSI--------------------

Query:  ----------------PFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCGERPKE
                        PF A+IVAE+F+KE+VRLHGFP+SIVSD DK+FLS+FW+EL RLAGTKLNRSTTYHPQ DGQT+V++RSVE YL CFCGERPKE
Subjt:  ----------------PFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCGERPKE

Query:  WLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT--------------H
        W+KWI WAEYWYN T+Q+SLGVSPFQAVYGRTP  L+ YGD  T NSTLDEQLK+RD+ALGALK+HLR+AQ KMK+YAD+KRRH               +
Subjt:  WLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT--------------H

Query:  RQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNEKEVWEVLMSWKG
        RQVSMRK RNEKLSPKYFGPY++LK+IG +AY+LELP +ATIHP+FHISQLK+AFG+C N + L P++TE HEWLAVPDE +GYQKN K  WEVLMSWKG
Subjt:  RQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNEKEVWEVLMSWKG

Query:  LLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
        L  HE TWE Y DFQQSF D+H+ED+ KLE+ECNVRPPIIH Y
Subjt:  LLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

TYK06572.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-22759.42Show/hide
Query:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP
        WG +E  AF +L++AMMTLPVL +PDF+ PFE+E+D+ G+GVGAVL Q ++P+A++S TL +RDR++PVYEREL+AVVLAVQRWRPYLLGR F VKTDQ 
Subjt:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP

Query:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML
        SLK+LLEQRV+QPQYQKW+AKLLGYSFEVVY+PG ENKA D LSR+ PT  LNQ+T P LID+++++EE  +D  L+EII  I+++  E+ +YTLQQG+L
Subjt:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML

Query:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------
        ++KGRLV++  S+L+PTI+H YHDSV GG+SGFLRTYKRLTGE++       + RY       +RNK+ ALTPAGLL+P+EIP  + S            
Subjt:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------

Query:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC
                                 PF AK+VAE F+KE+VRLHG+P+SIVSD DK+FLS+FW+EL RLAGTKLNRS++YHPQ DGQT+V+N+SVE YL 
Subjt:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC

Query:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------
        CFCGE+P+EW +W+ WAEYWYNTT+  S+G++PFQAVYGR PP LIYYGD ETPNSTLD+QLK+RD+ LGALK+HL++AQ++MK  AD KRR        
Subjt:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------

Query:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE
                +RQ S+RK RNEKLSPKYFGPYR+L+RIG +AY+LELP  A IHP+FH+SQLKKA G     + L P++ ENHEW+  P+EVYGY+KN    
Subjt:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE

Query:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
         WE L+SWKGL  HE TWE+  D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

TYK24654.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-22759.42Show/hide
Query:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP
        WG +E  AF +L++AMMTLPVL +PDF+ PFE+E+D+ G+GVGAVL Q ++P+A++S TL +RDR++PVYEREL+AVVLAVQRWRPYLLGR F VKTDQ 
Subjt:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP

Query:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML
        SLK+LLEQRV+QPQYQKW+AKLLGYSFEVVY+PG ENKA D LSR+ PT  LNQ+T P LID+++++EE  +D  L+EII  I+++  E+ +YTLQQG+L
Subjt:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML

Query:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------
        ++KGRLV++  S+L+PTI+H YHDSV GG+SGFLRTYKRLTGE++       + RY       +RNK+ ALTPAGLL+P+EIP  + S            
Subjt:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------

Query:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC
                                 PF AK+VAE F+KE+VRLHG+P+SIVSD DK+FLS+FW+EL RLAGTKLNRS++YHPQ DGQT+V+N+SVE YL 
Subjt:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC

Query:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------
        CFCGE+P+EW +W+ WAEYWYNTT+  S+G++PFQAVYGR PP LIYYGD ETPNSTLD+QLK+RD+ LGALK+HL++AQ++MK  AD KRR        
Subjt:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------

Query:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE
                +RQ S+RK RNEKLSPKYFGPYR+L+RIG +AY+LELP  A IHP+FH+SQLKKA G     + L P++ ENHEW+  P+EVYGY+KN    
Subjt:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE

Query:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
         WE L+SWKGL  HE TWE+  D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

TrEMBL top hitse value%identityAlignment
A0A5A7TDM4 Ty3/gypsy retrotransposon protein5.2e-22760.09Show/hide
Query:  SQEAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKF
        ++ AF +L++AMMTLPVLA+PDFN PFE+E+D+ G+GVGAVL+Q KRP+A++S  L MRDRA+PVYEREL+AVV AVQRWRPYLLGR F VKTDQ SLKF
Subjt:  SQEAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKF

Query:  LLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGMLRYKG
        LLEQRVIQPQYQ+WIAKLLGYSFEV+YKPG ENKA D LSR+ PT HLNQLT P L+D++VI++EV KD  L+EIIS I+++  E+ +YT  QG+L++KG
Subjt:  LLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGMLRYKG

Query:  RLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS----------------
        RLV++K S+LIPTIMH YHDSV GG+SGFLRTYKR+ GEL+       + +Y       ++NK+ AL+PAGLL+P+EIP  + S                
Subjt:  RLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS----------------

Query:  --------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCG
                             P+ AK VAE+F+KE+VRLHGFP+SIV D DKIFLS+FW E+ RLAGTKLNRS++YHPQ DGQT+V+N+SVE YL CFC 
Subjt:  --------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCG

Query:  ERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRR------------
        E+P+EW +W+ WAEYWYNTT+  S+G+SPFQAVYGR PP LI YG+ ETPNSTLD+QL++RDV LGALK+HL++AQ++MK +AD+KRR            
Subjt:  ERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRR------------

Query:  --HTHRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNE-KEVWEV
            +RQ S+RK RNEKLSPKYFGPYRIL+RIG +AYKLELP++A IHP+FH+SQLKKA G+    + L P++ E HEW+  P+E+Y Y+KN+  + WE 
Subjt:  --HTHRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNE-KEVWEV

Query:  LMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
        L+ WKGL  HE TWENY D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  LMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

A0A5A7UAE4 Ty3/gypsy retrotransposon protein6.1e-22859.42Show/hide
Query:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP
        WG +E  AF +L++AMMTLPVL +PDF+ PFE+E+D+ G+GVGAVL Q ++P+A++S TL +RDR++PVYEREL+AVVLAVQRWRPYLLGR F VKTDQ 
Subjt:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP

Query:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML
        SLK+LLEQRV+QPQYQKW+AKLLGYSFEVVY+PG ENKA D LSR+ PT  LNQ+T P LID+++++EE  +D  L+EII  I+++  E+ +YTLQQG+L
Subjt:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML

Query:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------
        ++KGRLV++  S+L+PTI+H YHDSV GG+SGFLRTYKRLTGE++       + RY       +RNK+ ALTPAGLL+P+EIP  + S            
Subjt:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------

Query:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC
                                 PF AK+VAE F+KE+VRLHG+P+SIVSD DK+FLS+FW+EL RLAGTKLNRS++YHPQ DGQT+V+N+SVE YL 
Subjt:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC

Query:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------
        CFCGE+P+EW +W+ WAEYWYNTT+  S+G++PFQAVYGR PP LIYYGD ETPNSTLD+QLK+RD+ LGALK+HL++AQ++MK  AD KRR        
Subjt:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------

Query:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE
                +RQ S+RK RNEKLSPKYFGPYR+L+RIG +AY+LELP  A IHP+FH+SQLKKA G     + L P++ ENHEW+  P+EVYGY+KN    
Subjt:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE

Query:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
         WE L+SWKGL  HE TWE+  D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

A0A5A7VG68 Transposon Tf2-9 polyprotein3.2e-25368.12Show/hide
Query:  FVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKFLLEQ
        F RL++AMMTLPVLALPDF+ PFE++ D+ GY VG VLMQNKRPIAF+SHTL +RDRAKPVYERELMAVVLAVQRWRPYLLGR F+VK DQ SLKFLLEQ
Subjt:  FVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKFLLEQ

Query:  RVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKI-QKEEVTNYTLQQGMLRYKGRLVIA
        RVIQPQY KWIAKLLGYSFEV+YKPG ENKA D LSRVP  V LNQLT P LIDLK+IREEV +D++LK+II +I ++EEV  YT+Q GML+YKGR+VIA
Subjt:  RVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKI-QKEEVTNYTLQQGMLRYKGRLVIA

Query:  KNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSSI--------------------
        K+S+LIPTI+H YHDSV  G+SGFLRTYKRLTGELF       + +Y       +RNK+L+L+PAGLL P+EIPSR+                       
Subjt:  KNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSSI--------------------

Query:  ----------------PFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCGERPKE
                        PF A+IVAE+F+KE+VRLHGFP+SIVSD DK+FLS+FW+EL RLAGTKLNRSTTYHPQ DGQT+V++RSVE YL CFCGERPKE
Subjt:  ----------------PFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLCCFCGERPKE

Query:  WLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT--------------H
        W+KWI WAEYWYN T+Q+SLGVSPFQAVYGRTP  L+ YGD  T NSTLDEQLK+RD+ALGALK+HLR+AQ KMK+YAD+KRRH               +
Subjt:  WLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT--------------H

Query:  RQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNEKEVWEVLMSWKG
        RQVSMRK RNEKLSPKYFGPY++LK+IG +AY+LELP +ATIHP+FHISQLK+AFG+C N + L P++TE HEWLAVPDE +GYQKN K  WEVLMSWKG
Subjt:  RQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNEKEVWEVLMSWKG

Query:  LLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
        L  HE TWE Y DFQQSF D+H+ED+ KLE+ECNVRPPIIH Y
Subjt:  LLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

A0A5D3C5N7 Ty3/gypsy retrotransposon protein6.1e-22859.42Show/hide
Query:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP
        WG +E  AF +L++AMMTLPVL +PDF+ PFE+E+D+ G+GVGAVL Q ++P+A++S TL +RDR++PVYEREL+AVVLAVQRWRPYLLGR F VKTDQ 
Subjt:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP

Query:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML
        SLK+LLEQRV+QPQYQKW+AKLLGYSFEVVY+PG ENKA D LSR+ PT  LNQ+T P LID+++++EE  +D  L+EII  I+++  E+ +YTLQQG+L
Subjt:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML

Query:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------
        ++KGRLV++  S+L+PTI+H YHDSV GG+SGFLRTYKRLTGE++       + RY       +RNK+ ALTPAGLL+P+EIP  + S            
Subjt:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------

Query:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC
                                 PF AK+VAE F+KE+VRLHG+P+SIVSD DK+FLS+FW+EL RLAGTKLNRS++YHPQ DGQT+V+N+SVE YL 
Subjt:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC

Query:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------
        CFCGE+P+EW +W+ WAEYWYNTT+  S+G++PFQAVYGR PP LIYYGD ETPNSTLD+QLK+RD+ LGALK+HL++AQ++MK  AD KRR        
Subjt:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------

Query:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE
                +RQ S+RK RNEKLSPKYFGPYR+L+RIG +AY+LELP  A IHP+FH+SQLKKA G     + L P++ ENHEW+  P+EVYGY+KN    
Subjt:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE

Query:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
         WE L+SWKGL  HE TWE+  D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

A0A5D3DM31 Ty3/gypsy retrotransposon protein6.1e-22859.42Show/hide
Query:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP
        WG +E  AF +L++AMMTLPVL +PDF+ PFE+E+D+ G+GVGAVL Q ++P+A++S TL +RDR++PVYEREL+AVVLAVQRWRPYLLGR F VKTDQ 
Subjt:  WGSQE--AFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQP

Query:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML
        SLK+LLEQRV+QPQYQKW+AKLLGYSFEVVY+PG ENKA D LSR+ PT  LNQ+T P LID+++++EE  +D  L+EII  I+++  E+ +YTLQQG+L
Subjt:  SLKFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKE--EVTNYTLQQGML

Query:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------
        ++KGRLV++  S+L+PTI+H YHDSV GG+SGFLRTYKRLTGE++       + RY       +RNK+ ALTPAGLL+P+EIP  + S            
Subjt:  RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELF-------LGRY-------ERNKALALTPAGLLVPVEIPSRVSS------------

Query:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC
                                 PF AK+VAE F+KE+VRLHG+P+SIVSD DK+FLS+FW+EL RLAGTKLNRS++YHPQ DGQT+V+N+SVE YL 
Subjt:  ------------------------IPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHPQMDGQTKVINRSVEIYLC

Query:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------
        CFCGE+P+EW +W+ WAEYWYNTT+  S+G++PFQAVYGR PP LIYYGD ETPNSTLD+QLK+RD+ LGALK+HL++AQ++MK  AD KRR        
Subjt:  CFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRRHT------

Query:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE
                +RQ S+RK RNEKLSPKYFGPYR+L+RIG +AY+LELP  A IHP+FH+SQLKKA G     + L P++ ENHEW+  P+EVYGY+KN    
Subjt:  --------HRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKN-EKE

Query:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY
         WE L+SWKGL  HE TWE+  D +  FP+FHLEDKV LE+E + RPPI+  Y
Subjt:  VWEVLMSWKGLLRHEGTWENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPY

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.7e-3825.43Show/hide
Query:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ
        +A   ++Q +++ PVL   DF+    +ETD+    VGAVL Q        P+ +YS  +        V ++E++A++ +++ WR YL      F + TD 
Subjt:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ

Query:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK
         +L  +   E      +  +W   L  ++FE+ Y+PG  N   D LSR       +P     N +   N I +       +  E   D  L  +++   K
Subjt:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK

Query:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-
            N  L+ G+L   K ++++  ++ L  TI+  YH+     + G       L   + L R+                   + NK+    P G L P+ 
Subjt:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-

Query:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH
          E P    S+ F+                                 A+  A +F + ++   G P+ I++D D IF S  W++        +  S  Y 
Subjt:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH

Query:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA
        PQ DGQT+  N++VE  L C C   P  W+  I   +  YN     +  ++PF+ V+ R  PAL      E P  +   DE  +E       +K+HL   
Subjt:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA

Query:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK
          KMK Y DMK +            V   K     ++ KL+P + GP+ +L++ GP  Y+L+LP S        FH+S L+K
Subjt:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK

P0CT35 Transposon Tf2-2 polyprotein6.7e-3825.43Show/hide
Query:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ
        +A   ++Q +++ PVL   DF+    +ETD+    VGAVL Q        P+ +YS  +        V ++E++A++ +++ WR YL      F + TD 
Subjt:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ

Query:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK
         +L  +   E      +  +W   L  ++FE+ Y+PG  N   D LSR       +P     N +   N I +       +  E   D  L  +++   K
Subjt:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK

Query:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-
            N  L+ G+L   K ++++  ++ L  TI+  YH+     + G       L   + L R+                   + NK+    P G L P+ 
Subjt:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-

Query:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH
          E P    S+ F+                                 A+  A +F + ++   G P+ I++D D IF S  W++        +  S  Y 
Subjt:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH

Query:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA
        PQ DGQT+  N++VE  L C C   P  W+  I   +  YN     +  ++PF+ V+ R  PAL      E P  +   DE  +E       +K+HL   
Subjt:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA

Query:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK
          KMK Y DMK +            V   K     ++ KL+P + GP+ +L++ GP  Y+L+LP S        FH+S L+K
Subjt:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK

P0CT36 Transposon Tf2-3 polyprotein6.7e-3825.43Show/hide
Query:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ
        +A   ++Q +++ PVL   DF+    +ETD+    VGAVL Q        P+ +YS  +        V ++E++A++ +++ WR YL      F + TD 
Subjt:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ

Query:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK
         +L  +   E      +  +W   L  ++FE+ Y+PG  N   D LSR       +P     N +   N I +       +  E   D  L  +++   K
Subjt:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK

Query:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-
            N  L+ G+L   K ++++  ++ L  TI+  YH+     + G       L   + L R+                   + NK+    P G L P+ 
Subjt:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-

Query:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH
          E P    S+ F+                                 A+  A +F + ++   G P+ I++D D IF S  W++        +  S  Y 
Subjt:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH

Query:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA
        PQ DGQT+  N++VE  L C C   P  W+  I   +  YN     +  ++PF+ V+ R  PAL      E P  +   DE  +E       +K+HL   
Subjt:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA

Query:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK
          KMK Y DMK +            V   K     ++ KL+P + GP+ +L++ GP  Y+L+LP S        FH+S L+K
Subjt:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK

P0CT41 Transposon Tf2-12 polyprotein6.7e-3825.43Show/hide
Query:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ
        +A   ++Q +++ PVL   DF+    +ETD+    VGAVL Q        P+ +YS  +        V ++E++A++ +++ WR YL      F + TD 
Subjt:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ

Query:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK
         +L  +   E      +  +W   L  ++FE+ Y+PG  N   D LSR       +P     N +   N I +       +  E   D  L  +++   K
Subjt:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK

Query:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-
            N  L+ G+L   K ++++  ++ L  TI+  YH+     + G       L   + L R+                   + NK+    P G L P+ 
Subjt:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-

Query:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH
          E P    S+ F+                                 A+  A +F + ++   G P+ I++D D IF S  W++        +  S  Y 
Subjt:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH

Query:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA
        PQ DGQT+  N++VE  L C C   P  W+  I   +  YN     +  ++PF+ V+ R  PAL      E P  +   DE  +E       +K+HL   
Subjt:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA

Query:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK
          KMK Y DMK +            V   K     ++ KL+P + GP+ +L++ GP  Y+L+LP S        FH+S L+K
Subjt:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK

Q9UR07 Transposon Tf2-11 polyprotein6.7e-3825.43Show/hide
Query:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ
        +A   ++Q +++ PVL   DF+    +ETD+    VGAVL Q        P+ +YS  +        V ++E++A++ +++ WR YL      F + TD 
Subjt:  EAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNK-----RPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLG--RAFIVKTDQ

Query:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK
         +L  +   E      +  +W   L  ++FE+ Y+PG  N   D LSR       +P     N +   N I +       +  E   D  L  +++   K
Subjt:  PSL--KFLLEQRVIQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSR-------VPPTVHLNQLTTPNLIDL-----KVIREEVEKDEHLKEIISKIQK

Query:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-
            N  L+ G+L   K ++++  ++ L  TI+  YH+     + G       L   + L R+                   + NK+    P G L P+ 
Subjt:  EEVTNYTLQQGML-RYKGRLVIAKNSSLIPTIMHIYHDSVLGGYSGFLRTYKRLTGELFLGRY-------------------ERNKALALTPAGLLVPV-

Query:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH
          E P    S+ F+                                 A+  A +F + ++   G P+ I++D D IF S  W++        +  S  Y 
Subjt:  --EIPSRVSSIPFV---------------------------------AKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYH

Query:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA
        PQ DGQT+  N++VE  L C C   P  W+  I   +  YN     +  ++PF+ V+ R  PAL      E P  +   DE  +E       +K+HL   
Subjt:  PQMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETP--NSTLDEQLKERDVALGALKDHLRIA

Query:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK
          KMK Y DMK +            V   K     ++ KL+P + GP+ +L++ GP  Y+L+LP S        FH+S L+K
Subjt:  QKKMKSYADMKRRHTHR-------QVSMRK----WRNEKLSPKYFGPYRILKRIGPIAYKLELPTSA--TIHPIFHISQLKK

Arabidopsis top hitse value%identityAlignment
AT1G75340.1 Zinc finger C-x8-C-x5-C-x3-H type family protein4.5e-0536.63Show/hide
Query:  SQTF--GNFSTLSGFDIKNAGSNIFSSAALT--------NLPPTNANSSASGQIAPNAQLVN-----KLQQENSSVDVDIWMKEKWVPGEIPEMAPPDAV
        S TF   +F    GF      +NIF  +  T        N    N N   +   AP     N     +LQ     VD  IW+KEKW PGEIPE APPDA 
Subjt:  SQTF--GNFSTLSGFDIKNAGSNIFSSAALT--------NLPPTNANSSASGQIAPNAQLVN-----KLQQENSSVDVDIWMKEKWVPGEIPEMAPPDAV

Query:  I
        +
Subjt:  I

AT1G75340.2 Zinc finger C-x8-C-x5-C-x3-H type family protein4.5e-0536.63Show/hide
Query:  SQTF--GNFSTLSGFDIKNAGSNIFSSAALT--------NLPPTNANSSASGQIAPNAQLVN-----KLQQENSSVDVDIWMKEKWVPGEIPEMAPPDAV
        S TF   +F    GF      +NIF  +  T        N    N N   +   AP     N     +LQ     VD  IW+KEKW PGEIPE APPDA 
Subjt:  SQTF--GNFSTLSGFDIKNAGSNIFSSAALT--------NLPPTNANSSASGQIAPNAQLVN-----KLQQENSSVDVDIWMKEKWVPGEIPEMAPPDAV

Query:  I
        +
Subjt:  I


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGGGGAAGCCAAGAAGCTTTCGTAAGGTTGCAACAAGCAATGATGACTCTGCCTGTTTTGGCACTACCAGATTTTAATACACCATTTGAAGTTGAAACAGATTC
ATTGGGCTATGGAGTGGGAGCAGTCCTAATGCAGAATAAGAGACCAATTGCTTTTTATAGCCACACATTAGTTATGAGGGACCGTGCCAAACCAGTCTATGAGAGGGAGT
TGATGGCAGTAGTGTTGGCAGTACAACGTTGGCGACCATATCTATTAGGAAGGGCATTTATAGTCAAGACTGATCAGCCCTCTCTTAAATTTCTGCTTGAACAGAGGGTG
ATACAGCCACAATACCAGAAGTGGATTGCAAAATTACTTGGCTACTCATTTGAAGTGGTTTATAAGCCCGGTTTCGAGAACAAGGCAGAGGATACCCTATCTCGAGTACC
ACCTACTGTACATCTCAACCAATTAACAACCCCTAATTTGATTGATTTGAAGGTTATAAGGGAGGAGGTTGAAAAGGATGAACACTTGAAGGAGATAATCAGTAAGATAC
AAAAAGAAGAAGTAACAAATTATACTTTGCAACAAGGGATGCTCCGGTACAAAGGAAGATTAGTGATTGCGAAGAACTCTTCCTTAATTCCTACTATTATGCACATTTAC
CATGATTCTGTCCTTGGGGGCTATTCAGGGTTCTTAAGAACTTACAAGAGGCTAACAGGAGAACTATTTTTGGGAAGGTATGAACGAAATAAAGCATTAGCACTCACGCC
TGCAGGGTTATTGGTTCCAGTGGAGATACCGAGTAGAGTCTCAAGCATCCCTTTTGTTGCCAAGATTGTGGCAGAATTATTTATGAAGGAGATAGTAAGGTTGCATGGCT
TTCCACAATCAATTGTTTCTGACTGTGATAAGATTTTTCTGAGCAATTTCTGGAGGGAACTACTCCGTTTGGCAGGCACTAAATTGAATCGGAGCACCACTTATCATCCT
CAAATGGATGGTCAGACAAAGGTTATTAACAGATCAGTGGAGATTTACTTATGTTGCTTTTGTGGGGAGAGACCGAAAGAATGGTTAAAATGGATTCCTTGGGCTGAATA
CTGGTATAACACTACATTCCAACGATCATTGGGAGTGTCACCTTTTCAAGCTGTATATGGACGAACACCACCAGCCCTTATATATTATGGAGACTGGGAAACTCCAAATT
CCACACTAGATGAGCAACTTAAAGAAAGAGATGTAGCCTTGGGTGCTTTAAAAGATCATTTACGAATAGCCCAAAAAAAGATGAAGAGTTATGCAGATATGAAGAGAAGA
CATACCCATAGACAGGTTTCTATGAGGAAGTGGAGGAATGAGAAGCTGTCACCTAAATATTTCGGTCCTTACCGAATTTTGAAGAGAATTGGCCCTATCGCGTATAAGTT
GGAATTACCTACATCAGCAACTATTCACCCTATTTTCCATATTTCACAGCTGAAGAAAGCCTTCGGGGAGTGCACAAATAAGGAGGAATTGGTACCATTTTTGACTGAGA
ATCACGAGTGGCTAGCTGTACCTGATGAGGTCTACGGATACCAAAAGAATGAGAAAGAAGTTTGGGAAGTTTTGATGAGTTGGAAGGGACTGCTGCGTCATGAGGGAACT
TGGGAAAATTATGATGATTTCCAACAGTCCTTCCCTGATTTCCACCTTGAGGACAAGGTGAAACTGGAGCAGGAATGCAATGTTAGACCACCCATAATACACCCGTACAT
GTTTATCAACTTCAATATGTTTCATCTTGTCATGATGGACTGGATTGGACTGTGGGCAATGGAAATTGCTGCTTTGTTTTTGTGCTACCTGCCTAATGCAGTCAATGTAA
ACTTCAAGATTGAGATGGTCATGTCTTTGAAACAATATGCATTTTCCATGAGTGGAGTTGGTAGCCAAACATTTGGGAACTTCTCTACTCTAAGTGGCTTTGACATAAAA
AATGCTGGAAGTAATATTTTCTCTTCAGCAGCCCTAACAAATCTCCCTCCAACGAATGCAAACTCAAGTGCCAGTGGACAAATTGCACCAAATGCCCAATTGGTAAATAA
GTTACAGCAAGAAAATAGTTCCGTGGATGTTGACATTTGGATGAAAGAGAAATGGGTTCCTGGAGAGATACCGGAAATGGCTCCTCCTGATGCAGTTATTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTGGGGAAGCCAAGAAGCTTTCGTAAGGTTGCAACAAGCAATGATGACTCTGCCTGTTTTGGCACTACCAGATTTTAATACACCATTTGAAGTTGAAACAGATTC
ATTGGGCTATGGAGTGGGAGCAGTCCTAATGCAGAATAAGAGACCAATTGCTTTTTATAGCCACACATTAGTTATGAGGGACCGTGCCAAACCAGTCTATGAGAGGGAGT
TGATGGCAGTAGTGTTGGCAGTACAACGTTGGCGACCATATCTATTAGGAAGGGCATTTATAGTCAAGACTGATCAGCCCTCTCTTAAATTTCTGCTTGAACAGAGGGTG
ATACAGCCACAATACCAGAAGTGGATTGCAAAATTACTTGGCTACTCATTTGAAGTGGTTTATAAGCCCGGTTTCGAGAACAAGGCAGAGGATACCCTATCTCGAGTACC
ACCTACTGTACATCTCAACCAATTAACAACCCCTAATTTGATTGATTTGAAGGTTATAAGGGAGGAGGTTGAAAAGGATGAACACTTGAAGGAGATAATCAGTAAGATAC
AAAAAGAAGAAGTAACAAATTATACTTTGCAACAAGGGATGCTCCGGTACAAAGGAAGATTAGTGATTGCGAAGAACTCTTCCTTAATTCCTACTATTATGCACATTTAC
CATGATTCTGTCCTTGGGGGCTATTCAGGGTTCTTAAGAACTTACAAGAGGCTAACAGGAGAACTATTTTTGGGAAGGTATGAACGAAATAAAGCATTAGCACTCACGCC
TGCAGGGTTATTGGTTCCAGTGGAGATACCGAGTAGAGTCTCAAGCATCCCTTTTGTTGCCAAGATTGTGGCAGAATTATTTATGAAGGAGATAGTAAGGTTGCATGGCT
TTCCACAATCAATTGTTTCTGACTGTGATAAGATTTTTCTGAGCAATTTCTGGAGGGAACTACTCCGTTTGGCAGGCACTAAATTGAATCGGAGCACCACTTATCATCCT
CAAATGGATGGTCAGACAAAGGTTATTAACAGATCAGTGGAGATTTACTTATGTTGCTTTTGTGGGGAGAGACCGAAAGAATGGTTAAAATGGATTCCTTGGGCTGAATA
CTGGTATAACACTACATTCCAACGATCATTGGGAGTGTCACCTTTTCAAGCTGTATATGGACGAACACCACCAGCCCTTATATATTATGGAGACTGGGAAACTCCAAATT
CCACACTAGATGAGCAACTTAAAGAAAGAGATGTAGCCTTGGGTGCTTTAAAAGATCATTTACGAATAGCCCAAAAAAAGATGAAGAGTTATGCAGATATGAAGAGAAGA
CATACCCATAGACAGGTTTCTATGAGGAAGTGGAGGAATGAGAAGCTGTCACCTAAATATTTCGGTCCTTACCGAATTTTGAAGAGAATTGGCCCTATCGCGTATAAGTT
GGAATTACCTACATCAGCAACTATTCACCCTATTTTCCATATTTCACAGCTGAAGAAAGCCTTCGGGGAGTGCACAAATAAGGAGGAATTGGTACCATTTTTGACTGAGA
ATCACGAGTGGCTAGCTGTACCTGATGAGGTCTACGGATACCAAAAGAATGAGAAAGAAGTTTGGGAAGTTTTGATGAGTTGGAAGGGACTGCTGCGTCATGAGGGAACT
TGGGAAAATTATGATGATTTCCAACAGTCCTTCCCTGATTTCCACCTTGAGGACAAGGTGAAACTGGAGCAGGAATGCAATGTTAGACCACCCATAATACACCCGTACAT
GTTTATCAACTTCAATATGTTTCATCTTGTCATGATGGACTGGATTGGACTGTGGGCAATGGAAATTGCTGCTTTGTTTTTGTGCTACCTGCCTAATGCAGTCAATGTAA
ACTTCAAGATTGAGATGGTCATGTCTTTGAAACAATATGCATTTTCCATGAGTGGAGTTGGTAGCCAAACATTTGGGAACTTCTCTACTCTAAGTGGCTTTGACATAAAA
AATGCTGGAAGTAATATTTTCTCTTCAGCAGCCCTAACAAATCTCCCTCCAACGAATGCAAACTCAAGTGCCAGTGGACAAATTGCACCAAATGCCCAATTGGTAAATAA
GTTACAGCAAGAAAATAGTTCCGTGGATGTTGACATTTGGATGAAAGAGAAATGGGTTCCTGGAGAGATACCGGAAATGGCTCCTCCTGATGCAGTTATTCAGTAACACC
GGCGGATTTAGTATAGAGCCTCCTTCAACTTTATTGTCTTTATATAATATGTATAAACAAAACCATTTAATGCTTAATATTTATAGATAGTTTAGTGGTAACTATGCTTG
GTTTAGGGTTTTCTGGATCCGCCACTGTTCAATAACCCATATACTAAAATACGTACGTGGAAATGCATGTACTGAGGTACCTAGATTTGATAGCAAGATTTAGTATTAGT
AGCGGTTTAGGAAATATGAATTTATTAACTTGAAACTTTCTGAGTTTAAAAGGGGAATTCTTAGAATTTTGCAATAATATATTAAATGACACAAGATCC
Protein sequenceShow/hide protein sequence
MDWGSQEAFVRLQQAMMTLPVLALPDFNTPFEVETDSLGYGVGAVLMQNKRPIAFYSHTLVMRDRAKPVYERELMAVVLAVQRWRPYLLGRAFIVKTDQPSLKFLLEQRV
IQPQYQKWIAKLLGYSFEVVYKPGFENKAEDTLSRVPPTVHLNQLTTPNLIDLKVIREEVEKDEHLKEIISKIQKEEVTNYTLQQGMLRYKGRLVIAKNSSLIPTIMHIY
HDSVLGGYSGFLRTYKRLTGELFLGRYERNKALALTPAGLLVPVEIPSRVSSIPFVAKIVAELFMKEIVRLHGFPQSIVSDCDKIFLSNFWRELLRLAGTKLNRSTTYHP
QMDGQTKVINRSVEIYLCCFCGERPKEWLKWIPWAEYWYNTTFQRSLGVSPFQAVYGRTPPALIYYGDWETPNSTLDEQLKERDVALGALKDHLRIAQKKMKSYADMKRR
HTHRQVSMRKWRNEKLSPKYFGPYRILKRIGPIAYKLELPTSATIHPIFHISQLKKAFGECTNKEELVPFLTENHEWLAVPDEVYGYQKNEKEVWEVLMSWKGLLRHEGT
WENYDDFQQSFPDFHLEDKVKLEQECNVRPPIIHPYMFINFNMFHLVMMDWIGLWAMEIAALFLCYLPNAVNVNFKIEMVMSLKQYAFSMSGVGSQTFGNFSTLSGFDIK
NAGSNIFSSAALTNLPPTNANSSASGQIAPNAQLVNKLQQENSSVDVDIWMKEKWVPGEIPEMAPPDAVIQ