; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039869 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039869
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr13:530237..535904
RNA-Seq ExpressionLag0039869
SyntenyLag0039869
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.9e-24136.79Show/hide
Query:  TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSE
        ++KLDR N+ LWK+L LP++R  KL+G++ G   CPE+F+ ++ +S N                           N  +  W A DQ LLGW+ NSMT+E
Subjt:  TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSE

Query:  VATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISW
        +ATQ++  E +++LW   Q L G  +R++  YL+  F   RKG  KM DYL  MKN  D L  AG+PV+T  LI Q L GLD EYNPVV  + D+  +SW
Subjt:  VATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISW

Query:  SEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRG-AGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQ
         ++QA+LL FE R+E  NNL    +L+  A+ N+AN       R+   G  +NN +   N RG  GGRGRG+       S K  CQVCG + H A+ C+ 
Subjt:  SEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRG-AGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQ

Query:  RFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNS
        RF+K +       N   G ++Q + N      AF+A+QN         +V D +WY DSGASNHVT       +  E+ GK S+ +G+G KL I   G+S
Subjt:  RFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNS

Query:  CLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFV
         L    K+  L  +L VP+I KNL+SVSKLA DN + VEF +  C VKDK TGKV+LKG+L +GLY+  G K                        S FV
Subjt:  CLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFV

Query:  LSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFS
                  K  WH+RLGHP+ +VL  V++SC + +  +++F FCEACQYGK H LPF +S+SHA +  +LVHTD+WGPAP+ + +GFKYY+ F+DDFS
Subjt:  LSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFS

Query:  RFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDA
        RF WIYPLK+K+E ++AF+ F  L +NQFN  IK +Q D GGEY  + +L  E G++ R+SCP+TSQQNGRAERKHRH+ E GLTLLAQA MPL +W +A
Subjt:  RFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDA

Query:  FLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASG
        F  AV LIN LPSQV   +SP  L+  K+ D++ L+ FG AC+PCL+ Y  HK QYHT +CV+LG S  HKG+KCL+S GR+FISRHV FNE+ FPF  G
Subjt:  FLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASG

Query:  FQQVVTTAASSSNV-SPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSASESS
        F    +   ++ NV S S PL           T     +  S  +   E P  T +  S DV                + D+  +    +E++ +  E+ 
Subjt:  FQQVVTTAASSSNV-SPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSASESS

Query:  ASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMD--------------------------------
          TQ  ++          H + TR K+GI KPKL    LT    D    EP    +AL+ P W+EAM                                 
Subjt:  ASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMD--------------------------------

Query:  -------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGL
                                         PV+KAST+R+ILSIAV   W++RQLD NNAFLNG L E V+M QP G+ D+   N++CKL KAIYGL
Subjt:  -------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGL

Query:  KQAPRAWNNTLKATLLSWGYS--------IIFKYRD----------------------------------------------------------------
        KQAPRAW ++LK  LL+WG+          + K +D                                                                
Subjt:  KQAPRAWNNTLKATLLSWGYS--------IIFKYRD----------------------------------------------------------------

Query:  --------------------------------------------------------------SSYRS---------------------------------
                                                                      S Y S                                 
Subjt:  --------------------------------------------------------------SSYRS---------------------------------

Query:  ------------------------------------------------SNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHAR
                                                        S   ALA  + E+ W+R L+TE+  P   K +LWCDN+SA ALA+NPV HAR
Subjt:  ------------------------------------------------SNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHAR

Query:  TKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELP
        +KHIEIDVH++RDQVLQ ++ V Y+P+ +Q ADCLTK LSH++F+ LR KLGV   P
Subjt:  TKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]4.8e-22435.03Show/hide
Query:  TTSFNSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWV
        +++ NSP  N  L  I ++KLDR N+ LWK+L L ++R  KL+G++ G + CPE+FV +A  S                          +++NP +  W+
Subjt:  TTSFNSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWV

Query:  AVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDE
        A DQ LLGWL NSM  ++ATQ++  E +++LW   Q L G  +++   YL+  F  +RKG  KM +YL  MKN +D L  AGSP++   L+ Q L GLD 
Subjt:  AVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDE

Query:  EYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRD-SGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKP
        EYNPVV  + D+  +SW ++QA+LL FE RL+  NN  S L+L+  AS N AN  +  GN+ N            RGN R +  RG    RG G  SN  
Subjt:  EYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRD-SGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKP

Query:  VCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKES
         CQVC  TGH A+ C  RF++ +           G N     + QG  +AF         +A+P    D  WY DSGA+NHVT   +      E+ GK S
Subjt:  VCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKES

Query:  VTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPI
        + +G+G KLKI   G++ L     N  L  VL VP I KNL+SVSKL  DN + VEF    C VKDK TG+ LLKG L +GLY+          +N+ P 
Subjt:  VTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPI

Query:  CNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPL
        C  ++ K+                      WH++LGHP+ +VL  V+K CN+ +S ++ F FCEACQ+GK H LPF  S+SH  +   L+H+D+WGPAP+
Subjt:  CNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPL

Query:  DSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETG
         S +GFKYY+ FIDDFSRF WI+PLK+K++ + AF+ F  L +NQFN  IK +Q D GGEY  + ++  E G++ R+SCP+TSQQNGRAERKHRHV E G
Subjt:  DSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETG

Query:  LTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLF
        LTLLAQA MPL++W +AF  AV LIN LPS V   +SP  L++ ++ D+ AL+ FG AC+PCL+ Y  HK Q+HT +CV++G S  HKG+KC++S GR+F
Subjt:  LTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLF

Query:  ISRHVRFNEEEFPFASGF----QQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHE
        +SRHV FNE  FPF  GF      + T   +SS + P+               T Q+   P    + ++    T S  S+D      ++      + T+ 
Subjt:  ISRHVRFNEEEFPFASGF----QQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHE

Query:  DSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAG-HPMVTRGKAGIFKPKL-WLTHACTDWSVTEPTRIADALATPQWREAMD-------------
         S  +Q   A+N   + + + ST +  +        +  H M TR K GI KPK+ ++  A TD    EP  + +AL  P W+EAMD             
Subjt:  DSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAG-HPMVTRGKAGIFKPKL-WLTHACTDWSVTEPTRIADALATPQWREAMD-------------

Query:  --------------------------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGY
                                                            PVVK+ST+R+IL+IAV   W++RQLD NNAFLNG+L E V+M QP GY
Subjt:  --------------------------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGY

Query:  EDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY-------SIIF--------------------------------------------------
         DA   N++CKL KAIYGLKQAPRAW ++L++TL++WG+       S+ F                                                  
Subjt:  EDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY-------SIIF--------------------------------------------------

Query:  ----------------KY----------------------------------------------------------------------------------
                        KY                                                                                  
Subjt:  ----------------KY----------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------RDSSYRS--------SNISALALASTEVIWLRQ--
                                                                          +S YRS        S  S   L S+E   L    
Subjt:  -----------------------------------------------------------------RDSSYRS--------SNISALALASTEVIWLRQ--

Query:  ----LMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPS
            L+ E+  P   K VLWCDN+SA ALA+NPV HAR+KHIEID+H++RDQVL+ K+ + Y+P+ +Q ADCLTK L H++F  +R KLGV   PS
Subjt:  ----LMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPS

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]2.9e-21342.73Show/hide
Query:  MTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRA
        MT EVATQ++  E +Q++W   Q L G  +R+   +L+  F ++RKG  KM +YL  MK  AD+L  AGS V+T  L++Q L GLD EYNP+V  + D+ 
Subjt:  MTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRA

Query:  GISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVN--MANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTA
         ++W EMQA+LL +E RLE  NN +S+L+L+P ++++  + N R   N      GG+ N G         GGRGRGRA       ++ VCQVC K GH A
Subjt:  GISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVN--MANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTA

Query:  LMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIA
          CY RFNK ++G    QN  E  + ++ +           N N N+ VA+P TV D +WY DSGASNHVT D N +    E +GK  +T+G+G  LKI 
Subjt:  LMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIA

Query:  FIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNE
          G+S L  + K+  LK +L VP I KNL+S+SKL  DN+++VEFHD  C VKDK TG++LL+G + +GLY+  G    +T TN+ P             
Subjt:  FIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNE

Query:  LSGFVLSSTLNVAIS-KAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYIL
                  +V  S K  WH++LGHP+ +VL  V+K CN+  S  E+F+FCEACQ+GK+H LPF NS S A +  DLVH+D+WGPAP+ SV+GFKYY+L
Subjt:  LSGFVLSSTLNVAIS-KAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYIL

Query:  FIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPL
        F+DD+SRF WIYPLK+K++  +AF+ F  LV+NQFN  IK LQ D GGE+  + ++  + G+++R SCP+TS QNGRAERKHRHVVE+GLTLLAQA MPL
Subjt:  FIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPL

Query:  KFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEE
         +W +AF  AV LIN LP+QV+  KSP + L+ K  D+ A++ FG AC+PCL+ Y  HK Q+HT KCV+LG S  HKG+KCL+S+GR+FISRHV FNE  
Subjt:  KFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEE

Query:  FPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENH--
        FPF  GF   + T   +  ++    L F                     +SP      T SN + + Q L    +  S+    H      QVE AEN   
Subjt:  FPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENH--

Query:  CSASESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKLWLTHAC-TDWSVTEPTRIADALATPQWREAM----------------------------
          A+ S  +  +S +     ++   H M TR K GI KPK     A        EP    +AL  P+W++AM                            
Subjt:  CSASESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKLWLTHAC-TDWSVTEPTRIADALATPQWREAM----------------------------

Query:  -------------------------------DID----PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDK
                                       D D    PV+KA T+R+ILSIAV   W++RQ+D NNAFLNG L E V+MRQP G+ D     ++CKL K
Subjt:  -------------------------------DID----PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDK

Query:  AIYGLKQAPRAWNNTLKATLLSWGY
        AIYGLKQAPR+W + L+  LL WG+
Subjt:  AIYGLKQAPRAWNNTLKATLLSWGY

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]7.1e-22041.23Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYN
        L    ++KLDR NF LWK+L LP++R  K +G++ G   CP++FV     S++ T                       ++NP Y+ W A DQ LLGWL N
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYN

Query:  SMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDR
        SMT ++ATQV+  E +++LW   Q L G  +R+   YL+  F  + K   KM  YL  MKN AD L  AGSP+++  L+ Q L GLD EYNPVV  + D+
Subjt:  SMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDR

Query:  AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL
          ISW + QA+LL FE RL+  NN     +++  AS N A+  +SG  +   +G R   G+   N RG  G GRGRAR   S   +P+CQ+CGK GHTA 
Subjt:  AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL

Query:  MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAF
         CY RF+K +              + +   G+G  +AF         VA+P    D  WY DSGASNHVT     L +  E  GK S+ +G+G KLKI  
Subjt:  MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAF

Query:  IGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNEL
         G++ L     +  L+ VL VP I KNL+SVSKL  DN   VEF + +C VKDK TGK LLKG L +GLY+    K   T+ +     +L          
Subjt:  IGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNEL

Query:  SGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI
                      K IWH++LGHP+ +VL+ V+K  N+ +S ++ F FCEACQ+GK H LPF  S+SHA +  DL+HTD+WGPAP+ S + FKYY+ F+
Subjt:  SGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI

Query:  DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKF
        DDFSRF WI+PLK+K+E + AF  F  LV+NQFN  IK ++ D GGEY  + +   + G++ ++SCP+TSQQNGRAERKHRHV E GLTLLAQA MPL +
Subjt:  DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKF

Query:  WVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFP
        W +AF  AV LIN LPS V   +SP  L++ K+ D+ AL+ FG AC+PCL+ Y  HK Q+HT +CV+LG S  HKG+KC++S GR+F+SRHV FNE  FP
Subjt:  WVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFP

Query:  FASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS
        F  GF   + T      V+   P+ F   S P   TT    E     V   E P     N+  D Q + ++    +              E A       
Subjt:  FASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS

Query:  ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMDID---------------------------
         S   T+++  PP    +   H M TR KAG++KPKL    LT    +    EP  +++AL+ P+W  AMD +                           
Subjt:  ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMDID---------------------------

Query:  ------------------------------------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAI
                                            PVVK+ST+R+ILSIAV   W++RQLD NNAFLNG L E V+M QP GY D    +++C+L+KAI
Subjt:  ------------------------------------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAI

Query:  YGLKQAPRAWNNTLKATLLSWGY
        YGLKQAPRAW + L+ TLLSWG+
Subjt:  YGLKQAPRAWNNTLKATLLSWGY

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]4.3e-20138.8Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYN
        L  I ++KLDR N+ LWK+L LP++R  K +G++ G   CPE+FV +A  S                          +++NP ++ W+A DQ LLGWL N
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYN

Query:  SMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDR
        SM  ++ATQ++  E +++LW   Q L G  +++   YL+  F  +RKG  KM +YL  MKN +D L  +GSP++   L+ Q L GLD EYNPVV  + D+
Subjt:  SMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDR

Query:  AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL
          +SW ++QA+LL FE RL+  NN  S L+L+  AS N AN  +           R N    RGN R +  RG    RG G  SN   CQVC  TGHTA+
Subjt:  AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL

Query:  MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAF
         C  RF++ +           G N     + QG  +AF         VA+P    D  WY DSGASNHVT   +      E+ GK S+ +G+G KLKI  
Subjt:  MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAF

Query:  IGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNEL
         G++ L        L  VL VP I KNL+SVSKL  DN +FVEF    C VKDK TG+ LLKG L +GLY+   +  V+  +N+ P C  ++ K+     
Subjt:  IGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNEL

Query:  SGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI
                         WH++LGHP+ +VL+ V+K CN+ +S ++ F FCEACQ+GK H LPF +S+SH  +   L+H+D+WGPAP+ S +GFKYY+ FI
Subjt:  SGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI

Query:  DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKF
        DDFSRF WI+PLK+K++ + AF+ F  L +NQFN  IK +Q D GGEY  + ++  E G++ R+SCP+TSQQNGRAERKHRHVVE GLTLLAQA MPL++
Subjt:  DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKF

Query:  WVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFP
        W +AF  AV LIN L S V   +SP  L++ ++ D+ AL+ FG AC+PCL+ Y  HK Q+HT +CV++G S  HKG     + G    S +   N+++  
Subjt:  WVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFP

Query:  FASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS
          +  Q   +T +S +N                    +++ +   + V+ N        N ST                   +D        +E+  S +
Subjt:  FASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS

Query:  ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL-WLTHACTDWSVTEPTRIADALATPQWREAMD-------------------------------
         +  S Q ++   T +     H + TR K GI KPKL ++    TD    EP  + +AL  P W+EAMD                               
Subjt:  ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL-WLTHACTDWSVTEPTRIADALATPQWREAMD-------------------------------

Query:  --------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYG
                                          PVVK+ST+R+IL+IAV   W++RQLD NNAFLNG+L E V+M QP GY D    N++CKL KAIYG
Subjt:  --------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYG

Query:  LKQAPRAWNNTLKATLLSWGY
        LKQAPRAW ++L++TL++WG+
Subjt:  LKQAPRAWNNTLKATLLSWGY

TrEMBL top hitse value%identityAlignment
A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)3.4e-22041.23Show/hide
Query:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYN
        L    ++KLDR NF LWK+L LP++R  K +G++ G   CP++FV     S++ T                       ++NP Y+ W A DQ LLGWL N
Subjt:  LNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYN

Query:  SMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDR
        SMT ++ATQV+  E +++LW   Q L G  +R+   YL+  F  + K   KM  YL  MKN AD L  AGSP+++  L+ Q L GLD EYNPVV  + D+
Subjt:  SMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDR

Query:  AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL
          ISW + QA+LL FE RL+  NN     +++  AS N A+  +SG  +   +G R   G+   N RG  G GRGRAR   S   +P+CQ+CGK GHTA 
Subjt:  AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTAL

Query:  MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAF
         CY RF+K +              + +   G+G  +AF         VA+P    D  WY DSGASNHVT     L +  E  GK S+ +G+G KLKI  
Subjt:  MCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAF

Query:  IGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNEL
         G++ L     +  L+ VL VP I KNL+SVSKL  DN   VEF + +C VKDK TGK LLKG L +GLY+    K   T+ +     +L          
Subjt:  IGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNEL

Query:  SGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI
                      K IWH++LGHP+ +VL+ V+K  N+ +S ++ F FCEACQ+GK H LPF  S+SHA +  DL+HTD+WGPAP+ S + FKYY+ F+
Subjt:  SGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFI

Query:  DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKF
        DDFSRF WI+PLK+K+E + AF  F  LV+NQFN  IK ++ D GGEY  + +   + G++ ++SCP+TSQQNGRAERKHRHV E GLTLLAQA MPL +
Subjt:  DDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKF

Query:  WVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFP
        W +AF  AV LIN LPS V   +SP  L++ K+ D+ AL+ FG AC+PCL+ Y  HK Q+HT +CV+LG S  HKG+KC++S GR+F+SRHV FNE  FP
Subjt:  WVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFP

Query:  FASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS
        F  GF   + T      V+   P+ F   S P   TT    E     V   E P     N+  D Q + ++    +              E A       
Subjt:  FASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSAS

Query:  ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMDID---------------------------
         S   T+++  PP    +   H M TR KAG++KPKL    LT    +    EP  +++AL+ P+W  AMD +                           
Subjt:  ESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMDID---------------------------

Query:  ------------------------------------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAI
                                            PVVK+ST+R+ILSIAV   W++RQLD NNAFLNG L E V+M QP GY D    +++C+L+KAI
Subjt:  ------------------------------------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAI

Query:  YGLKQAPRAWNNTLKATLLSWGY
        YGLKQAPRAW + L+ TLLSWG+
Subjt:  YGLKQAPRAWNNTLKATLLSWGY

A0A2Z6MBG6 Integrase catalytic domain-containing protein9.3e-24236.79Show/hide
Query:  TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSE
        ++KLDR N+ LWK+L LP++R  KL+G++ G   CPE+F+ ++ +S N                           N  +  W A DQ LLGW+ NSMT+E
Subjt:  TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSE

Query:  VATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISW
        +ATQ++  E +++LW   Q L G  +R++  YL+  F   RKG  KM DYL  MKN  D L  AG+PV+T  LI Q L GLD EYNPVV  + D+  +SW
Subjt:  VATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISW

Query:  SEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRG-AGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQ
         ++QA+LL FE R+E  NNL    +L+  A+ N+AN       R+   G  +NN +   N RG  GGRGRG+       S K  CQVCG + H A+ C+ 
Subjt:  SEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRG-AGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQ

Query:  RFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNS
        RF+K +       N   G ++Q + N      AF+A+QN         +V D +WY DSGASNHVT       +  E+ GK S+ +G+G KL I   G+S
Subjt:  RFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNS

Query:  CLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFV
         L    K+  L  +L VP+I KNL+SVSKLA DN + VEF +  C VKDK TGKV+LKG+L +GLY+  G K                        S FV
Subjt:  CLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFV

Query:  LSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFS
                  K  WH+RLGHP+ +VL  V++SC + +  +++F FCEACQYGK H LPF +S+SHA +  +LVHTD+WGPAP+ + +GFKYY+ F+DDFS
Subjt:  LSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFS

Query:  RFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDA
        RF WIYPLK+K+E ++AF+ F  L +NQFN  IK +Q D GGEY  + +L  E G++ R+SCP+TSQQNGRAERKHRH+ E GLTLLAQA MPL +W +A
Subjt:  RFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDA

Query:  FLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASG
        F  AV LIN LPSQV   +SP  L+  K+ D++ L+ FG AC+PCL+ Y  HK QYHT +CV+LG S  HKG+KCL+S GR+FISRHV FNE+ FPF  G
Subjt:  FLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASG

Query:  FQQVVTTAASSSNV-SPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSASESS
        F    +   ++ NV S S PL           T     +  S  +   E P  T +  S DV                + D+  +    +E++ +  E+ 
Subjt:  FQQVVTTAASSSNV-SPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSASESS

Query:  ASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMD--------------------------------
          TQ  ++          H + TR K+GI KPKL    LT    D    EP    +AL+ P W+EAM                                 
Subjt:  ASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKL---WLTHACTDWSVTEPTRIADALATPQWREAMD--------------------------------

Query:  -------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGL
                                         PV+KAST+R+ILSIAV   W++RQLD NNAFLNG L E V+M QP G+ D+   N++CKL KAIYGL
Subjt:  -------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGL

Query:  KQAPRAWNNTLKATLLSWGYS--------IIFKYRD----------------------------------------------------------------
        KQAPRAW ++LK  LL+WG+          + K +D                                                                
Subjt:  KQAPRAWNNTLKATLLSWGYS--------IIFKYRD----------------------------------------------------------------

Query:  --------------------------------------------------------------SSYRS---------------------------------
                                                                      S Y S                                 
Subjt:  --------------------------------------------------------------SSYRS---------------------------------

Query:  ------------------------------------------------SNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHAR
                                                        S   ALA  + E+ W+R L+TE+  P   K +LWCDN+SA ALA+NPV HAR
Subjt:  ------------------------------------------------SNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHAR

Query:  TKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELP
        +KHIEIDVH++RDQVLQ ++ V Y+P+ +Q ADCLTK LSH++F+ LR KLGV   P
Subjt:  TKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELP

A0A2Z6P4D5 Integrase catalytic domain-containing protein2.3e-22435.03Show/hide
Query:  TTSFNSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWV
        +++ NSP  N  L  I ++KLDR N+ LWK+L L ++R  KL+G++ G + CPE+FV +A  S                          +++NP +  W+
Subjt:  TTSFNSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWV

Query:  AVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDE
        A DQ LLGWL NSM  ++ATQ++  E +++LW   Q L G  +++   YL+  F  +RKG  KM +YL  MKN +D L  AGSP++   L+ Q L GLD 
Subjt:  AVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDE

Query:  EYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRD-SGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKP
        EYNPVV  + D+  +SW ++QA+LL FE RL+  NN  S L+L+  AS N AN  +  GN+ N            RGN R +  RG    RG G  SN  
Subjt:  EYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRD-SGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKP

Query:  VCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKES
         CQVC  TGH A+ C  RF++ +           G N     + QG  +AF         +A+P    D  WY DSGA+NHVT   +      E+ GK S
Subjt:  VCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKES

Query:  VTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPI
        + +G+G KLKI   G++ L     N  L  VL VP I KNL+SVSKL  DN + VEF    C VKDK TG+ LLKG L +GLY+          +N+ P 
Subjt:  VTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPI

Query:  CNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPL
        C  ++ K+                      WH++LGHP+ +VL  V+K CN+ +S ++ F FCEACQ+GK H LPF  S+SH  +   L+H+D+WGPAP+
Subjt:  CNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPL

Query:  DSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETG
         S +GFKYY+ FIDDFSRF WI+PLK+K++ + AF+ F  L +NQFN  IK +Q D GGEY  + ++  E G++ R+SCP+TSQQNGRAERKHRHV E G
Subjt:  DSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETG

Query:  LTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLF
        LTLLAQA MPL++W +AF  AV LIN LPS V   +SP  L++ ++ D+ AL+ FG AC+PCL+ Y  HK Q+HT +CV++G S  HKG+KC++S GR+F
Subjt:  LTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLF

Query:  ISRHVRFNEEEFPFASGF----QQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHE
        +SRHV FNE  FPF  GF      + T   +SS + P+               T Q+   P    + ++    T S  S+D      ++      + T+ 
Subjt:  ISRHVRFNEEEFPFASGF----QQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAELSPQSSLMPTHE

Query:  DSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAG-HPMVTRGKAGIFKPKL-WLTHACTDWSVTEPTRIADALATPQWREAMD-------------
         S  +Q   A+N   + + + ST +  +        +  H M TR K GI KPK+ ++  A TD    EP  + +AL  P W+EAMD             
Subjt:  DSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAG-HPMVTRGKAGIFKPKL-WLTHACTDWSVTEPTRIADALATPQWREAMD-------------

Query:  --------------------------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGY
                                                            PVVK+ST+R+IL+IAV   W++RQLD NNAFLNG+L E V+M QP GY
Subjt:  --------------------------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGY

Query:  EDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY-------SIIF--------------------------------------------------
         DA   N++CKL KAIYGLKQAPRAW ++L++TL++WG+       S+ F                                                  
Subjt:  EDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY-------SIIF--------------------------------------------------

Query:  ----------------KY----------------------------------------------------------------------------------
                        KY                                                                                  
Subjt:  ----------------KY----------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------RDSSYRS--------SNISALALASTEVIWLRQ--
                                                                          +S YRS        S  S   L S+E   L    
Subjt:  -----------------------------------------------------------------RDSSYRS--------SNISALALASTEVIWLRQ--

Query:  ----LMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPS
            L+ E+  P   K VLWCDN+SA ALA+NPV HAR+KHIEID+H++RDQVL+ K+ + Y+P+ +Q ADCLTK L H++F  +R KLGV   PS
Subjt:  ----LMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPS

A0A803PM38 Uncharacterized protein4.3e-23937.24Show/hide
Query:  PPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLL
        P     LNQ   +KLDR NF LW+ +   I+R ++L+G+L G  P P++F+ +       T  +   +S G             ++NP +E W+  DQLL
Subjt:  PPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLL

Query:  LGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVV
        LGWLY SMT  +A +VMG +++  LW A++ELFG  S+A+ D  R   Q +RKG   M DYLR  +  AD L  AG P     L+S VL GLD EY P+V
Subjt:  LGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVV

Query:  AMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLS---PGASVNMAN-----SRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNK
         +I+ R   +W ++Q  LL  + ++E  ++   S  L+      S ++AN       + GN  N + GG +NN       RG GGR  G          +
Subjt:  AMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLS---PGASVNMAN-----SRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNK

Query:  PVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKE
        P CQVCGK GH+A  CY R                                                          GASNH+T++ N +    EY GKE
Subjt:  PVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKE

Query:  SVTIGDGNKLKIAFIG-NSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTT-DTNE
         VT+ +GN+L I  IG  S          LK++L VPSI KNL+S+SKL  DN V VEF    C VKDK+TG+V+LKG L +GLY+FD   + T+  +N 
Subjt:  SVTIGDGNKLKIAFIG-NSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTT-DTNE

Query:  LPICNLVNSKDINNELSGFVLSSTLNVAIS--KAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLW
           C    S  + + +   V     N  +   K  WH+RLGHPS  VL +V+   N+  ++N S  FC+ACQ GKSH+LPF  +   A    +LVHTD+W
Subjt:  LPICNLVNSKDINNELSGFVLSSTLNVAIS--KAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLW

Query:  GPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRH
        GP+P+ S   F+YYI FIDDFSR+ WIYPLK K+EAL AFV F  LV+NQFNS +K +QTD GGEY    +  ++ G+  +  CPHTS QNGRAERKHRH
Subjt:  GPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRH

Query:  VVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSS
        +VE GLTLLAQA +P K+W DAF  AV LIN LP+ VL  K+P E+L+ ++ D++ L+VFG +CFPCLR YQ HKFQ+H+ KCV LG S  HKG+KCLSS
Subjt:  VVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSS

Query:  SGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNE----CPPTT----PSNSSTDVQPLPAELSP
        +GRL+ISR V FNE+EFPF SGF   + T    + VS  +P W ++ S  + +++ QN    S G +  +      PTT    P  S+         +S 
Subjt:  SGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNE----CPPTT----PSNSSTDVQPLPAELSP

Query:  QSSLMPTHEDSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQWREAMD------
          ++    +       +      +A     S    NL    ST    HPM+TR KAGIFKPK +LT      + +EP  I +AL    W  AM       
Subjt:  QSSLMPTHEDSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQWREAMD------

Query:  ---------------------------------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVY
                                                                   PV+KAST+R++LSIAVTK W++RQLD NNAFLNG + ED+Y
Subjt:  ---------------------------------------------------------IDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVY

Query:  MRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY------SIIFKYRDSSY-------------------------------------
        M+QP G+ED    N+VCKL K+IYGL+QAPRAW + LKATL SW +      S +F  + SSY                                     
Subjt:  MRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY------SIIFKYRDSSY-------------------------------------

Query:  -----------------------------------------------------------------------------------------------RSSNI
                                                                                                       RSS  
Subjt:  -----------------------------------------------------------------------------------------------RSSNI

Query:  S---ALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLR
        S   ALA  + E+ W++ L+ E+ FP  +  ++WCDN+ A ALA+NPV+HARTKHIEID+HFVRD++++ KLEVRYIPS+EQ ADCLTK+L+H    +L 
Subjt:  S---ALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLR

Query:  CKLGVAELPSSLREDIKDKRPKGPDKQESKLDQP
         KLGV  +P SLR ++++   +   + +S  D P
Subjt:  CKLGVAELPSSLREDIKDKRPKGPDKQESKLDQP

A0A803QCY3 Uncharacterized protein7.9e-22538.62Show/hide
Query:  TSTGNPAYVTGTTSFN------SPPLN-----QLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGA
        T+ GNP  VT   S N      +P L+       L Q  ++KLD  N+ LWK +   I+R ++L+G L+G + CP ++V              G+T  G 
Subjt:  TSTGNPAYVTGTTSFN------SPPLN-----QLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGA

Query:  VASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNL
               S   + LNP +E+W+  DQLL+GWLY+SMT  +AT+VMG  +A  LW A+++L+G  S+++ D  R + Q ++KG + M +YLR  K+ AD+L
Subjt:  VASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNL

Query:  GQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQ
          AG P     L + VL  LD  Y  +V  I+ R   SW E+Q  LL FE ++E                                 G   NN  GR   
Subjt:  GQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQ

Query:  RGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGAS
        RG GGR RGR R   + ++KP CQVCGK  H+A++CY  F+  ++G              +NQN  GQ      N N ++ +A PE +    W+ADSGAS
Subjt:  RGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGAS

Query:  NHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNF-KLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVL
        N++TAD + +    EY GKE VT+G+G+KL I+  GN  L  +   + KL ++L VP IAKN +SVSKL  DN+V +EFH   C VKD  T +VLL+G+L
Subjt:  NHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNF-KLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVL

Query:  DEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPN
         +GLY+      + T  N+                S ++  ST                      K +V  CN P +V+    FC+ACQYGKSH+LPF +
Subjt:  DEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPN

Query:  STSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLS
        S S AL   DLVHTDLWGP+P+ S   FKYY+ F+DD +RF WIYPLK K+EA +AF+ F +L +NQF   IKAL+TD GGEY  +       G+    S
Subjt:  STSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLS

Query:  CPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKC
        CPHTS QNGRAERKHRH+VE GLTLLAQ+ MPLK+W DAF  AV LIN LP+ +L  K+P E+L+ K  D++ L+ FG ACFPCLR YQAHKFQ+H+ KC
Subjt:  CPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKC

Query:  VYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGF------QQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPS
        V LG S  HKG+KCLS +GR++I R V FNE EFPF   F      + +V   +S+ +V PS+                     P+TG   +      PS
Subjt:  VYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGF------QQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPS

Query:  NSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVA-----ENHCSASESSASTQSSN---LPPTPSTVQAGHPMVTRGKAGIFKPKLWLTHACTDWSVTEP
        +SS + +P     +PQS          P+ V  A      N     E +A   +     +PP PS    G+  V R K  +            D ++   
Subjt:  NSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVA-----ENHCSASESSASTQSSN---LPPTPSTVQAGHPMVTRGKAGIFKPKLWLTHACTDWSVTEP

Query:  TRIADALATPQWREAMDID------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWN
         R+   L    + + + ID      PV+KAST+R+IL+IAV+KGW +RQLD NNAFLNG L+EDV+M QP G+E+    ++VCKL+K++YGLKQA RAW+
Subjt:  TRIADALATPQWREAMDID------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWN

Query:  NT---------------------------------------------------------------------------------------------LKATL
         T                                                                                             L  TL
Subjt:  NT---------------------------------------------------------------------------------------------LKATL

Query:  LSWGYSIIFKYRDSSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQ
        +SW          SS   S   AL   S ++ W+  L+ EIGFP     V WCDN+ A ALA+NPVFHAR KHIEID+HFVRD+VL+ +LEVRYIPS++Q
Subjt:  LSWGYSIIFKYRDSSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQ

Query:  PADCLTKTLSHSQFAYLRCKLGVAELPSSLREDIKD
         ADCLTK L+ S+F +L  K+G    P  LR D+++
Subjt:  PADCLTKTLSHSQFAYLRCKLGVAELPSSLREDIKD

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.2e-4121.98Show/hide
Query:  NPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLR-QVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLI
        N + +SW   ++     +   ++            A+++   +  ++  +S A +  LR ++          +  +  I       L  AG+ +     I
Subjt:  NPLYESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLR-QVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLI

Query:  SQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKR-LELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRAR
        S +L+ L   Y+ ++  I+     + SE    L   + R L+ +  +K+  + +    +N     ++   +N  +  R                 + +  
Subjt:  SQVLLGLDEEYNPVVAMIQDRAGISWSEMQAELLVFEKR-LELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRAR

Query:  GYGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLAN
          G+   K  C  CG+ GH    C+      +   + N+N+    N +  Q       AF+  +  N+ V     + +  +  DSGAS+H+  D +   +
Subjt:  GYGSFSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLAN

Query:  PVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAV
         VE      + +    +   A               L+ VL     A NL+SV +L ++  + +EF      +       V   G+L             
Subjt:  PVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAV

Query:  TTDTNELPICNL----VNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVL-----KSVVKSCNLPLSVNESFKFCEACQYGKSHALPFP--NST
            N +P+ N     +N+K  NN                  +WH+R GH S+  L     K++    +L  ++  S + CE C  GK   LPF      
Subjt:  TTDTNELPICNL----VNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVL-----KSVVKSCNLPLSVNESFKFCEACQYGKSHALPFP--NST

Query:  SHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYV--KIHQLCNEMGVKIRLS
        +H      +VH+D+ GP    +++   Y+++F+D F+ +   Y +K K++    F  F A  +  FN  +  L  DNG EY+  ++ Q C + G+   L+
Subjt:  SHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYV--KIHQLCNEMGVKIRLS

Query:  CPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVL--GGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTE
         PHT Q NG +ER  R + E   T+++ A +   FW +A L A  LIN +PS+ L    K+P E+ + KK   + LRVFG+  +  ++  Q  KF   + 
Subjt:  CPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVL--GGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTE

Query:  KCVYLGPSPLHKGHKCLSSSGRLFI-SRHVRFNEEEFPFASGFQ-QVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGV-----SPNECPPT
        K +++G  P   G K   +    FI +R V  +E     +   + + V    S  + + + P    N S   I+T   N       +     S       
Subjt:  KCVYLGPSPLHKGHKCLSSSGRLFI-SRHVRFNEEEFPFASGFQ-QVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGV-----SPNECPPT

Query:  TPSNSSTDVQPLPAELSPQSSLMPTHEDSP--------PSQVEVAENHCSASESSASTQSSNLPPTPS----------TVQAGHPMVTRGKAGI-FKPKL
         P++S   +Q      S +   +   +DS          S+    ++H + S+ S +   S    T            T   G  ++ R    +  KP++
Subjt:  TPSNSSTDVQPLPAELSPQSSLMPTHEDSP--------PSQVEVAENHCSASESSASTQSSNLPPTPS----------TVQAGHPMVTRGKAGI-FKPKL

Query:  --------------------------------------WLTHACTD---------WSVTE-----------------------PTRIADALATPQWREAM
                                              W     T+         W++T+                       P R    L    + +  
Subjt:  --------------------------------------WLTHACTD---------WSVTE-----------------------PTRIADALATPQWREAM

Query:  DID------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKC-SNYVCKLDKAIYGLKQAPRAWNNTLKATL
         ID      PV + S+ R ILS+ +    ++ Q+D   AFLNG L E++YMR P G     C S+ VCKL+KAIYGLKQA R W    +  L
Subjt:  DID------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKC-SNYVCKLDKAIYGLKQAPRAWNNTLKATL

P04146 Copia protein1.0e-1136.61Show/hide
Query:  SSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQ
        +S   +   AL  A  E +WL+ L+T I     +   ++ DN    ++A NP  H R KHI+I  HF R+QV    + + YIP+  Q AD  TK L  ++
Subjt:  SSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQ

Query:  FAYLRCKLGVAE
        F  LR KLG+ +
Subjt:  FAYLRCKLGVAE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-7326.92Show/hide
Query:  ESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYL-RQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVL
        E W  +D+     +   ++ +V   ++  + A+ +W  ++ L+  ++   + YL +Q++       +    +L +       L   G  +        +L
Subjt:  ESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYL-RQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVL

Query:  LGLDEEY-NPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGS
          L   Y N    ++  +  I   ++ + LL+ EK  +   N   +L ++ G              R +SY  R++N +GR     +G RG+ + R    
Subjt:  LGLDEEY-NPVVAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGS

Query:  FSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVA-NQNVNSCVAAPETVI-----DPNWYADSGASNHVTADYNNL
          N   C  C + GH    C          P   + +GE + ++N+ N     AA V  N NV   +   E  +     +  W  D+ AS+H T   +  
Subjt:  FSNKPVCQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVA-NQNVNSCVAAPETVI-----DPNWYADSGASNHVTADYNNL

Query:  ANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAE-HKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGV
           V  +   +V +G+ +  KIA IG+ C+         LK V  VP +  NL+S   L +D       +  + L K      V+ KGV    LYR    
Subjt:  ANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAE-HKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGV

Query:  KAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFD
                        N++    EL      +     IS  +WHKR+GH SE+ L+ + K   +  +   + K C+ C +GK H + F  S+   L+  D
Subjt:  KAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFD

Query:  LVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYV--KIHQLCNEMGVKIRLSCPHTSQQN
        LV++D+ GP  ++S+ G KY++ FIDD SR +W+Y LK K++  + F  F ALV+ +    +K L++DNGGEY   +  + C+  G++   + P T Q N
Subjt:  LVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYV--KIHQLCNEMGVKIRLSCPHTSQQN

Query:  GRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPL
        G AER +R +VE   ++L  A +P  FW +A   A  LIN  PS  L  + P  +   K++ +  L+VFG   F  + K Q  K    +  C+++G    
Subjt:  GRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPL

Query:  HKGHKCLSSSGRLFI-SRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAEL
          G++      +  I SR V F E E         V T A  S  V   I         P+  T      +PST  +P     TT   S    Q  P E+
Subjt:  HKGHKCLSSSGRLFI-SRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNECPPTTPSNSSTDVQPLPAEL

Query:  SPQSSLM---------PTHEDSPPSQVEVAENHCSASESSASTQ----SSNLPPTPSTVQAGHP------------MVTRGKAGIFK----PK-------
          Q   +         PT  +     +  +E     S    ST+    S +  P        HP            M +  K G +K    PK       
Subjt:  SPQSSLM---------PTHEDSPPSQVEVAENHCSASESSASTQ----SSNLPPTPSTVQAGHP------------MVTRGKAGIFK----PK-------

Query:  LWLTHACTDWSVTEPTRIADALATPQWREAMDID------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKL
         W+     D    +  R    L    + +   ID      PVVK ++IR ILS+A +   ++ QLD   AFL+G L+E++YM QP G+E A   + VCKL
Subjt:  LWLTHACTDWSVTEPTRIADALATPQWREAMDID------PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKL

Query:  DKAIYGLKQAPRAW
        +K++YGLKQAPR W
Subjt:  DKAIYGLKQAPRAW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-1233.58Show/hide
Query:  SWGYSIIFKYRDSSYRSSNISALALAST------------EVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGK
        S GY   F     S++S     +AL++T            E+IWL++ + E+G     + V++CD+ SA  L+ N ++HARTKHI++  H++R+ V    
Subjt:  SWGYSIIFKYRDSSYRSSNISALALAST------------EVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGK

Query:  LEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGV
        L+V  I +NE PAD LTK +  ++F   +  +G+
Subjt:  LEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGV

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.2e-2226.98Show/hide
Query:  DSGASNHV--TADYNNLANPVEYEGKESVTIGDGNK--LKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVK---DK
        DSGAS  +  +A Y + A P        + I D  K  + I  IGN     ++      K L  P+IA +L+S+S+LA  N          C  +   ++
Subjt:  DSGASNHV--TADYNNLANPVEYEGKESVTIGDGNK--LKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVK---DK

Query:  DTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPS-EEVLKSVVKSCNLPLS------VNESF
          G VL   V     Y       + +  ++L I N+  SK +N                   + H+ LGH +   + KS+ K+    L        N S 
Subjt:  DTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPS-EEVLKSVVKSCNLPLS------VNESF

Query:  KFCEACQYGKS----HALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPL--KRKNEALEAFVHFTALVKNQFNSSIKALQ
          C  C  GKS    H          + + F  +HTD++GP      +   Y+I F D+ +RF W+YPL  +R+   L  F    A +KNQFN+ +  +Q
Subjt:  KFCEACQYGKS----HALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPL--KRKNEALEAFVHFTALVKNQFNSSIKALQ

Query:  TDNGGEYVK--IHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPS
         D G EY    +H+     G+    +    S+ +G AER +R ++    TLL  + +P   W  A   + ++ N L S
Subjt:  TDNGGEYVK--IHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-13827.34Show/hide
Query:  NSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQ
        N+  LN  ++ +T  KL   N+L+W      +   Y+L G L G++  P            P T    A               A  +NP Y  W   D+
Subjt:  NSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQ

Query:  LLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNP
        L+   +  +++  V   V     A ++W  +++++   S      LR   +Q  KG   + DY++ +    D L   G P++    + +VL  L EEY P
Subjt:  LLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNP

Query:  VVAMIQDR-AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGY-GSFSNKPV--
        V+  I  +    + +E+   LL  E ++       SS ++ P  + N  + R++    N + G R N    R N   +    +     +  +  +KP   
Subjt:  VVAMIQDR-AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGY-GSFSNKPV--

Query:  -CQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQ-NVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKE
         CQ+CG  GH+A  C Q   + F+  +               N Q  P+ F   Q   N  + +P +    NW  DSGA++H+T+D+NNL+    Y G +
Subjt:  -CQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQ-NVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKE

Query:  SVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELP
         V + DG+ + I+  G++ L+ + +   L  +L VP+I KNL+SV +L   N V VEF      VKD +TG  LL+G   + LY             E P
Subjt:  SVTIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELP

Query:  ICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKF--CEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGP
        I +       +  +S F   S+     + + WH RLGHP+  +L SV+ + +L + +N S KF  C  C   KS+ +PF  ST ++    + +++D+W  
Subjt:  ICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKF--CEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGP

Query:  APLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVV
        +P+ S + ++YY++F+D F+R+ W+YPLK+K++  E F+ F  L++N+F + I    +DNGGE+V + +  ++ G+    S PHT + NG +ERKHRH+V
Subjt:  APLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVV

Query:  ETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLS-SS
        ETGLTLL+ AS+P  +W  AF VAV LIN LP+ +L  +SP + L+G   ++  LRVFG AC+P LR Y  HK    + +CV+LG S     + CL   +
Subjt:  ETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLS-SS

Query:  GRLFISRHVRFNEEEFPFASGFQQV-------------------------VTTAASSSN--------VSPSIPLWFSNISGPDIKTTQQNREVPSTGVSP
         RL+ISRHVRF+E  FPF++    +                         V  A S S+         SPS P   S +S         N +   +   P
Subjt:  GRLFISRHVRFNEEEFPFASGFQQV-------------------------VTTAASSSN--------VSPSIPLWFSNISGPDIKTTQQNREVPSTGVSP

Query:  NECPPTTPSNSSTDVQPLPAELSPQS-SLMPTHEDSP----PSQV-----EVAENHCSASESSASTQSSNLPPTPSTV----------------QA---G
        +   PT P  +       P +   Q+ S   T +++P    PSQ+       A++  S+   + S  SS+  PTP ++                QA    
Subjt:  NECPPTTPSNSSTDVQPLPAELSPQS-SLMPTHEDSP----PSQV-----EVAENHCSASESSASTQSSNLPPTPSTV----------------QA---G

Query:  HPMVTRGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQWREAM------------------------------------------------------
        H M TR KAGI KP    + A +  + +EP     AL   +WR AM                                                      
Subjt:  HPMVTRGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQWREAM------------------------------------------------------

Query:  ----------DIDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY
                     PV+K+++IR++L +AV + W +RQLD NNAFL G L +DVYM QPPG+ D    NYVCKL KA+YGLKQAPRAW   L+  LL+ G+
Subjt:  ----------DIDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY

Query:  ----------------SIIF--------------------------------------------------------------------------------
                        SI++                                                                                
Subjt:  ----------------SIIF--------------------------------------------------------------------------------

Query:  -----------------KYR--------------DSSYRSSNIS--------------------------------------------------------
                         +YR              D SY  + +S                                                        
Subjt:  -----------------KYR--------------DSSYRSSNIS--------------------------------------------------------

Query:  ---------------------------------ALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGK
                                         ++A  S+E+ W+  L+TE+G       V++CDNV A  L ANPVFH+R KHI ID HF+R+QV  G 
Subjt:  ---------------------------------ALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGK

Query:  LEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPSS
        L V ++ +++Q AD LTK LS + F     K+GV  +P S
Subjt:  LEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPSS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-13526.71Show/hide
Query:  NSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQ
        N+  LN  ++ +T  KL   N+L+W      +   Y+L G L G++P P                          A+   ++ P   +NP Y  W   D+
Subjt:  NSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQ

Query:  LLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNP
        L+   +  +++  V   V     A ++W  +++++   S      LR +                      D L   G P++    + +VL  L ++Y P
Subjt:  LLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNP

Query:  VVAMIQDR-AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPV---
        V+  I  +    S +E+   L+  E +L   N    S  + P  + N+   R++   RNQ+  G  N  +   N R    +         +   KP    
Subjt:  VVAMIQDR-AGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPV---

Query:  CQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESV
        CQ+C   GH+A  C Q             ++ +    Q        P    AN  VNS   A       NW  DSGA++H+T+D+NNL+    Y G + V
Subjt:  CQVCGKTGHTALMCYQRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESV

Query:  TIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPIC
         I DG+ + I   G++ L    ++  L KVL VP+I KNL+SV +L   N V VEF      VKD +TG  LL+G   + LY +    +        P  
Subjt:  TIGDGNKLKIAFIGNSCLAAEHKNFKLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPIC

Query:  NLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKF--CEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAP
           +S                        WH RLGHPS  +L SV+ + +LP+ +N S K   C  C   KSH +PF NST  +    + +++D+W  +P
Subjt:  NLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKF--CEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAP

Query:  LDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVET
        + S++ ++YY++F+D F+R+ W+YPLK+K++  + F+ F +LV+N+F + I  L +DNGGE+V +    ++ G+    S PHT + NG +ERKHRH+VE 
Subjt:  LDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQFNSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVET

Query:  GLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLS-SSGR
        GLTLL+ AS+P  +W  AF VAV LIN LP+ +L  +SP + L+G+  +++ L+VFG AC+P LR Y  HK +  +++C ++G S     + CL   +GR
Subjt:  GLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFGSACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLS-SSGR

Query:  LFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIP-----------LWFSNISGPDIKTTQQNREVPS----TGVSPNECPPTTPSNSSTDVQPLPA
        L+ SRHV+F+E  FPF++    V T+    S+ +P+ P           L      GP + T+ +    PS    T VS +  P ++ S+ S+     P+
Subjt:  LFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIP-----------LWFSNISGPDIKTTQQNREVPS----TGVSPNECPPTTPSNSSTDVQPLPA

Query:  ELSPQSSLMP-------------------------THEDSPPSQVEVAENHCSA---------SESSASTQSSNLP---PTPSTVQAG-------HPMVT
           PQ +  P                          +++SP  Q  ++  H            S SS+ST +  LP   P P  +Q         H M T
Subjt:  ELSPQSSLMP-------------------------THEDSPPSQVEVAENHCSA---------SESSASTQSSNLP---PTPSTVQAG-------HPMVT

Query:  RGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQWREAM-----------------------------------------------------------
        R K GI KP    ++A +  + +EP     A+   +WR+AM                                                           
Subjt:  RGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQWREAM-----------------------------------------------------------

Query:  -----DIDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY-----
                PV+K+++IR++L +AV + W +RQLD NNAFL G L ++VYM QPPG+ D    +YVC+L KAIYGLKQAPRAW   L+  LL+ G+     
Subjt:  -----DIDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGY-----

Query:  -----------SIIF-------------------------------------------------------------------------------------
                   SII+                                                                                     
Subjt:  -----------SIIF-------------------------------------------------------------------------------------

Query:  ------------KYR--------------DSSYRSSNIS-------------------------------------------------------------
                    +YR              D SY  + +S                                                             
Subjt:  ------------KYR--------------DSSYRSSNIS-------------------------------------------------------------

Query:  ----------------------------ALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRY
                                    ++A  S+E+ W+  L+TE+G       V++CDNV A  L ANPVFH+R KHI +D HF+R+QV  G L V +
Subjt:  ----------------------------ALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRY

Query:  IPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPSS
        + +++Q AD LTK LS   F     K+GV ++P S
Subjt:  IPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPSS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.3e-0621.56Show/hide
Query:  PLNIKLSDSNYMLWKNQLLNHIIAFDMESFIDGTPPPVK-------------------YLDPTQTQ-------------------------LQKIRKDS-
        P+ + + +SNY  W+   L H ++FD+   IDGT  P                      L P Q Q                          + +R DS 
Subjt:  PLNIKLSDSNYMLWKNQLLNHIIAFDMESFIDGTPPPVK-------------------YLDPTQTQ-------------------------LQKIRKDS-

Query:  --------LPVSQYLAQIKDISDKFSAIDEPLSYRDHLAYILEGFGPEYNPFVTSIQNRTDRPSLVDVRSLLLAYDAKLEKQSSVDTLNAIQANIANLSS
                + V+ Y  ++K ++D    +D P++ R+ + Y+L G  P+++  +  I++R   PS  D  ++L   + +L++      +     ++ + SS
Subjt:  --------LPVSQYLAQIKDISDKFSAIDEPLSYRDHLAYILEGFGPEYNPFVTSIQNRTDRPSLVDVRSLLLAYDAKLEKQSSVDTLNAIQANIANLSS

Query:  SNTYKRSSRPPNTQCSNS
        S     S  PP T    S
Subjt:  SNTYKRSSRPPNTQCSNS

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-1744.12Show/hide
Query:  PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYE----DAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGYSIIFKYRDS
        PV K +++++IL+I+    + L QLD +NAFLNG LDE++YM+ PPGY     D+   N VC L K+IYGLKQA R W      TL+ +G+  +  + D 
Subjt:  PVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYE----DAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGYSIIFKYRDS

Query:  SY
        +Y
Subjt:  SY

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-1133.55Show/hide
Query:  DKAIYGLKQAPRAWNN---TLKATLLSWGYSIIFKYRDSSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIE
        D +    K   R+ N     L  +L+SW  S   +    S   +   AL+ A+ E++WL Q   E+  P    ++L+CDN +A  +A N VFH RTKHIE
Subjt:  DKAIYGLKQAPRAWNN---TLKATLLSWGYSIIFKYRDSSYRSSNISALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIE

Query:  IDVHFVRDQ-VLQGKLEVRYIPSNEQPADCLTKTLS---HSQFAYLRCKLGVAEL
         D H VR++ V Q  L   +   +EQ  D  T+ LS        Y+    G+A L
Subjt:  IDVHFVRDQ-VLQGKLEVRYIPSNEQPADCLTKTLS---HSQFAYLRCKLGVAEL

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.6e-0622.14Show/hide
Query:  TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSE
        T+ L++ N+ +W+ L   +  S+ + GH+ G+S                                    TP  E     + W   D L+  W+Y ++T  
Subjt:  TIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPLYESWVAVDQLLLGWLYNSMTSE

Query:  VATQVMGYE-NAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGI-
        +   ++     A++LW +++ LF     A         + +   +  + +Y + +K+ +D L    SP++ R L+  +L GL E+Y+ ++ +I+ ++   
Subjt:  VATQVMGYE-NAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPVVAMIQDRAGI-

Query:  SWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVN---MANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGG
        S++E ++ LL+ E R  L N  KSSLS +   S++       R       + +   +N G GR  ++  GG
Subjt:  SWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVN---MANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGG

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.2e-0941.79Show/hide
Query:  IWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWG
        +WH RL H S+  ++ +VK   L  S   S KFCE C YGK+H + F        +  D VH+DLWG
Subjt:  IWHKRLGHPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGCGTCACCTCCACCGGAAATCCTGCCTATGTTACCGGAACAACCAGCTTCAACAGTCCTCCGCTCAATCAGCTGTTGAATCAAATCACAACCATCAAATTGGA
TCGAGGTAATTTTTTGCTTTGGAAAAACCTCGCGTTACCTATCCTTCGATCGTATAAATTAGAAGGGCATTTGTCTGGAGCTTCTCCGTGCCCTGAGAAATTTGTTCAAG
CCGCATCTGCAAGTGTCAATCCCACGACAACTGAAGCAGGAGCAACAAGCTCAGGAGCTGTTGCAAGTGAAGCGGCTGAGTCTACTCCAGCGAGAGAATTAAATCCGCTG
TATGAGTCATGGGTAGCAGTAGATCAACTGCTCCTGGGGTGGTTATACAATTCCATGACCTCTGAGGTTGCTACACAAGTCATGGGGTATGAAAATGCTCAAGAGTTATG
GGCAGCGATTCAAGAGTTGTTCGGTGTTCAATCTCGGGCAGAGGAGGACTATCTTCGTCAGGTATTCCAACAATCTCGTAAGGGAAATTCTAAAATGACTGACTATTTGC
GTATAATGAAAAATCACGCAGATAATCTGGGGCAAGCAGGAAGTCCGGTGAATACCAGGTCATTAATTTCTCAAGTTCTTCTTGGCCTTGATGAAGAATACAATCCGGTA
GTAGCTATGATTCAAGACAGAGCTGGAATTTCATGGTCTGAGATGCAAGCTGAATTGCTTGTGTTTGAGAAGAGGTTGGAGCTGCAGAACAATCTGAAAAGTTCCTTATC
CTTGAGTCCTGGAGCATCTGTAAACATGGCAAATAGCAGAGACAGTGGAAATCAAAGAAACCAAAGTTATGGTGGTCGAACCAATAATGGCTTTGGTCGAGGAAATCAAA
GAGGTGCTGGAGGACGAGGTAGAGGAAGAGCTAGAGGATATGGATCATTCAGTAACAAACCAGTTTGCCAGGTATGTGGGAAGACAGGCCATACGGCATTAATGTGTTAT
CAAAGGTTTAACAAAGAGTTTGTAGGACCTATTTACAATCAGAACAGGGGAGAAGGAGCAAATCGACAGAATAATCAAAATGGCCAAGGGCAACCTGCTGCTTTTGTTGC
AAATCAGAATGTTAATTCCTGTGTGGCAGCACCTGAGACTGTCATTGATCCAAATTGGTACGCTGACAGTGGGGCTTCAAATCACGTAACAGCAGACTACAACAACTTGG
CCAACCCAGTTGAATATGAAGGTAAAGAAAGTGTAACCATTGGTGATGGAAACAAACTTAAAATTGCATTCATTGGAAACTCTTGTTTGGCTGCTGAGCATAAGAATTTT
AAGCTTAAGAAAGTCTTATGTGTTCCTAGTATTGCCAAGAATCTTGTTAGTGTATCCAAACTTGCTAAAGATAATGAAGTATTTGTTGAATTTCATGATGGTTTTTGCCT
TGTAAAGGACAAGGATACGGGCAAGGTGCTGCTGAAAGGAGTACTTGATGAAGGTCTGTACCGTTTTGATGGAGTCAAGGCTGTCACGACAGATACAAATGAGTTACCAA
TCTGCAATCTGGTTAATTCCAAGGATATAAATAATGAGCTGTCTGGGTTTGTTTTATCTAGTACGTTAAATGTTGCTATCTCAAAAGCCATATGGCACAAACGTTTAGGT
CACCCATCTGAGGAAGTTCTTAAGTCTGTTGTTAAGAGTTGTAATCTTCCCTTAAGTGTTAATGAAAGTTTTAAGTTCTGTGAAGCATGCCAATATGGAAAGTCACATGC
CCTGCCTTTTCCTAACTCTACCTCTCATGCATTAGACAAATTTGATCTTGTTCATACAGATCTTTGGGGGCCAGCACCATTAGATTCAGTAAATGGTTTTAAATACTACA
TATTGTTCATTGATGATTTCAGTAGATTCGTGTGGATATATCCATTAAAACGAAAGAATGAGGCACTTGAAGCATTTGTTCACTTTACTGCCCTTGTTAAAAATCAATTC
AACAGTTCTATAAAAGCATTGCAGACTGACAATGGAGGAGAGTATGTTAAGATACATCAACTATGTAATGAAATGGGGGTTAAAATCAGATTGTCCTGCCCACATACCTC
TCAGCAAAATGGAAGGGCTGAAAGAAAACATCGACACGTTGTTGAGACAGGATTAACTCTTCTAGCTCAAGCCTCAATGCCCCTAAAATTCTGGGTGGATGCATTTCTTG
TTGCTGTATTATTAATAAATGGTCTTCCTTCTCAGGTTCTTGGAGGAAAATCACCCATGGAGCTGTTGTATGGCAAGAAAATTGACTTTCAAGCTTTACGAGTATTTGGA
TCAGCTTGTTTTCCATGTCTCAGAAAGTATCAAGCACATAAGTTCCAATATCACACTGAGAAGTGTGTTTATTTGGGACCAAGTCCGTTGCACAAAGGACACAAGTGCCT
CAGTTCCAGTGGTAGATTGTTTATTTCCCGACATGTTCGATTCAATGAAGAGGAGTTCCCGTTTGCCTCTGGTTTTCAGCAAGTAGTAACAACAGCTGCCTCAAGTTCTA
ATGTGTCACCTTCAATTCCTCTGTGGTTCTCTAATATCTCCGGCCCAGACATTAAAACCACTCAGCAAAATCGTGAGGTTCCTTCTACTGGAGTGTCTCCAAATGAGTGC
CCACCCACCACTCCCTCAAATTCATCCACGGATGTTCAACCTCTGCCAGCTGAATTATCTCCTCAATCTTCACTAATGCCAACTCATGAGGATTCTCCACCTTCTCAAGT
TGAAGTTGCTGAAAATCATTGTTCAGCATCTGAGTCTTCTGCCTCTACACAGTCATCAAATCTTCCTCCAACACCGTCCACAGTGCAGGCTGGTCATCCCATGGTCACTA
GAGGAAAGGCTGGGATATTCAAACCAAAGTTGTGGTTAACGCACGCCTGCACTGATTGGTCAGTCACTGAACCTACGAGAATTGCTGATGCTCTAGCTACTCCTCAATGG
CGGGAGGCGATGGACATTGACCCGGTTGTCAAGGCATCTACGATTCGTGTAATCCTCAGTATTGCTGTTACCAAAGGATGGCAGCTTCGTCAGTTGGACTTCAACAATGC
GTTTCTCAATGGAAGGCTTGATGAGGATGTGTATATGCGTCAACCCCCGGGATATGAGGATGCAAAGTGTTCAAACTATGTATGCAAACTTGACAAGGCGATTTATGGCC
TTAAACAGGCTCCAAGAGCATGGAATAATACATTGAAGGCTACTCTATTATCATGGGGCTACTCTATTATCTTTAAATACAGGGACTCCTCTTACAGATCCTCTAATATA
TCGGCGTTGGCATTAGCATCTACTGAAGTTATTTGGTTAAGACAGTTGATGACTGAAATTGGCTTCCCATGTTGTTCAAAATCTGTTCTGTGGTGCGACAATGTGAGTGC
TGGTGCCTTGGCTGCGAACCCGGTGTTTCATGCCCGGACCAAACACATCGAAATAGATGTTCACTTTGTTCGTGATCAGGTTCTTCAAGGGAAACTTGAAGTTAGATATA
TTCCCTCAAATGAGCAGCCAGCTGATTGTCTTACCAAGACACTGTCGCATTCCCAGTTTGCTTATCTACGTTGCAAACTTGGAGTAGCTGAGCTACCCTCAAGTTTGAGG
GAGGATATCAAGGATAAAAGGCCTAAAGGCCCAGACAAGCAAGAAAGCAAGTTGGATCAACCAAAGCCCATCAACAGTAGGCGCCCACATCTACCTCTTCATACCCACGG
CCTAAATCCTAATGTCCCCCCCTTTTCAACCCAAATCCCTCAGCATCCAAATACTTCGACCTTCCCTTTTCCACAAAATGCCTCAACTACTACTCCTGTTGGTTTTCCTT
ACCCTCCGACTGCTCAATCGTCACTCCCCTTTTTCCCAAATAACCCTGGTGCTCTTCCACTCTTCCCCCCCACCGCAACTTCTCCTTACCCTACCTTAACTCCTCCCCTC
AACATCAAACTTTCAGACTCAAATTACATGCTATGGAAGAACCAGCTACTGAATCACATAATTGCTTTTGATATGGAAAGTTTCATTGATGGCACTCCTCCTCCGGTAAA
GTACTTGGATCCGACACAAACTCAGTTGCAAAAGATTCGTAAGGATAGTCTTCCAGTCTCCCAATATTTAGCTCAGATTAAGGATATATCAGATAAGTTCTCTGCTATAG
ATGAGCCCCTGTCTTATCGTGATCATCTGGCTTATATACTTGAAGGTTTTGGTCCAGAATATAATCCCTTCGTCACTTCTATACAAAATCGAACCGACAGGCCATCTCTT
GTTGATGTTAGGAGCTTACTGCTTGCTTATGATGCTAAGTTAGAGAAACAGTCATCGGTTGATACTCTGAATGCTATTCAAGCCAATATTGCCAATCTTTCAAGCTCTAA
CACCTATAAACGCTCTTCTCGACCTCCCAATACCCAGTGTTCCAACTCTGCCTCGGTCCCCTGCCCCTTGTTCATTCCCTTCCTCATTTCCAACGCCGCCCACAATTCAC
TCCCAATTCTTTTCAACAAGCTCGACCTCCACGCTGGAATTCCCCTTCCTCTCAATCTCGACCACAATGTCAAATTTGTGGTAAGATGGGTCATATTGCTCAAATTTGTC
TCAATCCAACCGTCATTGGTTGGCCTAATGGAAAAAAAGATTAGGGAAGGAACTCGAGAACTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTAGCGTCACCTCCACCGGAAATCCTGCCTATGTTACCGGAACAACCAGCTTCAACAGTCCTCCGCTCAATCAGCTGTTGAATCAAATCACAACCATCAAATTGGA
TCGAGGTAATTTTTTGCTTTGGAAAAACCTCGCGTTACCTATCCTTCGATCGTATAAATTAGAAGGGCATTTGTCTGGAGCTTCTCCGTGCCCTGAGAAATTTGTTCAAG
CCGCATCTGCAAGTGTCAATCCCACGACAACTGAAGCAGGAGCAACAAGCTCAGGAGCTGTTGCAAGTGAAGCGGCTGAGTCTACTCCAGCGAGAGAATTAAATCCGCTG
TATGAGTCATGGGTAGCAGTAGATCAACTGCTCCTGGGGTGGTTATACAATTCCATGACCTCTGAGGTTGCTACACAAGTCATGGGGTATGAAAATGCTCAAGAGTTATG
GGCAGCGATTCAAGAGTTGTTCGGTGTTCAATCTCGGGCAGAGGAGGACTATCTTCGTCAGGTATTCCAACAATCTCGTAAGGGAAATTCTAAAATGACTGACTATTTGC
GTATAATGAAAAATCACGCAGATAATCTGGGGCAAGCAGGAAGTCCGGTGAATACCAGGTCATTAATTTCTCAAGTTCTTCTTGGCCTTGATGAAGAATACAATCCGGTA
GTAGCTATGATTCAAGACAGAGCTGGAATTTCATGGTCTGAGATGCAAGCTGAATTGCTTGTGTTTGAGAAGAGGTTGGAGCTGCAGAACAATCTGAAAAGTTCCTTATC
CTTGAGTCCTGGAGCATCTGTAAACATGGCAAATAGCAGAGACAGTGGAAATCAAAGAAACCAAAGTTATGGTGGTCGAACCAATAATGGCTTTGGTCGAGGAAATCAAA
GAGGTGCTGGAGGACGAGGTAGAGGAAGAGCTAGAGGATATGGATCATTCAGTAACAAACCAGTTTGCCAGGTATGTGGGAAGACAGGCCATACGGCATTAATGTGTTAT
CAAAGGTTTAACAAAGAGTTTGTAGGACCTATTTACAATCAGAACAGGGGAGAAGGAGCAAATCGACAGAATAATCAAAATGGCCAAGGGCAACCTGCTGCTTTTGTTGC
AAATCAGAATGTTAATTCCTGTGTGGCAGCACCTGAGACTGTCATTGATCCAAATTGGTACGCTGACAGTGGGGCTTCAAATCACGTAACAGCAGACTACAACAACTTGG
CCAACCCAGTTGAATATGAAGGTAAAGAAAGTGTAACCATTGGTGATGGAAACAAACTTAAAATTGCATTCATTGGAAACTCTTGTTTGGCTGCTGAGCATAAGAATTTT
AAGCTTAAGAAAGTCTTATGTGTTCCTAGTATTGCCAAGAATCTTGTTAGTGTATCCAAACTTGCTAAAGATAATGAAGTATTTGTTGAATTTCATGATGGTTTTTGCCT
TGTAAAGGACAAGGATACGGGCAAGGTGCTGCTGAAAGGAGTACTTGATGAAGGTCTGTACCGTTTTGATGGAGTCAAGGCTGTCACGACAGATACAAATGAGTTACCAA
TCTGCAATCTGGTTAATTCCAAGGATATAAATAATGAGCTGTCTGGGTTTGTTTTATCTAGTACGTTAAATGTTGCTATCTCAAAAGCCATATGGCACAAACGTTTAGGT
CACCCATCTGAGGAAGTTCTTAAGTCTGTTGTTAAGAGTTGTAATCTTCCCTTAAGTGTTAATGAAAGTTTTAAGTTCTGTGAAGCATGCCAATATGGAAAGTCACATGC
CCTGCCTTTTCCTAACTCTACCTCTCATGCATTAGACAAATTTGATCTTGTTCATACAGATCTTTGGGGGCCAGCACCATTAGATTCAGTAAATGGTTTTAAATACTACA
TATTGTTCATTGATGATTTCAGTAGATTCGTGTGGATATATCCATTAAAACGAAAGAATGAGGCACTTGAAGCATTTGTTCACTTTACTGCCCTTGTTAAAAATCAATTC
AACAGTTCTATAAAAGCATTGCAGACTGACAATGGAGGAGAGTATGTTAAGATACATCAACTATGTAATGAAATGGGGGTTAAAATCAGATTGTCCTGCCCACATACCTC
TCAGCAAAATGGAAGGGCTGAAAGAAAACATCGACACGTTGTTGAGACAGGATTAACTCTTCTAGCTCAAGCCTCAATGCCCCTAAAATTCTGGGTGGATGCATTTCTTG
TTGCTGTATTATTAATAAATGGTCTTCCTTCTCAGGTTCTTGGAGGAAAATCACCCATGGAGCTGTTGTATGGCAAGAAAATTGACTTTCAAGCTTTACGAGTATTTGGA
TCAGCTTGTTTTCCATGTCTCAGAAAGTATCAAGCACATAAGTTCCAATATCACACTGAGAAGTGTGTTTATTTGGGACCAAGTCCGTTGCACAAAGGACACAAGTGCCT
CAGTTCCAGTGGTAGATTGTTTATTTCCCGACATGTTCGATTCAATGAAGAGGAGTTCCCGTTTGCCTCTGGTTTTCAGCAAGTAGTAACAACAGCTGCCTCAAGTTCTA
ATGTGTCACCTTCAATTCCTCTGTGGTTCTCTAATATCTCCGGCCCAGACATTAAAACCACTCAGCAAAATCGTGAGGTTCCTTCTACTGGAGTGTCTCCAAATGAGTGC
CCACCCACCACTCCCTCAAATTCATCCACGGATGTTCAACCTCTGCCAGCTGAATTATCTCCTCAATCTTCACTAATGCCAACTCATGAGGATTCTCCACCTTCTCAAGT
TGAAGTTGCTGAAAATCATTGTTCAGCATCTGAGTCTTCTGCCTCTACACAGTCATCAAATCTTCCTCCAACACCGTCCACAGTGCAGGCTGGTCATCCCATGGTCACTA
GAGGAAAGGCTGGGATATTCAAACCAAAGTTGTGGTTAACGCACGCCTGCACTGATTGGTCAGTCACTGAACCTACGAGAATTGCTGATGCTCTAGCTACTCCTCAATGG
CGGGAGGCGATGGACATTGACCCGGTTGTCAAGGCATCTACGATTCGTGTAATCCTCAGTATTGCTGTTACCAAAGGATGGCAGCTTCGTCAGTTGGACTTCAACAATGC
GTTTCTCAATGGAAGGCTTGATGAGGATGTGTATATGCGTCAACCCCCGGGATATGAGGATGCAAAGTGTTCAAACTATGTATGCAAACTTGACAAGGCGATTTATGGCC
TTAAACAGGCTCCAAGAGCATGGAATAATACATTGAAGGCTACTCTATTATCATGGGGCTACTCTATTATCTTTAAATACAGGGACTCCTCTTACAGATCCTCTAATATA
TCGGCGTTGGCATTAGCATCTACTGAAGTTATTTGGTTAAGACAGTTGATGACTGAAATTGGCTTCCCATGTTGTTCAAAATCTGTTCTGTGGTGCGACAATGTGAGTGC
TGGTGCCTTGGCTGCGAACCCGGTGTTTCATGCCCGGACCAAACACATCGAAATAGATGTTCACTTTGTTCGTGATCAGGTTCTTCAAGGGAAACTTGAAGTTAGATATA
TTCCCTCAAATGAGCAGCCAGCTGATTGTCTTACCAAGACACTGTCGCATTCCCAGTTTGCTTATCTACGTTGCAAACTTGGAGTAGCTGAGCTACCCTCAAGTTTGAGG
GAGGATATCAAGGATAAAAGGCCTAAAGGCCCAGACAAGCAAGAAAGCAAGTTGGATCAACCAAAGCCCATCAACAGTAGGCGCCCACATCTACCTCTTCATACCCACGG
CCTAAATCCTAATGTCCCCCCCTTTTCAACCCAAATCCCTCAGCATCCAAATACTTCGACCTTCCCTTTTCCACAAAATGCCTCAACTACTACTCCTGTTGGTTTTCCTT
ACCCTCCGACTGCTCAATCGTCACTCCCCTTTTTCCCAAATAACCCTGGTGCTCTTCCACTCTTCCCCCCCACCGCAACTTCTCCTTACCCTACCTTAACTCCTCCCCTC
AACATCAAACTTTCAGACTCAAATTACATGCTATGGAAGAACCAGCTACTGAATCACATAATTGCTTTTGATATGGAAAGTTTCATTGATGGCACTCCTCCTCCGGTAAA
GTACTTGGATCCGACACAAACTCAGTTGCAAAAGATTCGTAAGGATAGTCTTCCAGTCTCCCAATATTTAGCTCAGATTAAGGATATATCAGATAAGTTCTCTGCTATAG
ATGAGCCCCTGTCTTATCGTGATCATCTGGCTTATATACTTGAAGGTTTTGGTCCAGAATATAATCCCTTCGTCACTTCTATACAAAATCGAACCGACAGGCCATCTCTT
GTTGATGTTAGGAGCTTACTGCTTGCTTATGATGCTAAGTTAGAGAAACAGTCATCGGTTGATACTCTGAATGCTATTCAAGCCAATATTGCCAATCTTTCAAGCTCTAA
CACCTATAAACGCTCTTCTCGACCTCCCAATACCCAGTGTTCCAACTCTGCCTCGGTCCCCTGCCCCTTGTTCATTCCCTTCCTCATTTCCAACGCCGCCCACAATTCAC
TCCCAATTCTTTTCAACAAGCTCGACCTCCACGCTGGAATTCCCCTTCCTCTCAATCTCGACCACAATGTCAAATTTGTGGTAAGATGGGTCATATTGCTCAAATTTGTC
TCAATCCAACCGTCATTGGTTGGCCTAATGGAAAAAAAGATTAGGGAAGGAACTCGAGAACTTTAG
Protein sequenceShow/hide protein sequence
MTSVTSTGNPAYVTGTTSFNSPPLNQLLNQITTIKLDRGNFLLWKNLALPILRSYKLEGHLSGASPCPEKFVQAASASVNPTTTEAGATSSGAVASEAAESTPARELNPL
YESWVAVDQLLLGWLYNSMTSEVATQVMGYENAQELWAAIQELFGVQSRAEEDYLRQVFQQSRKGNSKMTDYLRIMKNHADNLGQAGSPVNTRSLISQVLLGLDEEYNPV
VAMIQDRAGISWSEMQAELLVFEKRLELQNNLKSSLSLSPGASVNMANSRDSGNQRNQSYGGRTNNGFGRGNQRGAGGRGRGRARGYGSFSNKPVCQVCGKTGHTALMCY
QRFNKEFVGPIYNQNRGEGANRQNNQNGQGQPAAFVANQNVNSCVAAPETVIDPNWYADSGASNHVTADYNNLANPVEYEGKESVTIGDGNKLKIAFIGNSCLAAEHKNF
KLKKVLCVPSIAKNLVSVSKLAKDNEVFVEFHDGFCLVKDKDTGKVLLKGVLDEGLYRFDGVKAVTTDTNELPICNLVNSKDINNELSGFVLSSTLNVAISKAIWHKRLG
HPSEEVLKSVVKSCNLPLSVNESFKFCEACQYGKSHALPFPNSTSHALDKFDLVHTDLWGPAPLDSVNGFKYYILFIDDFSRFVWIYPLKRKNEALEAFVHFTALVKNQF
NSSIKALQTDNGGEYVKIHQLCNEMGVKIRLSCPHTSQQNGRAERKHRHVVETGLTLLAQASMPLKFWVDAFLVAVLLINGLPSQVLGGKSPMELLYGKKIDFQALRVFG
SACFPCLRKYQAHKFQYHTEKCVYLGPSPLHKGHKCLSSSGRLFISRHVRFNEEEFPFASGFQQVVTTAASSSNVSPSIPLWFSNISGPDIKTTQQNREVPSTGVSPNEC
PPTTPSNSSTDVQPLPAELSPQSSLMPTHEDSPPSQVEVAENHCSASESSASTQSSNLPPTPSTVQAGHPMVTRGKAGIFKPKLWLTHACTDWSVTEPTRIADALATPQW
REAMDIDPVVKASTIRVILSIAVTKGWQLRQLDFNNAFLNGRLDEDVYMRQPPGYEDAKCSNYVCKLDKAIYGLKQAPRAWNNTLKATLLSWGYSIIFKYRDSSYRSSNI
SALALASTEVIWLRQLMTEIGFPCCSKSVLWCDNVSAGALAANPVFHARTKHIEIDVHFVRDQVLQGKLEVRYIPSNEQPADCLTKTLSHSQFAYLRCKLGVAELPSSLR
EDIKDKRPKGPDKQESKLDQPKPINSRRPHLPLHTHGLNPNVPPFSTQIPQHPNTSTFPFPQNASTTTPVGFPYPPTAQSSLPFFPNNPGALPLFPPTATSPYPTLTPPL
NIKLSDSNYMLWKNQLLNHIIAFDMESFIDGTPPPVKYLDPTQTQLQKIRKDSLPVSQYLAQIKDISDKFSAIDEPLSYRDHLAYILEGFGPEYNPFVTSIQNRTDRPSL
VDVRSLLLAYDAKLEKQSSVDTLNAIQANIANLSSSNTYKRSSRPPNTQCSNSASVPCPLFIPFLISNAAHNSLPILFNKLDLHAGIPLPLNLDHNVKFVVRWVILLKFV
SIQPSLVGLMEKKIREGTREL