; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039383 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039383
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr2:42706121..42710421
RNA-Seq ExpressionLag0039383
SyntenyLag0039383
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR029472 - Retrotransposon Copia-like, N-terminal
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH07404.1 hypothetical protein Prudu_019334 [Prunus dulcis]1.0e-10735.71Show/hide
Query:  SSSSQSSSLTEVLPSGTSQPNS----SIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHD
        SSS   S  T+   S +S PNS    +   + NI ++V ++L  SNYL W      +L+ + L G++DG++PCP  FLP+        +NPA+  W   D
Subjt:  SSSSQSSSLTEVLPSGTSQPNS----SIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHD

Query:  SALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEY
          L+   N+TLS+    F +G  +S  +W  LE+R   ++ +HIH+L+S L +I KG ++T+ DYL +IK++ D L A    + D DL+  TL GL  E+
Subjt:  SALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEY

Query:  NSFRTSVRTRGGKITLDELHALLKSEAECIEQQAKSVTPFTP------TAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG
         SF  S+  R    +LDELH LL ++   + ++ K  +  T       +A  S             +    Q  F+   NRG   QFS  RGNYG     
Subjt:  NSFRTSVRTRGGKITLDELHALLKSEAECIEQQAKSVTPFTP------TAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG

Query:  PTSSNTGNQNQPRGGYNHYNQ-NQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCN
           +N GN N      NH +    G  RG   ++  S +S  R+ CQIC  P H A+DCY+R+N +  G+ PP+KLAAM + Y +    + +WL DSG  
Subjt:  PTSSNTGNQNQPRGGYNHYNQ-NQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCN

Query:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL--------------------C------------DKATGATLYKGRSR
        SH+T D+SNL+  S Y GED + +  G G+ +   G   L T  +   L  +L                    C            D+ TG  L +G  R
Subjt:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL--------------------C------------DKATGATLYKGRSR

Query:  DGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCR-DCVSCLKGKITKLPFTSSTTNTTSPLA
        DG YP+ S+  +    S IS S+F  + A V +        WH RLGHPS  + +K ++ + +    K S    C  C   K  KLPF  + ++T+  LA
Subjt:  DGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCR-DCVSCLKGKITKLPFTSSTTNTTSPLA

Query:  LIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQN
        L+  DVWGP+PV S+SGF+YY+  VDDY++Y+W FPL  K +V +  + F  ++E  +   +K  RSD GGEF + S   + +  G+ HQ SCPHTPEQN
Subjt:  LIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQN

Query:  GVAERKHRSIVETALALLHHASMPLEFW
        G  ERKHR +VETA  LL  +++P  +W
Subjt:  GVAERKHRSIVETALALLHHASMPLEFW

BBN68591.1 hypothetical protein Prudu_489S000400 [Prunus dulcis]1.0e-10735.71Show/hide
Query:  SSSSQSSSLTEVLPSGTSQPNS----SIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHD
        SSS   S  T+   S +S PNS    +   + NI ++V ++L  SNYL W      +L+ + L G++DG++PCP  FLP+        +NPA+  W   D
Subjt:  SSSSQSSSLTEVLPSGTSQPNS----SIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHD

Query:  SALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEY
          L+   N+TLS+    F +G  +S  +W  LE+R   ++ +HIH+L+S L +I KG ++T+ DYL +IK++ D L A    + D DL+  TL GL  E+
Subjt:  SALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEY

Query:  NSFRTSVRTRGGKITLDELHALLKSEAECIEQQAKSVTPFTP------TAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG
         SF  S+  R    +LDELH LL ++   + ++ K  +  T       +A  S             +    Q  F+   NRG   QFS  RGNYG     
Subjt:  NSFRTSVRTRGGKITLDELHALLKSEAECIEQQAKSVTPFTP------TAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG

Query:  PTSSNTGNQNQPRGGYNHYNQ-NQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCN
           +N GN N      NH +    G  RG   ++  S +S  R+ CQIC  P H A+DCY+R+N +  G+ PP+KLAAM + Y +    + +WL DSG  
Subjt:  PTSSNTGNQNQPRGGYNHYNQ-NQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCN

Query:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL--------------------C------------DKATGATLYKGRSR
        SH+T D+SNL+  S Y GED + +  G G+ +   G   L T  +   L  +L                    C            D+ TG  L +G  R
Subjt:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL--------------------C------------DKATGATLYKGRSR

Query:  DGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCR-DCVSCLKGKITKLPFTSSTTNTTSPLA
        DG YP+ S+  +    S IS S+F  + A V +        WH RLGHPS  + +K ++ + +    K S    C  C   K  KLPF  + ++T+  LA
Subjt:  DGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCR-DCVSCLKGKITKLPFTSSTTNTTSPLA

Query:  LIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQN
        L+  DVWGP+PV S+SGF+YY+  VDDY++Y+W FPL  K +V +  + F  ++E  +   +K  RSD GGEF + S   + +  G+ HQ SCPHTPEQN
Subjt:  LIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQN

Query:  GVAERKHRSIVETALALLHHASMPLEFW
        G  ERKHR +VETA  LL  +++P  +W
Subjt:  GVAERKHRSIVETALALLHHASMPLEFW

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]9.1e-13339.97Show/hide
Query:  ASSSSSQSSSLTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSA
        A + ++ S++ T   P+ +  P   IFLLSNICNL+  RLDSSNY+ W+FQ+ S+LKAHSL G +DG+ PCP++F+ +  G  + QINP + +W   D A
Subjt:  ASSSSSQSSSLTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSA

Query:  LITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNS
        L+TL+NATLS+ A S VIG+ TS + W  LE+R S+ TRS+I +LKS+LH I+KG  +++D Y+ +IK   D LA+VSV+I+DED+L+Y LNGL  EYN+
Subjt:  LITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNS

Query:  FRTSVRTRGGKITLDELHALLKSEAECIEQQAK-SVTPFTPTAMFSSSQSQN-GSNRG------RGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG
        F+TS+RT+   ITL+E++A+LK E + IE   K + +P  P AM +++   N  SNRG       GRGRG             +G+FS  RG   H    
Subjt:  FRTSVRTRGGKITLDELHALLKSEAECIEQQAK-SVTPFTPTAMFSSSQSQN-GSNRG------RGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG

Query:  PTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLS-TTNTWLADSGCN
          S N G  N P      Y   Q      Q SN  S+NS   + CQICN+ GH ALDCY+R++ SYQG+ P  +L AMS+ Y+TG   + N W  D+G  
Subjt:  PTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLS-TTNTWLADSGCN

Query:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILC--------------------------------DKATGATLYKGRSR
        +H+T DL+NL     Y G+D IT+A GQ + ++ SG  ++  +     L+ +LC                                DKAT   L++G S 
Subjt:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILC--------------------------------DKATGATLYKGRSR

Query:  DGLYPI-SSTFKTASADSV------------------ISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGK
         GLYP+ +S+    SA S+                  + +++++    + ++ ++    LWH RLGHPS   LQ  L+++++   P+ S   C  CL GK
Subjt:  DGLYPI-SSTFKTASADSV------------------ISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGK

Query:  ITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFF
        +TKLPF  STT +T+PL L+  D+WGP+P  S   F YYV FVDD+S                                    RSDGGGE+    L Q  
Subjt:  ITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFF

Query:  STKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFWKI
        +  G+ H+RSCPHTP+QNG+AERKHR IVET L LL  AS+PL++W +
Subjt:  STKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFWKI

RWR76373.1 putative polyprotein [Cinnamomum micranthum f. kanehirae]3.6e-10534.27Show/hide
Query:  LTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLS
        +  ++ S  S  ++  F++SNI NLV ++LD  NYL WR Q E +L +H L G VDGS  CP++F  + +   ++ I PA   W   D  L++ I ATLS
Subjt:  LTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLS

Query:  KVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNS--FRTSVRTR
        +   S VIG +TS   W  +E+R +SL+R+H  ELK  L  + K                      V  ++D+ D + + L+GL  EY+      +   +
Subjt:  KVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNS--FRTSVRTR

Query:  GGKITLDELHALL---KSEAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPR
           +++  +H LL   +    C    ++S +    TA+F+    QN +N   G GR  +G+ +GQ             G  G PS  P+ ++    N   
Subjt:  GGKITLDELHALL---KSEAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPR

Query:  GGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNS
         G                   S++    R+ CQICNR GH ALDCY+R++ +YQG HPP+KLAAM++      S    W  D+G   H+T ++ NL+L S
Subjt:  GGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNS

Query:  NYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILC--------------------------------DKATGATLYKGRSRDGLYPISSTFKTAS
        +Y+  D ++V  G G+ ++  G  ++ST  S   L+ +LC                                DKA+G TL++G+S++GLYP     +  +
Subjt:  NYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILC--------------------------------DKATGATLYKGRSRDGLYPISSTFKTAS

Query:  ADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTA--SAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVV
          +    ++F        V     A++WH RLGHP+  V Q   +A    VD   K+S   C  C  GK  KLPF+ S++ +++PL LI  D+WG SP +
Subjt:  ADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTA--SAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVV

Query:  SISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVET
        SISG+ YYV F+DD +KY W +PL  K       ++F  ++EN LS ++K F+SDGGGEF++     F ++ G+ H+ SCPHTPEQNGVAERKH  IVE 
Subjt:  SISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVET

Query:  ALALLHHASMPLEFW
         L LL  + MPL++W
Subjt:  ALALLHHASMPLEFW

WP_081894301.1 DDE-type integrase/transposase/recombinase [Acetobacter malorum]7.7e-11636.16Show/hide
Query:  LLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFL--PNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKTSNQ
        L+ N+   V V+LD +NYL W +Q+  +L++H + G VDGSK CP  F+  P+ +G ++      + +W  HD AL+ LI  TLS  A S +IG  ++++
Subjt:  LLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFL--PNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKTSNQ

Query:  VWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALLKSEA
        +W  L  R S++T++ I ++K  L  I KG +E++  Y  RIKD+ D L+A  V  DD+D+++  L GL SEYN+FRT +R R   I+L +  A L +E 
Subjt:  VWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALLKSEA

Query:  ECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISN
          IE    S + FT TAM     +Q   ++G+G            F+  + G   P+ G+  +  +   S N+     P GG+  ++ N+G+ RG   S+
Subjt:  ECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISN

Query:  S----SSSNSQGRI------------------TCQIC-----------------NRP------------GHGALDCYNRLNLSYQGRHPPSKLAAMSSFY
        S    S +NS G +                  TCQIC                 NRP            GH AL CY+R N SYQGR PPS L  M + Y
Subjt:  S----SSSNSQGRI------------------TCQIC-----------------NRP------------GHGALDCYNRLNLSYQGRHPPSKLAAMSSFY

Query:  DTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL----------------------------C-
                 W+AD+G  SH+T DL+NLT  + + G D IT A+G G+P++ +G   L   Q    L  IL                            C 
Subjt:  DTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL----------------------------C-

Query:  ---DKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGKIT
           DK TG  L +G  RDGLYPI               S       +  +    + +LWH RLGHPS+ V+   L  S + F    S   C+SCL+GK T
Subjt:  ---DKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGKIT

Query:  KLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFST
        KLPF+     +  P  ++  DVWGPSP +S+ G+++YV F+D+ +++TW+FPL +K +V  V + F  F+  Q S S+K F+SDGGGE+ +    QF   
Subjt:  KLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFST

Query:  KGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW
        KG++H +SCPHTPEQNG+AERKH  IVETAL LL  A +P +FW
Subjt:  KGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW

TrEMBL top hitse value%identityAlignment
A0A2N9G2N5 Uncharacterized protein4.1e-12337.7Show/hide
Query:  MASSSSSQSSSLTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDS
        MASS++  +   +      +S   + + +LSN+ NL+ V+LDSSN++ W+ Q+ S+LKA+S+   VDG+ P P  FL + +G  +++ NP   LW   D 
Subjt:  MASSSSSQSSSLTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDS

Query:  ALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYN
        AL+TLIN+TLS    S V+G  ++  VW  LE+R +S +R+++  LK  LH + KG  E++  YL ++K+  D+L AV ++ID+E+LL   L GL  EY 
Subjt:  ALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYN

Query:  SFRTSVRTRGGKITLDELHALLKSEAECIEQQAKSVTPFTPTAMFSSSQSQN-----------GSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGH
         F +++RTR   ++ +E+  LL+++   + + + S       A+F+S+   N            ++  RGRGR N  + +G  N  N  Q+SP +  Y  
Subjt:  SFRTSVRTRGGKITLDELHALLKSEAECIEQQAKSVTPFTPTAMFSSSQSQN-----------GSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGH

Query:  PSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADS
         S+ PT                    QGQ   +Q S + S NS  R  CQIC + GH ALDCY+R++ +YQGRHPP+KLAAM+S    G     +WL D+
Subjt:  PSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADS

Query:  GCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTS------QSPLNLSKI---------LC-----------------DKATGATLYKG
        G   H+T ++SNL +++ Y G D + V  GQ +P+   G G L T       QS L+ SKI         LC                 D  +G  LYKG
Subjt:  GCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTS------QSPLNLSKI---------LC-----------------DKATGATLYKG

Query:  RSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKAL--TASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTT
         S +GLYPI +T  +   ++            S  +S ++   LWH RLGHPS  VL  AL   +S +    K     C  CL GK+ KLPF  S   +T
Subjt:  RSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKAL--TASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTT

Query:  SPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHT
         PL LI  DVWGP+P+ S +G+RYY+ FVDDY++++WL+ L +K DV +    F   +ENQLS  +K  R+D GGE+ +   + F  + G+ H  SCPHT
Subjt:  SPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHT

Query:  PEQNGVAERKHRSIVETALALLHHASMPLEFW
        P+QNG  ERKHR I+E AL LL HAS+    W
Subjt:  PEQNGVAERKHRSIVETALALLHHASMPLEFW

A0A2N9G7E3 Integrase catalytic domain-containing protein1.5e-12539.35Show/hide
Query:  TSQPNSS-IFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFV
        +S PN++ + LLSNI NLV V+LD +NY+ W+FQ+ S LKA+ L  +VDGS PCP+ +  N D   +  +N     W   D ALI++I ATLS  A + V
Subjt:  TSQPNSS-IFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFV

Query:  IGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDEL
        IG K++  VW  LEKR +SL+RS++  LK  L++I K   E+++ Y+ +IK+  D+L AV V I+ E++L   L+GL +E+  F +++RTR   I+ +EL
Subjt:  IGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDEL

Query:  HALLKSEAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQ-GQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQG
        H L+  E + + +  +S       AM        G     G     Q   Q Q NRG + G+F+ +RG  G         N  N N  RGG+N+ +    
Subjt:  HALLKSEAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQ-GQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQG

Query:  QGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVA
                ++ +S    R TCQIC + GH ALDCY+R++ ++QG+HPP+KLAAM+  + +  S++N W++D+G   H TPDL+NL    +YNG DA+TV 
Subjt:  QGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVA

Query:  TGQGVPVTQSGFGTLSTSQSPLNLSKIL---------------------C-----------DKATGATLYKGRSRDGLYPISS----TFKTASADSVISK
         GQ +P+T  G   L  S+  L+L + L                     C           D  +G  LYKG +  GLYPI      + K  +     +K
Subjt:  TGQGVPVTQSGFGTLSTSQSPLNLSKIL---------------------C-----------DKATGATLYKGRSRDGLYPISS----TFKTASADSVISK

Query:  SSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQ---KALTASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFR
        S+      S +   +  ++ WH RLGHP+  +LQ   K L  S +D     S   C  C  GK+++LPF+ S T+ T PL L+  DVWGP+P+ SI+G R
Subjt:  SSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQ---KALTASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFR

Query:  YYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLH
        YYV F+DD+SK+TW FPL HK  V +  + F   LEN L+  LKV R+D GGE+ + +   + S++G+ HQ SCPHTP+QNGVAERKHR I+ETAL L+ 
Subjt:  YYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLH

Query:  HASMPLEFW
         +S+PL +W
Subjt:  HASMPLEFW

A0A2N9GRJ0 Uncharacterized protein2.3e-12637.89Show/hide
Query:  LPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAY
        +P+ T Q ++ I LLSNI NLV V+LD++NY+ W++QV S+L+A+SL   +DGS+PCP++FL +  G  S  +N  ++ W++ D  L+T++NATLS    
Subjt:  LPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAY

Query:  SFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITL
        S V+G K++  VW  LEKR +S+ RS+I  LK  LH + K   + VD +L R+K+  D+L AV V I DE++L   L GL +E++S R+++RTR   I+ 
Subjt:  SFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITL

Query:  DELHALLKSEAECIEQQAKSVTPFTPTAMFSSS----------QSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQP
        DEL  LL +E   ++    +       AM S+           Q  N SNRGRGR    +G+                                G +N  
Subjt:  DELHALLKSEAECIEQQAKSVTPFTPTAMFSSS----------QSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQP

Query:  RGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSS---------FYDTGLS----------------
        RGG+    QNQ     T     +S N   R  CQIC + GH ALDCY+R++ SYQGRHPP+KLAA++S           +T +S                
Subjt:  RGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSS---------FYDTGLS----------------

Query:  ------------------TTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL----------------C
                          +T TW++D+G   H TPDL+NL    +Y G D +++  G G+P+T  G   L  S    NL KIL                C
Subjt:  ------------------TTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKIL----------------C

Query:  DKA----------------TGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTAS---AVDFGP
        D A                +G TLYKG S+DGLYPI     ++   S    SS  P   S  +  +   ++WH RLGHP   VL   L      +V+   
Subjt:  DKA----------------TGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTAS---AVDFGP

Query:  KVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSD
        K S   C  C++GK+ + PF SS+   T+PL L+  DVWGP+PV SI+G R+YV FVD ++++TWLFP+ HK  V      F   +EN L+  +KV R+D
Subjt:  KVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSD

Query:  GGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW
         GGE+ N +   F ST+G++HQ SCPHTP+QNGVAERKHR IVETAL L+  +S+PL++W
Subjt:  GGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW

A0A2N9I8F3 Uncharacterized protein2.2e-12439.28Show/hide
Query:  NLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKTSNQVWTVLEKR
        NL+ V+LDS+N++ W+ Q+ S+LKA+S+   VDG+ P P  FL N DG  +T +NP   LW   D  L+ LIN+TLS    S V+G  ++ +VW  LE R
Subjt:  NLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKTSNQVWTVLEKR

Query:  LSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALLKSEAECIEQQAK
         +S +R+++  LK  LH + KG +ET+  YL ++K+  D+L AV  +ID+E+LL   L GL  EY  F +++RTR   +T +E+  LL++E +   + + 
Subjt:  LSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALLKSEAECIEQQAK

Query:  SVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRG-GYNHYNQNQGQGRGTQISNSSSSNSQ
        S     P AMF+S+ +   SN  +    GN  QF+G+                            G  N  RG G   YN NQ      Q S S+  NSQ
Subjt:  SVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRG-GYNHYNQNQGQGRGTQISNSSSSNSQ

Query:  -------GRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQ
                R  CQIC + GH ALDCY+R++ +YQGRHPP+KLAAM+S    G    +TWL D+G   H+T +L+NL   + Y G + ++V  GQ +P+  
Subjt:  -------GRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQ

Query:  SGFGTLSTSQSPLNLSKIL----------------------C----------DKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVS
         G G LST      L  +L                      C          D  +G  LYKG S++GLYPI     T  + S +S S+      S  +S
Subjt:  SGFGTLSTSQSPLNLSKIL----------------------C----------DKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVS

Query:  RESHAALWHLRLGHPSHVVLQKAL--TASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTW
         ++   LWH RLGHPS  VL  A+   +S +    K     C  CL GK+ +LPF  S   +T PL L+  DVWGP+PV S +G++YY+ FVDD+SK++W
Subjt:  RESHAALWHLRLGHPSHVVLQKAL--TASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTW

Query:  LFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW
        LF L  K +V N    F   +ENQLS S+K  R+D GGE+ + +   F ST+G+ HQ SCPHTP+QNG  ERKHR I+E+AL LL HAS+P+  W
Subjt:  LFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW

A0A5J5A1U7 Integrase catalytic domain-containing protein4.4e-13339.97Show/hide
Query:  ASSSSSQSSSLTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSA
        A + ++ S++ T   P+ +  P   IFLLSNICNL+  RLDSSNY+ W+FQ+ S+LKAHSL G +DG+ PCP++F+ +  G  + QINP + +W   D A
Subjt:  ASSSSSQSSSLTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSA

Query:  LITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNS
        L+TL+NATLS+ A S VIG+ TS + W  LE+R S+ TRS+I +LKS+LH I+KG  +++D Y+ +IK   D LA+VSV+I+DED+L+Y LNGL  EYN+
Subjt:  LITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNS

Query:  FRTSVRTRGGKITLDELHALLKSEAECIEQQAK-SVTPFTPTAMFSSSQSQN-GSNRG------RGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG
        F+TS+RT+   ITL+E++A+LK E + IE   K + +P  P AM +++   N  SNRG       GRGRG             +G+FS  RG   H    
Subjt:  FRTSVRTRGGKITLDELHALLKSEAECIEQQAK-SVTPFTPTAMFSSSQSQN-GSNRG------RGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSG

Query:  PTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLS-TTNTWLADSGCN
          S N G  N P      Y   Q      Q SN  S+NS   + CQICN+ GH ALDCY+R++ SYQG+ P  +L AMS+ Y+TG   + N W  D+G  
Subjt:  PTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLS-TTNTWLADSGCN

Query:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILC--------------------------------DKATGATLYKGRSR
        +H+T DL+NL     Y G+D IT+A GQ + ++ SG  ++  +     L+ +LC                                DKAT   L++G S 
Subjt:  SHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILC--------------------------------DKATGATLYKGRSR

Query:  DGLYPI-SSTFKTASADSV------------------ISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGK
         GLYP+ +S+    SA S+                  + +++++    + ++ ++    LWH RLGHPS   LQ  L+++++   P+ S   C  CL GK
Subjt:  DGLYPI-SSTFKTASADSV------------------ISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGK

Query:  ITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFF
        +TKLPF  STT +T+PL L+  D+WGP+P  S   F YYV FVDD+S                                    RSDGGGE+    L Q  
Subjt:  ITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFF

Query:  STKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFWKI
        +  G+ H+RSCPHTP+QNG+AERKHR IVET L LL  AS+PL++W +
Subjt:  STKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFWKI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-2424.82Show/hide
Query:  NHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYN-RLNLSYQGRHPPSKLAAMSS---------FYDTGLSTTNTWLADSGCNSHVTPDL
        N Y  N  + R T+       NS+ ++ C  C R GH   DC++ +  L+ + +    ++   +S           +T +     ++ DSG + H+  D 
Subjt:  NHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYN-RLNLSYQGRHPPSKLAAMSS---------FYDTGLSTTNTWLADSGCNSHVTPDL

Query:  SNLTLNSNYNGEDAITVA-TGQGVPVTQSGFGTLSTSQSPLNLSKILCDKATGATLYKGRSRDGLYPI----SSTFKTASADSVISKSSF---TPL----
        S  T +        I VA  G+ +  T+ G   L           + C +A G  +   R ++    I    S    + +   V+  S      P+    
Subjt:  SNLTLNSNYNGEDAITVA-TGQGVPVTQSGFGTLSTSQSPLNLSKILCDKATGATLYKGRSRDGLYPI----SSTFKTASADSVISKSSF---TPL----

Query:  CASVHVSRESHAALWHLRLGHPS-----HVVLQKALTASAVDFGPKVSCRDCVSCLKGKITKLPF--TSSTTNTTSPLALIQRDVWGPSPVVSISGFRYY
          S++   +++  LWH R GH S      +  +   +  ++    ++SC  C  CL GK  +LPF      T+   PL ++  DV GP   V++    Y+
Subjt:  CASVHVSRESHAALWHLRLGHPS-----HVVLQKALTASAVDFGPKVSCRDCVSCLKGKITKLPF--TSSTTNTTSPLALIQRDVWGPSPVVSISGFRYY

Query:  VCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHA
        V FVD ++ Y   + + +K DV ++   FV   E   +  +     D G E+++  + QF   KG+ +  + PHTP+ NGV+ER  R+I E A  ++  A
Subjt:  VCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHA

Query:  SMPLEFW
         +   FW
Subjt:  SMPLEFW

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2925.08Show/hide
Query:  WIAHDSALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNG
        W   D    + I   LS    + +I   T+  +WT LE    S T ++   LK  L+ +          +L     L+ QLA + V I++ED  +  LN 
Subjt:  WIAHDSALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNG

Query:  LSSEYNSFRTSVRTRGGKITL---DELHALLKSEAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPS
        L S Y++  T++    GK T+   D   ALL +E                  M    ++Q  +    GRGR  Q   +   N G  G     RG   + S
Subjt:  LSSEYNSFRTSVRTRGGKITL---DELHALLKSEAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPS

Query:  SGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGC
              N  N NQP G +     N  +G+G      +  N+   +                + + L          L+   S           W+ D+  
Subjt:  SGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGC

Query:  NSHVTP--DLSNLTLNSNYN----GEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILCDKATGATLYKGRSRDGL--YPISSTFKTASADSVISKS-
        + H TP  DL    +  ++     G  + +   G G    ++  G     +   ++  +  +  +G  L     RDG   Y  +  ++      VI+K  
Subjt:  NSHVTP--DLSNLTLNSNYN----GEDAITVATGQGVPVTQSGFGTLSTSQSPLNLSKILCDKATGATLYKGRSRDGL--YPISSTFKTASADSVISKS-

Query:  -------SFTPLCASV--HVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVS
               +   +C         E    LWH R+GH S   LQ     S + +    + + C  CL GK  ++ F +S+    + L L+  DV GP  + S
Subjt:  -------SFTPLCASV--HVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVS

Query:  ISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETA
        + G +Y+V F+DD S+  W++ L  K  V  V  +F   +E +    LK  RSD GGE+ ++   ++ S+ G+ H+++ P TP+ NGVAER +R+IVE  
Subjt:  ISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETA

Query:  LALLHHASMPLEFW
         ++L  A +P  FW
Subjt:  LALLHHASMPLEFW

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.7e-1220.8Show/hide
Query:  VWTVLEKRLSSLTRSHIHELKS--SLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALLKS
        + TVL K +S + +++  ELK   +L  +    + + D + + +  ++ +L   ++ + D       L GLS ++   R   RT+   + L +L A    
Subjt:  VWTVLEKRLSSLTRSHIHELKS--SLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALLKS

Query:  EAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQI
        E + I  + K +    P+     S+ +N S       R +      +    N       R N   P +    +   +    R   +H N++    +    
Subjt:  EAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQI

Query:  SNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFY-DTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVP
         N  S   Q     Q  ++P H  +D  + L          S+    S+ Y       +   + D+         + NL  N     + +I       + 
Subjt:  SNSSSSNSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFY-DTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVP

Query:  VTQSGFGTLSTSQSPLNLSKILCDKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAV
                L+        ++   +++ G  L         Y +S  +   S    ISK +   +  S  V++  +  L H  LGH +   +QK+L  +AV
Subjt:  VTQSGFGTLSTSQSPLNLSKILCDKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAV

Query:  DF-------GPKVSCRDCVSCLKGKITKLPFTSST----TNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYD--VQNVIMRFV
         +           S   C  CL GK TK      +      +  P   +  D++GP   +  S   Y++ F D+ +++ W++PL  + +  + NV    +
Subjt:  DF-------GPKVSCRDCVSCLKGKITKLPFTSST----TNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYD--VQNVIMRFV

Query:  PFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW
         F++NQ +  + V + D G E+ NK+LH+FF+ +G+    +       +GVAER +R+++     LLH + +P   W
Subjt:  PFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-7830.74Show/hide
Query:  NSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKT
        N++  L  N+ N+   +L S+NYL W  QV ++   + L G +DGS   P   +       + ++NP ++ W   D  + + +   +S      V    T
Subjt:  NSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKT

Query:  SNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELH-ALL
        + Q+W  L K  ++ +  H+ +L++ L   TKG T+T+DDY+  +    DQLA +   +D ++ +   L  L  EY      +  +    TL E+H  LL
Subjt:  SNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELH-ALL

Query:  KSEAECIEQQAKSVTPFTPTAM-FSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRG
          E++ +   + +V P T  A+   ++ + N +N G    R +        NR N     P++       S        NQ++P  G             
Subjt:  KSEAECIEQQAKSVTPFTPTAM-FSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRG

Query:  TQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNL--SYQGRHPPSKLAAMSSFYDTGLS---TTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITV
                        CQIC   GH A  C    +   S   + PPS         +  L    ++N WL DSG   H+T D +NL+L+  Y G D + V
Subjt:  TQISNSSSSNSQGRITCQICNRPGHGALDCYNRLNL--SYQGRHPPSKLAAMSSFYDTGLS---TTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITV

Query:  ATGQGVPVTQSGFGTLSTSQSPLNLSKI---------------LC-----------------DKATGATLYKGRSRDGLY--PISSTFKTASADSVISKS
        A G  +P++ +G  +LST   PLNL  I               LC                 D  TG  L +G+++D LY  PI+S+   +   S  SK+
Subjt:  ATGQGVPVTQSGFGTLSTSQSPLNLSKI---------------LC-----------------DKATGATLYKGRSRDGLY--PISSTFKTASADSVISKS

Query:  SFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVD-FGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYV
                      +H++ WH RLGHP+  +L   ++  ++    P      C  CL  K  K+PF+ ST N+T PL  I  DVW  SP++S   +RYYV
Subjt:  SFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVD-FGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYV

Query:  CFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHAS
         FVD +++YTWL+PL  K  V+   + F   LEN+    +  F SD GGEFV  +L ++FS  G+ H  S PHTPE NG++ERKHR IVET L LL HAS
Subjt:  CFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHAS

Query:  MPLEFW
        +P  +W
Subjt:  MPLEFW

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-6629.13Show/hide
Query:  RLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQST-QINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSL
        +L S+NYL W  QV ++   + L G +DGS P P    P   G  +  ++NP ++ W   D  + + I   +S      V    T+ Q+W  L K  ++ 
Subjt:  RLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQST-QINPAHSLWIAHDSALITLINATLSKVAYSFVIGFKTSNQVWTVLEKRLSSL

Query:  TRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELH-ALLKSEAECIEQQAKSVT
        +  H+ +L+                ++ R     DQLA +   +D ++ +   L  L  +Y      +  +    +L E+H  L+  E++ +   +  V 
Subjt:  TRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELH-ALLKSEAECIEQQAKSVT

Query:  PFTPTAMF--SSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGR
        P T   +   +++ ++N +NRG  R   N        NR N  Q S         SSG  S N   Q +P  G                           
Subjt:  PFTPTAMF--SSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSSNSQGR

Query:  ITCQICNRPGHGALDC-----YNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFG
          CQIC+  GH A  C     +       Q   P +     ++         N WL DSG   H+T D +NL+ +  Y G D + +A G  +P+T +G  
Subjt:  ITCQICNRPGHGALDC-----YNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFG

Query:  TLSTSQSPLNLSKI---------------LC-----------------DKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESH
        +L TS   L+L+K+               LC                 D  TG  L +G+++D LY     +  AS+ +V   S F   C     S+ +H
Subjt:  TLSTSQSPLNLSKI---------------LC-----------------DKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESH

Query:  AALWHLRLGHPSHVVLQKALTASAVD-FGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLT
        ++ WH RLGHPS  +L   ++  ++    P      C  C   K  K+PF++ST  ++ PL  I  DVW  SP++SI  +RYYV FVD +++YTWL+PL 
Subjt:  AALWHLRLGHPSHVVLQKALTASAVD-FGPKVSCRDCVSCLKGKITKLPFTSSTTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLT

Query:  HKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW
         K  V++  + F   +EN+    +    SD GGEFV   L  + S  G+ H  S PHTPE NG++ERKHR IVE  L LL HAS+P  +W
Subjt:  HKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNGVAERKHRSIVETALALLHHASMPLEFW

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.6e-0523.78Show/hide
Query:  SGTSQPNSSIFLLSNI-----CNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSK
        S TS P+S  +L  +I      ++  +  D  NY+ W+ +  S L+    FG +DG+ P PD F P     +  Q N     W+ +     ++ +  L  
Subjt:  SGTSQPNSSIFLLSNI-----CNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSK

Query:  VAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLA
        V Y+     +T++++W  L +         I++L+  L T+ +G  ++V++Y  ++  +  +L+
Subjt:  VAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLA

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.1e-0626.18Show/hide
Query:  IFLLSNICNLVHVRLD--SSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAY--SFVIGFK
        I+ +SNI + + V LD   SNY  WR    +   +  + G +DG+       LP       T  N  +  W   D  +   +  TL+   +  SFV    
Subjt:  IFLLSNICNLVHVRLD--SSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATLSKVAY--SFVIGFK

Query:  TSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALL
        TS  +W  ++ +  +   +    L S L T   G    V DY  ++K L D L  V V + D +L++Y LNGL+ ++++    ++ R    + D+   +L
Subjt:  TSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHALL

Query:  KSEAECIEQQAK------------SVTPFTPTAMFSSSQSQNGSNRG-RGRGRGNQGQFQGQFNRGNQGQFS----PFRGNYGHPSSGPTSSNTGNQ-NQ
        + E + +++  K            +V   +     ++ Q   G+  G RGRGRGN         RG  G+FS    P   ++  P   P   N+    N 
Subjt:  KSEAECIEQQAK------------SVTPFTPTAMFSSSQSQNGSNRG-RGRGRGNQGQFQGQFNRGNQGQFS----PFRGNYGHPSSGPTSSNTGNQ-NQ

Query:  PRGGYNHYNQNQGQGRG
        P G   + N N G G G
Subjt:  PRGGYNHYNQNQGQGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCTTCTTCCCAGTCCTCAAGTCTTACTGAAGTTCTACCTAGTGGTACCTCTCAGCCGAATTCATCCATTTTTCTTTTATCAAACATCTGTAACCTCGT
TCATGTGCGTCTTGATTCATCTAATTACTTGTTCTGGCGATTTCAAGTGGAATCAATGCTCAAAGCTCACTCGTTGTTCGGAATTGTTGATGGATCAAAACCTTGTCCTG
ATGAATTTCTTCCGAACGGTGATGGAGGACAGTCTACTCAGATCAATCCCGCTCATAGTCTATGGATTGCTCACGACAGTGCCCTGATTACTCTCATAAACGCAACTCTT
TCGAAAGTTGCCTACTCTTTCGTGATCGGCTTCAAAACCTCAAATCAGGTATGGACTGTGCTTGAGAAACGCCTCTCCTCCTTGACTCGTTCACATATACATGAGTTAAA
ATCGTCTTTGCATACGATTACTAAAGGACCTACTGAAACTGTTGATGATTATCTTGTTCGCATCAAAGATCTAGTTGATCAATTAGCTGCTGTATCTGTAGTCATAGATG
ACGAGGATTTATTGTTGTATACCTTGAATGGATTATCTTCAGAGTATAATTCTTTTCGAACCTCTGTACGAACTCGTGGTGGAAAGATTACTCTGGATGAATTACATGCG
TTATTGAAGTCTGAAGCTGAATGTATCGAACAGCAAGCAAAATCTGTAACTCCATTCACCCCAACCGCAATGTTCTCCTCCTCTCAATCTCAGAATGGCTCCAATCGTGG
TCGTGGAAGGGGAAGGGGAAATCAAGGTCAGTTTCAAGGGCAATTCAATCGTGGAAATCAGGGTCAATTTTCCCCATTCAGAGGCAATTATGGTCATCCTAGCAGTGGTC
CAACCTCGAGTAATACTGGCAACCAAAATCAACCTCGAGGAGGTTATAATCACTATAATCAGAATCAAGGACAAGGCCGTGGGACTCAGATTTCGAATTCCTCTTCTAGC
AACAGTCAAGGTCGCATTACATGTCAGATCTGTAACAGACCTGGTCACGGTGCATTGGACTGTTACAATCGCCTCAACTTGTCCTATCAAGGGCGGCATCCACCCTCTAA
GCTAGCAGCTATGTCCTCGTTCTATGACACTGGTTTGTCTACTACCAACACTTGGTTAGCCGACAGTGGGTGTAACTCCCATGTTACACCGGATCTCTCCAACTTAACAC
TCAATTCCAATTACAATGGTGAAGATGCTATCACTGTTGCCACTGGCCAGGGTGTACCCGTTACTCAATCGGGTTTTGGTACACTTTCTACTTCTCAAAGTCCCCTGAAT
CTATCCAAAATTCTTTGTGACAAGGCCACGGGTGCAACTTTATACAAGGGCAGGAGTAGAGATGGTTTATATCCTATATCATCCACTTTCAAGACTGCTTCTGCTGATTC
GGTGATTTCTAAATCGTCCTTCACTCCTCTCTGTGCATCCGTACATGTTTCCCGTGAATCTCATGCTGCTCTATGGCACTTGCGACTCGGGCATCCCTCTCATGTTGTTT
TACAAAAGGCTTTGACTGCTAGTGCTGTTGATTTTGGTCCTAAAGTTTCTTGTCGAGACTGTGTTAGTTGCTTAAAAGGGAAGATTACTAAACTTCCTTTTACTTCATCT
ACAACTAATACAACTAGTCCATTAGCTTTAATACAGCGTGATGTATGGGGGCCATCACCCGTTGTGTCCATTTCTGGCTTCCGGTATTATGTTTGTTTTGTTGATGACTA
TAGCAAATACACATGGCTTTTTCCTCTCACGCATAAATATGATGTTCAGAATGTTATCATGCGCTTTGTGCCATTTCTTGAAAATCAGTTGTCCTGTTCTCTGAAAGTTT
TTCGCTCGGACGGGGGAGGTGAGTTTGTTAACAAATCTTTACATCAGTTCTTCTCCACTAAGGGAGTTGTTCATCAACGATCTTGCCCTCATACTCCTGAGCAAAACGGA
GTTGCTGAGCGCAAGCATCGTTCGATAGTTGAGACTGCATTAGCTCTGTTGCATCATGCATCTATGCCCTTGGAGTTTTGGAAGATACTAAAGTGGCATAGATTTGGTGC
ACAAGAGGCACACCAAGAAACTGGACCCAAGAAGAAGATTGACCAAAGGAATTGTTTTCTCTTGTTTTGCAAGCCACGTCTTTTTCCATCAAACAAATTTACAGTAGCTG
TCACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTTCTTCTTCCCAGTCCTCAAGTCTTACTGAAGTTCTACCTAGTGGTACCTCTCAGCCGAATTCATCCATTTTTCTTTTATCAAACATCTGTAACCTCGT
TCATGTGCGTCTTGATTCATCTAATTACTTGTTCTGGCGATTTCAAGTGGAATCAATGCTCAAAGCTCACTCGTTGTTCGGAATTGTTGATGGATCAAAACCTTGTCCTG
ATGAATTTCTTCCGAACGGTGATGGAGGACAGTCTACTCAGATCAATCCCGCTCATAGTCTATGGATTGCTCACGACAGTGCCCTGATTACTCTCATAAACGCAACTCTT
TCGAAAGTTGCCTACTCTTTCGTGATCGGCTTCAAAACCTCAAATCAGGTATGGACTGTGCTTGAGAAACGCCTCTCCTCCTTGACTCGTTCACATATACATGAGTTAAA
ATCGTCTTTGCATACGATTACTAAAGGACCTACTGAAACTGTTGATGATTATCTTGTTCGCATCAAAGATCTAGTTGATCAATTAGCTGCTGTATCTGTAGTCATAGATG
ACGAGGATTTATTGTTGTATACCTTGAATGGATTATCTTCAGAGTATAATTCTTTTCGAACCTCTGTACGAACTCGTGGTGGAAAGATTACTCTGGATGAATTACATGCG
TTATTGAAGTCTGAAGCTGAATGTATCGAACAGCAAGCAAAATCTGTAACTCCATTCACCCCAACCGCAATGTTCTCCTCCTCTCAATCTCAGAATGGCTCCAATCGTGG
TCGTGGAAGGGGAAGGGGAAATCAAGGTCAGTTTCAAGGGCAATTCAATCGTGGAAATCAGGGTCAATTTTCCCCATTCAGAGGCAATTATGGTCATCCTAGCAGTGGTC
CAACCTCGAGTAATACTGGCAACCAAAATCAACCTCGAGGAGGTTATAATCACTATAATCAGAATCAAGGACAAGGCCGTGGGACTCAGATTTCGAATTCCTCTTCTAGC
AACAGTCAAGGTCGCATTACATGTCAGATCTGTAACAGACCTGGTCACGGTGCATTGGACTGTTACAATCGCCTCAACTTGTCCTATCAAGGGCGGCATCCACCCTCTAA
GCTAGCAGCTATGTCCTCGTTCTATGACACTGGTTTGTCTACTACCAACACTTGGTTAGCCGACAGTGGGTGTAACTCCCATGTTACACCGGATCTCTCCAACTTAACAC
TCAATTCCAATTACAATGGTGAAGATGCTATCACTGTTGCCACTGGCCAGGGTGTACCCGTTACTCAATCGGGTTTTGGTACACTTTCTACTTCTCAAAGTCCCCTGAAT
CTATCCAAAATTCTTTGTGACAAGGCCACGGGTGCAACTTTATACAAGGGCAGGAGTAGAGATGGTTTATATCCTATATCATCCACTTTCAAGACTGCTTCTGCTGATTC
GGTGATTTCTAAATCGTCCTTCACTCCTCTCTGTGCATCCGTACATGTTTCCCGTGAATCTCATGCTGCTCTATGGCACTTGCGACTCGGGCATCCCTCTCATGTTGTTT
TACAAAAGGCTTTGACTGCTAGTGCTGTTGATTTTGGTCCTAAAGTTTCTTGTCGAGACTGTGTTAGTTGCTTAAAAGGGAAGATTACTAAACTTCCTTTTACTTCATCT
ACAACTAATACAACTAGTCCATTAGCTTTAATACAGCGTGATGTATGGGGGCCATCACCCGTTGTGTCCATTTCTGGCTTCCGGTATTATGTTTGTTTTGTTGATGACTA
TAGCAAATACACATGGCTTTTTCCTCTCACGCATAAATATGATGTTCAGAATGTTATCATGCGCTTTGTGCCATTTCTTGAAAATCAGTTGTCCTGTTCTCTGAAAGTTT
TTCGCTCGGACGGGGGAGGTGAGTTTGTTAACAAATCTTTACATCAGTTCTTCTCCACTAAGGGAGTTGTTCATCAACGATCTTGCCCTCATACTCCTGAGCAAAACGGA
GTTGCTGAGCGCAAGCATCGTTCGATAGTTGAGACTGCATTAGCTCTGTTGCATCATGCATCTATGCCCTTGGAGTTTTGGAAGATACTAAAGTGGCATAGATTTGGTGC
ACAAGAGGCACACCAAGAAACTGGACCCAAGAAGAAGATTGACCAAAGGAATTGTTTTCTCTTGTTTTGCAAGCCACGTCTTTTTCCATCAAACAAATTTACAGTAGCTG
TCACGTGA
Protein sequenceShow/hide protein sequence
MASSSSSQSSSLTEVLPSGTSQPNSSIFLLSNICNLVHVRLDSSNYLFWRFQVESMLKAHSLFGIVDGSKPCPDEFLPNGDGGQSTQINPAHSLWIAHDSALITLINATL
SKVAYSFVIGFKTSNQVWTVLEKRLSSLTRSHIHELKSSLHTITKGPTETVDDYLVRIKDLVDQLAAVSVVIDDEDLLLYTLNGLSSEYNSFRTSVRTRGGKITLDELHA
LLKSEAECIEQQAKSVTPFTPTAMFSSSQSQNGSNRGRGRGRGNQGQFQGQFNRGNQGQFSPFRGNYGHPSSGPTSSNTGNQNQPRGGYNHYNQNQGQGRGTQISNSSSS
NSQGRITCQICNRPGHGALDCYNRLNLSYQGRHPPSKLAAMSSFYDTGLSTTNTWLADSGCNSHVTPDLSNLTLNSNYNGEDAITVATGQGVPVTQSGFGTLSTSQSPLN
LSKILCDKATGATLYKGRSRDGLYPISSTFKTASADSVISKSSFTPLCASVHVSRESHAALWHLRLGHPSHVVLQKALTASAVDFGPKVSCRDCVSCLKGKITKLPFTSS
TTNTTSPLALIQRDVWGPSPVVSISGFRYYVCFVDDYSKYTWLFPLTHKYDVQNVIMRFVPFLENQLSCSLKVFRSDGGGEFVNKSLHQFFSTKGVVHQRSCPHTPEQNG
VAERKHRSIVETALALLHHASMPLEFWKILKWHRFGAQEAHQETGPKKKIDQRNCFLLFCKPRLFPSNKFTVAVT