; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016097 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016097
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr12:33100973..33104062
RNA-Seq ExpressionLag0016097
SyntenyLag0016097
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]6.4e-13245.42Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        +SS   N  +  W A DQ LLGW+ NSMT E+ATQ++  E +K LW   Q L G  +R++  +L+  F   RKG +KM +YL  MKN  D L LAG+PV+
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
          +L+ Q L GLD E+N VV  L  + ++SW +LQA+LL FE R+E  N   +  + + NATAN+A NR  +  K SN          +   N+RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN
        RGRGK       + + PCQVCG  N                 N+  G     S NAF+A+Q      +V D  WY DSGASNHVT   E   + TE+ G 
Subjt:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN

Query:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS
          + VGNG+KL+I   GSS L      LNL ++L VP I KNL+S+SKLA DNN+ VEF  + C VKDK +G+V+LKG LKDGLYQL           S 
Subjt:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS

Query:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA
         + N S                    V  K  WHRRLGHP+ KVLD +++ C ++V  ++  SFCE+CQ+GK H L F  S++ A    +LVHTD+WGPA
Subjt:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA

Query:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE
        P+ +  GF+YYV F+DD SRFTW+YPLKQK++T+ AF  F+ + + QF   IK +Q D GGE+  V KL  + GI  R S PYTS QNGRAERKHRH+ E
Subjt:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE

Query:  TGLTLLAQASMP
         GLTLLAQA MP
Subjt:  TGLTLLAQASMP

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]1.1e-12344.12Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        + S  VNP +  W+A DQ LLGWL NSM  ++ATQ++  E +K LW   Q L G  +++   +L+  F  TRKG +KM EYL  MKN +D L LAGSP++
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
        N +L+ Q L GLD E+N VV  L  + ++SW ++QA+LL FE RL+  N      +FS   T N + N  + +    N  N  GN   +   N RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVC-GKGNV-------------NGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN
         GRGKGR    SN +  CQVC G G++               NY        S +AF+     A+P    D  WY DSGA+NHVT   +      E+ G 
Subjt:  RGRGKGRGYNYSNNRPPCQVC-GKGNV-------------NGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN

Query:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS
          + VGNG+KL I   GS+ L +    LNL +VL VP+I KNL+S+SKL  DNN+ VEF  + C VKDK +GQ +LKG LKDGLYQL +    V  SV  
Subjt:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS

Query:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA
                                     K  WHR+LGHP+ KVLD ++K CN+++  ++  SFCE+CQFGK H L F  S++       L+H+D+WGPA
Subjt:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA

Query:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE
        P+ S  GF+YYV F+DD SRFTW++PLKQK+DT+ AF  F+ + + QF   IK +Q D GGE+  V K+  + GI  R S PYTS QNGRAERKHRH+ E
Subjt:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE

Query:  TGLTLLAQASMP
         GLTLLAQA MP
Subjt:  TGLTLLAQASMP

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]7.1e-13144.12Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        +SS   NP++E W A DQ LLGWL NSMT  +ATQ++  E +  LW   Q L G  +R++  +L+  F  TRKG +KM +YL  MKN AD L LAG+P++
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
          +L+ Q L GLD E+N VV  L  + ++SW +LQA+LL FE R+E  N   S  + + NATAN+A        + ++  N  G+   +   N RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN
        RGRG+       + +  CQVCG  N                 N+        S NAF+A+Q      ++ D  WY DSGASNHVT   +   N +E+ G 
Subjt:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN

Query:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS
          + VGNG+KL I   GSS L      LNL ++L VP+I KNL+S+SKLA DNN+ VEF  + C VKDK +G+ +L+G LKDGLYQL + ++    S+  
Subjt:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS

Query:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA
                                     K  WHR+LGHP+ KVLD+++K CN+++  ++  SFCE+CQ+GK H L F  S + A    +LVHTD+WGPA
Subjt:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA

Query:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE
        P+ S  GF+YYV F+DD +RFTW+YPLKQK+DT  AF  F+ MV+ QF   IK +Q D GGE+  V K   + GI  R S PYTS QNGRAERKHRH+ E
Subjt:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE

Query:  TGLTLLAQASMP
         GLTLLAQA MP
Subjt:  TGLTLLAQASMP

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]3.4e-12544.39Show/hide
Query:  VNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLV
        +NP Y+ W A DQ LLGWL NSMT ++ATQV+  E +K LW   Q L G  +R+   +L+  F  T K  +KM +YL  MKN AD L LAGSP+++ +L+
Subjt:  VNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLV

Query:  SQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGK
         Q L GLD E+N VV  L  + ++SW + QA+LL FE RL+  N+     + + NA+AN A        K  +  N  G+R G+   N+RG  G RGR +
Subjt:  SQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGK

Query:  GRGYNYSNNRPPCQVCGK-------------GNVNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGNERVTVG
                 RP CQ+CGK              +     H   G G S +AF+     A+P    D  WY DSGASNHVT     L +  E  G   + VG
Subjt:  GRGYNYSNNRPPCQVCGK-------------GNVNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGNERVTVG

Query:  NGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTS
        NG+KL I   GS+ L D    +NL NVL VPEI KNL+S+SKL  DNN  VEF  ++C VKDK +G+ +LKG LKDGLYQL           S+N+    
Subjt:  NGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTS

Query:  VNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSVE
          N    A+           +  K +WHR+LGHP+ KVL+ ++K  N+++  ++  +FCE+CQFGK H L F  S++ A    DL+HTD+WGPAP+ S  
Subjt:  VNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSVE

Query:  GFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLL
         F+YYV FLDD SRFTW++PLKQK++T+ AF  F+ +V+ QF   IK ++ D GGE+  V K     GI  + S PYTS QNGRAERKHRH+ E GLTLL
Subjt:  GFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLL

Query:  AQASMP
        AQA MP
Subjt:  AQASMP

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]7.6e-12545.14Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        + S  VNP ++ W+A DQ LLGWL NSM  ++ATQ++  E +K LW   Q L G  +++   +L+  F  TRKG +KM EYL  MKN +D L L+GSP++
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
        N +L+ Q L GLD E+N VV  L  + ++SW ++QA+LL FE RL+  N      +FS   T N + N  + +    N  +  GN   +   N RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVC---GKGNVNGNYHPGRGNGQSPNAFMATQQPATPETVADP------SWYADSGASNHVTSNYENLSNPTEYGGNERVTV
         GRGKGR    SN +  CQVC   G   V+ +Y   R       +  A +Q +    VA P       WY DSGASNHVT   +      E+ G   + V
Subjt:  RGRGKGRGYNYSNNRPPCQVC---GKGNVNGNYHPGRGNGQSPNAFMATQQPATPETVADP------SWYADSGASNHVTSNYENLSNPTEYGGNERVTV

Query:  GNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNT
        GNG+KL I   GS+ L    + LNL +VL VP+I KNL+S+SKL  DNN+FVEF  + C VKDK +GQ +LKG LKDGLYQL DV      S  SN+   
Subjt:  GNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNT

Query:  SVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSV
                        P V M   K  WHR+LGHP+ KVL+ ++K CN+++  ++  SFCE+CQFGK H L F  S++       L+H+D+WGPAP+ S 
Subjt:  SVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSV

Query:  EGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTL
         GF+YYV F+DD SRFTW++PLKQK+DT+ AF  F+ + + QF   IK +Q D GGE+  V K+  + GI  R S PYTS QNGRAERKHRH+VE GLTL
Subjt:  EGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTL

Query:  LAQASMP
        LAQA MP
Subjt:  LAQASMP

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)3.4e-13144.12Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        +SS   NP++E W A DQ LLGWL NSMT  +ATQ++  E +  LW   Q L G  +R++  +L+  F  TRKG +KM +YL  MKN AD L LAG+P++
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
          +L+ Q L GLD E+N VV  L  + ++SW +LQA+LL FE R+E  N   S  + + NATAN+A        + ++  N  G+   +   N RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN
        RGRG+       + +  CQVCG  N                 N+        S NAF+A+Q      ++ D  WY DSGASNHVT   +   N +E+ G 
Subjt:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN

Query:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS
          + VGNG+KL I   GSS L      LNL ++L VP+I KNL+S+SKLA DNN+ VEF  + C VKDK +G+ +L+G LKDGLYQL + ++    S+  
Subjt:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS

Query:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA
                                     K  WHR+LGHP+ KVLD+++K CN+++  ++  SFCE+CQ+GK H L F  S + A    +LVHTD+WGPA
Subjt:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA

Query:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE
        P+ S  GF+YYV F+DD +RFTW+YPLKQK+DT  AF  F+ MV+ QF   IK +Q D GGE+  V K   + GI  R S PYTS QNGRAERKHRH+ E
Subjt:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE

Query:  TGLTLLAQASMP
         GLTLLAQA MP
Subjt:  TGLTLLAQASMP

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.7e-12544.39Show/hide
Query:  VNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLV
        +NP Y+ W A DQ LLGWL NSMT ++ATQV+  E +K LW   Q L G  +R+   +L+  F  T K  +KM +YL  MKN AD L LAGSP+++ +L+
Subjt:  VNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLV

Query:  SQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGK
         Q L GLD E+N VV  L  + ++SW + QA+LL FE RL+  N+     + + NA+AN A        K  +  N  G+R G+   N+RG  G RGR +
Subjt:  SQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGK

Query:  GRGYNYSNNRPPCQVCGK-------------GNVNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGNERVTVG
                 RP CQ+CGK              +     H   G G S +AF+     A+P    D  WY DSGASNHVT     L +  E  G   + VG
Subjt:  GRGYNYSNNRPPCQVCGK-------------GNVNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGNERVTVG

Query:  NGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTS
        NG+KL I   GS+ L D    +NL NVL VPEI KNL+S+SKL  DNN  VEF  ++C VKDK +G+ +LKG LKDGLYQL           S+N+    
Subjt:  NGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTS

Query:  VNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSVE
          N    A+           +  K +WHR+LGHP+ KVL+ ++K  N+++  ++  +FCE+CQFGK H L F  S++ A    DL+HTD+WGPAP+ S  
Subjt:  VNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSVE

Query:  GFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLL
         F+YYV FLDD SRFTW++PLKQK++T+ AF  F+ +V+ QF   IK ++ D GGE+  V K     GI  + S PYTS QNGRAERKHRH+ E GLTLL
Subjt:  GFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLL

Query:  AQASMP
        AQA MP
Subjt:  AQASMP

A0A2K3NEN7 Copia-like polyprotein (Fragment)3.7e-12545.14Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        + S  VNP ++ W+A DQ LLGWL NSM  ++ATQ++  E +K LW   Q L G  +++   +L+  F  TRKG +KM EYL  MKN +D L L+GSP++
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
        N +L+ Q L GLD E+N VV  L  + ++SW ++QA+LL FE RL+  N      +FS   T N + N  + +    N  +  GN   +   N RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVC---GKGNVNGNYHPGRGNGQSPNAFMATQQPATPETVADP------SWYADSGASNHVTSNYENLSNPTEYGGNERVTV
         GRGKGR    SN +  CQVC   G   V+ +Y   R       +  A +Q +    VA P       WY DSGASNHVT   +      E+ G   + V
Subjt:  RGRGKGRGYNYSNNRPPCQVC---GKGNVNGNYHPGRGNGQSPNAFMATQQPATPETVADP------SWYADSGASNHVTSNYENLSNPTEYGGNERVTV

Query:  GNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNT
        GNG+KL I   GS+ L    + LNL +VL VP+I KNL+S+SKL  DNN+FVEF  + C VKDK +GQ +LKG LKDGLYQL DV      S  SN+   
Subjt:  GNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNT

Query:  SVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSV
                        P V M   K  WHR+LGHP+ KVL+ ++K CN+++  ++  SFCE+CQFGK H L F  S++       L+H+D+WGPAP+ S 
Subjt:  SVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSV

Query:  EGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTL
         GF+YYV F+DD SRFTW++PLKQK+DT+ AF  F+ + + QF   IK +Q D GGE+  V K+  + GI  R S PYTS QNGRAERKHRH+VE GLTL
Subjt:  EGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTL

Query:  LAQASMP
        LAQA MP
Subjt:  LAQASMP

A0A2Z6MBG6 Integrase catalytic domain-containing protein3.1e-13245.42Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        +SS   N  +  W A DQ LLGW+ NSMT E+ATQ++  E +K LW   Q L G  +R++  +L+  F   RKG +KM +YL  MKN  D L LAG+PV+
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
          +L+ Q L GLD E+N VV  L  + ++SW +LQA+LL FE R+E  N   +  + + NATAN+A NR  +  K SN          +   N+RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN
        RGRGK       + + PCQVCG  N                 N+  G     S NAF+A+Q      +V D  WY DSGASNHVT   E   + TE+ G 
Subjt:  RGRGKGRGYNYSNNRPPCQVCGKGN--------------VNGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN

Query:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS
          + VGNG+KL+I   GSS L      LNL ++L VP I KNL+S+SKLA DNN+ VEF  + C VKDK +G+V+LKG LKDGLYQL           S 
Subjt:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS

Query:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA
         + N S                    V  K  WHRRLGHP+ KVLD +++ C ++V  ++  SFCE+CQ+GK H L F  S++ A    +LVHTD+WGPA
Subjt:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA

Query:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE
        P+ +  GF+YYV F+DD SRFTW+YPLKQK++T+ AF  F+ + + QF   IK +Q D GGE+  V KL  + GI  R S PYTS QNGRAERKHRH+ E
Subjt:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE

Query:  TGLTLLAQASMP
         GLTLLAQA MP
Subjt:  TGLTLLAQASMP

A0A2Z6P4D5 Integrase catalytic domain-containing protein5.3e-12444.12Show/hide
Query:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT
        + S  VNP +  W+A DQ LLGWL NSM  ++ATQ++  E +K LW   Q L G  +++   +L+  F  TRKG +KM EYL  MKN +D L LAGSP++
Subjt:  ESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVT

Query:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS
        N +L+ Q L GLD E+N VV  L  + ++SW ++QA+LL FE RL+  N      +FS   T N + N  + +    N  N  GN   +   N RG  G 
Subjt:  NRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGS

Query:  RGRGKGRGYNYSNNRPPCQVC-GKGNV-------------NGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN
         GRGKGR    SN +  CQVC G G++               NY        S +AF+     A+P    D  WY DSGA+NHVT   +      E+ G 
Subjt:  RGRGKGRGYNYSNNRPPCQVC-GKGNV-------------NGNYHPGRGNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGN

Query:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS
          + VGNG+KL I   GS+ L +    LNL +VL VP+I KNL+S+SKL  DNN+ VEF  + C VKDK +GQ +LKG LKDGLYQL +    V  SV  
Subjt:  ERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSS

Query:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA
                                     K  WHR+LGHP+ KVLD ++K CN+++  ++  SFCE+CQFGK H L F  S++       L+H+D+WGPA
Subjt:  NRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA

Query:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE
        P+ S  GF+YYV F+DD SRFTW++PLKQK+DT+ AF  F+ + + QF   IK +Q D GGE+  V K+  + GI  R S PYTS QNGRAERKHRH+ E
Subjt:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE

Query:  TGLTLLAQASMP
         GLTLLAQA MP
Subjt:  TGLTLLAQASMP

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.9e-1621.03Show/hide
Query:  AKDLWSTIQELFGVQSRAEEDFLRQTFQQTR-KGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLV
        A+ +   +  ++  +S A +  LR+     +    + +  +  +       L  AG+ +   + +S +L+ L   ++ ++  ++  +  + +    +  +
Subjt:  AKDLWSTIQELFGVQSRAEEDFLRQTFQQTR-KGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLV

Query:  FEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGKGRGYNYSNNRPPCQVCGK-GNVNGN-YHPGR--GN
         ++ ++I+N H  T         N  V+  +N+ K +   N     +  + GN                  S  +  C  CG+ G++  + +H  R   N
Subjt:  FEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGKGRGYNYSNNRPPCQVCGK-GNVNGN-YHPGR--GN

Query:  GQSPN------------AFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGNERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPE
            N            AFM  +   T   + +  +  DSGAS+H+ ++    ++  E     ++ V    +         +     H + LE+VL   E
Subjt:  GQSPN------------AFMATQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGNERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPE

Query:  IAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTSVNNG-VKSAFVVSYVMPQVNMVESKN--VWHR
         A NL+S+ +L Q+  + +EF        DKS                          ++S N L    N+G + +  V+++    +N     N  +WH 
Subjt:  IAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTSVNNG-VKSAFVVSYVMPQVNMVESKN--VWHR

Query:  RLGHPSP-KVLDMIVKG--CNLQVKSNEVLS--FCESCQFGKSHALAF-PLSNNRAVHR-FDLVHTDLWGPAPVPSVEGFRYYVLFLDDNSRFTWLYPLK
        R GH S  K+L++  K    +  + +N  LS   CE C  GK   L F  L +   + R   +VH+D+ GP    +++   Y+V+F+D  + +   Y +K
Subjt:  RLGHPSP-KVLDMIVKG--CNLQVKSNEVLS--FCESCQFGKSHALAF-PLSNNRAVHR-FDLVHTDLWGPAPVPSVEGFRYYVLFLDDNSRFTWLYPLK

Query:  QKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEH--SRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLLAQASM
         K+D  + F+ F    +  F   +  L  DNG E+  + + + C + GI    + P+T   NG +ER  R + E   T+++ A +
Subjt:  QKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEH--SRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLLAQASM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-3623.79Show/hide
Query:  DVSSRPESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFL-RQTFQQTRKGNLKMAEYLRVMKNHADNLG
        DV S+   ++    + E W  +D+     +   ++ +V   +I  + A+ +W+ ++ L+  ++   + +L +Q +            +L V       L 
Subjt:  DVSSRPESSLVVNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFL-RQTFQQTRKGNLKMAEYLRVMKNHADNLG

Query:  LAGSPVTNRNLVSQVLLGLDEEF-NAVVAMLQGRASVSWSELQAELLVFEK-RLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYN
          G  +   +    +L  L   + N    +L G+ ++   ++ + LL+ EK R + +N  ++ ++            RG +  + SN          Y  
Subjt:  LAGSPVTNRNLVSQVLLGLDEEF-NAVVAMLQGRASVSWSELQAELLVFEK-RLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYN

Query:  GNARGGNGSRGRGKGRGYNYSNNRP-------PCQVCGKGNVNGNYHPGR-----GNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSN
          ARG + +R + + R   Y+ N+P       P    GKG  +G  +         N  +   F+  ++     +  +  W  D+ AS+H T    +L  
Subjt:  GNARGGNGSRGRGKGRGYNYSNNRP-------PCQVCGKGNVNGNYHPGR-----GNGQSPNAFMATQQPATPETVADPSWYADSGASNHVTSNYENLSN

Query:  PTEYGGNERVTVGNGDKLSIKCVGSSILTDGIH-ILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNA
            G    V +GN     I  +G   +   +   L L++V  VP++  NL+S   L +D       +  + L K      V+ KG  +  LY       
Subjt:  PTEYGGNERVTVGNGDKLSIKCVGSSILTDGIH-ILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNA

Query:  RVVSSVSSNRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLV
        R  + +    LN + +                    S ++WH+R+GH S K L ++ K   +       +  C+ C FGK H ++F  S+ R ++  DLV
Subjt:  RVVSSVSSNRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLV

Query:  HTDLWGPAPVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHS--RVHKLCQQLGIHSRFSYPYTSSQNGR
        ++D+ GP  + S+ G +Y+V F+DD SR  W+Y LK K+     F+ F  +V+ + G  +K L+SDNGGE++     + C   GI    + P T   NG 
Subjt:  HTDLWGPAPVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHS--RVHKLCQQLGIHSRFSYPYTSSQNGR

Query:  AERKHRHLVETGLTLLAQASMP
        AER +R +VE   ++L  A +P
Subjt:  AERKHRHLVETGLTLLAQASMP

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein1.6e-1323.84Show/hide
Query:  VTNRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGN
        V++R     +L GL  +F  +    + + ++  S+L AE+ +     +I N +K +    H+   N++      SP   N  N     + Y+  N+    
Subjt:  VTNRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGN

Query:  GSRGRGKGRGYNYSNNRPPCQVCGKGNVNGNYHPGRGN---GQSPNAFMATQQPATPETVADPSWYADSGAS----------NHVTSNYE-NLSN-----
         ++         +S  R       +  V+  Y         GQ       T+   + + + D     DSGAS          +H T N E N+ +     
Subjt:  GSRGRGKGRGYNYSNNRPPCQVCGKGNVNGNYHPGRGN---GQSPNAFMATQQPATPETVADPSWYADSGAS----------NHVTSNYE-NLSN-----

Query:  -PTEYGGNERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKD--KSSGQVVLKGTLKDGLYQLQDV
         P    GN      NG K SIK                   L  P IA +L+S+S+LA  N          C  ++  + S   VL   +K G +     
Subjt:  -PTEYGGNERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKD--KSSGQVVLKGTLKDGLYQLQDV

Query:  NARVVSSVSSNRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPS-PKVLDMIVKGCNLQVK------SNEVLSFCESCQFGKS----HALAFP
           + S +S      ++NN  KS  V  Y  P +         HR LGH +   +   + K     +K      SN     C  C  GKS    H     
Subjt:  NARVVSSVSSNRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPS-PKVLDMIVKGCNLQVK------SNEVLSFCESCQFGKS----HALAFP

Query:  LSNNRAVHRFDLVHTDLWGPAPVPSVEGFRYYVLFLDDNSRFTWLYPL--KQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSR--VHKLCQQLGI
        L    +   F  +HTD++GP          Y++ F D+ +RF W+YPL  +++   L  F      +K QF   +  +Q D G E++   +HK     GI
Subjt:  LSNNRAVHRFDLVHTDLWGPAPVPSVEGFRYYVLFLDDNSRFTWLYPL--KQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSR--VHKLCQQLGI

Query:  HSRFSYPYTSSQNGRAERKHRHLVETGLTLLAQASMP
         + ++    S  +G AER +R L+    TLL  + +P
Subjt:  HSRFSYPYTSSQNGRAERKHRHLVETGLTLLAQASMP

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-6929.76Show/hide
Query:  DQLVAHSAS---SNVSKIQKLPSTMTNVFSNTTNAMAAGATMFSSPPLNQLLNQITSIKLDPAEASSSRATANGDVSSRPESSLVVNPQYEVWVAVDQLL
        ++LV ++ S    N+S + KL ST   ++S   +A      +F    L   L+  T++           AT   D + R      VNP Y  W   D+L+
Subjt:  DQLVAHSAS---SNVSKIQKLPSTMTNVFSNTTNAMAAGATMFSSPPLNQLLNQITSIKLDPAEASSSRATANGDVSSRPESSLVVNPQYEVWVAVDQLL

Query:  LGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLVSQVLLGLDEEFNAVV
           +  +++  V   V     A  +W T+++++   S      LR   +Q  KG   + +Y++ +    D L L G P+ +   V +VL  L EE+  V+
Subjt:  LGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLVSQVLLGLDEEFNAVV

Query:  AMLQGR-ASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGKGRGYNYSNNRP---
          +  +    + +E+   LL  E ++ +  S  + +  + NA ++      +N        N NGNR   Y+      N    +     ++ +NN+    
Subjt:  AMLQGR-ASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGKGRGYNYSNNRP---

Query:  --PCQVCGKGNVNGNYHPGRGNGQSPNAFMATQQPATPETVADP-------------SWYADSGASNHVTSNYENLSNPTEYGGNERVTVGNGDKLSIKC
           CQ+CG   V G+        Q   + + +QQP +P T   P             +W  DSGA++H+TS++ NLS    Y G + V V +G  + I  
Subjt:  --PCQVCGKGNVNGNYHPGRGNGQSPNAFMATQQPATPETVADP-------------SWYADSGASNHVTSNYENLSNPTEYGGNERVTVGNGDKLSIKC

Query:  VGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTSVNNGVKSAF
         GS+ L+     LNL N+L VP I KNL+S+ +L   N V VEF      VKD ++G  +L+G  KD LY+    +++ VS  +S               
Subjt:  VGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTSVNNGVKSAF

Query:  VVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQV--KSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSVEGFRYYVL
              P      S   WH RLGHP+P +L+ ++   +L V   S++ LS C  C   KS+ + F  S   +    + +++D+W  +P+ S + +RYYV+
Subjt:  VVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQV--KSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVPSVEGFRYYVL

Query:  FLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLLAQASMP
        F+D  +R+TWLYPLKQK+     F  F+ +++ +F   I    SDNGGE   + +   Q GI    S P+T   NG +ERKHRH+VETGLTLL+ AS+P
Subjt:  FLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVETGLTLLAQASMP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-6230.72Show/hide
Query:  VNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLV
        VNP Y  W   D+L+   +  +++  V   V     A  +W T+++++   S      LR                        D L L G P+ +   V
Subjt:  VNPQYEVWVAVDQLLLGWLYNSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLV

Query:  SQVLLGLDEEFNAVVAMLQGR-ASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRG
         +VL  L +++  V+  +  +    S +E+   L+  E +L   NS +         TAN+  +R +N+ +  N    N N    YN N    N  +   
Subjt:  SQVLLGLDEEFNAVVAMLQGR-ASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRG

Query:  KGRGYNYSNNRPP------CQVCGKGNVNGNYHPGRGNGQSPNAFMATQQPATP------ETVADP----SWYADSGASNHVTSNYENLSNPTEYGGNER
         G   + S+NR P      CQ+C     +    P     QS      +  P TP        V  P    +W  DSGA++H+TS++ NLS    Y G + 
Subjt:  KGRGYNYSNNRPP------CQVCGKGNVNGNYHPGRGNGQSPNAFMATQQPATP------ETVADP----SWYADSGASNHVTSNYENLSNPTEYGGNER

Query:  VTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNR
        V + +G  + I   GS+ L      L+L  VL VP I KNL+S+ +L   N V VEF      VKD ++G  +L+G  KD LY+    +++ VS  +S  
Subjt:  VTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDKSSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNR

Query:  LNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQV--KSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA
                           P      S   WH RLGHPS  +L+ ++   +L V   S+++LS C  C   KSH + F  S   +    + +++D+W  +
Subjt:  LNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQV--KSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPA

Query:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE
        P+ S++ +RYYV+F+D  +R+TWLYPLKQK+     F  F+++V+ +F   I  L SDNGGE   +     Q GI    S P+T   NG +ERKHRH+VE
Subjt:  PVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNGRAERKHRHLVE

Query:  TGLTLLAQASMP
         GLTLL+ AS+P
Subjt:  TGLTLLAQASMP

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.0e-0728.45Show/hide
Query:  DVSSRPESSLVVNPQYEV-WVAVDQLLLGWLYNSMTP-EVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNL
        DV    + +L+     +V W   D ++   LY ++TP +     +    ++D+W  I+  F     A    L    +    G++++A+Y R MK  AD+L
Subjt:  DVSSRPESSLVVNPQYEV-WVAVDQLLLGWLYNSMTP-EVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNL

Query:  GLAGSPVTNRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNG
             PVT+RNLV  VL GL+ +F+ ++ +++ R      +  A +L  E+    +    +     H++++   V   S +P  +N     GN+ G Y G
Subjt:  GLAGSPVTNRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNG

Query:  NARGGNGSRGRGKGRGYNYSN-------NRPP
          RG N  RGRG GR ++Y N       NRPP
Subjt:  NARGGNGSRGRGKGRGYNYSN-------NRPP

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-0727.21Show/hide
Query:  WVAVDQLLLGWLYNSMTPEVATQVIGVE-NAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLVSQVLLG
        W   D L+  W+Y ++T  +   +I V   A+DLW +++ LF     A         + T   +L + EY + +K+ +D L    SP+++R LV  +L G
Subjt:  WVAVDQLLLGWLYNSMTPEVATQVIGVE-NAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLVSQVLLG

Query:  LDEEFNAVVAMLQGRASV-SWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSN-----PANGNGNRQGYYNGNARGGNG-----S
        L E+++ ++ +++ ++   S++E ++ LL+ E RL    S+KS  S SH           +N P  SN     P       Q Y+N N+  G G     +
Subjt:  LDEEFNAVVAMLQGRASV-SWSELQAELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSN-----PANGNGNRQGYYNGNARGGNG-----S

Query:  RGRGKGRG-YNYSN----NRPPCQVCGKGNVNGNY--------HPGRGNGQSPNAFMA-TQQPATPETVADP
        RG G   G YN +N    N+PP  + G       Y        H      Q P  +M+ T    +P ++ +P
Subjt:  RGRGKGRG-YNYSN----NRPPCQVCGKGNVNGNY--------HPGRGNGQSPNAFMA-TQQPATPETVADP

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.3e-1035.53Show/hide
Query:  ESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVP
        +   +WH RL H S + ++++VK   L       L FCE C +GK+H + F    +   +  D VH+DLWG   VP
Subjt:  ESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFPLSNNRAVHRFDLVHTDLWGPAPVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAATTGGGATTGGGGAACAACGAAGAGAGGGAATGGGTTTGCGCTATTAAAGCCCTTGAGTCTCAGAAAGTCTGTTCTATTGTTGCTGGTAGCCGAAACTCCCTC
GCCATATGCGAAGATGGCAAGCTGTTTACGTGGGGTTGGAATCAGAGAGGAACTCTTGGGCACCCAGCAGAGACCAAAGCTGAGAACATTCCGAGCCAGGTCAAAGCTCT
TGCTAATGTTAAGATCGTACAGCTATTGGTGGGTGGCACTTATTAAGACTAGCAGTCAAGAACTAGTCAAAGGTCAAGGTGCCAAATCAGAAGGAAAGGTTCAGTCAACA
GCCCAATTGGCAGATGAGTCAACAGACCAGTTAGTAGCCCATTCAGCATCCTCGAACGTCAGCAAAATCCAGAAGCTTCCTAGCACTATGACCAACGTCTTCTCAAACAC
AACAAATGCCATGGCCGCCGGAGCTACAATGTTCAGTAGCCCTCCTCTAAACCAGCTACTCAATCAAATAACCTCCATCAAGCTTGATCCGGCAGAAGCTTCGAGCTCAA
GAGCGACTGCAAACGGCGATGTGTCCTCTCGACCCGAGTCCAGTTTGGTGGTAAATCCCCAATATGAAGTCTGGGTAGCTGTTGATCAGCTACTACTAGGCTGGTTATAC
AACTCAATGACTCCAGAGGTTGCAACCCAAGTGATAGGAGTCGAGAATGCCAAGGATCTTTGGTCAACCATTCAAGAATTATTTGGGGTTCAATCACGAGCCGAGGAAGA
CTTCTTACGCCAAACCTTTCAACAAACCAGAAAAGGTAACTTAAAAATGGCTGAGTATTTAAGAGTTATGAAAAATCATGCTGATAACCTGGGATTAGCTGGTAGTCCTG
TTACTAATAGAAATTTAGTGTCTCAAGTGTTATTGGGACTTGATGAAGAATTTAATGCTGTTGTTGCAATGTTACAAGGAAGAGCTAGTGTTTCATGGTCTGAGTTGCAG
GCAGAACTTTTAGTCTTTGAAAAGAGGTTGGAAATTCAAAACTCTCACAAGAGTACAGTGAGTTTTAGCCACAATGCCACAGCAAATATGGCAGTCAACAGAGGCAGTAA
CTCTCCAAAGCCGTCGAACCCTGCAAATGGGAATGGAAATCGCCAAGGGTATTACAATGGAAACGCGCGTGGTGGAAATGGAAGTCGTGGTCGAGGCAAGGGACGCGGTT
ACAACTATTCTAACAACCGTCCCCCTTGCCAAGTCTGTGGAAAGGGCAATGTGAATGGAAATTATCATCCAGGGAGAGGAAATGGGCAGTCACCAAACGCCTTTATGGCT
ACACAGCAGCCTGCAACACCTGAAACCGTTGCTGATCCAAGTTGGTATGCCGACAGTGGGGCATCTAATCATGTGACCAGCAACTATGAAAATCTCTCCAATCCTACAGA
ATATGGAGGTAATGAGCGTGTCACAGTAGGTAATGGTGATAAATTGTCCATTAAGTGTGTTGGTTCATCTATTCTAACTGATGGGATTCATATTCTAAATCTTGAAAATG
TCTTGTGTGTACCCGAAATAGCCAAGAACCTAGTGAGTATGTCAAAACTAGCTCAAGATAATAATGTGTTCGTTGAATTTCATGGAGATTTTTGTCTTGTTAAGGACAAG
AGTTCGGGCCAAGTGGTGCTGAAAGGAACACTTAAGGACGGGCTTTATCAACTACAAGATGTGAATGCAAGAGTTGTGTCTTCTGTTTCCAGCAATCGTCTGAACACTTC
TGTCAATAATGGTGTTAAATCTGCGTTTGTTGTTTCTTATGTCATGCCTCAAGTTAATATGGTTGAGTCTAAAAACGTTTGGCATAGGAGGTTGGGACATCCCTCTCCTA
AAGTTTTGGATATGATAGTTAAAGGTTGTAATCTTCAAGTGAAGTCTAATGAAGTGTTATCATTTTGTGAGTCATGTCAATTTGGAAAGTCCCATGCTCTTGCCTTTCCA
TTGTCTAACAATCGTGCTGTTCATCGCTTTGATCTTGTTCACACTGACTTGTGGGGTCCTGCACCAGTCCCATCAGTTGAAGGCTTTCGATATTATGTGTTATTTCTTGA
TGATAATAGTAGGTTCACGTGGCTCTATCCTCTAAAACAAAAGAATGACACACTGGCTGCATTTGAACATTTTCGCACGATGGTAAAAACTCAGTTTGGATGTGTAATTA
AAGCTCTACAATCAGATAATGGTGGAGAGCACTCTCGTGTACACAAACTATGTCAGCAGCTTGGAATACATTCACGATTTTCCTACCCCTATACTTCGTCTCAGAATGGA
AGAGCTGAGCGAAAACACCGCCATTTGGTTGAAACAGGGCTTACCTTACTAGCCCAAGCCTCTATGCCCTTTGTTACTGGTGGGACGCATTTGTTACAGCAACTCACTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGGCAATTGGGATTGGGGAACAACGAAGAGAGGGAATGGGTTTGCGCTATTAAAGCCCTTGAGTCTCAGAAAGTCTGTTCTATTGTTGCTGGTAGCCGAAACTCCCTC
GCCATATGCGAAGATGGCAAGCTGTTTACGTGGGGTTGGAATCAGAGAGGAACTCTTGGGCACCCAGCAGAGACCAAAGCTGAGAACATTCCGAGCCAGGTCAAAGCTCT
TGCTAATGTTAAGATCGTACAGCTATTGGTGGGTGGCACTTATTAAGACTAGCAGTCAAGAACTAGTCAAAGGTCAAGGTGCCAAATCAGAAGGAAAGGTTCAGTCAACA
GCCCAATTGGCAGATGAGTCAACAGACCAGTTAGTAGCCCATTCAGCATCCTCGAACGTCAGCAAAATCCAGAAGCTTCCTAGCACTATGACCAACGTCTTCTCAAACAC
AACAAATGCCATGGCCGCCGGAGCTACAATGTTCAGTAGCCCTCCTCTAAACCAGCTACTCAATCAAATAACCTCCATCAAGCTTGATCCGGCAGAAGCTTCGAGCTCAA
GAGCGACTGCAAACGGCGATGTGTCCTCTCGACCCGAGTCCAGTTTGGTGGTAAATCCCCAATATGAAGTCTGGGTAGCTGTTGATCAGCTACTACTAGGCTGGTTATAC
AACTCAATGACTCCAGAGGTTGCAACCCAAGTGATAGGAGTCGAGAATGCCAAGGATCTTTGGTCAACCATTCAAGAATTATTTGGGGTTCAATCACGAGCCGAGGAAGA
CTTCTTACGCCAAACCTTTCAACAAACCAGAAAAGGTAACTTAAAAATGGCTGAGTATTTAAGAGTTATGAAAAATCATGCTGATAACCTGGGATTAGCTGGTAGTCCTG
TTACTAATAGAAATTTAGTGTCTCAAGTGTTATTGGGACTTGATGAAGAATTTAATGCTGTTGTTGCAATGTTACAAGGAAGAGCTAGTGTTTCATGGTCTGAGTTGCAG
GCAGAACTTTTAGTCTTTGAAAAGAGGTTGGAAATTCAAAACTCTCACAAGAGTACAGTGAGTTTTAGCCACAATGCCACAGCAAATATGGCAGTCAACAGAGGCAGTAA
CTCTCCAAAGCCGTCGAACCCTGCAAATGGGAATGGAAATCGCCAAGGGTATTACAATGGAAACGCGCGTGGTGGAAATGGAAGTCGTGGTCGAGGCAAGGGACGCGGTT
ACAACTATTCTAACAACCGTCCCCCTTGCCAAGTCTGTGGAAAGGGCAATGTGAATGGAAATTATCATCCAGGGAGAGGAAATGGGCAGTCACCAAACGCCTTTATGGCT
ACACAGCAGCCTGCAACACCTGAAACCGTTGCTGATCCAAGTTGGTATGCCGACAGTGGGGCATCTAATCATGTGACCAGCAACTATGAAAATCTCTCCAATCCTACAGA
ATATGGAGGTAATGAGCGTGTCACAGTAGGTAATGGTGATAAATTGTCCATTAAGTGTGTTGGTTCATCTATTCTAACTGATGGGATTCATATTCTAAATCTTGAAAATG
TCTTGTGTGTACCCGAAATAGCCAAGAACCTAGTGAGTATGTCAAAACTAGCTCAAGATAATAATGTGTTCGTTGAATTTCATGGAGATTTTTGTCTTGTTAAGGACAAG
AGTTCGGGCCAAGTGGTGCTGAAAGGAACACTTAAGGACGGGCTTTATCAACTACAAGATGTGAATGCAAGAGTTGTGTCTTCTGTTTCCAGCAATCGTCTGAACACTTC
TGTCAATAATGGTGTTAAATCTGCGTTTGTTGTTTCTTATGTCATGCCTCAAGTTAATATGGTTGAGTCTAAAAACGTTTGGCATAGGAGGTTGGGACATCCCTCTCCTA
AAGTTTTGGATATGATAGTTAAAGGTTGTAATCTTCAAGTGAAGTCTAATGAAGTGTTATCATTTTGTGAGTCATGTCAATTTGGAAAGTCCCATGCTCTTGCCTTTCCA
TTGTCTAACAATCGTGCTGTTCATCGCTTTGATCTTGTTCACACTGACTTGTGGGGTCCTGCACCAGTCCCATCAGTTGAAGGCTTTCGATATTATGTGTTATTTCTTGA
TGATAATAGTAGGTTCACGTGGCTCTATCCTCTAAAACAAAAGAATGACACACTGGCTGCATTTGAACATTTTCGCACGATGGTAAAAACTCAGTTTGGATGTGTAATTA
AAGCTCTACAATCAGATAATGGTGGAGAGCACTCTCGTGTACACAAACTATGTCAGCAGCTTGGAATACATTCACGATTTTCCTACCCCTATACTTCGTCTCAGAATGGA
AGAGCTGAGCGAAAACACCGCCATTTGGTTGAAACAGGGCTTACCTTACTAGCCCAAGCCTCTATGCCCTTTGTTACTGGTGGGACGCATTTGTTACAGCAACTCACTTG
A
Protein sequenceShow/hide protein sequence
MGNWDWGTTKRGNGFALLKPLSLRKSVLLLLVAETPSPYAKMASCLRGVGIREELLGTQQRPKLRTFRARSKLLLMLRSYSYWWVALIKTSSQELVKGQGAKSEGKVQST
AQLADESTDQLVAHSASSNVSKIQKLPSTMTNVFSNTTNAMAAGATMFSSPPLNQLLNQITSIKLDPAEASSSRATANGDVSSRPESSLVVNPQYEVWVAVDQLLLGWLY
NSMTPEVATQVIGVENAKDLWSTIQELFGVQSRAEEDFLRQTFQQTRKGNLKMAEYLRVMKNHADNLGLAGSPVTNRNLVSQVLLGLDEEFNAVVAMLQGRASVSWSELQ
AELLVFEKRLEIQNSHKSTVSFSHNATANMAVNRGSNSPKPSNPANGNGNRQGYYNGNARGGNGSRGRGKGRGYNYSNNRPPCQVCGKGNVNGNYHPGRGNGQSPNAFMA
TQQPATPETVADPSWYADSGASNHVTSNYENLSNPTEYGGNERVTVGNGDKLSIKCVGSSILTDGIHILNLENVLCVPEIAKNLVSMSKLAQDNNVFVEFHGDFCLVKDK
SSGQVVLKGTLKDGLYQLQDVNARVVSSVSSNRLNTSVNNGVKSAFVVSYVMPQVNMVESKNVWHRRLGHPSPKVLDMIVKGCNLQVKSNEVLSFCESCQFGKSHALAFP
LSNNRAVHRFDLVHTDLWGPAPVPSVEGFRYYVLFLDDNSRFTWLYPLKQKNDTLAAFEHFRTMVKTQFGCVIKALQSDNGGEHSRVHKLCQQLGIHSRFSYPYTSSQNG
RAERKHRHLVETGLTLLAQASMPFVTGGTHLLQQLT