; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019224 (gene) of Snake gourd v1 genome

Gene IDTan0019224
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:30591624..30593094
RNA-Seq ExpressionTan0019224
SyntenyTan0019224
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-16061.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-16161.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L+    FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-16061.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-16061.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]5.8e-16061.3Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KK K A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein9.6e-16161.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

A0A5A7TU93 Gag/pol protein4.3e-16161.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L+    FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

A0A5A7TWB9 Gag/pol protein9.6e-16161.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

A0A5A7UGV2 Gag/pol protein9.6e-16161.51Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KKAK A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

A0A5D3CPJ6 Gag/pol protein2.8e-16061.3Show/hide
Query:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN
        M+S+ + +L  ++L G N+ +WK+ ++ +L++DDLRF+L EECPQ+ A +  ++V++ ++RWAKANEKA+AYI+A +S+VLAK+ E M++AREIM  LQ 
Subjt:  MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQN

Query:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG
        +  Q   QI+H+ +KY+YNARM EG SVREHVL+++VHFNVA+MNGAVIDE  QV FILESL +SFL FRSN  +NK  Y LT+LLNELQTF+SLMK KG
Subjt:  LSRQPFEQIRHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKG

Query:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET
        Q  GEAN+   +R+F +GS+SGTKS   SSG KK +KKK G   KA LAA K  KK K A KG CFHCN +GHWKRN PKYLAE K+ K+GK DL +LET
Subjt:  QADGEANLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKA-LAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKE-KKGKLDLHILET

Query:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA
        CLVENDD  WI+DS ATNHVCSSFQ  SS+++LE GEMT+RV TG VVSA AVG  +L     FLLL+N+Y+VP LKRNLIS+ CL+E  Y ++F++N+ 
Subjt:  CLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEA

Query:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC
        FI K GV ICSAKL N  Y+L+   SKA+LN EM K A TQNKR +ISP  N +LWHLRLGHINLN+IERL KN LL++LE++SLP CESC
Subjt:  FILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.5e-0520.52Show/hide
Query:  RLTGENFTTWKSNLDAILVVDDLRFIL---TEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQNLSRQPFEQI
        +LT  N+  W   + A+    +L   L   T   P  +  D    V   + RW + ++   + I+  IS  +   +    +A +I   L+ +   P    
Subjt:  RLTGENFTTWKSNLDAILVVDDLRFIL---TEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQNLSRQPFEQI

Query:  RHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKGQADG---EA
              Y +  +++            I  F+   + G  +D   QV  +LE+L   + P    +       +LT +   L   +S +     A+     A
Subjt:  RHECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKGQADG---EA

Query:  NLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKALAADKGKKKAKVADKGKCFHCNVDGHWKRNFP---KYLAELKEKKG---------KLDLH
        N+  H R      +   +    +      +            +D  + K  +   G+C  C+V GH  +  P   ++ +   +++          + +L 
Subjt:  NLFAHSRRFQKGSSSGTKSCGLSSGLKKTQKKKIGGKGKALAADKGKKKAKVADKGKCFHCNVDGHWKRNFP---KYLAELKEKKG---------KLDLH

Query:  ILETCLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELG------
        +       N    W+LDS AT+H+ S F    SF +   G   + +  G  +     G A L    R L L  +  VP + +NLIS+  L          
Subjt:  ILETCLVENDDLTWILDSKATNHVCSSFQETSSFKELEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELG------

Query:  YCISFSINEAFILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLE-DDSLPPCE
        +  SF + +   L  GV +   K  +  Y      S+A+        AS  +K    S       WH RLGH +L  +  +  N  L  L     L  C 
Subjt:  YCISFSINEAFILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQNKRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLE-DDSLPPCE

Query:  SC
         C
Subjt:  SC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCATATATAGCCTTACTAAAAAAGGAACGTTTAACTGGTGAAAACTTTACTACGTGGAAGTCAAACCTGGATGCGATCCTTGTTGTTGACGACCTTCGGTT
CATACTAACTGAGGAATGTCCTCAGATTCTTGCTCAAGACACACCTCAATCAGTTAAGGATGCGTTTGACCGTTGGGCCAAGGCCAATGAAAAAGCCAAGGCCTATATCA
TGGCTGGGATATCTGATGTTTTAGCCAAGAGATTGGAGGGTATGGTCTCAGCTCGTGAGATCATGGGCTATTTGCAAAATTTGTCCAGACAACCGTTTGAACAAATTCGA
CACGAATGCATCAAGTACGTTTATAACGCGCGCATGAAGGAGGGAACTTCAGTGAGAGAACATGTTCTCGATCTTATAGTTCACTTCAACGTGGCCAAAATGAACGGAGC
GGTCATAGATGAGCAAATTCAGGTGTTCTTCATCTTGGAATCTCTTTCGAAGAGTTTCCTGCCATTCCGTAGCAATGTGGAGTTGAACAAAAAGGTGTATAACCTGACTT
CCCTCCTAAATGAACTACAGACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGTTTGCCCATTCCAGAAGGTTCCAGAAGGGTTCATCC
TCTGGGACTAAGTCATGTGGTTTATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAGGGAAAGGGAAGGCACTTGCTGCTGACAAAGGCAAGAAAAAAGCCAA
GGTTGCAGATAAAGGAAAATGTTTCCACTGCAACGTGGATGGGCATTGGAAGCGAAACTTCCCAAAATACCTTGCTGAGCTCAAAGAGAAGAAAGGTAAATTAGATTTAC
ATATTCTTGAAACTTGTTTAGTGGAAAATGATGATTTGACCTGGATACTTGATTCAAAAGCCACTAATCACGTTTGCTCTTCATTTCAGGAAACTAGTTCCTTCAAGGAG
CTCGAAGTGGGTGAGATGACGCTCAGGGTCAGAACGGGAGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAGCTGTTTTTTGGAGATAGATTTCTTTTATTAAAAAA
TCTGTATATAGTTCCTAAGCTTAAAAGGAACTTAATTTCTATCTCTTGTTTAATAGAACTTGGATATTGTATTTCTTTTTCAATTAATGAAGCGTTCATTTTGAAAGAAG
GTGTCAATATTTGTTCTGCTAAATTGAATAATGGCTTCTACATACTGAAACCGAATAAATCCAAAGCAATTTTAAATCATGAGATGCTTAAAAATGCTAGTACTCAAAAC
AAAAGGCAAAGAATTTCTCCAAACAATAATACCTATCTATGGCATCTTAGACTTGGTCATATTAATCTCAACCAGATTGAGAGACTCTCTAAGAATTTACTTCTAAATAA
GTTAGAAGATGATTCTTTACCTCCTTGTGAATCATGCTAG
mRNA sequenceShow/hide mRNA sequence
CATGTCGTCCTCATATATAGCCTTACTAAAAAAGGAACGTTTAACTGGTGAAAACTTTACTACGTGGAAGTCAAACCTGGATGCGATCCTTGTTGTTGACGACCTTCGGT
TCATACTAACTGAGGAATGTCCTCAGATTCTTGCTCAAGACACACCTCAATCAGTTAAGGATGCGTTTGACCGTTGGGCCAAGGCCAATGAAAAAGCCAAGGCCTATATC
ATGGCTGGGATATCTGATGTTTTAGCCAAGAGATTGGAGGGTATGGTCTCAGCTCGTGAGATCATGGGCTATTTGCAAAATTTGTCCAGACAACCGTTTGAACAAATTCG
ACACGAATGCATCAAGTACGTTTATAACGCGCGCATGAAGGAGGGAACTTCAGTGAGAGAACATGTTCTCGATCTTATAGTTCACTTCAACGTGGCCAAAATGAACGGAG
CGGTCATAGATGAGCAAATTCAGGTGTTCTTCATCTTGGAATCTCTTTCGAAGAGTTTCCTGCCATTCCGTAGCAATGTGGAGTTGAACAAAAAGGTGTATAACCTGACT
TCCCTCCTAAATGAACTACAGACTTTCCAGTCTCTTATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGTTTGCCCATTCCAGAAGGTTCCAGAAGGGTTCATC
CTCTGGGACTAAGTCATGTGGTTTATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAGGGAAAGGGAAGGCACTTGCTGCTGACAAAGGCAAGAAAAAAGCCA
AGGTTGCAGATAAAGGAAAATGTTTCCACTGCAACGTGGATGGGCATTGGAAGCGAAACTTCCCAAAATACCTTGCTGAGCTCAAAGAGAAGAAAGGTAAATTAGATTTA
CATATTCTTGAAACTTGTTTAGTGGAAAATGATGATTTGACCTGGATACTTGATTCAAAAGCCACTAATCACGTTTGCTCTTCATTTCAGGAAACTAGTTCCTTCAAGGA
GCTCGAAGTGGGTGAGATGACGCTCAGGGTCAGAACGGGAGACGTCGTCTCAGCTCGTGCAGTGGGAGATGCCAAGCTGTTTTTTGGAGATAGATTTCTTTTATTAAAAA
ATCTGTATATAGTTCCTAAGCTTAAAAGGAACTTAATTTCTATCTCTTGTTTAATAGAACTTGGATATTGTATTTCTTTTTCAATTAATGAAGCGTTCATTTTGAAAGAA
GGTGTCAATATTTGTTCTGCTAAATTGAATAATGGCTTCTACATACTGAAACCGAATAAATCCAAAGCAATTTTAAATCATGAGATGCTTAAAAATGCTAGTACTCAAAA
CAAAAGGCAAAGAATTTCTCCAAACAATAATACCTATCTATGGCATCTTAGACTTGGTCATATTAATCTCAACCAGATTGAGAGACTCTCTAAGAATTTACTTCTAAATA
AGTTAGAAGATGATTCTTTACCTCCTTGTGAATCATGCTAG
Protein sequenceShow/hide protein sequence
MSSSYIALLKKERLTGENFTTWKSNLDAILVVDDLRFILTEECPQILAQDTPQSVKDAFDRWAKANEKAKAYIMAGISDVLAKRLEGMVSAREIMGYLQNLSRQPFEQIR
HECIKYVYNARMKEGTSVREHVLDLIVHFNVAKMNGAVIDEQIQVFFILESLSKSFLPFRSNVELNKKVYNLTSLLNELQTFQSLMKNKGQADGEANLFAHSRRFQKGSS
SGTKSCGLSSGLKKTQKKKIGGKGKALAADKGKKKAKVADKGKCFHCNVDGHWKRNFPKYLAELKEKKGKLDLHILETCLVENDDLTWILDSKATNHVCSSFQETSSFKE
LEVGEMTLRVRTGDVVSARAVGDAKLFFGDRFLLLKNLYIVPKLKRNLISISCLIELGYCISFSINEAFILKEGVNICSAKLNNGFYILKPNKSKAILNHEMLKNASTQN
KRQRISPNNNTYLWHLRLGHINLNQIERLSKNLLLNKLEDDSLPPCESC