; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004960 (gene) of Snake gourd v1 genome

Gene IDTan0004960
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:12354523..12355986
RNA-Seq ExpressionTan0004960
SyntenyTan0004960
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]4.0e-14358.56Show/hide
Query:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCFI
        ETQ K+ KVS   NA+LWHLRL HINLNRIERLVKS ILN+LE+NSLP CESCLEGKMTKRSF+GKG RAK P EL+HSDLCG M+VKARGGYEYF+ FI
Subjt:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCFI

Query:  DDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLPD
        DD+SRYG++YL+H KSES +KFKEYK EVEN +GKTIKTL+SDRGGEYMD++FQDY+I+ GI      P++   NGVSERRNRTLLDMVRSM SYA+LPD
Subjt:  DDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLPD

Query:  SFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK--------------------------
        SF GYA+ETA+ ILNNVPSKSV ETP+ELW GRK                                 GYPKE++                          
Subjt:  SFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK--------------------------

Query:  --------------YVNKCVDPSTSSQV-----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGMDQE
                      + N    PS+S++V            SQELR+PRRSGRVV Q  RY+ L ETQ+I PDD  EDPLTY QAM D D+D+WIK M+ E
Subjt:  --------------YVNKCVDPSTSSQV-----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGMDQE

Query:  MESMHFNSAESLW-----INQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        MESM+FNS  +L      +   GCKWIYKRKR   GKV TFKARLV K +TQ EGVDYEETFSPVAM+KSIRILL+I  +Y+YE+
Subjt:  MESMHFNSAESLW-----INQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-14358.81Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR ++SP  N +LWHLRL HINL+RI RLVK+ +LN+L++ SLP CESCLEGKMTKR F+GKGYRAKEP ELIHSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
        IDDYSRYGY+YLM  KSE+L+KFKEYKTEVENLL K IK L+SDRGGEYMD  FQDYMI+HGI      P +   NGVSERRNRTLLDMVRSM SYA+LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
         SF GYAVETAV ILNNVPSKSV ETPFELW GRK                                 GYPKET+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  ----------YVNKCVDPST--------SSQV----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGM
                   +++  D ST        SS+V           SQ LRMPRRSGRVV Q  RY+ L ETQV+ PDD  EDPL+Y QAM D DKD+W+K M
Subjt:  ----------YVNKCVDPST--------SSQV----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGM

Query:  DQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        D EMESM+FNS   L      +   GCKWIYKRKR   GKV TFKARLV K +TQ EGVDYEETFSPVAM+KSIRILL+I  +YDYE+
Subjt:  DQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-14157.32Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR+K+SPKENAHLWHLRL HINLNRIERLVK+ +L+ELEENSLP+CESCLEGKMTKR F+GKG+RAKEP EL+HSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
         DDYSRYGY+YLM  KSE+L+KFKEYK EVEN L KTIKT +SDRGGEYMD +FQ+Y+++ GI      P +   NGVSERRNRTLLDMVRSM SYA LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
        +SF GYAV+TAV ILN VPSKSV ETP +LWNGRK                                 GYPK T+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW
                         +  +PST                S  R+   Q LR PRRSGRV     RYMSL ET  +  D D EDPLT+ +AM D DKDEW
Subjt:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW

Query:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        IK M+ E+ESM+FNS   L      +   GCKWIYKRKRG DGKV TFKARLV K +TQVEGVDYEETFSPVAM+KSIRILL+I AY+DYE+
Subjt:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-14157.32Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR+K+SPKENAHLWHLRL HINLNRIERLVK+ +L+ELEENSLP+CESCLEGKMTKR F+GKG+RAKEP EL+HSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
         DDYSRYGY+YLM  KSE+L+KFKEYK EVEN L KTIKT +SDRGGEYMD +FQ+Y+++ GI      P +   NGVSERRNRTLLDMVRSM SYA LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
        +SF GYAV+TAV ILN VPSKSV ETP +LWNGRK                                 GYPK T+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW
                         +  +PST                S  R+   Q LR PRRSGRV     RYMSL ET  +  D D EDPLT+ +AM D DKDEW
Subjt:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW

Query:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        IK M+ E+ESM+FNS   L      +   GCKWIYKRKRG DGKV TFKARLV K +TQVEGVDYEETFSPVAM+KSIRILL+I AY+DYE+
Subjt:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-14157.32Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR+K+SPKENAHLWHLRL HINLNRIERLVK+ +L+ELEENSLP+CESCLEGKMTKR F+GKG+RAKEP EL+HSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
         DDYSRYGY+YLM  KSE+L+KFKEYK EVEN L KTIKT +SDRGGEYMD +FQ+Y+++ GI      P +   NGVSERRNRTLLDMVRSM SYA LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
        +SF GYAV+TAV ILN VPSKSV ETP +LWNGRK                                 GYPK T+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW
                         +  +PST                S  R+   Q LR PRRSGRV     RYMSL ET  +  D D EDPLT+ +AM D DKDEW
Subjt:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW

Query:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        IK M+ E+ESM+FNS   L      +   GCKWIYKRKRG DGKV TFKARLV K +TQVEGVDYEETFSPVAM+KSIRILL+I AY+DYE+
Subjt:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.3e-14157.32Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR+K+SPKENAHLWHLRL HINLNRIERLVK+ +L+ELEENSLP+CESCLEGKMTKR F+GKG+RAKEP EL+HSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
         DDYSRYGY+YLM  KSE+L+KFKEYK EVEN L KTIKT +SDRGGEYMD +FQ+Y+++ GI      P +   NGVSERRNRTLLDMVRSM SYA LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
        +SF GYAV+TAV ILN VPSKSV ETP +LWNGRK                                 GYPK T+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW
                         +  +PST                S  R+   Q LR PRRSGRV     RYMSL ET  +  D D EDPLT+ +AM D DKDEW
Subjt:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW

Query:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        IK M+ E+ESM+FNS   L      +   GCKWIYKRKRG DGKV TFKARLV K +TQVEGVDYEETFSPVAM+KSIRILL+I AY+DYE+
Subjt:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

A0A5A7TZD0 Gag/pol protein1.1e-14358.81Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR ++SP  N +LWHLRL HINL+RI RLVK+ +LN+L++ SLP CESCLEGKMTKR F+GKGYRAKEP ELIHSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
        IDDYSRYGY+YLM  KSE+L+KFKEYKTEVENLL K IK L+SDRGGEYMD  FQDYMI+HGI      P +   NGVSERRNRTLLDMVRSM SYA+LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
         SF GYAVETAV ILNNVPSKSV ETPFELW GRK                                 GYPKET+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  ----------YVNKCVDPST--------SSQV----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGM
                   +++  D ST        SS+V           SQ LRMPRRSGRVV Q  RY+ L ETQV+ PDD  EDPL+Y QAM D DKD+W+K M
Subjt:  ----------YVNKCVDPST--------SSQV----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGM

Query:  DQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        D EMESM+FNS   L      +   GCKWIYKRKR   GKV TFKARLV K +TQ EGVDYEETFSPVAM+KSIRILL+I  +YDYE+
Subjt:  DQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

A0A5A7V4M1 Gag/pol protein2.3e-14157.32Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR+K+SPKENAHLWHLRL HINLNRIERLVK+ +L+ELEENSLP+CESCLEGKMTKR F+GKG+RAKEP EL+HSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
         DDYSRYGY+YLM  KSE+L+KFKEYK EVEN L KTIKT +SDRGGEYMD +FQ+Y+++ GI      P +   NGVSERRNRTLLDMVRSM SYA LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
        +SF GYAV+TAV ILN VPSKSV ETP +LWNGRK                                 GYPK T+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW
                         +  +PST                S  R+   Q LR PRRSGRV     RYMSL ET  +  D D EDPLT+ +AM D DKDEW
Subjt:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW

Query:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        IK M+ E+ESM+FNS   L      +   GCKWIYKRKRG DGKV TFKARLV K +TQVEGVDYEETFSPVAM+KSIRILL+I AY+DYE+
Subjt:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

A0A5D3CPJ6 Gag/pol protein2.3e-14157.32Show/hide
Query:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF
        A TQ KR+K+SPKENAHLWHLRL HINLNRIERLVK+ +L+ELEENSLP+CESCLEGKMTKR F+GKG+RAKEP EL+HSDLCG M+VKARGG+EYF+ F
Subjt:  AETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCF

Query:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP
         DDYSRYGY+YLM  KSE+L+KFKEYK EVEN L KTIKT +SDRGGEYMD +FQ+Y+++ GI      P +   NGVSERRNRTLLDMVRSM SYA LP
Subjt:  IDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLP

Query:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------
        +SF GYAV+TAV ILN VPSKSV ETP +LWNGRK                                 GYPK T+                         
Subjt:  DSFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK-------------------------

Query:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW
                         +  +PST                S  R+   Q LR PRRSGRV     RYMSL ET  +  D D EDPLT+ +AM D DKDEW
Subjt:  --------------YVNKCVDPST---------------SSQVRS---QELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEW

Query:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        IK M+ E+ESM+FNS   L      +   GCKWIYKRKRG DGKV TFKARLV K +TQVEGVDYEETFSPVAM+KSIRILL+I AY+DYE+
Subjt:  IKGMDQEMESMHFNSAESL-----WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

E2GK51 Gag/pol protein (Fragment)1.9e-14358.56Show/hide
Query:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCFI
        ETQ K+ KVS   NA+LWHLRL HINLNRIERLVKS ILN+LE+NSLP CESCLEGKMTKRSF+GKG RAK P EL+HSDLCG M+VKARGGYEYF+ FI
Subjt:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCFI

Query:  DDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLPD
        DD+SRYG++YL+H KSES +KFKEYK EVEN +GKTIKTL+SDRGGEYMD++FQDY+I+ GI      P++   NGVSERRNRTLLDMVRSM SYA+LPD
Subjt:  DDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLPD

Query:  SFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK--------------------------
        SF GYA+ETA+ ILNNVPSKSV ETP+ELW GRK                                 GYPKE++                          
Subjt:  SFLGYAVETAVCILNNVPSKSVCETPFELWNGRK---------------------------------GYPKETK--------------------------

Query:  --------------YVNKCVDPSTSSQV-----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGMDQE
                      + N    PS+S++V            SQELR+PRRSGRVV Q  RY+ L ETQ+I PDD  EDPLTY QAM D D+D+WIK M+ E
Subjt:  --------------YVNKCVDPSTSSQV-----------RSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGMDQE

Query:  MESMHFNSAESLW-----INQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
        MESM+FNS  +L      +   GCKWIYKRKR   GKV TFKARLV K +TQ EGVDYEETFSPVAM+KSIRILL+I  +Y+YE+
Subjt:  MESMHFNSAESLW-----INQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.5e-3233.2Show/hide
Query:  QTKRVKVSPKENAHLWHLRLDHIN------LNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRA--KEPFELIHSDLCGLMSVKARGGYE
        Q   +    K N  LWH R  HI+      + R        +LN L E S  ICE CL GK  +  F     +   K P  ++HSD+CG ++        
Subjt:  QTKRVKVSPKENAHLWHLRLDHIN------LNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRA--KEPFELIHSDLCGLMSVKARGGYE

Query:  YFVCFIDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGIT-----PNSQHLNGVSERRNRTLLDMVRSMKS
        YFV F+D ++ Y   YL+  KS+    F+++  + E      +  L  D G EY+  E + + +K GI+     P++  LNGVSER  RT+ +  R+M S
Subjt:  YFVCFIDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGIT-----PNSQHLNGVSERRNRTLLDMVRSMKS

Query:  YARLPDSFLGYAVETAVCILNNVPSKSVCE---TPFELWNGRKGYPKETK
         A+L  SF G AV TA  ++N +PS+++ +   TP+E+W+ +K Y K  +
Subjt:  YARLPDSFLGYAVETAVCILNNVPSKSVCE---TPFELWNGRKGYPKETK

P04146 Copia protein3.5e-0931.48Show/hide
Query:  PLTYDQAMVDADKDEWIKGMDQEMESMHFNSAESLW-----INQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAI
        P ++D+     DK  W + ++ E+ +   N+  ++       N    +W++  K    G    +KARLV + FTQ   +DYEETF+PVA + S R +L++
Subjt:  PLTYDQAMVDADKDEWIKGMDQEMESMHFNSAESLW-----INQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAI

Query:  VAYYDYEV
        V  Y+ +V
Subjt:  VAYYDYEV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-4829.26Show/hide
Query:  LWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCFIDDYSRYGYIYLMHRKS
        LWH R+ H++   ++ L K  +++  +  ++  C+ CL GK  + SF     R     +L++SD+CG M +++ GG +YFV FIDD SR  ++Y++  K 
Subjt:  LWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCFIDDYSRYGYIYLMHRKS

Query:  ESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLPDSFLGYAVETAVCILNN
        +  + F+++   VE   G+ +K L+SD GGEY   EF++Y   HGI      P +   NGV+ER NRT+++ VRSM   A+LP SF G AV+TA  ++N 
Subjt:  ESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGI-----TPNSQHLNGVSERRNRTLLDMVRSMKSYARLPDSFLGYAVETAVCILNN

Query:  VPSKSVC-ETPFELWNGRK---------------GYPKE--TKYVNKCVD--------------------------------------------------
         PS  +  E P  +W  ++                 PKE  TK  +K +                                                   
Subjt:  VPSKSVC-ETPFELWNGRK---------------GYPKE--TKYVNKCVD--------------------------------------------------

Query:  -------PSTSSQVRS-----------------------------QELRMP----------RRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMV
               PSTS+   S                             +E+  P          RRS R   +  RY S   T+ +   DD E P +  + + 
Subjt:  -------PSTSSQVRS-----------------------------QELRMP----------RRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMV

Query:  DADKDEWIKGMDQEMESMHFNSAESLWINQTG-----CKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV
          +K++ +K M +EMES+  N    L     G     CKW++K K+  D K+  +KARLV K F Q +G+D++E FSPV  + SIR +L++ A  D EV
Subjt:  DADKDEWIKGMDQEMESMHFNSAESLWINQTG-----CKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV

P93293 Uncharacterized mitochondrial protein AtMg003004.6e-0936.78Show/hide
Query:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSV
        ET    +  + K+   LWH RL H++   +E LVK   L+  + +SL  CE C+ GK  + +FS   +  K P + +HSDL G  SV
Subjt:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-2229.06Show/hide
Query:  MAETQTKRVKVSPKENA--HLWHLRLDHINLNRIERLVKSVILNELE-ENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEY
        +A +Q   +  SP   A    WH RL H   + +  ++ +  L+ L   +    C  CL  K  K  FS     +  P E I+SD+     + +   Y Y
Subjt:  MAETQTKRVKVSPKENA--HLWHLRLDHINLNRIERLVKSVILNELE-ENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEY

Query:  FVCFIDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGIT-----PNSQHLNGVSERRNRTLLDMVRSMKSY
        +V F+D ++RY ++Y + +KS+  + F  +K  +EN     I T  SD GGE++     +Y  +HGI+     P++   NG+SER++R +++   ++ S+
Subjt:  FVCFIDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGIT-----PNSQHLNGVSERRNRTLLDMVRSMKSY

Query:  ARLPDSFLGYAVETAVCILNNVPSKSV-CETPFE
        A +P ++  YA   AV ++N +P+  +  E+PF+
Subjt:  ARLPDSFLGYAVETAVCILNNVPSKSV-CETPFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.2e-1037.25Show/hide
Query:  DPLTYDQAMVDADKDEWIKGMDQEMESMHFNSAESL------WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILL
        +P T  QA+ D   + W   M  E+ +   N    L       +   GC+WI+ +K   DG ++ +KARLV K + Q  G+DY ETFSPV    SIRI+L
Subjt:  DPLTYDQAMVDADKDEWIKGMDQEMESMHFNSAESL------WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILL

Query:  AI
         +
Subjt:  AI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-2430.77Show/hide
Query:  MAETQTKRVKVSP--KENAHLWHLRLDHINLNRIERLVKSVILNELE-ENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEY
        +A +Q   +  SP  K     WH RL H +L  +  ++ +  L  L   + L  C  C   K  K  FS     + +P E I+SD+     + +   Y Y
Subjt:  MAETQTKRVKVSP--KENAHLWHLRLDHINLNRIERLVKSVILNELE-ENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEY

Query:  FVCFIDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGIT-----PNSQHLNGVSERRNRTLLDMVRSMKSY
        +V F+D ++RY ++Y + +KS+    F  +K+ VEN     I TL SD GGE++    +DY+ +HGI+     P++   NG+SER++R +++M  ++ S+
Subjt:  FVCFIDDYSRYGYIYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGIT-----PNSQHLNGVSERRNRTLLDMVRSMKSY

Query:  ARLPDSFLGYAVETAVCILNNVPSKSV-CETPFE
        A +P ++  YA   AV ++N +P+  +  ++PF+
Subjt:  ARLPDSFLGYAVETAVCILNNVPSKSV-CETPFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-1039.22Show/hide
Query:  DPLTYDQAMVDADKDEWIKGMDQEMESMHFNSAESL------WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILL
        +P T  QAM D   D W + M  E+ +   N    L       +   GC+WI+ +K   DG ++ +KARLV K + Q  G+DY ETFSPV    SIRI+L
Subjt:  DPLTYDQAMVDADKDEWIKGMDQEMESMHFNSAESL------WINQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILL

Query:  AI
         +
Subjt:  AI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-1441.67Show/hide
Query:  EDPLTYDQAMVDADKDEWIKGMDQE---MESMHFNSAESLWINQ--TGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILL
        ++P TY++A    +   W   MD E   ME+ H     +L  N+   GCKW+YK K   DG +  +KARLV K +TQ EG+D+ ETFSPV  + S++++L
Subjt:  EDPLTYDQAMVDADKDEWIKGMDQE---MESMHFNSAESLWINQ--TGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILL

Query:  AIVAYYDY
        AI A Y++
Subjt:  AIVAYYDY

ATMG00300.1 Gag-Pol-related retrotransposon family protein3.2e-1036.78Show/hide
Query:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSV
        ET    +  + K+   LWH RL H++   +E LVK   L+  + +SL  CE C+ GK  + +FS   +  K P + +HSDL G  SV
Subjt:  ETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.5e-1036.36Show/hide
Query:  QAMVDADKDE-WIKGMDQEMESMHFNSAESLWI--------NQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAI
        ++++ A KD  W + M +E++++   S    WI        N  GCKW++K K   DG +   KARLV K F Q EG+ + ET+SPV    +IR +L +
Subjt:  QAMVDADKDE-WIKGMDQEMESMHFNSAESLWI--------NQTGCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAAACACAAACTAAGAGAGTGAAAGTTTCTCCTAAAGAAAATGCCCATCTTTGGCATCTAAGGTTAGACCACATTAATCTAAATAGGATTGAGAGACTAGTGAA
GAGTGTAATTCTAAACGAGTTGGAAGAAAACTCTTTACCGATATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTTCTTTTAGTGGAAAAGGATATAGAGCCAAAG
AGCCCTTTGAGCTTATACATTCTGACCTATGTGGTCTGATGAGTGTTAAAGCACGAGGAGGTTACGAATACTTTGTATGTTTTATAGATGACTATTCAAGGTATGGGTAT
ATTTACCTAATGCATAGGAAGTCTGAAAGTCTTAAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTGTTAGGTAAAACAATAAAAACACTTCAATCGGATCGAGG
TGGAGAGTATATGGACACTGAATTCCAGGACTATATGATAAAACATGGAATTACTCCCAACTCTCAGCACCTGAATGGTGTATCGGAGAGGAGAAACAGAACCCTGTTGG
ACATGGTTCGGTCGATGAAGAGCTATGCTCGTCTCCCTGATTCTTTTTTAGGTTACGCAGTTGAGACCGCGGTTTGTATTTTGAACAACGTTCCATCGAAGAGTGTTTGT
GAAACACCTTTCGAACTCTGGAATGGACGTAAAGGTTACCCAAAAGAGACTAAGTACGTCAACAAGTGTGTTGATCCTAGCACGTCTAGTCAAGTCCGTTCTCAAGAGTT
GAGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCTTGAACGTTACATGAGTTTAGATGAAACCCAAGTCATCACCCCTGATGATGACTACGAGGATCCATTGACCT
ATGATCAGGCAATGGTAGACGCTGACAAAGACGAATGGATTAAAGGTATGGACCAGGAAATGGAGTCGATGCACTTCAATTCTGCTGAGAGCTTGTGGATCAACCAGACG
GGTTGCAAATGGATCTACAAGCGTAAACGTGGCGTAGATGGGAAGGTGCATACCTTCAAAGCACGACTAGTGACAAAGGATTTTACCCAGGTAGAAGGGGTTGACTATGA
GGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATCAGGATCCTTTTGGCCATTGTCGCATATTATGACTATGAGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAAACACAAACTAAGAGAGTGAAAGTTTCTCCTAAAGAAAATGCCCATCTTTGGCATCTAAGGTTAGACCACATTAATCTAAATAGGATTGAGAGACTAGTGAA
GAGTGTAATTCTAAACGAGTTGGAAGAAAACTCTTTACCGATATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTTCTTTTAGTGGAAAAGGATATAGAGCCAAAG
AGCCCTTTGAGCTTATACATTCTGACCTATGTGGTCTGATGAGTGTTAAAGCACGAGGAGGTTACGAATACTTTGTATGTTTTATAGATGACTATTCAAGGTATGGGTAT
ATTTACCTAATGCATAGGAAGTCTGAAAGTCTTAAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTGTTAGGTAAAACAATAAAAACACTTCAATCGGATCGAGG
TGGAGAGTATATGGACACTGAATTCCAGGACTATATGATAAAACATGGAATTACTCCCAACTCTCAGCACCTGAATGGTGTATCGGAGAGGAGAAACAGAACCCTGTTGG
ACATGGTTCGGTCGATGAAGAGCTATGCTCGTCTCCCTGATTCTTTTTTAGGTTACGCAGTTGAGACCGCGGTTTGTATTTTGAACAACGTTCCATCGAAGAGTGTTTGT
GAAACACCTTTCGAACTCTGGAATGGACGTAAAGGTTACCCAAAAGAGACTAAGTACGTCAACAAGTGTGTTGATCCTAGCACGTCTAGTCAAGTCCGTTCTCAAGAGTT
GAGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCTTGAACGTTACATGAGTTTAGATGAAACCCAAGTCATCACCCCTGATGATGACTACGAGGATCCATTGACCT
ATGATCAGGCAATGGTAGACGCTGACAAAGACGAATGGATTAAAGGTATGGACCAGGAAATGGAGTCGATGCACTTCAATTCTGCTGAGAGCTTGTGGATCAACCAGACG
GGTTGCAAATGGATCTACAAGCGTAAACGTGGCGTAGATGGGAAGGTGCATACCTTCAAAGCACGACTAGTGACAAAGGATTTTACCCAGGTAGAAGGGGTTGACTATGA
GGAAACCTTTTCACCTGTTGCCATGGTAAAGTCGATCAGGATCCTTTTGGCCATTGTCGCATATTATGACTATGAGGTATGA
Protein sequenceShow/hide protein sequence
MAETQTKRVKVSPKENAHLWHLRLDHINLNRIERLVKSVILNELEENSLPICESCLEGKMTKRSFSGKGYRAKEPFELIHSDLCGLMSVKARGGYEYFVCFIDDYSRYGY
IYLMHRKSESLKKFKEYKTEVENLLGKTIKTLQSDRGGEYMDTEFQDYMIKHGITPNSQHLNGVSERRNRTLLDMVRSMKSYARLPDSFLGYAVETAVCILNNVPSKSVC
ETPFELWNGRKGYPKETKYVNKCVDPSTSSQVRSQELRMPRRSGRVVRQLERYMSLDETQVITPDDDYEDPLTYDQAMVDADKDEWIKGMDQEMESMHFNSAESLWINQT
GCKWIYKRKRGVDGKVHTFKARLVTKDFTQVEGVDYEETFSPVAMVKSIRILLAIVAYYDYEV