; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000200 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000200
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr10:5944671..5945906
RNA-Seq ExpressionPay0000200
SyntenyPay0000200
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036692.1 general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo var. makuwa]2.8e-13274.64Show/hide
Query:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA
        +NGEE EIIEEDGEEEV DENVIEVGAVKNLNIELSINSVVGLTNPGTM    KVKDE VVVLI  GATHNFISEKLVTNLNLPLKATTNYGVILGSGA 
Subjt:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA

Query:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI
        IK KGICGKVEVLLGDWKVVDSFLPLELGGVD+IL MQWLHSLG TEVDWKHL MSFQHGGRKV I GDPSLTKKGVSLKSMMKTWEG+D GFLVECR+I
Subjt:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI

Query:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ
        EGK       +EE E  VD  +   +          PVLLVRKKDGSW                                ANMFSKIDLKAGY+QIRMHQ
Subjt:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ

Query:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV
        EDVEKT F T +GHYEFLVMPFGLTNAPSTFQALMNA+ RPYMRRFV
Subjt:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV

KAA0038753.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-13161.95Show/hide
Query:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL
        MLV+ E GEE EI+EE+  +   +    EV  V+NLNIELS+NSVVGL NPGTMKVKGKV  E+VV+LIDCGATHNFI+E LVT L L L+ T NYGVIL
Subjt:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL

Query:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV
        GSG A+KGKG+C  VEV L  WKV DSFLPL+LGGVD+ILGMQWLHSLGVTEVDWK L ++F H G+KV+IRGDPSL K  VSLK++MKTW  DDQGFLV
Subjt:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV

Query:  ECRSIE------------GKV---PVATF-------------------------------------------YEEEYETTVDNSIPPLL----KKFLDVP
        ECR++E            GKV   P+AT                                             +EE E  VD  +   +    K     P
Subjt:  ECRSIE------------GKV---PVATF-------------------------------------------YEEEYETTVDNSIPPLL----KKFLDVP

Query:  VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNA
        VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDEL GA++FSK+DLKAGY+QIRM  ED+EKTAFRTH+GHYEFLVMPFGLTNAPSTFQALMN 
Subjt:  VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNA

Query:  IFRPYMRRFV
        +F+PY+RRFV
Subjt:  IFRPYMRRFV

KAA0063375.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.1e-13262.2Show/hide
Query:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL
        MLV+ E GEE EI+EE+  +   +    EV  V+NLNIELS+NSVVGL NPGTMKVKGKV +E+VV+LIDCGATHNFI+E LVT L L L+ T NYGVIL
Subjt:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL

Query:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV
        GSG A+KGKG+C  VEV L  WKV DSFLPL+LGGVD+ILGMQWLHSLGVTEVDWK L ++FQH G+KV+IRGDPSL K  VSLK++MKTW  DDQGFLV
Subjt:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV

Query:  ECRSIE------------GKV---PVATF-------------------------------------------YEEEYETTVDNSIPPLL----KKFLDVP
        ECR++E            GKV   P+AT                                             +EE E  VD  +   +    K     P
Subjt:  ECRSIE------------GKV---PVATF-------------------------------------------YEEEYETTVDNSIPPLL----KKFLDVP

Query:  VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNA
        VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDEL GA++FSK+DLKAGY+QIRM  ED+EKTAFRTH+GHYEFLVMPFGLTNAPSTFQALMN 
Subjt:  VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNA

Query:  IFRPYMRRFV
        +F+PY+RRFV
Subjt:  IFRPYMRRFV

TYK03666.1 general transcription factor 3C polypeptide 3 isoform X1 [Cucumis melo var. makuwa]8.1e-13274.35Show/hide
Query:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA
        +NGEE EIIEEDGEEEV DENVIEVGAVKNLNIELSINSVVGLTNPGTM    KVKDE VVVLI  GATHNFISEKLVTNLNLPLKATTNYGVILGSGA 
Subjt:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA

Query:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI
        IK KGICGKVEVLLGDWKV DSFLPLELGGVD+IL MQWLHSLG TEVDWKHL MSFQHGGRKV I GDPSLTKKGVSLKSMMKTWEG+D GFLVECR+I
Subjt:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI

Query:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ
        EGK       +EE E  VD  +   +          PVLLVRKKDGSW                                ANMFSKIDLKAGY+QIRMHQ
Subjt:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ

Query:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV
        EDVEKT F T +GHYEFLVMPFGLTNAPSTFQALMNA+ RPYMRRFV
Subjt:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV

TYK14806.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.1e-13261.8Show/hide
Query:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL
        MLV+ E GEE EI+EE+  +  A+   +EV  V+NLNIELS+NSVVGLTNPGTMKVKG+V +E+VV+LIDCGATHNFI+EKLVT L L L+ T NYGVIL
Subjt:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL

Query:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV
        GSG A+KGKG+C  VEV L  WKV DSFLPL+LGGVD+ILGMQWLHSLGVTEVDWK L ++F H GRKV+I+GDPSLTK  VSLK++MK+W  DDQGFLV
Subjt:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV

Query:  ECRSIE----------------------------------------------------GKVPV-------ATFYEEEYETTVDNSIPPLL----KKFLDV
        ECR+IE                                                    G  PV       A   +EE E  VD  +   +    K     
Subjt:  ECRSIE----------------------------------------------------GKVPV-------ATFYEEEYETTVDNSIPPLL----KKFLDV

Query:  PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMN
        PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIP+IEELFDEL GA++FSKIDLKAGY+QIRM  ED++KTAFRTH+GHYEFLVMPFGLTNAPSTFQALMN
Subjt:  PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMN

Query:  AIFRPYMRRFV
         +F+PY+RRFV
Subjt:  AIFRPYMRRFV

TrEMBL top hitse value%identityAlignment
A0A5A7SZU3 General transcription factor 3C polypeptide 3 isoform X11.3e-13274.64Show/hide
Query:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA
        +NGEE EIIEEDGEEEV DENVIEVGAVKNLNIELSINSVVGLTNPGTM    KVKDE VVVLI  GATHNFISEKLVTNLNLPLKATTNYGVILGSGA 
Subjt:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA

Query:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI
        IK KGICGKVEVLLGDWKVVDSFLPLELGGVD+IL MQWLHSLG TEVDWKHL MSFQHGGRKV I GDPSLTKKGVSLKSMMKTWEG+D GFLVECR+I
Subjt:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI

Query:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ
        EGK       +EE E  VD  +   +          PVLLVRKKDGSW                                ANMFSKIDLKAGY+QIRMHQ
Subjt:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ

Query:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV
        EDVEKT F T +GHYEFLVMPFGLTNAPSTFQALMNA+ RPYMRRFV
Subjt:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV

A0A5A7VBU7 Ty3/gypsy retrotransposon protein3.9e-13262.2Show/hide
Query:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL
        MLV+ E GEE EI+EE+  +   +    EV  V+NLNIELS+NSVVGL NPGTMKVKGKV +E+VV+LIDCGATHNFI+E LVT L L L+ T NYGVIL
Subjt:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL

Query:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV
        GSG A+KGKG+C  VEV L  WKV DSFLPL+LGGVD+ILGMQWLHSLGVTEVDWK L ++FQH G+KV+IRGDPSL K  VSLK++MKTW  DDQGFLV
Subjt:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV

Query:  ECRSIE------------GKV---PVATF-------------------------------------------YEEEYETTVDNSIPPLL----KKFLDVP
        ECR++E            GKV   P+AT                                             +EE E  VD  +   +    K     P
Subjt:  ECRSIE------------GKV---PVATF-------------------------------------------YEEEYETTVDNSIPPLL----KKFLDVP

Query:  VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNA
        VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDEL GA++FSK+DLKAGY+QIRM  ED+EKTAFRTH+GHYEFLVMPFGLTNAPSTFQALMN 
Subjt:  VLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNA

Query:  IFRPYMRRFV
        +F+PY+RRFV
Subjt:  IFRPYMRRFV

A0A5D3BZP9 General transcription factor 3C polypeptide 3 isoform X13.9e-13274.35Show/hide
Query:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA
        +NGEE EIIEEDGEEEV DENVIEVGAVKNLNIELSINSVVGLTNPGTM    KVKDE VVVLI  GATHNFISEKLVTNLNLPLKATTNYGVILGSGA 
Subjt:  ENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAA

Query:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI
        IK KGICGKVEVLLGDWKV DSFLPLELGGVD+IL MQWLHSLG TEVDWKHL MSFQHGGRKV I GDPSLTKKGVSLKSMMKTWEG+D GFLVECR+I
Subjt:  IKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSI

Query:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ
        EGK       +EE E  VD  +   +          PVLLVRKKDGSW                                ANMFSKIDLKAGY+QIRMHQ
Subjt:  EGKVPVATFYEEEYETTVDNSIPPLL----KKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ

Query:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV
        EDVEKT F T +GHYEFLVMPFGLTNAPSTFQALMNA+ RPYMRRFV
Subjt:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRFV

A0A5D3CU05 Ty3/gypsy retrotransposon protein1.9e-13161.31Show/hide
Query:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL
        MLV+ E GEE EI+EE+  +  A+   ++V +V+NLNIELS+NSVVGL NPGTMKVKG+V +E+VV+LIDCGATHNFI+E LVT L + L+ T NYGVIL
Subjt:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL

Query:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV
        GSG A+KGKG+C  VEV L  WKV DSFLPL+LGGVD+ILGMQWLHSLGVTEVDWK L ++F H G+KV+IRGDPSLTK  VSLK++MK+W  DDQGFLV
Subjt:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV

Query:  ECRSIE----------------------------------------------------GKVPV-------ATFYEEEYETTVDNSIPPLL----KKFLDV
        ECR+IE                                                    G  PV       A   +EE E  VD  +   +    K     
Subjt:  ECRSIE----------------------------------------------------GKVPV-------ATFYEEEYETTVDNSIPPLL----KKFLDV

Query:  PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMN
        PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDEL GA++FSKIDLKAGY+QIRM  ED+EKTAFRTH+GHYEFLVMPFGLTNAPSTFQALMN
Subjt:  PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMN

Query:  AIFRPYMRRFV
         +F+PY+RRFV
Subjt:  AIFRPYMRRFV

A0A5D3CWK0 Ty3/gypsy retrotransposon protein3.9e-13261.8Show/hide
Query:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL
        MLV+ E GEE EI+EE+  +  A+   +EV  V+NLNIELS+NSVVGLTNPGTMKVKG+V +E+VV+LIDCGATHNFI+EKLVT L L L+ T NYGVIL
Subjt:  MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVIL

Query:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV
        GSG A+KGKG+C  VEV L  WKV DSFLPL+LGGVD+ILGMQWLHSLGVTEVDWK L ++F H GRKV+I+GDPSLTK  VSLK++MK+W  DDQGFLV
Subjt:  GSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLV

Query:  ECRSIE----------------------------------------------------GKVPV-------ATFYEEEYETTVDNSIPPLL----KKFLDV
        ECR+IE                                                    G  PV       A   +EE E  VD  +   +    K     
Subjt:  ECRSIE----------------------------------------------------GKVPV-------ATFYEEEYETTVDNSIPPLL----KKFLDV

Query:  PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMN
        PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIP+IEELFDEL GA++FSKIDLKAGY+QIRM  ED++KTAFRTH+GHYEFLVMPFGLTNAPSTFQALMN
Subjt:  PVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMN

Query:  AIFRPYMRRFV
         +F+PY+RRFV
Subjt:  AIFRPYMRRFV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.2e-2641.38Show/hide
Query:  YEEEYETTVDNSIPPLLKKFL--------DVPVLLVRKKDGS-----WRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ
        Y + YE  V++ I  +L + +        + P+ +V KK  +     +R  +DYR LN +T+ D+ PIP ++E+  +L   N F+ IDL  G++QI M  
Subjt:  YEEEYETTVDNSIPPLLKKFL--------DVPVLLVRKKDGS-----WRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQ

Query:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRR
        E V KTAF T  GHYE+L MPFGL NAP+TFQ  MN I RP + +
Subjt:  EDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.6e-2641.26Show/hide
Query:  EEYETTVDNSIPPLLKKFL--------DVPVLLVRKKD-----GSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQED
        + +E  V+N +  +L + L        + P  +V KK        +R  +DYR LN +TIPD++PIP ++E+  +L     F+ IDL  G++QI M +E 
Subjt:  EEYETTVDNSIPPLLKKFL--------DVPVLLVRKKD-----GSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQED

Query:  VEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRR
        + KTAF T  GHYE+L MPFGL NAP+TFQ  MN I RP + +
Subjt:  VEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRR

P31843 RNA-directed DNA polymerase homolog2.2e-2351.49Show/hide
Query:  SWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRF
        S R C+DYRAL  VTI +K+PIP +++LFD L  A  F+K+DL++GY+Q+R+ + D  KT   T  G +EF VMPFGLTNA +TF  LMN +   Y+  F
Subjt:  SWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPSTFQALMNAIFRPYMRRF

Query:  V
        V
Subjt:  V

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-2650.42Show/hide
Query:  VDNSIPPLLKKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMP
        +DN      K     PV+LV KKDG++R CVDYR LN  TI D FP+P I+ L   +  A +F+ +DL +GY+QI M  +D  KTAF T  G YE+ VMP
Subjt:  VDNSIPPLLKKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMP

Query:  FGLTNAPSTFQALMNAIFR
        FGL NAPSTF   M   FR
Subjt:  FGLTNAPSTFQALMNAIFR

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-2650.42Show/hide
Query:  VDNSIPPLLKKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMP
        +DN      K     PV+LV KKDG++R CVDYR LN  TI D FP+P I+ L   +  A +F+ +DL +GY+QI M  +D  KTAF T  G YE+ VMP
Subjt:  VDNSIPPLLKKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMP

Query:  FGLTNAPSTFQALMNAIFR
        FGL NAPSTF   M   FR
Subjt:  FGLTNAPSTFQALMNAIFR

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein8.9e-2032.95Show/hide
Query:  VVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELG--GVDVILGM
        V+ LT    M+  G + D +VVV ID GAT NFI  +L  +L LP   T    V+LG    I+  G C  + + + + ++ ++FL L+L    VDVILG 
Subjt:  VVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAAIKGKGICGKVEVLLGDWKVVDSFLPLELG--GVDVILGM

Query:  QWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSIEGKVPVATFYEEE
        +WL  LG T V+W++ + SF H  + + +  +    ++ V+ K  MK+   ++Q  + E R+ +G++ V ++ E++
Subjt:  QWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSIEGKVPVATFYEEE

AT3G30770.1 Eukaryotic aspartyl protease family protein2.0e-1133.85Show/hide
Query:  SVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAAIKGKGICGKVEVLLGDWKVVDSFLPLEL--GGVDVILG
        S    T    M+  G +   +VVV+ID GAT+NFIS++L   L LP   T    V+LG    I+  G C  + +L+ + ++ ++FL L+L    VDVILG
Subjt:  SVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAAIKGKGICGKVEVLLGDWKVVDSFLPLEL--GGVDVILG

Query:  MQWLHSLGVTEVDWKHLEMSFQHGGRKVII
             +L    + W + + SF H  + V +
Subjt:  MQWLHSLGVTEVDWKHLEMSFQHGGRKVII

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding1.1e-0437.93Show/hide
Query:  KGICGKVEVLLGDWKVVDSFL--PLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQH
        K  C ++ + + D  +V+ +    L+   VDVILG +WL  LG TEV+W++   SF H
Subjt:  KGICGKVEVLLGDWKVVDSFL--PLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGTAATGTGGGAGAATGGAGAAGAGTTTGAGATCATTGAGGAAGATGGAGAAGAAGAAGTGGCGGATGAAAATGTTATTGAAGTAGGAGCAGTGAAGAATTTGAA
CATAGAGCTATCCATTAATTCGGTGGTGGGGTTGACCAATCCGGGAACGATGAAGGTGAAAGGAAAGGTGAAGGATGAGCAAGTGGTGGTGCTGATTGACTGTGGGGCTA
CCCACAATTTTATATCCGAAAAATTGGTAACCAACCTGAATCTACCGTTGAAAGCTACAACCAATTATGGGGTGATTTTGGGTTCAGGAGCAGCCATTAAAGGGAAAGGA
ATTTGTGGGAAAGTAGAGGTACTATTGGGCGATTGGAAGGTGGTGGACAGTTTCTTGCCACTTGAGCTGGGTGGTGTTGACGTCATACTTGGTATGCAATGGTTGCATTC
TCTTGGAGTGACTGAAGTGGATTGGAAGCATTTAGAGATGTCCTTTCAGCACGGAGGAAGAAAGGTCATAATACGTGGAGACCCAAGCCTTACTAAGAAGGGAGTCAGTT
TGAAGAGCATGATGAAAACTTGGGAAGGAGACGACCAAGGTTTTTTGGTGGAATGCCGATCTATTGAGGGGAAGGTACCAGTGGCAACATTTTACGAAGAAGAGTATGAG
ACAACCGTAGATAATTCCATTCCTCCATTGTTAAAGAAATTTTTAGATGTTCCTGTGCTGTTAGTAAGGAAAAAGGATGGAAGCTGGAGGTTTTGTGTTGACTACCGAGC
GCTCAATAATGTCACCATACCAGATAAATTTCCAATTCCTGTCATTGAAGAGCTGTTTGATGAGTTGAATGGTGCAAACATGTTCTCCAAGATTGATCTTAAAGCCGGCT
ACTACCAAATACGAATGCACCAAGAGGATGTGGAAAAGACAGCATTTCGCACTCATAAAGGCCATTATGAATTTCTGGTCATGCCCTTTGGTTTGACTAATGCACCTTCT
ACTTTCCAAGCTTTGATGAATGCCATTTTCAGGCCGTATATGAGGAGGTTTGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGGTAATGTGGGAGAATGGAGAAGAGTTTGAGATCATTGAGGAAGATGGAGAAGAAGAAGTGGCGGATGAAAATGTTATTGAAGTAGGAGCAGTGAAGAATTTGAA
CATAGAGCTATCCATTAATTCGGTGGTGGGGTTGACCAATCCGGGAACGATGAAGGTGAAAGGAAAGGTGAAGGATGAGCAAGTGGTGGTGCTGATTGACTGTGGGGCTA
CCCACAATTTTATATCCGAAAAATTGGTAACCAACCTGAATCTACCGTTGAAAGCTACAACCAATTATGGGGTGATTTTGGGTTCAGGAGCAGCCATTAAAGGGAAAGGA
ATTTGTGGGAAAGTAGAGGTACTATTGGGCGATTGGAAGGTGGTGGACAGTTTCTTGCCACTTGAGCTGGGTGGTGTTGACGTCATACTTGGTATGCAATGGTTGCATTC
TCTTGGAGTGACTGAAGTGGATTGGAAGCATTTAGAGATGTCCTTTCAGCACGGAGGAAGAAAGGTCATAATACGTGGAGACCCAAGCCTTACTAAGAAGGGAGTCAGTT
TGAAGAGCATGATGAAAACTTGGGAAGGAGACGACCAAGGTTTTTTGGTGGAATGCCGATCTATTGAGGGGAAGGTACCAGTGGCAACATTTTACGAAGAAGAGTATGAG
ACAACCGTAGATAATTCCATTCCTCCATTGTTAAAGAAATTTTTAGATGTTCCTGTGCTGTTAGTAAGGAAAAAGGATGGAAGCTGGAGGTTTTGTGTTGACTACCGAGC
GCTCAATAATGTCACCATACCAGATAAATTTCCAATTCCTGTCATTGAAGAGCTGTTTGATGAGTTGAATGGTGCAAACATGTTCTCCAAGATTGATCTTAAAGCCGGCT
ACTACCAAATACGAATGCACCAAGAGGATGTGGAAAAGACAGCATTTCGCACTCATAAAGGCCATTATGAATTTCTGGTCATGCCCTTTGGTTTGACTAATGCACCTTCT
ACTTTCCAAGCTTTGATGAATGCCATTTTCAGGCCGTATATGAGGAGGTTTGTATAG
Protein sequenceShow/hide protein sequence
MLVMWENGEEFEIIEEDGEEEVADENVIEVGAVKNLNIELSINSVVGLTNPGTMKVKGKVKDEQVVVLIDCGATHNFISEKLVTNLNLPLKATTNYGVILGSGAAIKGKG
ICGKVEVLLGDWKVVDSFLPLELGGVDVILGMQWLHSLGVTEVDWKHLEMSFQHGGRKVIIRGDPSLTKKGVSLKSMMKTWEGDDQGFLVECRSIEGKVPVATFYEEEYE
TTVDNSIPPLLKKFLDVPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPVIEELFDELNGANMFSKIDLKAGYYQIRMHQEDVEKTAFRTHKGHYEFLVMPFGLTNAPS
TFQALMNAIFRPYMRRFV