; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035217 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035217
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPol protein
Genome locationchr3:16744828..16747634
RNA-Seq ExpressionLag0035217
SyntenyLag0035217
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047631.1 pol protein [Cucumis melo var. makuwa]2.6e-2576.92Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        ERIGPVAYRLALPP+ S+VHDVFHVSMLR+Y+ADP+HV+DFEPL++NENL YE + V+ILAR+ K LRNREI LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

KAA0050376.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-2579.49Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        ERIGPVAYRLALPPS S+VHDVFHVSMLRRY+ADP+HV+DFEPLR+NENL YE + V+ILAR+ K LRNR I+LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

TYK03589.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-2579.49Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        ERIGPVAYRLALPPS S+VHDVFHVSMLRRY+ADP+HV+DFEPLR+NENL YE + V+ILAR+ K LRNR I+LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

XP_038877573.1 zinc finger BED domain-containing protein RICESLEEPER 1-like [Benincasa hispida]3.0e-2958.12Show/hide
Query:  FRFQHVDMEDDEDYFDVDTEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQN
        F F   D+ D++D   VD+EVD+YLLE R K +DNF +L+WWK NSSRYK+  K+ARD+L + ++TVA ESAF+TGG+VID F ++L+P+IVEAL+CAQN
Subjt:  FRFQHVDMEDDEDYFDVDTEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQN

Query:  WLRSNAININLQSHLEE
         LRS  I+I L+ +LEE
Subjt:  WLRSNAININLQSHLEE

XP_038890289.1 zinc finger BED domain-containing protein RICESLEEPER 1-like [Benincasa hispida]1.5e-2860.19Show/hide
Query:  DDEDYFDVDTEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQNWLRSNAINI
        D++D   VD+EVD+YLLE R K +DNF IL+WWK NSSRYK+FSK+ARD+L + V+ VASES F+T  RVID F ++L+P+ +E+L+CAQNWLRS  I+I
Subjt:  DDEDYFDVDTEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQNWLRSNAINI

Query:  NLQSHLEE
         L+ +LEE
Subjt:  NLQSHLEE

TrEMBL top hitse value%identityAlignment
A0A5A7TJV2 Reverse transcriptase1.3e-2576.92Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        E+IGPVAYRLALPPSLS+VHDVFHVSMLR+Y+ADP+HV+DFEPL+L+ENL +E +L++ILAR+ K LRNREI+LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

A0A5A7U220 Pol protein1.3e-2576.92Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        ERIGPVAYRLALPP+ S+VHDVFHVSMLR+Y+ADP+HV+DFEPL++NENL YE + V+ILAR+ K LRNREI LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

A0A5A7U866 Ty3-gypsy retrotransposon protein5.7e-2679.49Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        ERIGPVAYRLALPPS S+VHDVFHVSMLRRY+ADP+HV+DFEPLR+NENL YE + V+ILAR+ K LRNR I+LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

A0A5D3BDY6 Reverse transcriptase1.3e-2576.92Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        E+IGPVAYRLALPPSLS+VHDVFHVSMLR+Y+ADP+HV+DFEPL+L+ENL +E +L++ILAR+ K LRNREI+LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

A0A5D3BZT2 Ty3-gypsy retrotransposon protein5.7e-2679.49Show/hide
Query:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL
        ERIGPVAYRLALPPS S+VHDVFHVSMLRRY+ADP+HV+DFEPLR+NENL YE + V+ILAR+ K LRNR I+LVKVL
Subjt:  ERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVL

SwissProt top hitse value%identityAlignment
B9FJG3 Zinc finger BED domain-containing protein RICESLEEPER 13.5e-1244.32Show/hide
Query:  TEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVAS-ESAFS--TGGRVIDLFHSTLAPSIVEALICAQNWLR
        +E++ YL E        F IL WWK+N+ +Y   SK+ARD+L I ++ V+S  S FS  TG R++D + S+  P IVEAL+CA++WL+
Subjt:  TEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVAS-ESAFS--TGGRVIDLFHSTLAPSIVEALICAQNWLR

P03010 Putative AC9 transposase1.2e-1742.74Show/hide
Query:  STDSTARTSNPFRFQHVDMEDDEDYFDVDT-EVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLA
        S D T   +    FQ+  + + +DY  V++ E+D Y+ E   K    F IL WW+   + Y + +++ARDVL IQV+TVASESAFS GGRV+D + + L 
Subjt:  STDSTARTSNPFRFQHVDMEDDEDYFDVDT-EVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLA

Query:  PSIVEALICAQNWLRSN
          IVEALIC ++W+ ++
Subjt:  PSIVEALICAQNWLRSN

P08770 Putative AC transposase1.2e-1742.74Show/hide
Query:  STDSTARTSNPFRFQHVDMEDDEDYFDVDT-EVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLA
        S D T   +    FQ+  + + +DY  V++ E+D Y+ E   K    F IL WW+   + Y + +++ARDVL IQV+TVASESAFS GGRV+D + + L 
Subjt:  STDSTARTSNPFRFQHVDMEDDEDYFDVDT-EVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLA

Query:  PSIVEALICAQNWLRSN
          IVEALIC ++W+ ++
Subjt:  PSIVEALICAQNWLRSN

Q6AVI0 Zinc finger BED domain-containing protein RICESLEEPER 24.5e-1243.18Show/hide
Query:  TEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVAS-ESAFS--TGGRVIDLFHSTLAPSIVEALICAQNWLR
        +E++ YL E        F IL WWK+N+ ++   S++ARD+L I ++ V+S  S FS  TG R++D + S+L P IVEAL+CA++WL+
Subjt:  TEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVAS-ESAFS--TGGRVIDLFHSTLAPSIVEALICAQNWLR

Q75HY5 Zinc finger BED domain-containing protein RICESLEEPER 32.0e-1242.7Show/hide
Query:  TEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASE----SAFSTGGRVIDLFHSTLAPSIVEALICAQNWLR
        +E++ YL E       +F ILEWWK+N+ ++   SK+ARDVL I ++ V+S     SA +TG +++D + S+L P  VEAL CA++WL+
Subjt:  TEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASE----SAFSTGGRVIDLFHSTLAPSIVEALICAQNWLR

Arabidopsis top hitse value%identityAlignment
AT1G18560.1 BED zinc finger ;hAT family dimerisation domain6.3e-0929.52Show/hide
Query:  ILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQNWLRSNAININLQSHLEERGFERIGPVAYRLALPPSLS
        +L+WWKVNS RY   S +ARD L +Q  + A E  F   G  ID     +     +++IC ++W+ +    + L+    E  +ER+  +A  +A   S  
Subjt:  ILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQNWLRSNAININLQSHLEERGFERIGPVAYRLALPPSLS

Query:  SVHDV
         +  +
Subjt:  SVHDV

AT1G42190.1 GAG/POL/ENV polyprotein7.9e-0442Show/hide
Query:  YRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLV
        Y+L LP  + + H VFHVSMLR+ +    +VI   P  L EN+   G L+
Subjt:  YRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLV

AT3G42170.1 BED zinc finger ;hAT family dimerisation domain1.2e-1241.3Show/hide
Query:  DVDTEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQNW-LRSNA
        ++ +E+D YL E        F +L+WWK N  +Y   SK+ARD+L I V+  A +  F    R +D + ++L P  VEALICA+ W L SNA
Subjt:  DVDTEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVEALICAQNW-LRSNA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCAAGAATCCACTTCTACTGATTCTACTGCTAGAACAAGTAATCCTTTTCGCTTTCAACATGTTGATATGGAAGATGATGAGGATTATTTTGATGTGGACACAGA
GGTGGATGTTTACTTATTGGAACGTCGTGCAAAAGGGGATGATAATTTTTACATCTTAGAATGGTGGAAAGTGAACTCTTCTAGGTACAAGGTTTTTAGTAAGGTTGCAC
GAGATGTTTTGGTAATTCAAGTGGCTACAGTTGCTTCTGAGTCTGCATTCAGTACTGGTGGAAGAGTGATAGATCTTTTTCACTCCACATTGGCTCCATCCATAGTTGAG
GCACTTATTTGTGCACAAAATTGGCTACGTTCTAATGCTATCAATATTAATCTTCAATCACATTTGGAGGAAAGGGGTTTTGAGAGGATTGGCCCTGTGGCTTACCGTTT
GGCATTGCCACCATCTCTTTCTTCGGTGCATGACGTGTTCCATGTGTCTATGCTGAGAAGATACATGGCAGACCCGTCTCATGTGATTGACTTTGAACCTTTGAGGCTAA
ATGAGAACTTGTGTTATGAAGGAAAGCTAGTTCAAATCCTTGCCAGAGATCGCAAAACCCTTCGTAATAGGGAGATAACACTGGTTAAAGTGCTTTGCAGTGAGCAGCAC
GCGTTCCGGCCGGTTTTGCAGTGCGGTGGTGGTCAAACATTGGTATTTCTCCAACAAACATTCTGTGAGGCGCGACCAGCATCATTTTCGGTGTATTTCTGGCGTGTTGT
TCAATGGCAAGCGTTCCTCTTCATATCTAGCGGTTCAAGTAGCGAGTCTTCAAGTTTCAGTGTCATTGAGCAATATCGGCGGTGGAGCCCAGTTACAGGCACGTTATCGT
GCGCTCTCTCAGTTCAGGACGGTGTTGAGCCCATGTTCAGACACGTTTTCGTGCACCAGTCGCTCCATCGAGTTTTCGATGTTGAGCTCATATTCAAGCACGTTTTTGTG
CACCAGTCGCTCCATCGAGTTTACGATGTTGAGCTCATATTCAAGCACGTTTTCGTGCACCAGCAGCTCCATCGAGTTTTCGATGTTGAGCTTATGCTTAGACATGTTTT
CATGCACCAGATTGCTCCACTGTATTTTCATGATTCAGGGGTGTTGTGTTGGGATAGGTGTAGTTCTGCTAGAGGTTGCCAGCTAGTCTCTAGTATACTTGCCTTACCCG
ACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCAAGAATCCACTTCTACTGATTCTACTGCTAGAACAAGTAATCCTTTTCGCTTTCAACATGTTGATATGGAAGATGATGAGGATTATTTTGATGTGGACACAGA
GGTGGATGTTTACTTATTGGAACGTCGTGCAAAAGGGGATGATAATTTTTACATCTTAGAATGGTGGAAAGTGAACTCTTCTAGGTACAAGGTTTTTAGTAAGGTTGCAC
GAGATGTTTTGGTAATTCAAGTGGCTACAGTTGCTTCTGAGTCTGCATTCAGTACTGGTGGAAGAGTGATAGATCTTTTTCACTCCACATTGGCTCCATCCATAGTTGAG
GCACTTATTTGTGCACAAAATTGGCTACGTTCTAATGCTATCAATATTAATCTTCAATCACATTTGGAGGAAAGGGGTTTTGAGAGGATTGGCCCTGTGGCTTACCGTTT
GGCATTGCCACCATCTCTTTCTTCGGTGCATGACGTGTTCCATGTGTCTATGCTGAGAAGATACATGGCAGACCCGTCTCATGTGATTGACTTTGAACCTTTGAGGCTAA
ATGAGAACTTGTGTTATGAAGGAAAGCTAGTTCAAATCCTTGCCAGAGATCGCAAAACCCTTCGTAATAGGGAGATAACACTGGTTAAAGTGCTTTGCAGTGAGCAGCAC
GCGTTCCGGCCGGTTTTGCAGTGCGGTGGTGGTCAAACATTGGTATTTCTCCAACAAACATTCTGTGAGGCGCGACCAGCATCATTTTCGGTGTATTTCTGGCGTGTTGT
TCAATGGCAAGCGTTCCTCTTCATATCTAGCGGTTCAAGTAGCGAGTCTTCAAGTTTCAGTGTCATTGAGCAATATCGGCGGTGGAGCCCAGTTACAGGCACGTTATCGT
GCGCTCTCTCAGTTCAGGACGGTGTTGAGCCCATGTTCAGACACGTTTTCGTGCACCAGTCGCTCCATCGAGTTTTCGATGTTGAGCTCATATTCAAGCACGTTTTTGTG
CACCAGTCGCTCCATCGAGTTTACGATGTTGAGCTCATATTCAAGCACGTTTTCGTGCACCAGCAGCTCCATCGAGTTTTCGATGTTGAGCTTATGCTTAGACATGTTTT
CATGCACCAGATTGCTCCACTGTATTTTCATGATTCAGGGGTGTTGTGTTGGGATAGGTGTAGTTCTGCTAGAGGTTGCCAGCTAGTCTCTAGTATACTTGCCTTACCCG
ACTGA
Protein sequenceShow/hide protein sequence
MSQESTSTDSTARTSNPFRFQHVDMEDDEDYFDVDTEVDVYLLERRAKGDDNFYILEWWKVNSSRYKVFSKVARDVLVIQVATVASESAFSTGGRVIDLFHSTLAPSIVE
ALICAQNWLRSNAININLQSHLEERGFERIGPVAYRLALPPSLSSVHDVFHVSMLRRYMADPSHVIDFEPLRLNENLCYEGKLVQILARDRKTLRNREITLVKVLCSEQH
AFRPVLQCGGGQTLVFLQQTFCEARPASFSVYFWRVVQWQAFLFISSGSSSESSSFSVIEQYRRWSPVTGTLSCALSVQDGVEPMFRHVFVHQSLHRVFDVELIFKHVFV
HQSLHRVYDVELIFKHVFVHQQLHRVFDVELMLRHVFMHQIAPLYFHDSGVLCWDRCSSARGCQLVSSILALPD