; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021510 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021510
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Ty3-G Gag-Pol polyprotein
Genome locationchr7:8528985..8530877
RNA-Seq ExpressionLag0021510
SyntenyLag0021510
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042859.1 hypothetical protein E6C27_scaffold44G003720 [Cucumis melo var. makuwa]4.4e-1561.54Show/hide
Query:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI
        +  H  F  P  L +++  GWRFCVDYR LN+ TI DKF IPVIEELLDELN +T++  SKL LKSGYH IRM E NI
Subjt:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI

KAA0043267.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]4.4e-1560.26Show/hide
Query:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI
        +  H  +  P  L +++  GWRFCVDYR LNQAT+ DKF IPVIEELLDEL+ +TI+  SKL LKSGYH+IRM E +I
Subjt:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI

KAA0064293.1 Retrovirus-related Pol polyprotein from transposon 297 family [Cucumis melo var. makuwa]7.5e-1560.26Show/hide
Query:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI
        K  H  F  P  L +++  GWRFCVDY+ LN+ TI DKFLIPVIEELLDEL+ +TI+  SKL LKSGYH+IRM + +I
Subjt:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI

XP_015964281.2 uncharacterized protein LOC107488099 [Arachis duranensis]4.4e-1560.81Show/hide
Query:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ
        F  P  L +++  GWRFCVDYRALN+ T+PDKF IP+IEELLDEL+ +T++  SKL LKSGYH+IRM E +I++
Subjt:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ

XP_025611868.1 uncharacterized protein LOC112705246 [Arachis hypogaea]5.7e-1560.81Show/hide
Query:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ
        F  P  L +++  GWRFCVDYRALN+ T+PDKF IP+IEELLDEL  +T++  SKL LKSGYH+IRM E +I++
Subjt:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ

TrEMBL top hitse value%identityAlignment
A0A5D3BSG2 Retrovirus-related Pol polyprotein from transposon 297 family3.6e-1560.26Show/hide
Query:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI
        K  H  F  P  L +++  GWRFCVDY+ LN+ TI DKFLIPVIEELLDEL+ +TI+  SKL LKSGYH+IRM + +I
Subjt:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI

A0A5D3C1V9 Reverse transcriptase domain-containing protein2.1e-1561.54Show/hide
Query:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI
        +  H  F  P  L +++  GWRFCVDYR LN+ TI DKF IPVIEELLDELN +T++  SKL LKSGYH IRM E NI
Subjt:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI

A0A5D3E123 Transposon Ty3-I Gag-Pol polyprotein2.1e-1560.26Show/hide
Query:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI
        +  H  +  P  L +++  GWRFCVDYR LNQAT+ DKF IPVIEELLDEL+ +TI+  SKL LKSGYH+IRM E +I
Subjt:  KTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNI

A0A6D2L909 Reverse transcriptase domain-containing protein3.6e-1558.54Show/hide
Query:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ---KTLSG
        F  P  L  ++   WRFC+DYRALN+ATIPDKF IPVI++LLDEL+ +TI+  SKL L+SGYH+IRMLE +I++   +TL G
Subjt:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ---KTLSG

A0A6P4DBC2 uncharacterized protein LOC1074880992.1e-1560.81Show/hide
Query:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ
        F  P  L +++  GWRFCVDYRALN+ T+PDKF IP+IEELLDEL+ +T++  SKL LKSGYH+IRM E +I++
Subjt:  FLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRMLELNIYQ

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4121.6e-0425.29Show/hide
Query:  EVEE-ASKAQSLEVDDLVELDLKSVIGFSTPGAMQLKRKTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKL
        +VEE  ++ Q L  D +VE    SV  +++P  +  K+ + +            ++  WR  +DYR +N+  + DKF +P I+++LD+L  +  +  S L
Subjt:  EVEE-ASKAQSLEVDDLVELDLKSVIGFSTPGAMQLKRKTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKL

Query:  YLKSGYHRIRMLE----LNIYQKTLSGLMKTTLSFWWNIKIKGYDGVACALLS--EGTKGISWIDWLLRMFCDE
         L SG+H+I + E    +  +  +      T L F   I    +  +     S  E ++   ++D L+ + C E
Subjt:  YLKSGYHRIRMLE----LNIYQKTLSGLMKTTLSFWWNIKIKGYDGVACALLS--EGTKGISWIDWLLRMFCDE

P31843 RNA-directed DNA polymerase homolog9.5e-0544.23Show/hide
Query:  RFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM
        R C+DYRAL + TI +K+ IP +++L D L  +T +  +KL L+SGY ++R+
Subjt:  RFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.9e-0543.75Show/hide
Query:  PYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM
        P  L  ++   +R CVDYR LN+ATI D F +P I+ LL  +  + I+  + L L SGYH+I M
Subjt:  PYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.9e-0543.75Show/hide
Query:  PYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM
        P  L  ++   +R CVDYR LN+ATI D F +P I+ LL  +  + I+  + L L SGYH+I M
Subjt:  PYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM

Q9UR07 Transposon Tf2-11 polyprotein6.2e-0446.15Show/hide
Query:  RFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM
        R  VDY+ LN+   P+ + +P+IE+LL ++  STI+  +KL LKS YH IR+
Subjt:  RFCVDYRALNQATIPDKFLIPVIEELLDELNWSTIYSKSKLYLKSGYHRIRM

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAAATTACCTTCCTAAACAGGTTGAATCCAACCATTAGGGCTGAAGTGATCAGTCGACAACATCTCGGGCTAGATGATATTATGAAGCAAGCCCATGTAGTCAA
AGATAAGGATATGGTAGTGAAAATGGCCATAGAGATGATGGACTCGGGCACGTCAAATGATGTGGGAAAGGCAACCCAACCACTTAGCTCCAAAAGAAATGGGTCAAGTA
TGGTCAAGTTGAGTATCCGAAGATTTGATTCTGTAGCTACTTGTTCGATAAGCCTACTTGAGATAGGAAGAGCAAATTGTCGAGAATTACATTCGAAGAGGTTGACTGAT
GTGGAGTTTCAAGCGAAGCTAGAAAAGGAAGAAATTGAGTTATTGTCGTGTGACGAAGTAGAGGAAGCTTCGAAGGCACAAAGTTTGGAAGTTGACGATCTAGTAGAGTT
GGATTTGAAATCAGTGATTGGTTTTTCAACACCGGGTGCTATGCAGCTGAAAAGGAAGACCAAACACAAGTCTTTTCTCGAGCCCTATACTCTTGGTCAAGAAGAAAGAT
GGGGGTGGAGGTTTTGTGTTGATTATCGAGCACTGAACCAAGCTACAATTCCTGATAAATTCTTGATACCAGTTATAGAAGAGTTACTTGATGAGTTGAATTGGTCAACA
ATTTACTCGAAGTCGAAGCTTTACCTTAAGTCGGGGTACCATCGGATTCGAATGCTGGAGTTGAACATATACCAAAAAACGCTTTCAGGACTCATGAAGACCACTTTGAG
TTTTTGGTGGAATATTAAGATAAAAGGCTATGATGGAGTGGCCTGTGCCTTGTTAAGTGAAGGAACTAAGGGGATTTCTTGGATTGATTGGTTATTACGGATGTTTTGTG
ATGAATTATGGAATGTTTGTTGTCCTATTGACTCAGTTGTTGAAGAAATACTCCTAAACATGGGTGGTGGAAACTACTCAAGCTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAAATTACCTTCCTAAACAGGTTGAATCCAACCATTAGGGCTGAAGTGATCAGTCGACAACATCTCGGGCTAGATGATATTATGAAGCAAGCCCATGTAGTCAA
AGATAAGGATATGGTAGTGAAAATGGCCATAGAGATGATGGACTCGGGCACGTCAAATGATGTGGGAAAGGCAACCCAACCACTTAGCTCCAAAAGAAATGGGTCAAGTA
TGGTCAAGTTGAGTATCCGAAGATTTGATTCTGTAGCTACTTGTTCGATAAGCCTACTTGAGATAGGAAGAGCAAATTGTCGAGAATTACATTCGAAGAGGTTGACTGAT
GTGGAGTTTCAAGCGAAGCTAGAAAAGGAAGAAATTGAGTTATTGTCGTGTGACGAAGTAGAGGAAGCTTCGAAGGCACAAAGTTTGGAAGTTGACGATCTAGTAGAGTT
GGATTTGAAATCAGTGATTGGTTTTTCAACACCGGGTGCTATGCAGCTGAAAAGGAAGACCAAACACAAGTCTTTTCTCGAGCCCTATACTCTTGGTCAAGAAGAAAGAT
GGGGGTGGAGGTTTTGTGTTGATTATCGAGCACTGAACCAAGCTACAATTCCTGATAAATTCTTGATACCAGTTATAGAAGAGTTACTTGATGAGTTGAATTGGTCAACA
ATTTACTCGAAGTCGAAGCTTTACCTTAAGTCGGGGTACCATCGGATTCGAATGCTGGAGTTGAACATATACCAAAAAACGCTTTCAGGACTCATGAAGACCACTTTGAG
TTTTTGGTGGAATATTAAGATAAAAGGCTATGATGGAGTGGCCTGTGCCTTGTTAAGTGAAGGAACTAAGGGGATTTCTTGGATTGATTGGTTATTACGGATGTTTTGTG
ATGAATTATGGAATGTTTGTTGTCCTATTGACTCAGTTGTTGAAGAAATACTCCTAAACATGGGTGGTGGAAACTACTCAAGCTTTTGA
Protein sequenceShow/hide protein sequence
MMKITFLNRLNPTIRAEVISRQHLGLDDIMKQAHVVKDKDMVVKMAIEMMDSGTSNDVGKATQPLSSKRNGSSMVKLSIRRFDSVATCSISLLEIGRANCRELHSKRLTD
VEFQAKLEKEEIELLSCDEVEEASKAQSLEVDDLVELDLKSVIGFSTPGAMQLKRKTKHKSFLEPYTLGQEERWGWRFCVDYRALNQATIPDKFLIPVIEELLDELNWST
IYSKSKLYLKSGYHRIRMLELNIYQKTLSGLMKTTLSFWWNIKIKGYDGVACALLSEGTKGISWIDWLLRMFCDELWNVCCPIDSVVEEILLNMGGGNYSSF