; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024689 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024689
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr10:4982961..4985544
RNA-Seq ExpressionLag0024689
SyntenyLag0024689
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.5e-3035.51Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH
        DS  W  +S G ++VAS +       ++ +P+ + +       F NLWK  IPKK  FFIWT++Y  +NT ++L +   +    PS C +C    +   H
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH

Query:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS
        +F+ C +A  IW S   HL+S ++   P +   +C+   + K   ++ +++ +  A+ LW IW ERN+ IF    +T   IWEDI +LA  W S +  FS
Subjt:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS

Query:  NYQASTIALNWKVF
        NYQAS+IALN   F
Subjt:  NYQASTIALNWKVF

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-2934Show/hide
Query:  WIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCGVASD
        W P    +++VASA+ + +  +S+ +         +LW++ IP+K KFFIWT+++++LNT D +Q+   S +LNPS C  C + ++ ++H+F+ C  A  
Subjt:  WIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCGVASD

Query:  IWHSFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQASTIALNWK
        +W+ +   +G  M + + V  +CL+         + I+  +   A LW IW  RN+LIF +   + +N WEDI +L   W S +K   NY  +TIALN K
Subjt:  IWHSFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQASTIALNWK

TYK10356.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.6e-3037.02Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCG
        DS  WI +S G +SVAS +              +   F NLWK+ IPKK KFFIWT++Y  +NT D+L +   +    PS C +C    +   H+F+ C 
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCG

Query:  VASDIWH--SFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQAST
        +A  IW+  S HL S ++   P     +C+   + K   ++ I++ +  A+ LW IW ERN+ IF    +T  ++WEDI +LA  W S    F+NYQA++
Subjt:  VASDIWH--SFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQAST

Query:  IALNWKVF
        IALN   F
Subjt:  IALNWKVF

TYK13741.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.5e-3035.51Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH
        DS  W  +S G ++VAS +       ++ +P+ + +       F NLWK  IPKK  FFIWT++Y  +NT ++L +   +    PS C +C    +   H
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH

Query:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS
        +F+ C +A  IW S   HL+S ++   P +   +C+   + K   ++ +++ +  A+ LW IW ERN+ IF    +T   IWEDI +LA  W S +  FS
Subjt:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS

Query:  NYQASTIALNWKVF
        NYQAS+IALN   F
Subjt:  NYQASTIALNWKVF

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]3.5e-3035.51Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH
        DS  W  +S G ++VAS +       ++ +P+ + +       F NLWK  IPKK  FFIWT++Y  +NT ++L +   +    PS C +C    +   H
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH

Query:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS
        +F+ C +A  IW S   HL+S ++   P +   +C+   + K   ++ +++ +  A+ LW IW ERN+ IF    +T   IWEDI +LA  W S +  FS
Subjt:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS

Query:  NYQASTIALNWKVF
        NYQAS+IALN   F
Subjt:  NYQASTIALNWKVF

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.7e-3035.51Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH
        DS  W  +S G ++VAS +       ++ +P+ + +       F NLWK  IPKK  FFIWT++Y  +NT ++L +   +    PS C +C    +   H
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH

Query:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS
        +F+ C +A  IW S   HL+S ++   P +   +C+   + K   ++ +++ +  A+ LW IW ERN+ IF    +T   IWEDI +LA  W S +  FS
Subjt:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS

Query:  NYQASTIALNWKVF
        NYQAS+IALN   F
Subjt:  NYQASTIALNWKVF

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein8.5e-3034Show/hide
Query:  WIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCGVASD
        W P    +++VASA+ + +  +S+ +         +LW++ IP+K KFFIWT+++++LNT D +Q+   S +LNPS C  C + ++ ++H+F+ C  A  
Subjt:  WIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCGVASD

Query:  IWHSFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQASTIALNWK
        +W+ +   +G  M + + V  +CL+         + I+  +   A LW IW  RN+LIF +   + +N WEDI +L   W S +K   NY  +TIALN K
Subjt:  IWHSFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQASTIALNWK

A0A5D3CJ08 LINE-1 retrotransposable element ORF2 protein2.2e-3037.02Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCG
        DS  WI +S G +SVAS +              +   F NLWK+ IPKK KFFIWT++Y  +NT D+L +   +    PS C +C    +   H+F+ C 
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCG

Query:  VASDIWH--SFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQAST
        +A  IW+  S HL S ++   P     +C+   + K   ++ I++ +  A+ LW IW ERN+ IF    +T  ++WEDI +LA  W S    F+NYQA++
Subjt:  VASDIWH--SFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQAST

Query:  IALNWKVF
        IALN   F
Subjt:  IALNWKVF

A0A5D3CPL6 LINE-1 retrotransposable element ORF2 protein1.7e-3035.51Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH
        DS  W  +S G ++VAS +       ++ +P+ + +       F NLWK  IPKK  FFIWT++Y  +NT ++L +   +    PS C +C    +   H
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH

Query:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS
        +F+ C +A  IW S   HL+S ++   P +   +C+   + K   ++ +++ +  A+ LW IW ERN+ IF    +T   IWEDI +LA  W S +  FS
Subjt:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS

Query:  NYQASTIALNWKVF
        NYQAS+IALN   F
Subjt:  NYQASTIALNWKVF

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.7e-3035.51Show/hide
Query:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH
        DS  W  +S G ++VAS +       ++ +P+ + +       F NLWK  IPKK  FFIWT++Y  +NT ++L +   +    PS C +C    +   H
Subjt:  DSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSI------IFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDH

Query:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS
        +F+ C +A  IW S   HL+S ++   P +   +C+   + K   ++ +++ +  A+ LW IW ERN+ IF    +T   IWEDI +LA  W S +  FS
Subjt:  IFVHCGVASDIWHSF--HLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFS

Query:  NYQASTIALNWKVF
        NYQAS+IALN   F
Subjt:  NYQASTIALNWKVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G33710.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-0623.91Show/hide
Query:  SFPTPVLSRRVDSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFS-NLWKALIPKKIKFFIWTVIYRRLNTTDRL----QQIFTSTALNPSCCPL
        S   P LS  +D   W+ + +     +SA+   W+     RP+   + ++  +W      K  F +W     RL T  RL     Q+ T+       C L
Subjt:  SFPTPVLSRRVDSLQWIPDSQGRFSVASARCLYWSLASVDRPQPHSIIFS-NLWKALIPKKIKFFIWTVIYRRLNTTDRL----QQIFTSTALNPSCCPL

Query:  CHAGSDALDHIFVHCGVASDIWHSFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIA-AILWVIWGERNSLIFQNNS
        C    +  DH+F+ C  A  +WH+  ++  + +P  S V W  L  + ++ N +    ++ +I  ++L+ IW +RN+ +  + +
Subjt:  CHAGSDALDHIFVHCGVASDIWHSFHLASGIHMPIPSQVNWICLEAFAVKANLQREILIQSMIA-AILWVIWGERNSLIFQNNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGAGGTCGAGCATTCTCCTCGACCGAGGCTGAGGAGGAAAAAATGGGGTCGGGGAGGGCCCCGACCCTTCTACCGTTTGCCTTGGGTCGCTCTTCACCAATCTG
GGTCCGACAATTCCAGACCATCCGAACCGAAAGCTTGATAGTTATTTCTATTCACAACCGGTGTTCCACACCTTATGTTTATTTTTTGGGATTCCTAAACTCTGAAGGTT
CTCTGATCGGAGCTGGTGGGAAGAAAACATCAAAGAAAATCGAGATCAATTTGTTGGAATCTCTCGCTATTGTCGAAGGGCTCAATCAAATTTCAACAAAATTTCAAGCG
TACCCAGAGATTCGAGATCACGAGGTGGTAGTCGAGTCGGGCACGACTGAGATAGTGAAGTTGCTTAATCGGGAAATGATCGACCTCTCAGAGGTCTTTGTCGTCATCGA
TGAGATCCGTGGGCTAGCGTTGCAAGCAAATGTAGCGGCGTTCAGCTTCAGTCCAAGATCATCAAATTTTTTGGCGCACTCCCTTTGCGCGCGCAGTGAAGAACGTGTTT
CCTCCTTTCCAACTCCTGTTTTATCTAGACGGGTGGATTCCTTGCAATGGATCCCAGATTCTCAGGGTCGTTTCTCTGTTGCTTCTGCGAGGTGTTTATATTGGAGTTTG
GCATCGGTAGATCGACCTCAACCTCATTCGATTATTTTCTCAAATCTATGGAAGGCTCTGATTCCAAAAAAGATCAAGTTTTTCATTTGGACTGTTATCTATAGAAGATT
AAATACAACAGACAGGCTTCAACAAATTTTCACATCCACTGCTTTGAATCCGAGTTGTTGTCCCCTTTGCCATGCTGGCTCGGATGCGTTGGATCATATTTTTGTCCATT
GTGGAGTTGCTTCTGACATTTGGCACTCTTTTCATCTAGCTTCTGGTATTCATATGCCAATTCCTAGCCAAGTTAATTGGATATGTTTAGAGGCTTTTGCAGTCAAGGCT
AATTTGCAGAGGGAGATTCTTATTCAGTCCATGATTGCAGCAATACTATGGGTTATTTGGGGCGAGCGCAATAGTCTGATTTTTCAGAACAATTCTCGGACCAACATCAA
TATTTGGGAAGATATTATTTCGCTCGCGTCCTTTTGGGTATCATCCACAAAAGCTTTCTCCAATTACCAGGCTTCAACTATCGCCTTGAATTGGAAAGTTTTTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCGAGGTCGAGCATTCTCCTCGACCGAGGCTGAGGAGGAAAAAATGGGGTCGGGGAGGGCCCCGACCCTTCTACCGTTTGCCTTGGGTCGCTCTTCACCAATCTG
GGTCCGACAATTCCAGACCATCCGAACCGAAAGCTTGATAGTTATTTCTATTCACAACCGGTGTTCCACACCTTATGTTTATTTTTTGGGATTCCTAAACTCTGAAGGTT
CTCTGATCGGAGCTGGTGGGAAGAAAACATCAAAGAAAATCGAGATCAATTTGTTGGAATCTCTCGCTATTGTCGAAGGGCTCAATCAAATTTCAACAAAATTTCAAGCG
TACCCAGAGATTCGAGATCACGAGGTGGTAGTCGAGTCGGGCACGACTGAGATAGTGAAGTTGCTTAATCGGGAAATGATCGACCTCTCAGAGGTCTTTGTCGTCATCGA
TGAGATCCGTGGGCTAGCGTTGCAAGCAAATGTAGCGGCGTTCAGCTTCAGTCCAAGATCATCAAATTTTTTGGCGCACTCCCTTTGCGCGCGCAGTGAAGAACGTGTTT
CCTCCTTTCCAACTCCTGTTTTATCTAGACGGGTGGATTCCTTGCAATGGATCCCAGATTCTCAGGGTCGTTTCTCTGTTGCTTCTGCGAGGTGTTTATATTGGAGTTTG
GCATCGGTAGATCGACCTCAACCTCATTCGATTATTTTCTCAAATCTATGGAAGGCTCTGATTCCAAAAAAGATCAAGTTTTTCATTTGGACTGTTATCTATAGAAGATT
AAATACAACAGACAGGCTTCAACAAATTTTCACATCCACTGCTTTGAATCCGAGTTGTTGTCCCCTTTGCCATGCTGGCTCGGATGCGTTGGATCATATTTTTGTCCATT
GTGGAGTTGCTTCTGACATTTGGCACTCTTTTCATCTAGCTTCTGGTATTCATATGCCAATTCCTAGCCAAGTTAATTGGATATGTTTAGAGGCTTTTGCAGTCAAGGCT
AATTTGCAGAGGGAGATTCTTATTCAGTCCATGATTGCAGCAATACTATGGGTTATTTGGGGCGAGCGCAATAGTCTGATTTTTCAGAACAATTCTCGGACCAACATCAA
TATTTGGGAAGATATTATTTCGCTCGCGTCCTTTTGGGTATCATCCACAAAAGCTTTCTCCAATTACCAGGCTTCAACTATCGCCTTGAATTGGAAAGTTTTTCTGTAA
Protein sequenceShow/hide protein sequence
MGRGRAFSSTEAEEEKMGSGRAPTLLPFALGRSSPIWVRQFQTIRTESLIVISIHNRCSTPYVYFLGFLNSEGSLIGAGGKKTSKKIEINLLESLAIVEGLNQISTKFQA
YPEIRDHEVVVESGTTEIVKLLNREMIDLSEVFVVIDEIRGLALQANVAAFSFSPRSSNFLAHSLCARSEERVSSFPTPVLSRRVDSLQWIPDSQGRFSVASARCLYWSL
ASVDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQQIFTSTALNPSCCPLCHAGSDALDHIFVHCGVASDIWHSFHLASGIHMPIPSQVNWICLEAFAVKA
NLQREILIQSMIAAILWVIWGERNSLIFQNNSRTNINIWEDIISLASFWVSSTKAFSNYQASTIALNWKVFL