; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029299 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029299
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationchr8:37375707..37381983
RNA-Seq ExpressionLag0029299
SyntenyLag0029299
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051909.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]3.4e-2781.71Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+ EDGT ++KS+
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL

KAA0067206.1 hypothetical protein E6C27_scaffold418G00080 [Cucumis melo var. makuwa]2.6e-2782.93Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL
        +DSINN LHVPEGPIT+ KAKKIQEAFTLHVQKL NAQRE  NFEPKFLYNVSSASQEE+ +KMA EKLCSLEDGT D+KS+
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL

TYK02449.1 F15O4.13 [Cucumis melo var. makuwa]6.8e-2882.93Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+LEDGT ++KS+
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL

TYK08102.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]4.4e-2783.75Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKK
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+LEDGT ++K
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKK

TYK11649.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]2.0e-2778.82Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSLMFN
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+LEDGT ++K  +++
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSLMFN

TrEMBL top hitse value%identityAlignment
A0A5A7UEI2 Transposon Ty3-I Gag-Pol polyprotein1.6e-2781.71Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+ EDGT ++KS+
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL

A0A5A7VL78 RT_RNaseH_2 domain-containing protein1.3e-2782.93Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL
        +DSINN LHVPEGPIT+ KAKKIQEAFTLHVQKL NAQRE  NFEPKFLYNVSSASQEE+ +KMA EKLCSLEDGT D+KS+
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL

A0A5D3BWE8 F15O4.133.3e-2882.93Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+LEDGT ++KS+
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSL

A0A5D3C9X5 Transposon Ty3-I Gag-Pol polyprotein2.1e-2783.75Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKK
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+LEDGT ++K
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKK

A0A5D3CN32 Transposon Ty3-I Gag-Pol polyprotein9.6e-2878.82Show/hide
Query:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSLMFN
        +DS+NN LHVPEGPIT+ KAKKIQEAFTLHVQKLANAQRE  NFE KFLYNVSSASQEE+GVKMAREKLC+LEDGT ++K  +++
Subjt:  MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSLMFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAATCAATAATCCATTACATGTACCTGAAGGACCAATCACAAGGAGCAAGGCAAAGAAGATACAAGAGGCTTTCACACTGCATGTTCAAAAGCTAGCAAATGC
ACAACGAGAGGCCGGGAATTTTGAACCCAAATTTTTGTATAATGTTAGTTCAGCAAGTCAAGAAGAGAATGGAGTCAAGATGGCACGGGAAAAGTTGTGTAGTTTGGAAG
ATGGCACGGAGGACAAAAAAAGTTTGATGTTCAATCGATTCATTGGATTAGTTTTTGATCAATCTAATTCTGGAGTGATTTTGGACCACACAGATGGACAAGGAGCTGAC
GAGGACAATCGGGCAGAGGTAGGATCAAGAGACCGACCCAGAGGAAGACCGGACCAAAGGGTCGGGCCAAAATGGCCCGACCATTCGGCCCGTTTGCACGGGCCGAGCCC
GGTGACCTCTTTTCGGTCCCTAATGTCCCGAATCGCCCCGGTTTCGCCTGGTTCGCCCCGAAACGCCACCGAATTCCTAAAAAACCCTAGGAGGACAAACAGGCATCGGA
GGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTCTTGCAGTTGAGGGGGGAAGACGTGGCCTGCAAGACAGTAAACCTGCACACCGGTGTGGTGCTC
GCCACACCGGCTCCGATGCTTAAGTCAGAAAGCAGAACGGAGGGGCGTGGAAAAGTAAGAGCAAGAGCAGAAAAGAGAAAATCGATAGTAGAGTTTAGGGATGGAATTCG
GAATCCCCTCTTCAGTGGCGAAGAGGGATATAAATACCTGTTCATCCTCCTAGCCTTTTTAGGGTTTCGGAGGCGTTTAAGGTCAAACCAGGCGAAACCGGGGCATCCAG
AGGCGGTGGGGACCAGACGGGACCAAACGAGCTCGGCCCGCACGAGCGGGCCGAGGGTCGGCCTCGGCCATGGCCGAGGCCGACCATGTGCTCGGCTCGGCTATGGGCCG
AGGCCGAGCATGGGGTCGGGTCAAAAACCCGACCCCTTCGGCCTTGGCCCGTCCCGCTTACCGGCTCGCCTCCTTGGCCCGATTTCCAGCCCGATTTCTCCCCGATTGTC
CTCGTCAGCTCCTCGTACATCGGGGTGGTCCAAAATTGCCTATAACATTAAGCCCCCACTCATGAATTGGGATTCGGGGGGGCGTTGCCCCGAACCCCACCGGGCGCTTC
GCCCTGGACCCCGACTCGCTGACCGGCGAGACCCCCGTACTGGGTCATTCGGAGTGCTTCATTTCAGGAGAATACCTTTAAAGTTTGTGCTCGGCCTCGGCAAGAGGTGC
TCGGCCTTGGCAAGAGGTGCTCGGTCCTGGCTTCTGCAGGGTGAGCTTGACTTAAATGAAGTGTATGGACTCATGTCAGGTGAGTCCAGGCTCGCCCAGTGTAAGGATGC
TCAAGCCAGAGGAGGTGAGGTAGACTCGTTGGTCCAGGCTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTTCGACAGTCCAGGCTCGCCCGGTGTAAAGATGCTC
AGTCCAGAGGAGGTGAGGTAGACTCGTTGGTCCCGACTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTTCGACAGTCCAGGCTCGCCCGGTGTAAAGATGCTCAA
GCCAGAGGATGTGAGGTAGACTCGTCGGTCCAGGCTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTTCGGCAGTCCAGGCTCGCCCGGTGTAAAGATGCTCAAGC
CAGAGGATGTGAGGTAGACTCGTCGGTCCAGGCTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTCGGTAGTCCAGGCTCGCATGTCTTGATGCTTGAGTCGGTTG
CCTTGTGCACACTCCTGAATACTAAACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCAATCAATAATCCATTACATGTACCTGAAGGACCAATCACAAGGAGCAAGGCAAAGAAGATACAAGAGGCTTTCACACTGCATGTTCAAAAGCTAGCAAATGC
ACAACGAGAGGCCGGGAATTTTGAACCCAAATTTTTGTATAATGTTAGTTCAGCAAGTCAAGAAGAGAATGGAGTCAAGATGGCACGGGAAAAGTTGTGTAGTTTGGAAG
ATGGCACGGAGGACAAAAAAAGTTTGATGTTCAATCGATTCATTGGATTAGTTTTTGATCAATCTAATTCTGGAGTGATTTTGGACCACACAGATGGACAAGGAGCTGAC
GAGGACAATCGGGCAGAGGTAGGATCAAGAGACCGACCCAGAGGAAGACCGGACCAAAGGGTCGGGCCAAAATGGCCCGACCATTCGGCCCGTTTGCACGGGCCGAGCCC
GGTGACCTCTTTTCGGTCCCTAATGTCCCGAATCGCCCCGGTTTCGCCTGGTTCGCCCCGAAACGCCACCGAATTCCTAAAAAACCCTAGGAGGACAAACAGGCATCGGA
GGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGGTTTTTGCTGGTCTTGCAGTTGAGGGGGGAAGACGTGGCCTGCAAGACAGTAAACCTGCACACCGGTGTGGTGCTC
GCCACACCGGCTCCGATGCTTAAGTCAGAAAGCAGAACGGAGGGGCGTGGAAAAGTAAGAGCAAGAGCAGAAAAGAGAAAATCGATAGTAGAGTTTAGGGATGGAATTCG
GAATCCCCTCTTCAGTGGCGAAGAGGGATATAAATACCTGTTCATCCTCCTAGCCTTTTTAGGGTTTCGGAGGCGTTTAAGGTCAAACCAGGCGAAACCGGGGCATCCAG
AGGCGGTGGGGACCAGACGGGACCAAACGAGCTCGGCCCGCACGAGCGGGCCGAGGGTCGGCCTCGGCCATGGCCGAGGCCGACCATGTGCTCGGCTCGGCTATGGGCCG
AGGCCGAGCATGGGGTCGGGTCAAAAACCCGACCCCTTCGGCCTTGGCCCGTCCCGCTTACCGGCTCGCCTCCTTGGCCCGATTTCCAGCCCGATTTCTCCCCGATTGTC
CTCGTCAGCTCCTCGTACATCGGGGTGGTCCAAAATTGCCTATAACATTAAGCCCCCACTCATGAATTGGGATTCGGGGGGGCGTTGCCCCGAACCCCACCGGGCGCTTC
GCCCTGGACCCCGACTCGCTGACCGGCGAGACCCCCGTACTGGGTCATTCGGAGTGCTTCATTTCAGGAGAATACCTTTAAAGTTTGTGCTCGGCCTCGGCAAGAGGTGC
TCGGCCTTGGCAAGAGGTGCTCGGTCCTGGCTTCTGCAGGGTGAGCTTGACTTAAATGAAGTGTATGGACTCATGTCAGGTGAGTCCAGGCTCGCCCAGTGTAAGGATGC
TCAAGCCAGAGGAGGTGAGGTAGACTCGTTGGTCCAGGCTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTTCGACAGTCCAGGCTCGCCCGGTGTAAAGATGCTC
AGTCCAGAGGAGGTGAGGTAGACTCGTTGGTCCCGACTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTTCGACAGTCCAGGCTCGCCCGGTGTAAAGATGCTCAA
GCCAGAGGATGTGAGGTAGACTCGTCGGTCCAGGCTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTTCGGCAGTCCAGGCTCGCCCGGTGTAAAGATGCTCAAGC
CAGAGGATGTGAGGTAGACTCGTCGGTCCAGGCTCACATGGTCTCACTCTGGATGAGGCGAGGTAGACTCGGTAGTCCAGGCTCGCATGTCTTGATGCTTGAGTCGGTTG
CCTTGTGCACACTCCTGAATACTAAACTTTAA
Protein sequenceShow/hide protein sequence
MDSINNPLHVPEGPITRSKAKKIQEAFTLHVQKLANAQREAGNFEPKFLYNVSSASQEENGVKMAREKLCSLEDGTEDKKSLMFNRFIGLVFDQSNSGVILDHTDGQGAD
EDNRAEVGSRDRPRGRPDQRVGPKWPDHSARLHGPSPVTSFRSLMSRIAPVSPGSPRNATEFLKNPRRTNRHRRRCGLHHAGVQRFLLVLQLRGEDVACKTVNLHTGVVL
ATPAPMLKSESRTEGRGKVRARAEKRKSIVEFRDGIRNPLFSGEEGYKYLFILLAFLGFRRRLRSNQAKPGHPEAVGTRRDQTSSARTSGPRVGLGHGRGRPCARLGYGP
RPSMGSGQKPDPFGLGPSRLPARLLGPISSPISPRLSSSAPRTSGWSKIAYNIKPPLMNWDSGGRCPEPHRALRPGPRLADRRDPRTGSFGVLHFRRIPLKFVLGLGKRC
SALARGARSWLLQGELDLNEVYGLMSGESRLAQCKDAQARGGEVDSLVQAHMVSLWMRRGRLRQSRLARCKDAQSRGGEVDSLVPTHMVSLWMRRGRLRQSRLARCKDAQ
ARGCEVDSSVQAHMVSLWMRRGRLRQSRLARCKDAQARGCEVDSSVQAHMVSLWMRRGRLGSPGSHVLMLESVALCTLLNTKL