; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005457 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005457
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:18171601..18175732
RNA-Seq ExpressionLag0005457
SyntenyLag0005457
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]3.1e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

VVA13439.1 Hypothetical predicted protein, partial [Prunus dulcis]3.1e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

VVA21938.1 Hypothetical predicted protein, partial [Prunus dulcis]3.1e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

VVA25489.1 Hypothetical predicted protein, partial [Prunus dulcis]3.1e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

VVA39726.1 Hypothetical predicted protein, partial [Prunus dulcis]3.1e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

TrEMBL top hitse value%identityAlignment
A0A4Y1R3V4 VIRB2-interacting protein 21.5e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

A0A5E4FED8 Reverse transcriptase domain-containing protein (Fragment)1.5e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

A0A5E4GJ11 Reverse transcriptase domain-containing protein (Fragment)1.5e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

A0A5H2XKI7 VIRB2-interacting protein 21.5e-3348.37Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+  A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)5.2e-3449.02Show/hide
Query:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD
        INA  NET+I LIPKK E+  V+DFR ISL T LYK++++VL+  L++              +GRQILD + +ANE+++E +R N  G   K+++EK +D
Subjt:  INANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFD

Query:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
         V++ F D +L  KG G +WRSWI+GC+  ANFS++INGRPRG+I A+RGL Q
Subjt:  KVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-0725.41Show/hide
Query:  YWLLVGQDKGREMIEEFLLHLPFRDKGRELWNINANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQIL
        +W  +G D  R + E F        KG     +  +     + L+PKK + R + ++R +SL +  YKI+A+ +S  LK                GR I 
Subjt:  YWLLVGQDKGREMIEEFLLHLPFRDKGRELWNINANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIARVLSEHLKK--------------EGRQIL

Query:  DVSPMANELIDEWKRKNDKGAALKLNIEKTFDKVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ
        D   +  +L+   +R     A L L+ EK FD+VD+ +    L     G ++  ++K   + A   + IN      +   RG+ Q
Subjt:  DVSPMANELIDEWKRKNDKGAALKLNIEKTFDKVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTAGCATCAAGTTCAGCGGAAATGACACTGGTTTCATCCCAGGTGAAATAGCATTCCTTTCATCATCGAGATCACAGTATGGTTGTGGGATTTCTCCTCCTTG
GGACAACACAAAAGAAGATGACATGGAAGAAAACAAGGCAACAGACCCTCCTTCACCTTCAGCGACCATTTGTATTCTTGCTCCATGGCTTCAAAAATACCATATGTGTA
TTATGCCTATCCCATCTACTAGCAAAAAGAGCAAAATTGGGCAAAGACCGAACAAATTGGCTAGAGAGCTTGCAGAAGTTACCAAAGGTCTGTACTCTCTTTCCACTCTT
CTCACTCTTGCTGACGACTTCAAATTTTGGATTTCAAGGATCTATGGCCCTTCTTCTTATAAAAACAGGTCTCAGTTTTGGCAAGAATTATACGATCTCTCCTACCTGCG
CTCCGATTACTGGTTACTTGTTGGGCAGGATAAGGGACGGGAGATGATCGAGGAGTTCCTCCTCCATCTGCCATTCAGAGATAAGGGACGGGAGTTATGGAATATCAATG
CTAATGTCAACGAAACTTATATTTTTTTGATACCTAAGAAGCTTGAAGCTCGTTCGGTGGCTGATTTTCGTCTCATAAGTCTCACTACATGCCTCTACAAGATCATTGCT
AGGGTTCTATCAGAACATCTTAAAAAGGAAGGAAGACAAATTCTTGATGTCTCTCCTATGGCCAATGAGCTCATTGATGAATGGAAAAGGAAGAATGATAAAGGAGCGGC
CCTTAAACTGAACATCGAAAAAACTTTCGATAAGGTGGACTACAGTTTCCGAGACAATATCCTCTCTGTTAAAGGGTTGGGGCAAAAATGGAGATCATGGATTAAAGGTT
GCATCTCCCCTGCAAATTTCTCTATCATTATTAACGGTCGGCCTCGAGGTGAAATTATTGCTACTCGGGGTCTTTGTCAAGTGATGGGTGCTTTCAAAAATATGTCTGGA
CACAACAGCAATCTCCAAAAATCAGAAAACATCGGCATAAATGCAAAGTTCTACTGGGGAAGAGATGCACATCTAGATCCAAAGATGTTGGATAGGAAATTTGATGAGAA
CATTGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTAGCATCAAGTTCAGCGGAAATGACACTGGTTTCATCCCAGGTGAAATAGCATTCCTTTCATCATCGAGATCACAGTATGGTTGTGGGATTTCTCCTCCTTG
GGACAACACAAAAGAAGATGACATGGAAGAAAACAAGGCAACAGACCCTCCTTCACCTTCAGCGACCATTTGTATTCTTGCTCCATGGCTTCAAAAATACCATATGTGTA
TTATGCCTATCCCATCTACTAGCAAAAAGAGCAAAATTGGGCAAAGACCGAACAAATTGGCTAGAGAGCTTGCAGAAGTTACCAAAGGTCTGTACTCTCTTTCCACTCTT
CTCACTCTTGCTGACGACTTCAAATTTTGGATTTCAAGGATCTATGGCCCTTCTTCTTATAAAAACAGGTCTCAGTTTTGGCAAGAATTATACGATCTCTCCTACCTGCG
CTCCGATTACTGGTTACTTGTTGGGCAGGATAAGGGACGGGAGATGATCGAGGAGTTCCTCCTCCATCTGCCATTCAGAGATAAGGGACGGGAGTTATGGAATATCAATG
CTAATGTCAACGAAACTTATATTTTTTTGATACCTAAGAAGCTTGAAGCTCGTTCGGTGGCTGATTTTCGTCTCATAAGTCTCACTACATGCCTCTACAAGATCATTGCT
AGGGTTCTATCAGAACATCTTAAAAAGGAAGGAAGACAAATTCTTGATGTCTCTCCTATGGCCAATGAGCTCATTGATGAATGGAAAAGGAAGAATGATAAAGGAGCGGC
CCTTAAACTGAACATCGAAAAAACTTTCGATAAGGTGGACTACAGTTTCCGAGACAATATCCTCTCTGTTAAAGGGTTGGGGCAAAAATGGAGATCATGGATTAAAGGTT
GCATCTCCCCTGCAAATTTCTCTATCATTATTAACGGTCGGCCTCGAGGTGAAATTATTGCTACTCGGGGTCTTTGTCAAGTGATGGGTGCTTTCAAAAATATGTCTGGA
CACAACAGCAATCTCCAAAAATCAGAAAACATCGGCATAAATGCAAAGTTCTACTGGGGAAGAGATGCACATCTAGATCCAAAGATGTTGGATAGGAAATTTGATGAGAA
CATTGGATAG
Protein sequenceShow/hide protein sequence
MEASIKFSGNDTGFIPGEIAFLSSSRSQYGCGISPPWDNTKEDDMEENKATDPPSPSATICILAPWLQKYHMCIMPIPSTSKKSKIGQRPNKLARELAEVTKGLYSLSTL
LTLADDFKFWISRIYGPSSYKNRSQFWQELYDLSYLRSDYWLLVGQDKGREMIEEFLLHLPFRDKGRELWNINANVNETYIFLIPKKLEARSVADFRLISLTTCLYKIIA
RVLSEHLKKEGRQILDVSPMANELIDEWKRKNDKGAALKLNIEKTFDKVDYSFRDNILSVKGLGQKWRSWIKGCISPANFSIIINGRPRGEIIATRGLCQVMGAFKNMSG
HNSNLQKSENIGINAKFYWGRDAHLDPKMLDRKFDENIG