; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy07g009680 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy07g009680
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr07:38537175..38541917
RNA-Seq ExpressionLcy07g009680
SyntenyLcy07g009680
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERN08425.1 hypothetical protein AMTR_s03239p00005680, partial [Amborella trichopoda]4.2e-4976.98Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL LALRM++P SL+  S+ ++R+++EKWDRSNRM LMIIKRGIPEAFRGAVSE +T+A  FL EIEKRFAK+DKAETS LL++LISMK+KG ENIREY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSHL SKLKALKLELSD LLVHLVLISL  QF+QFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

RYE19068.1 hypothetical protein EOP45_13205, partial [Sphingobacteriaceae bacterium]1.9e-4976.26Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL L+LRM++P SL+  SS D+R+++EKWDRSNRM LMIIKRGIPEAFRGAVSE ITNAKDFL EIEKRF K+DKAETS LL++LISMK+KGNENIREY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSH+ SKLK LKLELSD LLVHLVLIS+  + +QF+
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

XP_022874595.1 uncharacterized protein LOC111393333 [Olea europaea var. sylvestris]4.9e-5079.14Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL LAL +E+P  L  ESSLDE++ FE+WDRSNRM LMIIK GI EAFRGAVSEGITNAK+FL+EIEKRF KNDKAETS LLQ LISMKYKG  N+REY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSH+ SKLK LKLELSD LLVHLVLISL AQF+QFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

XP_022880462.1 uncharacterized protein LOC111397696 [Olea europaea var. sylvestris]3.2e-4977.7Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL LALR+E+P  L+ ESS DE++ FE+W RSNRM LMIIKRGIPEAFRGAVSEGITNAK+FL+EIEKRF KNDKAETS LLQ LISMKYKG  N+REY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSH+ SKLK LKL+ SD LLVHLVLISL AQ +QFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

XP_022889207.1 uncharacterized protein LOC111404665 [Olea europaea var. sylvestris]8.4e-5078.42Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL LAL +E+P  L  ESSLDE++ FE+W RSNRM LMIIKRGIPEAFRGAV EGITNAK+FL+EIEKRF KNDK+ETS LLQ LISMKYKG  N+REY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSH+ SKLK LKLELSD LLVHLVLISL AQF+QFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

TrEMBL top hitse value%identityAlignment
A0A438CR00 Uncharacterized protein4.2e-4773.38Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MD+ LALRM +P  L+ ES+ ++   + KW+RSNR+ LMI+KRGIPEAFRGAV++ +TNA DFL EI+KRFAKNDKAETSMLL  LISMKYKG  N+REY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSHL SKLKALKLELSD LL+HLVLISL AQFNQFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

A0A443PLH5 Uncharacterized protein6.5e-4877.7Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL LALR+EQP SL+  SS D++K FEKWDRSNRM LMIIKRGIPEAFRGAVSE +T AK+FL EIEKRF KNDKAETS LLQ LISMKY G  NIREY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IM MSH+ SKLKAL LELSD LLVHLVLISL A ++QFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

A0A4Q3EHJ0 gag_pre-integrs domain-containing protein (Fragment)9.1e-5076.26Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL L+LRM++P SL+  SS D+R+++EKWDRSNRM LMIIKRGIPEAFRGAVSE ITNAKDFL EIEKRF K+DKAETS LL++LISMK+KGNENIREY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSH+ SKLK LKLELSD LLVHLVLIS+  + +QF+
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

A0A5P1E579 CCHC-type domain-containing protein3.8e-4876.26Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL  ALR EQP  L+ +S+ D +K  EKW+RSNRM LMIIKRG+PEAFRG  SEGIT AKDFL EIEKRFAKNDK ETS LL RLISMKYKG  NIREY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSH+ SKLKALKL+L D LLVHLVL+SL AQFNQFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

U5CY72 CCHC-type domain-containing protein (Fragment)2.0e-4976.98Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL LALRM++P SL+  S+ ++R+++EKWDRSNRM LMIIKRGIPEAFRGAVSE +T+A  FL EIEKRFAK+DKAETS LL++LISMK+KG ENIREY
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK
        IMEMSHL SKLKALKLELSD LLVHLVLISL  QF+QFK
Subjt:  IMEMSHLTSKLKALKLELSDGLLVHLVLISLSAQFNQFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53670.1 unknown protein1.3e-1940.97Show/hide
Query:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY
        MDL L+L  E+P S          K  + WDRSNR+ +MI+K  IP+ FRG V + +T AKDFL  +E  FAKN++AE S +     SM Y  NEN+RE 
Subjt:  MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREY

Query:  IMEMSHLTSKLKALKLE---LSDGLLVHLVLISLSAQFNQFKQL
        IM M  L +K K L +     +D +L H  +  L  Q+   K +
Subjt:  IMEMSHLTSKLKALKLE---LSDGLLVHLVLISLSAQFNQFKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTAGGCCTTGCACTAAGAATGGAGCAACCCCCTTCTCTTTCAGTTGAAAGTTCTCTTGATGAAAGGAAACTTTTTGAGAAGTGGGACCGCTCAAACCGCATGTG
TCTAATGATCATAAAGCGTGGCATTCCCGAGGCATTTAGAGGTGCGGTATCTGAAGGGATAACCAATGCCAAAGATTTCCTTATTGAAATAGAAAAACGTTTTGCTAAAA
ACGATAAGGCTGAAACAAGCATGCTATTACAACGCTTGATTTCAATGAAATATAAGGGAAATGAAAATATTAGGGAGTACATTATGGAAATGTCCCATCTGACATCAAAA
CTAAAGGCACTTAAGCTTGAGCTATCTGATGGCTTGCTTGTGCACTTGGTATTGATCTCTCTTTCTGCACAGTTTAACCAGTTTAAGCAGCTCACCGTCGCCGACGCTTC
TCTGCCGTCCACTGCTGCCACTGTCACCGTCGTCTTTCTTCCGTTCGCGTCGCAGCAGCCTTGCTCCTACCGATTTTGCACTGATTTTCTACACCAGCTCACCGTTGTCG
ACGCTTCTCTGCCGTCCACTGCTGCCGCTGTCACCGTTGTCTTTCTTCCGTTCGCGTCGCGACAGCCTTCTCCTATCGATTTTGCACTGATTTTCTACACGTTTGTTAAG
CTCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTAGGCCTTGCACTAAGAATGGAGCAACCCCCTTCTCTTTCAGTTGAAAGTTCTCTTGATGAAAGGAAACTTTTTGAGAAGTGGGACCGCTCAAACCGCATGTG
TCTAATGATCATAAAGCGTGGCATTCCCGAGGCATTTAGAGGTGCGGTATCTGAAGGGATAACCAATGCCAAAGATTTCCTTATTGAAATAGAAAAACGTTTTGCTAAAA
ACGATAAGGCTGAAACAAGCATGCTATTACAACGCTTGATTTCAATGAAATATAAGGGAAATGAAAATATTAGGGAGTACATTATGGAAATGTCCCATCTGACATCAAAA
CTAAAGGCACTTAAGCTTGAGCTATCTGATGGCTTGCTTGTGCACTTGGTATTGATCTCTCTTTCTGCACAGTTTAACCAGTTTAAGCAGCTCACCGTCGCCGACGCTTC
TCTGCCGTCCACTGCTGCCACTGTCACCGTCGTCTTTCTTCCGTTCGCGTCGCAGCAGCCTTGCTCCTACCGATTTTGCACTGATTTTCTACACCAGCTCACCGTTGTCG
ACGCTTCTCTGCCGTCCACTGCTGCCGCTGTCACCGTTGTCTTTCTTCCGTTCGCGTCGCGACAGCCTTCTCCTATCGATTTTGCACTGATTTTCTACACGTTTGTTAAG
CTCGATTGA
Protein sequenceShow/hide protein sequence
MDLGLALRMEQPPSLSVESSLDERKLFEKWDRSNRMCLMIIKRGIPEAFRGAVSEGITNAKDFLIEIEKRFAKNDKAETSMLLQRLISMKYKGNENIREYIMEMSHLTSK
LKALKLELSDGLLVHLVLISLSAQFNQFKQLTVADASLPSTAATVTVVFLPFASQQPCSYRFCTDFLHQLTVVDASLPSTAAAVTVVFLPFASRQPSPIDFALIFYTFVK
LD