; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041122 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041122
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr13:12430142..12431587
RNA-Seq ExpressionLag0041122
SyntenyLag0041122
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]9.6e-1531.89Show/hide
Query:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH
        M L +L     L++A  LLTQ+SK+AD+  +  M+S +ASH S RF A+L+++ + F  +S   YY  ++ +++ +D M +    +F SM +   +    
Subjt:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH

Query:  DRSLLMFESPRFSYPAVS-GLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE
         + +L +E+P  + P +   L L P +   +G+++Y  FF+++    + I+ EL ++H D + V  T ++VKFS+   EI +TKE
Subjt:  DRSLLMFESPRFSYPAVS-GLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE

XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]1.3e-1433.51Show/hide
Query:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH
        M L  L+    L++A  LL Q++K+AD+  +P M+  + S+ S +F A+L+L+ + F  FS       ++ +   +D M +    +F SM +   D    
Subjt:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH

Query:  DRSLLMFESPRFSYPAV-SGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE
        ++ +L FE+P    P +   L L P +   +G+++Y  FF++   E + I+ EL ++H DT++V VT SQVKFS+   EI+LTKE
Subjt:  DRSLLMFESPRFSYPAV-SGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE

XP_016903187.1 PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo]1.5e-1233.53Show/hide
Query:  LMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMFESPR
        L +A   L Q+++EAD+  +P   S +AS+ S RF A L++    F  +     +  RI +++ +D +    +D   S  +      +  + +L FES  
Subjt:  LMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMFESPR

Query:  FSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE
         +      L L PS+  ++GE+DY  FFS+D  + + ++  L I+H D+I V  T SQVKFS+   EIVLTKE
Subjt:  FSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE

XP_022145570.1 uncharacterized protein LOC111014988 [Momordica charantia]1.6e-1438.04Show/hide
Query:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH
        M L  L     L +AI  LT+++  AD+  SP     + S IS  F A+L+++P+FF  F+    +  RI +D+L+ I+    +D      + F    + 
Subjt:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH

Query:  DRSLLMFESPRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIY-HDTITVIVTDSQVKFSVGVIEIVLTKE
        +R LL FE+ R        L L PS+  +VGEIDY    S+  DEF+ IVT+L  Y +  I   +TDSQVKFSV   EI+LTKE
Subjt:  DRSLLMFESPRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIY-HDTITVIVTDSQVKFSVGVIEIVLTKE

XP_022156121.1 uncharacterized protein LOC111023086 [Momordica charantia]3.1e-3750Show/hide
Query:  HTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMF
        +TI L+NA+ +L Q + +AD+N +P MVS +++  S RF A+LRLAP FF  +   +Y+ C I+ DA Y I+FNM    +  M+L + + ++ DR LL+F
Subjt:  HTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMF

Query:  ES--PRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYHDTITVIVTDSQVKFSVGVIEIVLTKE
         S       P  SGLV+LP +   V EIDYRYFFS+D ++F HIVTELH ++DTI V +TDSQVKF++G IEIVLTKE
Subjt:  ES--PRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYHDTITVIVTDSQVKFSVGVIEIVLTKE

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980106.1e-1533.51Show/hide
Query:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH
        M L  L+    L++A  LL Q++K+AD+  +P M+  + S+ S +F A+L+L+ + F  FS       ++ +   +D M +    +F SM +   D    
Subjt:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH

Query:  DRSLLMFESPRFSYPAV-SGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE
        ++ +L FE+P    P +   L L P +   +G+++Y  FF++   E + I+ EL ++H DT++V VT SQVKFS+   EI+LTKE
Subjt:  DRSLLMFESPRFSYPAV-SGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE

A0A1S4E4N8 uncharacterized protein LOC1035022637.4e-1333.53Show/hide
Query:  LMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMFESPR
        L +A   L Q+++EAD+  +P   S +AS+ S RF A L++    F  +     +  RI +++ +D +    +D   S  +      +  + +L FES  
Subjt:  LMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMFESPR

Query:  FSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE
         +      L L PS+  ++GE+DY  FFS+D  + + ++  L I+H D+I V  T SQVKFS+   EIVLTKE
Subjt:  FSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYH-DTITVIVTDSQVKFSVGVIEIVLTKE

A0A6J1CUU8 uncharacterized protein LOC1110149888.0e-1538.04Show/hide
Query:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH
        M L  L     L +AI  LT+++  AD+  SP     + S IS  F A+L+++P+FF  F+    +  RI +D+L+ I+    +D      + F    + 
Subjt:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH

Query:  DRSLLMFESPRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIY-HDTITVIVTDSQVKFSVGVIEIVLTKE
        +R LL FE+ R        L L PS+  +VGEIDY    S+  DEF+ IVT+L  Y +  I   +TDSQVKFSV   EI+LTKE
Subjt:  DRSLLMFESPRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIY-HDTITVIVTDSQVKFSVGVIEIVLTKE

A0A6J1DTW1 uncharacterized protein LOC1110230861.5e-3750Show/hide
Query:  HTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMF
        +TI L+NA+ +L Q + +AD+N +P MVS +++  S RF A+LRLAP FF  +   +Y+ C I+ DA Y I+FNM    +  M+L + + ++ DR LL+F
Subjt:  HTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMF

Query:  ES--PRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYHDTITVIVTDSQVKFSVGVIEIVLTKE
         S       P  SGLV+LP +   V EIDYRYFFS+D ++F HIVTELH ++DTI V +TDSQVKF++G IEIVLTKE
Subjt:  ES--PRFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYHDTITVIVTDSQVKFSVGVIEIVLTKE

A0A6J1KZ05 uncharacterized protein LOC1114988874.8e-1234.05Show/hide
Query:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH
        M L  L H   L  A  LL Q+S EAD+  S +  S + S+ S RF A+ +++ +FFA +S  + +  R+ + + YD M++     F SM + F +    
Subjt:  MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDH

Query:  DRSLLMFESPRFSYPAVSG-LVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIY-HDTITVIVTDSQVKFSVGVIEIVLTKE
         R +L FES   +   +   L L PS+  E+G+I +  FFS+   +F+ I+T L  + +++I V +T S+VKF     E +LTKE
Subjt:  DRSLLMFESPRFSYPAVSG-LVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIY-HDTITVIVTDSQVKFSVGVIEIVLTKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACTGTTCGAGCTGGACCACACCATTCATCTCATGAACGCAATACAATTACTGACTCAACTTTCAAAGGAAGCCGACATGAACTCCTCGCCGGCGATGGTTTCGTT
CGTAGCCTCCCACATCTCGTCTCGCTTCGCCGCTTCCCTCCGACTTGCCCCGCAGTTCTTCGCCGAGTTTTCCTTCACCCAATATTACGATTGCCGGATCTTCATTGATG
CCTTGTACGATATCATGTTCAATATGTATATCGACACTTTTGAATCCATGATACTCGTTTTCAGCGACGACGTCGATCACGATCGCAGCCTCCTTATGTTCGAAAGTCCG
AGGTTTTCATATCCAGCGGTTTCTGGATTGGTATTGTTACCATCGAAAGGAACAGAGGTTGGCGAAATTGACTACAGATACTTTTTCTCAATGGATATGGATGAGTTCAA
ACACATTGTAACTGAGTTACATATCTACCACGATACAATTACTGTTATTGTGACGGATTCACAAGTCAAATTCTCTGTTGGAGTTATTGAAATTGTTCTCACCAAAGAGG
TATGGTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGACTGTTCGAGCTGGACCACACCATTCATCTCATGAACGCAATACAATTACTGACTCAACTTTCAAAGGAAGCCGACATGAACTCCTCGCCGGCGATGGTTTCGTT
CGTAGCCTCCCACATCTCGTCTCGCTTCGCCGCTTCCCTCCGACTTGCCCCGCAGTTCTTCGCCGAGTTTTCCTTCACCCAATATTACGATTGCCGGATCTTCATTGATG
CCTTGTACGATATCATGTTCAATATGTATATCGACACTTTTGAATCCATGATACTCGTTTTCAGCGACGACGTCGATCACGATCGCAGCCTCCTTATGTTCGAAAGTCCG
AGGTTTTCATATCCAGCGGTTTCTGGATTGGTATTGTTACCATCGAAAGGAACAGAGGTTGGCGAAATTGACTACAGATACTTTTTCTCAATGGATATGGATGAGTTCAA
ACACATTGTAACTGAGTTACATATCTACCACGATACAATTACTGTTATTGTGACGGATTCACAAGTCAAATTCTCTGTTGGAGTTATTGAAATTGTTCTCACCAAAGAGG
TATGGTTATAA
Protein sequenceShow/hide protein sequence
MRLFELDHTIHLMNAIQLLTQLSKEADMNSSPAMVSFVASHISSRFAASLRLAPQFFAEFSFTQYYDCRIFIDALYDIMFNMYIDTFESMILVFSDDVDHDRSLLMFESP
RFSYPAVSGLVLLPSKGTEVGEIDYRYFFSMDMDEFKHIVTELHIYHDTITVIVTDSQVKFSVGVIEIVLTKEVWL