; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028357 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028357
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:19387931..19390761
RNA-Seq ExpressionLag0028357
SyntenyLag0028357
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.4e-3764.46Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQT+GQF G+ +EDPHLHL+ F+ +SDSF  QGVP DALRL LF YS+RD A+ WLNS   GS+ TWN+L EKFLSKYFP N N KL +EI 
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F+Q +DE+  +AWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]7.8e-4170.25Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQT+GQFHGL SEDPHLHLKSFLGVSDSF  Q V  D +RL+LF YS+RDGAK+WLN+ ALG+I +WN L EKFL KYFP  RN +  +EIV
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F+Q ED+T SEAWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]1.3e-4070.25Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQT+GQFHGLSS+DPHLHLKSFLGVSDSF  QGV  D +RL+ FSYS+RDGAK+WLN  ALG I +WN LAEKFL KYFP  R+ +  +EIV
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F++ E+ET SEAWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

XP_022960432.1 uncharacterized protein LOC111461168 [Cucurbita moschata]3.5e-4171.07Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQT+GQFHGL SEDPHLHLKSFLGVSDSF  QGV  D +RL+LF YS+RDGAK+WLN+ A  +I +WN LAEKFL KYFP  RN +  +EIV
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F+Q EDET SEAWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

XP_023511572.1 uncharacterized protein LOC111776371 [Cucurbita pepo subsp. pepo]1.2e-3849.49Show/hide
Query:  VRMTSALSLMFQGAATERNEKVTERNQTKRNVAERTLMSRLLKRRDCVGKARAQRFVRNLYAVVTSIFSVVLSVLLVLKLVMFHMLQTVGQFHGLSSEDP
        ++  + L+  F+ +A   N+K    N              L   R+   +A A   V  LY  +  I          LK VMF MLQT+G+FHGLSSEDP
Subjt:  VRMTSALSLMFQGAATERNEKVTERNQTKRNVAERTLMSRLLKRRDCVGKARAQRFVRNLYAVVTSIFSVVLSVLLVLKLVMFHMLQTVGQFHGLSSEDP

Query:  HLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIVGFRQLEDETFSEAWERFKEFL
        HLHLKSFLGVSDSF  QGV  D +RL+LF YS+RDGAK+WLN+ A G+I +WN LA+KF  KYF   RN +  +EIV F+Q EDET SEAWERFKE L
Subjt:  HLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIVGFRQLEDETFSEAWERFKEFL

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333943.8e-4170.25Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQT+GQFHGL SEDPHLHLKSFLGVSDSF  Q V  D +RL+LF YS+RDGAK+WLN+ ALG+I +WN L EKFL KYFP  RN +  +EIV
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F+Q ED+T SEAWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

A0A6J1EQ90 uncharacterized protein LOC1114364111.6e-3664.84Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGV-------SDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNV
        LK VMF MLQT+GQFHGL  EDPHLHLKSFLGV       SDSF  QGV  D +RL+LF Y +RDGAK+WLN+ A G+I +WN LAE FL KYFP  RN 
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGV-------SDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNV

Query:  KLMSEIVGFRQLEDETFSEAWERFKEFL
        +  +EIV F+Q EDET SEA ERFKE L
Subjt:  KLMSEIVGFRQLEDETFSEAWERFKEFL

A0A6J1G7Q6 uncharacterized protein LOC1114515986.4e-4170.25Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQT+GQFHGLSS+DPHLHLKSFLGVSDSF  QGV  D +RL+ FSYS+RDGAK+WLN  ALG I +WN LAEKFL KYFP  R+ +  +EIV
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F++ E+ET SEAWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

A0A6J1H7E4 uncharacterized protein LOC1114611681.7e-4171.07Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQT+GQFHGL SEDPHLHLKSFLGVSDSF  QGV  D +RL+LF YS+RDGAK+WLN+ A  +I +WN LAEKFL KYFP  RN +  +EIV
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F+Q EDET SEAWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

U5CUI2 Retrotrans_gag domain-containing protein5.6e-3765.29Show/hide
Query:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
        LK VMF MLQTVGQF G+ +EDPHLHL+SFL VSDSF IQGV  + LRL LF +S+RD A++WLN+    S+  WN+LAEKFL KYFP  RN K  SEI+
Subjt:  LKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV

Query:  GFRQLEDETFSEAWERFKEFL
         F+QLEDE+ S+AWERFKE L
Subjt:  GFRQLEDETFSEAWERFKEFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGCCCGGTGTTTTGGGCGATTTGTTGCAAATCGAGTCTGTTTCTGGGAAACCCGAGAGCCAGCGTCAGTTCGCAAGAAACACTCGTTGCTGCTTCGATAGTAG
GCGGGAGATGCCTTTTTGGGTGCGGATGACGTCAGCCCTAAGTCTAATGTTTCAAGGAGCGGCAACCGAACGAAACGAGAAAGTAACCGAACGGAACCAGACGAAACGAA
ACGTCGCTGAACGGACGTTGATGTCGCGCCTCTTGAAGAGGCGGGACTGCGTGGGGAAGGCACGAGCTCAAAGATTCGTCAGGAATCTCTACGCCGTCGTCACATCGATT
TTCAGCGTTGTCTTGTCCGTGTTACTAGTTTTGAAACTGGTAATGTTTCATATGTTGCAAACCGTGGGTCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACACCT
TAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGCGATGCCCTTAGATTAACTTTGTTCTCGTATTCTATTAGAGATGGAGCAAAGGCGTGGT
TAAATTCTTTTGCTCTAGGATCAATTAGGACATGGAATGAGTTAGCAGAAAAATTTCTTAGTAAATATTTCCCACGAAATAGGAATGTTAAATTGATGAGTGAAATAGTA
GGGTTTAGGCAACTTGAAGACGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGTTTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGCCCGGTGTTTTGGGCGATTTGTTGCAAATCGAGTCTGTTTCTGGGAAACCCGAGAGCCAGCGTCAGTTCGCAAGAAACACTCGTTGCTGCTTCGATAGTAG
GCGGGAGATGCCTTTTTGGGTGCGGATGACGTCAGCCCTAAGTCTAATGTTTCAAGGAGCGGCAACCGAACGAAACGAGAAAGTAACCGAACGGAACCAGACGAAACGAA
ACGTCGCTGAACGGACGTTGATGTCGCGCCTCTTGAAGAGGCGGGACTGCGTGGGGAAGGCACGAGCTCAAAGATTCGTCAGGAATCTCTACGCCGTCGTCACATCGATT
TTCAGCGTTGTCTTGTCCGTGTTACTAGTTTTGAAACTGGTAATGTTTCATATGTTGCAAACCGTGGGTCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACACCT
TAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGCGATGCCCTTAGATTAACTTTGTTCTCGTATTCTATTAGAGATGGAGCAAAGGCGTGGT
TAAATTCTTTTGCTCTAGGATCAATTAGGACATGGAATGAGTTAGCAGAAAAATTTCTTAGTAAATATTTCCCACGAAATAGGAATGTTAAATTGATGAGTGAAATAGTA
GGGTTTAGGCAACTTGAAGACGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGTTTTTGTGA
Protein sequenceShow/hide protein sequence
MEGPGVLGDLLQIESVSGKPESQRQFARNTRCCFDSRREMPFWVRMTSALSLMFQGAATERNEKVTERNQTKRNVAERTLMSRLLKRRDCVGKARAQRFVRNLYAVVTSI
FSVVLSVLLVLKLVMFHMLQTVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPSDALRLTLFSYSIRDGAKAWLNSFALGSIRTWNELAEKFLSKYFPRNRNVKLMSEIV
GFRQLEDETFSEAWERFKEFL