; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007727 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007727
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:3799511..3800833
RNA-Seq ExpressionLag0007727
SyntenyLag0007727
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.7e-3237.05Show/hide
Query:  EEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPP
        E PK IR YFQP     Q GI+  PIN NNFEL  GLIQM                                                D+A+DWL++IPP
Subjt:  EEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPP

Query:  GSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTL
         SITTW  L QAFL K+FPP+K+ +L+TEIGTF+Q  +EQL+E W+                              TK+I+DA AGG++ SK  + A T+
Subjt:  GSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTL

Query:  LEEMATNSYQWPSERSGPK-KIAARVFEIDNFSGTGSAQSIESATALASQT
        LE++AT SY WP ER+ P    AA ++E+D  +    AQ      AL+  T
Subjt:  LEEMATNSYQWPSERSGPK-KIAARVFEIDNFSGTGSAQSIESATALASQT

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]4.8e-2934.36Show/hide
Query:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG
        +P+ ++ Y +P+  +  SGI    IN NNFEL   LI M                                                DKAR WLQS+ PG
Subjt:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG

Query:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLL
        SIT+W  + + FL KFFPP+KT +L++EIG F+Q   E L+E W+                              T+TIVDAA+GGTL+SKT E A +LL
Subjt:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLL

Query:  EEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGS--AQSIESATALASQTQEENLEQV
        EEMA+N+YQWP+ER+  KK+A  + E++ F+   +  A      +AL +Q   +  E V
Subjt:  EEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGS--AQSIESATALASQTQEENLEQV

XP_023881727.1 uncharacterized protein LOC111994101 [Quercus suber]9.6e-3037.12Show/hide
Query:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG
        +P+ ++ Y +P+  +  SGI    IN NNFEL   LI M                                                DKAR WLQS+ PG
Subjt:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG

Query:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWDTKTIVDAAAGGTLLSKTIENARTLLEEMATNSYQWPSERSGPKKIAARVFEIDNF
        SIT+W  + +  L KFFP +KT +L++EIG F+Q   E L++ W+ +TIVDAA+GGTL+SKT E A +LLEEMA+N+YQWP+ER+  KK+A  + E++ F
Subjt:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWDTKTIVDAAAGGTLLSKTIENARTLLEEMATNSYQWPSERSGPKKIAARVFEIDNF

Query:  SGTGS--AQSIESATALASQTQEENLEQV
        +   +  A      +AL +Q   ++ E V
Subjt:  SGTGS--AQSIESATALASQTQEENLEQV

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]2.8e-2934.36Show/hide
Query:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG
        +P+ ++ Y +P+  +  SGI    IN NNFEL   LI M                                                DKAR WLQS+ PG
Subjt:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG

Query:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLL
        SIT+W  + + FL KFFPP+KT +L++EIG F+Q   E L+E W+                              T+TIVDAA+GGTL+SKT E A +LL
Subjt:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLL

Query:  EEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGS--AQSIESATALASQTQEENLEQV
        EEMA+N+YQWP+ER+  KK+A  + E++ F+   +  A      +AL++Q   ++ E V
Subjt:  EEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGS--AQSIESATALASQTQEENLEQV

XP_023929660.1 uncharacterized protein LOC112040975 [Quercus suber]2.5e-3034.36Show/hide
Query:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG
        +P+ ++ Y +P+  +  SGI +  IN NNFEL   LI M                                                DKAR WLQS+ PG
Subjt:  EPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSIPPG

Query:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLL
        SIT+W  + + FL KFFPP+KT +L++EIG F+Q   E L+E W+                              T+TIVDAA+GGTL+SKT E A +LL
Subjt:  SITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLL

Query:  EEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGS--AQSIESATALASQTQEENLEQV
        EEMA+N YQWP+ER+  KK+A  + E+++F+   +  A      +AL +Q   +++E V
Subjt:  EEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGS--AQSIESATALASQTQEENLEQV

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953733.4e-2533.71Show/hide
Query:  DNPNPEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWL
        D  +   P+ ++ Y +PV  +  SGI    IN NNFEL   LI M                                                DKAR WL
Subjt:  DNPNPEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWL

Query:  QSIPPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIE
        QS+  GSIT+W  + + FL KFFPP+KT +L++EI  F+Q   E L+E W+                              T+TIVDAAAGGTL+SKTIE
Subjt:  QSIPPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIE

Query:  NART-LLEEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGSAQSIESATALASQTQEENLEQV
         A T LLEEM +N+YQWP+E++  KK+      I   + T       +AT++   + E + EQV
Subjt:  NART-LLEEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGSAQSIESATALASQTQEENLEQV

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129454.5e-2535.5Show/hide
Query:  PEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM-------------------------------------------------DKARDWLQSI
        PE  + +R Y  P+ Q     I    IN NNFE+    IQM                                                 DKA+ WL S+
Subjt:  PEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM-------------------------------------------------DKARDWLQSI

Query:  PPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENAR
        P GSITTW  L Q FL KFFPP+KT K++ +I +F Q   E L+E W+                               KTI+DAAAGG L+SK   +A 
Subjt:  PPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENAR

Query:  TLLEEMATNSYQWPSERSGPKKIAARVFEID
         LLEEMA+N+YQWPSERSG +K A   +EID
Subjt:  TLLEEMATNSYQWPSERSGPKKIAARVFEID

A0A6J0ZYV0 uncharacterized protein LOC1104134135.9e-2535.5Show/hide
Query:  PEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM-------------------------------------------------DKARDWLQSI
        PE  + +R Y  P+ Q     I    IN NNFE+    IQM                                                 DKA+ WL S+
Subjt:  PEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM-------------------------------------------------DKARDWLQSI

Query:  PPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENAR
        P GSITTW  L Q FL KFFPP+KT K++ +I +F Q   E L+E W+                               KTI+DAAAGG L+SK   +A 
Subjt:  PPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENAR

Query:  TLLEEMATNSYQWPSERSGPKKIAARVFEID
         LLEEMA+N+YQWPSERSG +K A   +EID
Subjt:  TLLEEMATNSYQWPSERSGPKKIAARVFEID

A0A6P6T1H4 uncharacterized protein LOC1136966888.5e-2435.12Show/hide
Query:  NPEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM-------------------DKARDWLQSIPPGSITTWVALVQAFLKKFFPPSKTVKLK
        N    + +R +  P  Q+ Q+ I    +N NNFE+   LIQM                   DKA+ WLQS P  + TTW  L + FL KFFPP KT KL+
Subjt:  NPEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM-------------------DKARDWLQSIPPGSITTWVALVQAFLKKFFPPSKTVKLK

Query:  TEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLLEEMATNSYQWPSERSGPKKIAARVFE
         +I +F Q   E L++ W+                              TK  VDAAAGG L+ KT E A+ L+EEMA N+Y+W +ER   ++ A  + E
Subjt:  TEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENARTLLEEMATNSYQWPSERSGPKKIAARVFE

Query:  IDNFS
        ID  +
Subjt:  IDNFS

A0A6P6XAQ1 Reverse transcriptase4.2e-2332.91Show/hide
Query:  NPEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSI
        N    + +R +  P  Q  Q+ IV   +N NNFE+   LIQM                                                DKA+ WLQS 
Subjt:  NPEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQM------------------------------------------------DKARDWLQSI

Query:  PPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENAR
        PP + TTW  L +AFL KFFPP KT KL+ +I +F Q   E L+E W+                              TKT VDAAAGG L+ KT E A+
Subjt:  PPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWD------------------------------TKTIVDAAAGGTLLSKTIENAR

Query:  TLLEEMATNSYQWPSERSGPKKIAARVFEIDNFS
         L+EEMA N+YQW +ER   ++ A  + E+D  +
Subjt:  TLLEEMATNSYQWPSERSGPKKIAARVFEIDNFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAATCCTAACCCAGAGGAGCCTAAGCCTATTAGGAGCTATTTTCAGCCGGTCTTTCAGGAGCAACAGTCGGGGATAGTCTATGCCCCGATCAATACTAATAATTT
TGAGCTGATAACGGGTCTCATTCAGATGGACAAAGCGAGGGACTGGTTACAATCTATTCCACCTGGGAGCATTACCACTTGGGTTGCTTTGGTCCAAGCGTTTTTAAAGA
AATTCTTCCCTCCTTCTAAGACAGTCAAGCTAAAGACCGAGATTGGGACATTCCAGCAGCCGTTTAATGAGCAGTTGTTCGAGACTTGGGATACAAAAACTATTGTTGAT
GCAGCTGCAGGTGGGACTCTGTTGTCCAAAACCATTGAGAATGCTAGGACTTTGCTGGAGGAAATGGCCACCAATAGCTATCAGTGGCCATCTGAGCGGTCGGGACCGAA
AAAGATTGCTGCTAGAGTGTTTGAGATTGACAACTTTTCAGGTACAGGGAGTGCTCAATCGATTGAGTCTGCAACTGCCCTTGCATCCCAAACTCAAGAGGAAAATCTAG
AACAGGTCTTGGTTCCAGTTTCTCCTTTGCTTGTAAGCTTCCATAAAGAGAACATCGTCATTCCAGGACAGTATTGTTCACTCCTCCTACTCTTTTATTGTTCTCATGCT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAATCCTAACCCAGAGGAGCCTAAGCCTATTAGGAGCTATTTTCAGCCGGTCTTTCAGGAGCAACAGTCGGGGATAGTCTATGCCCCGATCAATACTAATAATTT
TGAGCTGATAACGGGTCTCATTCAGATGGACAAAGCGAGGGACTGGTTACAATCTATTCCACCTGGGAGCATTACCACTTGGGTTGCTTTGGTCCAAGCGTTTTTAAAGA
AATTCTTCCCTCCTTCTAAGACAGTCAAGCTAAAGACCGAGATTGGGACATTCCAGCAGCCGTTTAATGAGCAGTTGTTCGAGACTTGGGATACAAAAACTATTGTTGAT
GCAGCTGCAGGTGGGACTCTGTTGTCCAAAACCATTGAGAATGCTAGGACTTTGCTGGAGGAAATGGCCACCAATAGCTATCAGTGGCCATCTGAGCGGTCGGGACCGAA
AAAGATTGCTGCTAGAGTGTTTGAGATTGACAACTTTTCAGGTACAGGGAGTGCTCAATCGATTGAGTCTGCAACTGCCCTTGCATCCCAAACTCAAGAGGAAAATCTAG
AACAGGTCTTGGTTCCAGTTTCTCCTTTGCTTGTAAGCTTCCATAAAGAGAACATCGTCATTCCAGGACAGTATTGTTCACTCCTCCTACTCTTTTATTGTTCTCATGCT
TGA
Protein sequenceShow/hide protein sequence
MDNPNPEEPKPIRSYFQPVFQEQQSGIVYAPINTNNFELITGLIQMDKARDWLQSIPPGSITTWVALVQAFLKKFFPPSKTVKLKTEIGTFQQPFNEQLFETWDTKTIVD
AAAGGTLLSKTIENARTLLEEMATNSYQWPSERSGPKKIAARVFEIDNFSGTGSAQSIESATALASQTQEENLEQVLVPVSPLLVSFHKENIVIPGQYCSLLLLFYCSHA