; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036074 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036074
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:38516821..38517573
RNA-Seq ExpressionLag0036074
SyntenyLag0036074
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]5.5e-7455.29Show/hide
Query:  MRKVRELALVPFDPEIKRTFNKLRRESKVNRIQMVNPNPEE--PKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLK
        MR+ R   ++P DPEI+RT   LRR    N+I  +     E  P+ ++DY +PV     S I+   INANNFELK  LI M +   +   P +DPN HL 
Subjt:  MRKVRELALVPFDPEIKRTFNKLRRESKVNRIQMVNPNPEE--PKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLK

Query:  SFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTFEQ-----LFEAWERFKELLRKCPQHG
         FL+IC  VK+NGV+EDTIRLRLFPFSL++K R WLQS+ PGSI +W  + + FL KFFPPAKT  LR EIG F+Q     L+EAWER+K+L+R+CPQHG
Subjt:  SFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTFEQ-----LFEAWERFKELLRKCPQHG

Query:  YPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD
         PDWLQVQ+FYNGL   T+TIVDAA+G TL+SK  E A  LLEEMA+++YQWP++
Subjt:  YPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]7.4e-7959.59Show/hide
Query:  LVPFDPEIKRTFNKLRRESKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVK
        L+P DPEI RT+   RR  +    Q      E PK IRDYFQP     Q GI+   IN NNFELK GLIQMAR+ A+  +  EDP+ HL+SFL+ICG VK
Subjt:  LVPFDPEIKRTFNKLRRESKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVK

Query:  LNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWERFKELLRKCPQHGYPDWLQVQLF
        +NGVS D I+LRLFPFSLQ++ +DWL++IPP SITTW  L Q FL K+FPPAK+  LR EIGTF     EQL+EAWER+K+LLR+CPQHGYPDWLQ+QLF
Subjt:  LNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWERFKELLRKCPQHGYPDWLQVQLF

Query:  YNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD
        YNGL  STK+I+DA AG ++ SK  + A T+LE++AT SY WP +
Subjt:  YNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD

XP_021279280.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 [Herrania umbratica]4.8e-7052.61Show/hide
Query:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-
        M++   L LVPFDP+I+RTF + RRE              +  N    +N  PE  + +RDY  P+ Q     I   +INANNFE+K   IQM +     
Subjt:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-

Query:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE
           P++DPNSHL +FL+IC   K NGV++D IRLRLFPFSL++K + WL S+P GSITTW  L Q FL KFFPPAKT  +R +I +F     E L+EAWE
Subjt:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE

Query:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD
        RFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAG  L+SK   +A  LLEEMA+++YQWPS+
Subjt:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD

XP_021279860.1 uncharacterized protein LOC110413413 [Herrania umbratica]6.3e-7052.61Show/hide
Query:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-
        M++   L LVPFDP+I+RTF + RRE              +  N    +N  PE  + +RDY  P+ Q     I   +INANNFE+K   IQM +     
Subjt:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-

Query:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE
           P++DPNSHL +FL+IC   K NGV++D IRLRLFPFSL++K + WL S+P GSITTW  L Q FL KFFPPAKT  +R +I +F     E L+EAWE
Subjt:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE

Query:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD
        RFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAG  L+SK   +A  LLEEMA+++YQWPS+
Subjt:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]1.1e-6959.62Show/hide
Query:  EPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPG
        +P+ ++DY +P+  +  SGI   TINANNFELK  LI M +   +   P +DPN HL  FL+IC  VK+NGV+EDTIRLRLFPFSL++K R WLQS+ PG
Subjt:  EPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPG

Query:  SITTWNALVQTFLRKFFPPAKTVNLRIEIG-----TFEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLL
        SIT+W  + + FL KFFPPAKT  LR EIG      FE L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+G TL+SK  E A +LL
Subjt:  SITTWNALVQTFLRKFFPPAKTVNLRIEIG-----TFEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLL

Query:  EEMATDSYQWPSD
        EEMA+++YQWP++
Subjt:  EEMATDSYQWPSD

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953736.6e-6558.69Show/hide
Query:  PKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGS
        P+ ++DY +PV  +  SGI   TINANNFELK  LI M +   +   P +DPN HL  FL IC  VK+NGV+ DTIRLRLFPFSL++K R WLQS+  GS
Subjt:  PKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGS

Query:  ITTWNALVQTFLRKFFPPAKTVNLRIEIGTFEQ-----LFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENART-LL
        IT+W  + + FL KFFPPAKT  LR EI  F+Q     L+EAWER+K L+R CPQHG P+WLQVQ+FYNGL   T+TIVDAAAG TL+SK +E A T LL
Subjt:  ITTWNALVQTFLRKFFPPAKTVNLRIEIGTFEQ-----LFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENART-LL

Query:  EEMATDSYQWPSD
        EEM +++YQWP++
Subjt:  EEMATDSYQWPSD

A0A3S3N117 Retrotrans_gag domain-containing protein5.1e-6549.02Show/hide
Query:  MRKVRELALVPFDPEIKRTFNKLRRESK-VNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY---KPTEDPNSHLK
        MR+ + L LVP DPEI+RT  +L++E K  +  ++     +  + + DY  P+     S I    I ANNFE+K  +IQM          P +DPN+H+ 
Subjt:  MRKVRELALVPFDPEIKRTFNKLRRESK-VNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY---KPTEDPNSHLK

Query:  SFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWERFKELLRKCPQHG
        +FL++C   K NGV++D +RLRL PFSL++K + WL S+P  +ITTW+ L + FL KFFPP KTV +R +I TF     E L+EAWER+KELLRKCP HG
Subjt:  SFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWERFKELLRKCPQHG

Query:  YPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD
         P W+QVQ FYNGL  +T+T +DAA G TL+ K  E A  L+EEMAT++YQWPSD
Subjt:  YPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129452.3e-7052.61Show/hide
Query:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-
        M++   L LVPFDP+I+RTF + RRE              +  N    +N  PE  + +RDY  P+ Q     I   +INANNFE+K   IQM +     
Subjt:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-

Query:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE
           P++DPNSHL +FL+IC   K NGV++D IRLRLFPFSL++K + WL S+P GSITTW  L Q FL KFFPPAKT  +R +I +F     E L+EAWE
Subjt:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE

Query:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD
        RFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAG  L+SK   +A  LLEEMA+++YQWPS+
Subjt:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD

A0A6J0ZYV0 uncharacterized protein LOC1104134133.1e-7052.61Show/hide
Query:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-
        M++   L LVPFDP+I+RTF + RRE              +  N    +N  PE  + +RDY  P+ Q     I   +INANNFE+K   IQM +     
Subjt:  MRKVRELALVPFDPEIKRTFNKLRRE--------------SKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY-

Query:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE
           P++DPNSHL +FL+IC   K NGV++D IRLRLFPFSL++K + WL S+P GSITTW  L Q FL KFFPPAKT  +R +I +F     E L+EAWE
Subjt:  --KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTF-----EQLFEAWE

Query:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD
        RFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAG  L+SK   +A  LLEEMA+++YQWPS+
Subjt:  RFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENARTLLEEMATDSYQWPSD

A0A6P6XAQ1 Reverse transcriptase2.1e-6355.56Show/hide
Query:  NPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSI
        N    + +RD+  P  Q  Q+ IV  T+NANNFE+K  LIQM +   Y    TEDPNSHL +FL+IC  +K NGVSED I+LRLFPFSL++K + WLQS 
Subjt:  NPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAY--KPTEDPNSHLKSFLDICGMVKLNGVSEDTIRLRLFPFSLQNKVRDWLQSI

Query:  PPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTFEQ-----LFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENAR
        PP + TTW+ L + FL KFFPP KT  LR++I +F Q     L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAG  L+ K  E A+
Subjt:  PPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTFEQ-----LFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGETLLSKPVENAR

Query:  TLLEEMATDSYQWPSD
         L+EEMA ++YQW ++
Subjt:  TLLEEMATDSYQWPSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAAAAGTTAGGGAGTTGGCCTTAGTTCCCTTTGACCCGGAGATTAAGAGAACTTTTAATAAACTTCGAAGGGAAAGTAAAGTTAATCGTATCCAAATGGTCAATCC
TAACCCCGAGGAACCTAAGCCTATTAGAGACTACTTTCAACCGGTCTTTCAGGAGCAACAGTCGGGGATTGTCTATGCCACGATCAATGCTAACAATTTTGAGCTGAAAA
CAGGTCTCATACAGATGGCTCGAGATTGCGCTTACAAGCCCACGGAGGATCCAAATTCTCACCTGAAGTCCTTCCTTGACATTTGTGGGATGGTCAAATTAAATGGTGTT
TCTGAAGATACCATTCGCTTACGTTTGTTTCCTTTTTCATTGCAGAACAAAGTGAGGGATTGGTTGCAATCGATTCCACCTGGGAGCATTACTACGTGGAATGCTTTGGT
CCAGACGTTTTTAAGGAAATTCTTCCCTCCTGCTAAGACGGTCAACCTAAGGATCGAGATTGGGACGTTCGAGCAGCTGTTCGAGGCTTGGGAGCGATTCAAAGAGCTGT
TGAGGAAGTGCCCTCAGCATGGCTATCCCGACTGGCTTCAAGTTCAATTATTTTATAATGGCTTAACTCCTAGTACAAAAACTATTGTTGATGCAGCTGCAGGTGAGACT
CTGTTGTCCAAACCCGTTGAGAATGCTAGGACTTTGCTGGAGGAAATGGCCACCGATAGCTATCAGTGGCCATCTGATTGGTTATCAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCGAAAAGTTAGGGAGTTGGCCTTAGTTCCCTTTGACCCGGAGATTAAGAGAACTTTTAATAAACTTCGAAGGGAAAGTAAAGTTAATCGTATCCAAATGGTCAATCC
TAACCCCGAGGAACCTAAGCCTATTAGAGACTACTTTCAACCGGTCTTTCAGGAGCAACAGTCGGGGATTGTCTATGCCACGATCAATGCTAACAATTTTGAGCTGAAAA
CAGGTCTCATACAGATGGCTCGAGATTGCGCTTACAAGCCCACGGAGGATCCAAATTCTCACCTGAAGTCCTTCCTTGACATTTGTGGGATGGTCAAATTAAATGGTGTT
TCTGAAGATACCATTCGCTTACGTTTGTTTCCTTTTTCATTGCAGAACAAAGTGAGGGATTGGTTGCAATCGATTCCACCTGGGAGCATTACTACGTGGAATGCTTTGGT
CCAGACGTTTTTAAGGAAATTCTTCCCTCCTGCTAAGACGGTCAACCTAAGGATCGAGATTGGGACGTTCGAGCAGCTGTTCGAGGCTTGGGAGCGATTCAAAGAGCTGT
TGAGGAAGTGCCCTCAGCATGGCTATCCCGACTGGCTTCAAGTTCAATTATTTTATAATGGCTTAACTCCTAGTACAAAAACTATTGTTGATGCAGCTGCAGGTGAGACT
CTGTTGTCCAAACCCGTTGAGAATGCTAGGACTTTGCTGGAGGAAATGGCCACCGATAGCTATCAGTGGCCATCTGATTGGTTATCAGTATAG
Protein sequenceShow/hide protein sequence
MRKVRELALVPFDPEIKRTFNKLRRESKVNRIQMVNPNPEEPKPIRDYFQPVFQEQQSGIVYATINANNFELKTGLIQMARDCAYKPTEDPNSHLKSFLDICGMVKLNGV
SEDTIRLRLFPFSLQNKVRDWLQSIPPGSITTWNALVQTFLRKFFPPAKTVNLRIEIGTFEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGET
LLSKPVENARTLLEEMATDSYQWPSDWLSV