; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018165 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018165
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:17852110..17853179
RNA-Seq ExpressionLag0018165
SyntenyLag0018165
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.3e-2634.13Show/hide
Query:  MTDQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW---
        M ++  KAIRDYF   L  +  GI    I   NFELK  LIQ+ RE AF+   +ED H HLRSFL+ICGTVK+ GV+ D+I ++LFPFSLQDRA+DW   
Subjt:  MTDQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW---

Query:  ---------------------------------------------------------------------------LSRDFKTILDTATGGNFLVKTIKDA
                                                                                   L+   K+ILD   GG+   K  ++A
Subjt:  ---------------------------------------------------------------------------LSRDFKTILDTATGGNFLVKTIKDA

Query:  RALVEELALTSYQWPSEQ--PKSKPKEGPCKAEEVKSLRLQVEALTKALNKL
          ++E+LA TSY WP E+  P      G  + +EV SL+ Q+ +LT AL+KL
Subjt:  RALVEELALTSYQWPSEQ--PKSKPKEGPCKAEEVKSLRLQVEALTKALNKL

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]3.9e-2830.43Show/hide
Query:  IRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRAR-------------
        IRDY       N+ GI    I A N ELK  LIQ+VRE+ F+ + +ED +NHL  FLD+CGTVK+ GV  D+I ++LFP SLQD+               
Subjt:  IRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRAR-------------

Query:  ---------------------------------------DW---------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKE
                                               +W         L+   +TILD A GG  L +T ++A  L++++A  S+QWPSE+  +K   
Subjt:  ---------------------------------------DW---------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKE

Query:  GPCKAEEVKSLRLQVEALTKALNKLLGSEIPSLPKVEVKME--SQPEEVVHQ--------EKKKPLEDMIGKLIEETKILMTNL-GTMSRMQARVEKRDL
        G  + +E+ SL+ QV+ALT A++KL G       ++    +  S  E  + Q        EKK  LED++G  I E +   + +   +  M+ ++E    
Subjt:  GPCKAEEVKSLRLQVEALTKALNKLLGSEIPSLPKVEVKME--SQPEEVVHQ--------EKKKPLEDMIGKLIEETKILMTNL-GTMSRMQARVEKRDL

Query:  VISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRNRK
         I  ME ++  +   +N +Q+ K  + I + P+E C A+++R+ K
Subjt:  VISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRNRK

XP_023881727.1 uncharacterized protein LOC111994101 [Quercus suber]3.4e-2426.6Show/hide
Query:  EDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWL------
        + + ++DY    +  NY GI  +TI A NFELK +LI +V+++ F   P +D + HL  FL+IC TVK+ GV  D+I ++LFPFSL+D+AR WL      
Subjt:  EDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWL------

Query:  ---------------------------------SRDF--------KTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKS
                                           DF        +TI+D A+GG  + KT + A +L+EE+A  +YQWP+E+  +K   G  + E   +
Subjt:  ---------------------------------SRDF--------KTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKS

Query:  LRLQVEALTKALNKLLGSEIP------SLPKVEVKMESQPEEVVH-----------------------------------------------QEKKKPLE
        L  QV +L+  ++ L    IP      +   + + M    +E V                                                 EKK  LE
Subjt:  LRLQVEALTKALNKLLGSEIP------SLPKVEVKMESQPEEVVH-----------------------------------------------QEKKKPLE

Query:  DMIGKLIEETKILMTNLGT-MSRMQARVEKRDLVISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRN
        D +   +EETK       + +  ++         I  +E ++  L   +N  Q+    + I ++P+EQC AI++R+
Subjt:  DMIGKLIEETKILMTNLGT-MSRMQARVEKRDLVISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRN

XP_023899824.1 LOW QUALITY PROTEIN: uncharacterized protein LOC112011709 [Quercus suber]9.0e-2527Show/hide
Query:  EDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWLSR----
        + + ++DY    +  NY GI ++TI A NFELK +LI +V+++ F   P +D + +L  FL+IC T+K+ GV  D+I ++LFPFSL+D+AR WL      
Subjt:  EDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWLSR----

Query:  ------------------------------DFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALN
                                       F+TI+D A+GG  + KT + A +L+EE+   +YQWP+E+  +K   G  + E   +L  QV +L+  ++
Subjt:  ------------------------------DFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALN

Query:  KLLGSEIP-----------SLPKVEVKMES---------------------------------------QPE---EVVHQEKKKPLEDMIGKLIEETKIL
         L    IP           ++P  EV  E                                        QP    +    EKK  LED +   +EETK  
Subjt:  KLLGSEIP-----------SLPKVEVKMES---------------------------------------QPE---EVVHQEKKKPLEDMIGKLIEETKIL

Query:  MTNLGT-MSRMQARVEKRDLVISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRN
             + +  ++         +  +E ++  L   +N  Q+    +   ++P+EQC AI++RN
Subjt:  MTNLGT-MSRMQARVEKRDLVISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRN

XP_024019991.1 uncharacterized protein LOC112091203 [Morus notabilis]2.4e-2537.24Show/hide
Query:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW-----
        +++ +AIRDYF   +  +Y GI  +TI A+N ELK SLI +V+++ F   P+ED + HL  FL+   TVKI GV   +I +KLFPFSL+D+AR W     
Subjt:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW-----

Query:  -----------------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALNKLLGSE
                         L+   +TI+D+  GG+ + K+I +A  L++E++  SYQW SE+   K   G  +   + SL  QV AL+  +  L   E
Subjt:  -----------------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALNKLLGSE

TrEMBL top hitse value%identityAlignment
A0A2I4E1Q5 uncharacterized protein LOC1089854721.8e-2334.7Show/hide
Query:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWLSR--
        D + + ++DY    +  NY GI  +TI A NFELK +LI +V+++ F   P +D + HL  FL+IC TVKI GV  D+I ++LFPFSL+D+ARD + R  
Subjt:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWLSR--

Query:  -------------------DFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALNKLLGSEI-PSL
                             +TI+D  +GG  + KT + A  L+EE+   +YQWP E+  +K   G  + E + +L  QV  L+  +  L    I  S 
Subjt:  -------------------DFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALNKLLGSEI-PSL

Query:  PKVEVKMESQPEEVVHQEK
          V     + P     QE+
Subjt:  PKVEVKMESQPEEVVHQEK

A0A2I4E8H0 uncharacterized protein LOC1089872581.2e-2239.18Show/hide
Query:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWLSRDF
        D +   ++DY    +  NY GI  +TI A NFELK +LI +V ++ F   P +D + HL  FL+IC TVKI GV  D+I ++LFPFSL+D          
Subjt:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWLSRDF

Query:  KTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKP-------KEGPCKAEEVKSLRLQVEALT
        KTI++   GG  + KT + A +L+EE+   +YQWP+E+  +K        K     + +V SL  Q+ ALT
Subjt:  KTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKP-------KEGPCKAEEVKSLRLQVEALT

A0A2I4F4G9 uncharacterized protein LOC1089954232.4e-2338.71Show/hide
Query:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW-----
        D + + ++DY    +  NY  I  +TI A NFELK +LI +V+++ F   P +D + HL  FL+IC T+KI  V  ++I ++LF FSL++ ARDW     
Subjt:  DQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW-----

Query:  ----LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEG
            L+   +TI+D A GG  + KT++ A +L+EE+ L +YQWP+++  +K   G
Subjt:  ----LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEG

A0A6J1B036 uncharacterized protein LOC1104228061.4e-2035.33Show/hide
Query:  YYGIAYETIVAENFELKASLIQLVRESA-----FKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW------------------
        +  I   +I A NFE+K + IQ+++ S      F   PS+D ++HL +FL+IC T K  GV  D+I ++LFPFSL+D+A+ W                  
Subjt:  YYGIAYETIVAENFELKASLIQLVRESA-----FKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDW------------------

Query:  ---------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALNKL
                 L    KTI+D A GG  + K   DA  L+EE+A  +Y+WPSE+  S+   G  + + + +L  QV AL+K L+ L
Subjt:  ---------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALNKL

A0A6J1DU19 uncharacterized protein LOC1110243611.9e-2830.43Show/hide
Query:  IRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRAR-------------
        IRDY       N+ GI    I A N ELK  LIQ+VRE+ F+ + +ED +NHL  FLD+CGTVK+ GV  D+I ++LFP SLQD+               
Subjt:  IRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRAR-------------

Query:  ---------------------------------------DW---------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKE
                                               +W         L+   +TILD A GG  L +T ++A  L++++A  S+QWPSE+  +K   
Subjt:  ---------------------------------------DW---------LSRDFKTILDTATGGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKE

Query:  GPCKAEEVKSLRLQVEALTKALNKLLGSEIPSLPKVEVKME--SQPEEVVHQ--------EKKKPLEDMIGKLIEETKILMTNL-GTMSRMQARVEKRDL
        G  + +E+ SL+ QV+ALT A++KL G       ++    +  S  E  + Q        EKK  LED++G  I E +   + +   +  M+ ++E    
Subjt:  GPCKAEEVKSLRLQVEALTKALNKLLGSEIPSLPKVEVKME--SQPEEVVHQ--------EKKKPLEDMIGKLIEETKILMTNL-GTMSRMQARVEKRDL

Query:  VISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRNRK
         I  ME ++  +   +N +Q+ K  + I + P+E C A+++R+ K
Subjt:  VISKMEAKLDLL---MNQIQQEKLSNVIAMDPQEQCLAISMRNRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGATCAAGAAGACAAAGCAATTAGAGATTATTTCTTGCATGAATTACTGAGGAACTACTATGGGATAGCATATGAGACGATCGTTGCTGAAAATTTTGAGCTCAA
AGCTAGTCTCATTCAATTGGTAAGAGAAAGTGCTTTCAAAGACCATCCATCAGAGGATCTGCATAATCACCTTAGATCATTTTTGGATATCTGTGGAACAGTAAAGATAG
GTGGAGTGAACCCTGATTCTATTCACGTGAAATTATTTCCATTTTCTTTGCAAGATAGGGCTAGAGATTGGTTGAGTCGGGACTTTAAAACAATTCTTGACACAGCGACG
GGAGGAAACTTCTTAGTCAAAACTATCAAAGATGCTCGAGCCCTAGTAGAAGAGTTAGCCTTGACAAGTTATCAATGGCCGTCTGAGCAACCAAAGTCCAAACCAAAGGA
GGGACCATGTAAGGCTGAAGAGGTAAAATCTTTAAGATTGCAAGTTGAAGCTTTAACCAAGGCTCTAAACAAGCTCTTAGGATCAGAGATTCCAAGTTTGCCTAAAGTTG
AAGTAAAAATGGAGTCTCAACCTGAGGAAGTTGTTCATCAAGAGAAAAAGAAACCCTTGGAAGACATGATAGGAAAATTAATTGAGGAGACCAAGATTTTGATGACTAAT
CTAGGAACAATGTCAAGAATGCAAGCCAGAGTAGAGAAACGAGATCTTGTCATTAGTAAAATGGAGGCAAAGCTCGATCTTCTCATGAACCAAATTCAGCAAGAAAAGTT
ATCCAATGTCATCGCCATGGACCCACAAGAACAATGCCTAGCCATATCTATGAGAAATAGAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAGATCAAGAAGACAAAGCAATTAGAGATTATTTCTTGCATGAATTACTGAGGAACTACTATGGGATAGCATATGAGACGATCGTTGCTGAAAATTTTGAGCTCAA
AGCTAGTCTCATTCAATTGGTAAGAGAAAGTGCTTTCAAAGACCATCCATCAGAGGATCTGCATAATCACCTTAGATCATTTTTGGATATCTGTGGAACAGTAAAGATAG
GTGGAGTGAACCCTGATTCTATTCACGTGAAATTATTTCCATTTTCTTTGCAAGATAGGGCTAGAGATTGGTTGAGTCGGGACTTTAAAACAATTCTTGACACAGCGACG
GGAGGAAACTTCTTAGTCAAAACTATCAAAGATGCTCGAGCCCTAGTAGAAGAGTTAGCCTTGACAAGTTATCAATGGCCGTCTGAGCAACCAAAGTCCAAACCAAAGGA
GGGACCATGTAAGGCTGAAGAGGTAAAATCTTTAAGATTGCAAGTTGAAGCTTTAACCAAGGCTCTAAACAAGCTCTTAGGATCAGAGATTCCAAGTTTGCCTAAAGTTG
AAGTAAAAATGGAGTCTCAACCTGAGGAAGTTGTTCATCAAGAGAAAAAGAAACCCTTGGAAGACATGATAGGAAAATTAATTGAGGAGACCAAGATTTTGATGACTAAT
CTAGGAACAATGTCAAGAATGCAAGCCAGAGTAGAGAAACGAGATCTTGTCATTAGTAAAATGGAGGCAAAGCTCGATCTTCTCATGAACCAAATTCAGCAAGAAAAGTT
ATCCAATGTCATCGCCATGGACCCACAAGAACAATGCCTAGCCATATCTATGAGAAATAGAAAGTAG
Protein sequenceShow/hide protein sequence
MTDQEDKAIRDYFLHELLRNYYGIAYETIVAENFELKASLIQLVRESAFKDHPSEDLHNHLRSFLDICGTVKIGGVNPDSIHVKLFPFSLQDRARDWLSRDFKTILDTAT
GGNFLVKTIKDARALVEELALTSYQWPSEQPKSKPKEGPCKAEEVKSLRLQVEALTKALNKLLGSEIPSLPKVEVKMESQPEEVVHQEKKKPLEDMIGKLIEETKILMTN
LGTMSRMQARVEKRDLVISKMEAKLDLLMNQIQQEKLSNVIAMDPQEQCLAISMRNRK