; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015795 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015795
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr12:24409025..24410643
RNA-Seq ExpressionLag0015795
SyntenyLag0015795
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146774.1 uncharacterized protein LOC111015901 [Momordica charantia]3.0e-4550.5Show/hide
Query:  GPSGGESGRKRKVAVREAQQEPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPS
        GP   ESGRKRK  VREA+       +Y   + + S K+EF+E EAT + HPHNDALV+ L +ANAKVHRIL+DGGSS D+ S TA+ AM LG + LK S
Subjt:  GPSGGESGRKRKVAVREAQQEPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPS

Query:  HTPSVGFGGEK-VTRGECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK
         TP +GFGGE+ + +G   +        +++TRM++F+VVDY  +YN IL RPT +M  +  STYHQ +KFPT  GVG +  EQ++SRECY+ +++  DK
Subjt:  HTPSVGFGGEK-VTRGECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK

XP_022150028.1 uncharacterized protein LOC111018300 [Momordica charantia]1.8e-5041.09Show/hide
Query:  PEKLRSDPDRRNRNKYCMFHGDHDHTTEN-------------------------ASNQGKGGA-NPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQ
        PE++ +   +R++ +YC+FH DHDH T++                         A+  G+  + +P  EIRTI+GGP   E GRKRK ++RE +      
Subjt:  PEKLRSDPDRRNRNKYCMFHGDHDHTTEN-------------------------ASNQGKGGA-NPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQ

Query:  GMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKV-TRGECRITGDVW
         +Y   + +   K+EF+E+EAT + HPHND LV+ L +ANAKVHRIL+DGGSSAD++S TA+ AM LG    K S    V F GE+V   G   +T    
Subjt:  GMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKV-TRGECRITGDVW

Query:  RRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK
           +++T +++F+V+DY  +YNAILGRPT +M  +  STYHQ + FPT  G+G + +EQ++SRECY+ ++K  D+
Subjt:  RRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK

XP_022158093.1 uncharacterized protein LOC111024662 [Momordica charantia]1.6e-4640.07Show/hide
Query:  LRSDPDRRNRNKYCMFHGDHDHTTEN--------------------------ASNQGKGGANPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQGMY
        +++ P++R++ +YC+FH DH H T++                            N      +P  EI+TI GGP+  E G+KRK +++EA+  P    +Y
Subjt:  LRSDPDRRNRNKYCMFHGDHDHTTEN--------------------------ASNQGKGGANPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQGMY

Query:  SLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTRGECRITGDV--WRR
                 K++F+E+E T + HPHNDALV+ L + N KVHRIL+DGGSS  ++S TA+ AM LG   LK +  P VGFGGE+V + +CRI   V     
Subjt:  SLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTRGECRITGDV--WRR

Query:  LQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVD
         + +T+++ F+VVDY  +YNAILGRPT +   +  STYH+ LKFPT+ G+  V  EQ++S ECY+ +L+  D
Subjt:  LQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVD

XP_023916366.1 uncharacterized protein LOC112027956 [Quercus suber]6.8e-4536.29Show/hide
Query:  ERATTNVRGFMTRAQRYISAEELLKSKRKK--ERVGGCQYQTDAGKTGERGTRPKEEAGADL----STPRPTTRFGRNT--------------GHESAKR
        E+A   +   +  AQ +++AE+ + +K++K  ERV     + +  +  E+G RPK+    D         P+ R  + T                 S K 
Subjt:  ERATTNVRGFMTRAQRYISAEELLKSKRKK--ERVGGCQYQTDAGKTGERGTRPKEEAGADL----STPRPTTRFGRNT--------------GHESAKR

Query:  PEKLRSDPDRRNRNKYCMFHGDHDHTT----------ENASNQGK-------------------GGANPPL-EIRTILGGPSGGESGRKRKVAVREAQQ-
        PEK+R DP++RNR+KYC FH DH H T          EN   QGK                     + PPL EIR I+GG S G+S   +K  ++E Q  
Subjt:  PEKLRSDPDRRNRNKYCMFHGDHDHTT----------ENASNQGK-------------------GGANPPL-EIRTILGGPSGGESGRKRKVAVREAQQ-

Query:  EPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTR-GECRI
        +  G+   +   DE +  + FT+ EA  IHHPH+DA+V+AL +A+    R+L+D GSSAD+L   AF  M++G + L+P H+P VGFGG KV   G   +
Subjt:  EPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTR-GECRI

Query:  TGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPTY-MGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDKRFK
           V    + +T  +NF+VVD   +YNAI+GRPT     +  STYH  +KFPTE+GVG V  +Q  +RECY +A+   D++F+
Subjt:  TGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPTY-MGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDKRFK

XP_030950020.1 uncharacterized protein LOC115973918 [Quercus lobata]5.2e-4542.29Show/hide
Query:  SAKRPEKLRSDPDRRNRNKYCMFHGDHDHTT----------ENASNQGK-------------------GGANPPL-EIRTILGGPSGGESGRKRKVAVRE
        S K PEKL+ DP++RNRNKYC FH DH H T          EN   QGK                     + PPL EIR I+GG S G S + +K  ++ 
Subjt:  SAKRPEKLRSDPDRRNRNKYCMFHGDHDHTT----------ENASNQGK-------------------GGANPPL-EIRTILGGPSGGESGRKRKVAVRE

Query:  AQQ-EPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTR-G
         Q  +  G+   ++ +DE +  + FT+ +A  IHHPH+DALV++L +AN    R+L+D GSSAD+L   AF  M+LG D L+P ++P VGFGG KV   G
Subjt:  AQQ-EPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTR-G

Query:  ECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPTYMG-SSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMAL
           ++  V    Q +T+ +NF+VVD   +YNAI+GRPT     +  STY+  +KFPTE GVG V  +Q  +RECY   L
Subjt:  ECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPTYMG-SSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMAL

TrEMBL top hitse value%identityAlignment
A0A6J1CZ14 uncharacterized protein LOC1110159011.5e-4550.5Show/hide
Query:  GPSGGESGRKRKVAVREAQQEPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPS
        GP   ESGRKRK  VREA+       +Y   + + S K+EF+E EAT + HPHNDALV+ L +ANAKVHRIL+DGGSS D+ S TA+ AM LG + LK S
Subjt:  GPSGGESGRKRKVAVREAQQEPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPS

Query:  HTPSVGFGGEK-VTRGECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK
         TP +GFGGE+ + +G   +        +++TRM++F+VVDY  +YN IL RPT +M  +  STYHQ +KFPT  GVG +  EQ++SRECY+ +++  DK
Subjt:  HTPSVGFGGEK-VTRGECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK

A0A6J1D8C9 uncharacterized protein LOC1110183008.9e-5141.09Show/hide
Query:  PEKLRSDPDRRNRNKYCMFHGDHDHTTEN-------------------------ASNQGKGGA-NPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQ
        PE++ +   +R++ +YC+FH DHDH T++                         A+  G+  + +P  EIRTI+GGP   E GRKRK ++RE +      
Subjt:  PEKLRSDPDRRNRNKYCMFHGDHDHTTEN-------------------------ASNQGKGGA-NPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQ

Query:  GMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKV-TRGECRITGDVW
         +Y   + +   K+EF+E+EAT + HPHND LV+ L +ANAKVHRIL+DGGSSAD++S TA+ AM LG    K S    V F GE+V   G   +T    
Subjt:  GMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKV-TRGECRITGDVW

Query:  RRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK
           +++T +++F+V+DY  +YNAILGRPT +M  +  STYHQ + FPT  G+G + +EQ++SRECY+ ++K  D+
Subjt:  RRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK

A0A6J1DQL9 uncharacterized protein LOC1110229052.8e-4439.63Show/hide
Query:  LRSDPDRRNRNKYCMFHGDHDHTTEN-------------------------ASNQGKGGANPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQGMYS
        +++   +R++ +Y +FH DH H T++                          +  G+   +P  EI+TI+GGP   ESGRK KV VREA+       +Y 
Subjt:  LRSDPDRRNRNKYCMFHGDHDHTTEN-------------------------ASNQGKGGANPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQGMYS

Query:  LLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTRGECRITGDVWRRLQT
          + + S  +EF+++EAT + HPHNDALV+ L +AN KVHRIL+DGG+SAD++S TA+  M +G   LK + TP V      +  G            ++
Subjt:  LLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTRGECRITGDVWRRLQT

Query:  VTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK
        VT+M+ F+VVDY  +YNAILGR T +M  +  STYHQ +KFPT  GVG +  EQ++SRECY+ ++K  D+
Subjt:  VTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK

A0A6J1DQY2 uncharacterized protein LOC1110223211.1e-4350Show/hide
Query:  GPSGGESGRKRKVAVREAQQEPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPS
        GP   ES RKRK  VREA+   +   +Y   +      +EF+ENEAT + HPHNDALV+ L +AN KVHRIL+DGGSSAD++S TA+ AM LG   LK S
Subjt:  GPSGGESGRKRKVAVREAQQEPEGQGMYSLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPS

Query:  HTPSVGFGGEKV-TRGECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK
          P VGFGGE+V   G   +        ++VT+M++F+VV+Y  +YNAILGRPT +M  +  STYHQ  KFPT  GVG +  EQ++SRECY  ++++ D+
Subjt:  HTPSVGFGGEKV-TRGECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDK

A0A6J1DV51 uncharacterized protein LOC1110246627.8e-4740.07Show/hide
Query:  LRSDPDRRNRNKYCMFHGDHDHTTEN--------------------------ASNQGKGGANPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQGMY
        +++ P++R++ +YC+FH DH H T++                            N      +P  EI+TI GGP+  E G+KRK +++EA+  P    +Y
Subjt:  LRSDPDRRNRNKYCMFHGDHDHTTEN--------------------------ASNQGKGGANPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQGMY

Query:  SLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTRGECRITGDV--WRR
                 K++F+E+E T + HPHNDALV+ L + N KVHRIL+DGGSS  ++S TA+ AM LG   LK +  P VGFGGE+V + +CRI   V     
Subjt:  SLLLDENSPKLEFTENEATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTRGECRITGDV--WRR

Query:  LQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVD
         + +T+++ F+VVDY  +YNAILGRPT +   +  STYH+ LKFPT+ G+  V  EQ++S ECY+ +L+  D
Subjt:  LQTVTRMINFVVVDYVPAYNAILGRPT-YMGSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATACCAGACTTGGATGGACTTTCATGGGGCAACGAAGCAACAAGGTGCCGAGCCTTTGCGCTAACACTCACAGGTCTAGCAAGACAGTGGTTTGGGAAATCCTGCG
AAGGATGAAAGGCTACTCAATTCAATCGGAGAGAGCCACCACGAACGTACGTGGATTTATGACTCGAGCACAAAGATACATAAGCGCCGAGGAGCTGTTGAAATCCAAGA
GGAAGAAAGAGAGAGTCGGGGGATGTCAATATCAGACAGACGCCGGGAAGACAGGGGAAAGAGGCACCAGGCCGAAGGAAGAGGCCGGAGCCGACCTGAGCACTCCTCGG
CCAACGACCAGGTTTGGCCGCAATACAGGACACGAATCTGCTAAACGCCCAGAAAAGTTGAGATCAGATCCCGACAGGAGAAACCGAAACAAATATTGCATGTTCCATGG
AGATCACGACCATACAACCGAGAATGCATCCAACCAAGGCAAGGGTGGTGCCAACCCACCGCTCGAGATTCGAACCATTTTAGGAGGACCCTCAGGAGGAGAGTCGGGTA
GGAAGCGAAAAGTTGCAGTTCGAGAGGCACAACAAGAGCCCGAGGGACAAGGTATGTACTCACTCCTACTTGATGAAAACTCACCAAAGTTAGAGTTTACAGAAAATGAG
GCTACGGGAATACATCATCCGCACAACGACGCGCTGGTAGTCGCTCTAACGGTTGCCAACGCGAAGGTCCACCGGATCCTCATTGATGGGGGAAGTTCCGCTGATGTGCT
CTCAACTACTGCGTTCGACGCCATGAAGCTGGGGAGTGATCACCTGAAGCCGAGCCACACGCCATCGGTAGGTTTTGGCGGAGAAAAAGTAACCCGAGGGGAGTGTCGAA
TTACCGGTGACGTTTGGAGAAGGTTACAGACAGTAACGAGGATGATCAACTTTGTGGTGGTGGACTACGTCCCGGCATATAATGCCATCTTGGGACGACCCACCTACATG
GGCTCAAGCTGTGGTTCAACCTATCACCAAGTGCTGAAGTTCCCAACTGAAGAAGGTGTAGGAGCAGTGTACGACGAGCAGAAAATGTCAAGGGAATGCTACTTTATGGC
ACTCAAGAACGTTGACAAAAGGTTCAAGCGACGCCAGCCTCGGGATATGGCCGAGGCCGAGAAGCTGAAGGGGCAAGTTTTCCCCTCCCAATGGAACATTATTTACTCTT
CATTATGTTTTCCAGTCCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATACCAGACTTGGATGGACTTTCATGGGGCAACGAAGCAACAAGGTGCCGAGCCTTTGCGCTAACACTCACAGGTCTAGCAAGACAGTGGTTTGGGAAATCCTGCG
AAGGATGAAAGGCTACTCAATTCAATCGGAGAGAGCCACCACGAACGTACGTGGATTTATGACTCGAGCACAAAGATACATAAGCGCCGAGGAGCTGTTGAAATCCAAGA
GGAAGAAAGAGAGAGTCGGGGGATGTCAATATCAGACAGACGCCGGGAAGACAGGGGAAAGAGGCACCAGGCCGAAGGAAGAGGCCGGAGCCGACCTGAGCACTCCTCGG
CCAACGACCAGGTTTGGCCGCAATACAGGACACGAATCTGCTAAACGCCCAGAAAAGTTGAGATCAGATCCCGACAGGAGAAACCGAAACAAATATTGCATGTTCCATGG
AGATCACGACCATACAACCGAGAATGCATCCAACCAAGGCAAGGGTGGTGCCAACCCACCGCTCGAGATTCGAACCATTTTAGGAGGACCCTCAGGAGGAGAGTCGGGTA
GGAAGCGAAAAGTTGCAGTTCGAGAGGCACAACAAGAGCCCGAGGGACAAGGTATGTACTCACTCCTACTTGATGAAAACTCACCAAAGTTAGAGTTTACAGAAAATGAG
GCTACGGGAATACATCATCCGCACAACGACGCGCTGGTAGTCGCTCTAACGGTTGCCAACGCGAAGGTCCACCGGATCCTCATTGATGGGGGAAGTTCCGCTGATGTGCT
CTCAACTACTGCGTTCGACGCCATGAAGCTGGGGAGTGATCACCTGAAGCCGAGCCACACGCCATCGGTAGGTTTTGGCGGAGAAAAAGTAACCCGAGGGGAGTGTCGAA
TTACCGGTGACGTTTGGAGAAGGTTACAGACAGTAACGAGGATGATCAACTTTGTGGTGGTGGACTACGTCCCGGCATATAATGCCATCTTGGGACGACCCACCTACATG
GGCTCAAGCTGTGGTTCAACCTATCACCAAGTGCTGAAGTTCCCAACTGAAGAAGGTGTAGGAGCAGTGTACGACGAGCAGAAAATGTCAAGGGAATGCTACTTTATGGC
ACTCAAGAACGTTGACAAAAGGTTCAAGCGACGCCAGCCTCGGGATATGGCCGAGGCCGAGAAGCTGAAGGGGCAAGTTTTCCCCTCCCAATGGAACATTATTTACTCTT
CATTATGTTTTCCAGTCCGCTAG
Protein sequenceShow/hide protein sequence
MHTRLGWTFMGQRSNKVPSLCANTHRSSKTVVWEILRRMKGYSIQSERATTNVRGFMTRAQRYISAEELLKSKRKKERVGGCQYQTDAGKTGERGTRPKEEAGADLSTPR
PTTRFGRNTGHESAKRPEKLRSDPDRRNRNKYCMFHGDHDHTTENASNQGKGGANPPLEIRTILGGPSGGESGRKRKVAVREAQQEPEGQGMYSLLLDENSPKLEFTENE
ATGIHHPHNDALVVALTVANAKVHRILIDGGSSADVLSTTAFDAMKLGSDHLKPSHTPSVGFGGEKVTRGECRITGDVWRRLQTVTRMINFVVVDYVPAYNAILGRPTYM
GSSCGSTYHQVLKFPTEEGVGAVYDEQKMSRECYFMALKNVDKRFKRRQPRDMAEAEKLKGQVFPSQWNIIYSSLCFPVR