; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026360 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026360
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:35431994..35433261
RNA-Seq ExpressionLag0026360
SyntenyLag0026360
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]6.9e-2339.23Show/hide
Query:  EEVMKVEVPQKFKVP--------------------------------CLLFHP--SRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT
        EE+MKV+VP KFK+P                                C +F    S   R WF +LKR SIS FK LA+AF+ QF+G R   +P   LLT
Subjt:  EEVMKVEVPQKFKVP--------------------------------CLLFHP--SRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT

Query:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKR------SERKYKRASSSDHDSKK
        +KQ+  ESL DY+ RFN+E LQVEG +   +L+A  + + DE L  S GK    T++E +SRAQKYMSA E   SKR      S++  +R+      S+ 
Subjt:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKR------SERKYKRASSSDHDSKK

Query:  DKRQRTDER
        +KR R+  +
Subjt:  DKRQRTDER

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]3.7e-2439Show/hide
Query:  LKELENPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVPCLLFHPSRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT
        +KE   P+  L  + +  G    + ID +D  + E     ++ +  +     F  +   R+WF +LKR SIS FK+LA AF+ QF+G R   KP   LLT
Subjt:  LKELENPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVPCLLFHPSRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT

Query:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRT
        +KQ+  ESL++Y+ RFN+E LQVEG ++  AL+A  +G++DERL+ S GK    T+ E +SRAQKYMSA EL+   R   + +   S+  + + +KR R+
Subjt:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRT

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]3.8e-2933.01Show/hide
Query:  SPRQAG---RGRGRAEDADTKIAALEDEVKGMNQSLSKIL---QILDKPSPSTK---------LHEGSLIRDLKKGKNPVEYMDESETESRGKKTNSSTS
        SPR++      + R    D ++   ED     NQ   + L     L  P P  K           E  L+RD KKGK P     ES+TE   + TNS  S
Subjt:  SPRQAG---RGRGRAEDADTKIAALEDEVKGMNQSLSKIL---QILDKPSPSTK---------LHEGSLIRDLKKGKNPVEYMDESETESRGKKTNSSTS

Query:  MVR-GLKHTERTVLRSPESCTSRRTGLRNLVEKKRRVAKTVESEARATEAEAKKNHLPWKTE--LLNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDP
         +R G    +RT +  P                     KT +          K +H    +E   L+  K  + P+  + +      G D+EEL+DQ D 
Subjt:  MVR-GLKHTERTVLRSPESCTSRRTGLRNLVEKKRRVAKTVESEARATEAEAKKNHLPWKTE--LLNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDP

Query:  PFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHIN
        PFTEE+M+ +VP KFK+P                                C +F  +     R WF +LKR SIS FK LARAF+ QF+G R   +P   
Subjt:  PFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHIN

Query:  LLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKS------KRSERKYKRASSSDHD
        LLT+KQ+  ESLRDY+ RFN+E LQVEG ++  +L+A  +G+ DE L  S GK    T++E +SRAQ+YMSA E   S      KR++ K +R+      
Subjt:  LLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKS------KRSERKYKRASSSDHD

Query:  SKKDKRQRTDER
        S+ +KR R+ ++
Subjt:  SKKDKRQRTDER

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]5.4e-2837.25Show/hide
Query:  LNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHW
        LN  K ++ P+  + +  +   G D+EEL+ Q D PFTEE+M+ +VP KFK+P                                C +F  +     R W
Subjt:  LNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHW

Query:  FERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSR
        F +LKR SIS FK LARAF+ QF+G R   +P   LLT+KQ+  ESL DY+ RFN+E LQ+EG ++  +L+A  +G+ DE L  S  K    T++E +SR
Subjt:  FERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSR

Query:  AQKYMSAEELLKS------KRSERKYKRASSSDHDSKKDKRQRTDER
        AQ+YMSA E   S      KR+++K +R+      S+ +KR R  ++
Subjt:  AQKYMSAEELLKS------KRSERKYKRASSSDHDSKKDKRQRTDER

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]9.0e-2336.7Show/hide
Query:  LKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVP--------------------------------CLLF--HPSRINRHWFERLKRRSISCFKDLAR
        ++  G QD  +++ + +PPFT+E+M+   P  F++P                                C  F    SR  R WF  L+  SIS F +L R
Subjt:  LKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVP--------------------------------CLLF--HPSRINRHWFERLKRRSISCFKDLAR

Query:  AFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSE
         F   F  AR+  KP   LLTVKQ  GESLR+YI R+N E  QV+GY +G AL  +  GL+  RL  S+ K+   TY+E +SRA+KY +AEE  +SK+  
Subjt:  AFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSE

Query:  RKYKRASSSDHDSKKDKR
         K +  SS +  +K+D+R
Subjt:  RKYKRASSSDHDSKKDKR

TrEMBL top hitse value%identityAlignment
A0A6J1D7D2 uncharacterized protein LOC1110183073.3e-2339.23Show/hide
Query:  EEVMKVEVPQKFKVP--------------------------------CLLFHP--SRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT
        EE+MKV+VP KFK+P                                C +F    S   R WF +LKR SIS FK LA+AF+ QF+G R   +P   LLT
Subjt:  EEVMKVEVPQKFKVP--------------------------------CLLFHP--SRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT

Query:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKR------SERKYKRASSSDHDSKK
        +KQ+  ESL DY+ RFN+E LQVEG +   +L+A  + + DE L  S GK    T++E +SRAQKYMSA E   SKR      S++  +R+      S+ 
Subjt:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKR------SERKYKRASSSDHDSKK

Query:  DKRQRTDER
        +KR R+  +
Subjt:  DKRQRTDER

A0A6J1DWY0 uncharacterized protein LOC1110252931.8e-2933.01Show/hide
Query:  SPRQAG---RGRGRAEDADTKIAALEDEVKGMNQSLSKIL---QILDKPSPSTK---------LHEGSLIRDLKKGKNPVEYMDESETESRGKKTNSSTS
        SPR++      + R    D ++   ED     NQ   + L     L  P P  K           E  L+RD KKGK P     ES+TE   + TNS  S
Subjt:  SPRQAG---RGRGRAEDADTKIAALEDEVKGMNQSLSKIL---QILDKPSPSTK---------LHEGSLIRDLKKGKNPVEYMDESETESRGKKTNSSTS

Query:  MVR-GLKHTERTVLRSPESCTSRRTGLRNLVEKKRRVAKTVESEARATEAEAKKNHLPWKTE--LLNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDP
         +R G    +RT +  P                     KT +          K +H    +E   L+  K  + P+  + +      G D+EEL+DQ D 
Subjt:  MVR-GLKHTERTVLRSPESCTSRRTGLRNLVEKKRRVAKTVESEARATEAEAKKNHLPWKTE--LLNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDP

Query:  PFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHIN
        PFTEE+M+ +VP KFK+P                                C +F  +     R WF +LKR SIS FK LARAF+ QF+G R   +P   
Subjt:  PFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHIN

Query:  LLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKS------KRSERKYKRASSSDHD
        LLT+KQ+  ESLRDY+ RFN+E LQVEG ++  +L+A  +G+ DE L  S GK    T++E +SRAQ+YMSA E   S      KR++ K +R+      
Subjt:  LLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKS------KRSERKYKRASSSDHD

Query:  SKKDKRQRTDER
        S+ +KR R+ ++
Subjt:  SKKDKRQRTDER

A0A6J1DZ49 uncharacterized protein LOC1110248511.8e-2439Show/hide
Query:  LKELENPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVPCLLFHPSRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT
        +KE   P+  L  + +  G    + ID +D  + E     ++ +  +     F  +   R+WF +LKR SIS FK+LA AF+ QF+G R   KP   LLT
Subjt:  LKELENPQGDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVPCLLFHPSRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLT

Query:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRT
        +KQ+  ESL++Y+ RFN+E LQVEG ++  AL+A  +G++DERL+ S GK    T+ E +SRAQKYMSA EL+   R   + +   S+  + + +KR R+
Subjt:  VKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRT

A0A6J1E1E7 uncharacterized protein LOC1110255482.6e-2837.25Show/hide
Query:  LNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHW
        LN  K ++ P+  + +  +   G D+EEL+ Q D PFTEE+M+ +VP KFK+P                                C +F  +     R W
Subjt:  LNTLKELENPQ-GDLQKLKDSGGQDMEELIDQVDPPFTEEVMKVEVPQKFKVP--------------------------------CLLFHPSRIN--RHW

Query:  FERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSR
        F +LKR SIS FK LARAF+ QF+G R   +P   LLT+KQ+  ESL DY+ RFN+E LQ+EG ++  +L+A  +G+ DE L  S  K    T++E +SR
Subjt:  FERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSR

Query:  AQKYMSAEELLKS------KRSERKYKRASSSDHDSKKDKRQRTDER
        AQ+YMSA E   S      KR+++K +R+      S+ +KR R  ++
Subjt:  AQKYMSAEELLKS------KRSERKYKRASSSDHDSKKDKRQRTDER

A0A7J0GII1 Retrotrans_gag domain-containing protein8.5e-1927.91Show/hide
Query:  MEKGSRN-PNPEASKNSRQPQTSRDEDNIQGSPRQAGRGRGRAEDADTKIAALEDEVKGMNQSLSKILQILDKPSPSTKLHEGSLIRDLKKGKNPVEYMD
        M   SRN P P AS     P  +R       +P QA       E    +I  + ++++ MN++  +++QIL   +P   L    LI D+++ ++      
Subjt:  MEKGSRN-PNPEASKNSRQPQTSRDEDNIQGSPRQAGRGRGRAEDADTKIAALEDEVKGMNQSLSKILQILDKPSPSTKLHEGSLIRDLKKGKNPVEYMD

Query:  ESETESRGKKTNSSTSMVRGLKHTERTVLRSPESCTSRRTGLRNLVEKKRRVAKTVESEARATEAEAKKNHLPWKTELLNTLKELENPQGDLQKLKDSGG
         S       +  S+  + RG +    + LR   S  S +T  R+   +  R    V +  ++T    K   L  + + +NT                   
Subjt:  ESETESRGKKTNSSTSMVRGLKHTERTVLRSPESCTSRRTGLRNLVEKKRRVAKTVESEARATEAEAKKNHLPWKTELLNTLKELENPQGDLQKLKDSGG

Query:  QDMEELIDQVDPPFTEEVMKVEVPQKFKVPCLLFHPSRIN---RHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRF
          ++ LI Q +PPFTE +++  +  KFK+P  L   + +    R WF +L   +I  F DL+R F+A FM  R   K   +L T+ Q+  ESL+D++ RF
Subjt:  QDMEELIDQVDPPFTEEVMKVEVPQKFKVPCLLFHPSRIN---RHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRF

Query:  NDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRTDER
        N   L+VE  S+   ++A+  GL    L +S+ K+   T +   S+A KY++AEEL ++KR     +R    DH  K+   +R D R
Subjt:  NDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRTDER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGAGGATGGAGAAGGGAAGCCGAAATCCGAACCCAGAAGCTTCGAAAAACAGCCGCCAGCCGCAGACGTCACGAGACGAGGACAACATCCAGGGGTCACCGAG
ACAAGCAGGCCGAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAAATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCAGAGTTTGTCCAAAATACTCCAAATCC
TGGATAAACCCAGCCCTAGCACCAAACTCCATGAGGGGAGCTTGATTAGAGACCTGAAGAAGGGGAAGAATCCAGTCGAATACATGGATGAATCAGAGACAGAATCCAGA
GGAAAGAAGACCAACAGCTCAACCAGCATGGTCAGGGGGCTGAAGCACACAGAGCGCACAGTACTGAGGAGCCCTGAGTCATGTACCAGCCGTAGAACAGGCCTGAGAAA
TCTAGTCGAGAAAAAGCGCAGAGTGGCCAAAACTGTTGAGTCTGAGGCCAGAGCTACCGAGGCCGAGGCTAAGAAAAACCATCTCCCTTGGAAGACTGAGCTTCTAAACA
CACTAAAGGAGCTCGAAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGATTCGGGAGGGCAAGACATGGAAGAACTAATCGACCAAGTCGACCCACCCTTCACAGAAGAA
GTCATGAAAGTTGAGGTGCCCCAGAAGTTCAAGGTGCCGTGCCTTCTTTTTCACCCTAGCAGGATCAATAGGCACTGGTTTGAGAGGCTAAAAAGGAGATCCATCAGCTG
TTTCAAGGATTTAGCCCGAGCATTCCTTGCACAGTTCATGGGAGCCAGAGAACTGCGCAAGCCTCACATCAACCTCTTAACAGTCAAACAGCAGCCAGGTGAGAGCTTGC
GTGATTATATAACACGTTTCAACGATGAAGCACTGCAGGTTGAGGGATACAGCGAGGGAGCAGCCCTAGTAGCCATAACAGCCGGACTGGAAGACGAAAGACTGCTCAAT
TCAATAGGTAAGAGCCAACATCGAACCTATGCGGAGTTTGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAATCAAAGAGGTCAGAACGAAAGTACAA
GAGGGCTTCTTCATCTGACCACGACAGTAAGAAGGACAAGAGGCAGCGGACAGATGAAAGGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAGAGGATGGAGAAGGGAAGCCGAAATCCGAACCCAGAAGCTTCGAAAAACAGCCGCCAGCCGCAGACGTCACGAGACGAGGACAACATCCAGGGGTCACCGAG
ACAAGCAGGCCGAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAAATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCAGAGTTTGTCCAAAATACTCCAAATCC
TGGATAAACCCAGCCCTAGCACCAAACTCCATGAGGGGAGCTTGATTAGAGACCTGAAGAAGGGGAAGAATCCAGTCGAATACATGGATGAATCAGAGACAGAATCCAGA
GGAAAGAAGACCAACAGCTCAACCAGCATGGTCAGGGGGCTGAAGCACACAGAGCGCACAGTACTGAGGAGCCCTGAGTCATGTACCAGCCGTAGAACAGGCCTGAGAAA
TCTAGTCGAGAAAAAGCGCAGAGTGGCCAAAACTGTTGAGTCTGAGGCCAGAGCTACCGAGGCCGAGGCTAAGAAAAACCATCTCCCTTGGAAGACTGAGCTTCTAAACA
CACTAAAGGAGCTCGAAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGATTCGGGAGGGCAAGACATGGAAGAACTAATCGACCAAGTCGACCCACCCTTCACAGAAGAA
GTCATGAAAGTTGAGGTGCCCCAGAAGTTCAAGGTGCCGTGCCTTCTTTTTCACCCTAGCAGGATCAATAGGCACTGGTTTGAGAGGCTAAAAAGGAGATCCATCAGCTG
TTTCAAGGATTTAGCCCGAGCATTCCTTGCACAGTTCATGGGAGCCAGAGAACTGCGCAAGCCTCACATCAACCTCTTAACAGTCAAACAGCAGCCAGGTGAGAGCTTGC
GTGATTATATAACACGTTTCAACGATGAAGCACTGCAGGTTGAGGGATACAGCGAGGGAGCAGCCCTAGTAGCCATAACAGCCGGACTGGAAGACGAAAGACTGCTCAAT
TCAATAGGTAAGAGCCAACATCGAACCTATGCGGAGTTTGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAATCAAAGAGGTCAGAACGAAAGTACAA
GAGGGCTTCTTCATCTGACCACGACAGTAAGAAGGACAAGAGGCAGCGGACAGATGAAAGGGGCTGA
Protein sequenceShow/hide protein sequence
MSKRMEKGSRNPNPEASKNSRQPQTSRDEDNIQGSPRQAGRGRGRAEDADTKIAALEDEVKGMNQSLSKILQILDKPSPSTKLHEGSLIRDLKKGKNPVEYMDESETESR
GKKTNSSTSMVRGLKHTERTVLRSPESCTSRRTGLRNLVEKKRRVAKTVESEARATEAEAKKNHLPWKTELLNTLKELENPQGDLQKLKDSGGQDMEELIDQVDPPFTEE
VMKVEVPQKFKVPCLLFHPSRINRHWFERLKRRSISCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLN
SIGKSQHRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRTDERG