; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018095 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018095
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:16002412..16003307
RNA-Seq ExpressionLag0018095
SyntenyLag0018095
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]4.1e-4140.68Show/hide
Query:  NKTTLPKQ-RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEH
        ++  LP+  +D+++PV+    S I+   I ANNFELK  LI M ++   S   L   +    +        K    +    ++ +    L+DKA+ WL+ 
Subjt:  NKTTLPKQ-RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEH

Query:  SLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEA
          P                 FPPAKT +LR+EIG F+Q D E LYEAWERYK+++R CPQHG PDWLQV +FYNGLN  T+T++D ++GG+ +SKT   A
Subjt:  SLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEA

Query:  NDLLEEMAATSYQWPTEKGIISKKAGLYEIDESSSL
          LLEEMA+ +YQWPTE+ +  K AG++E++  ++L
Subjt:  NDLLEEMAATSYQWPTEKGIISKKAGLYEIDESSSL

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]4.1e-4140.68Show/hide
Query:  NKTTLPKQ-RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEH
        ++  LP+  +D+++PV+    S I+   I ANNFELK  LI M ++   S   L   +    +        K    +    ++ +    L+DKA+ WL+ 
Subjt:  NKTTLPKQ-RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEH

Query:  SLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEA
          P                 FPPAKT +LR+EIG F+Q D E LYEAWERYK+++R CPQHG PDWLQV +FYNGLN  T+T++D ++GG+ +SKT   A
Subjt:  SLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEA

Query:  NDLLEEMAATSYQWPTEKGIISKKAGLYEIDESSSL
          LLEEMA+ +YQWPTE+ +  K AG++E++  ++L
Subjt:  NDLLEEMAATSYQWPTEKGIISKKAGLYEIDESSSL

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.7e-5347.57Show/hide
Query:  NKTT-----LPKQ-RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAK
        N+TT     +PK  RD+ QP LP    GI+   I  NNFELK GLIQM RE+          H             K    S+   ++ +    LQD+AK
Subjt:  NKTT-----LPKQ-RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAK

Query:  DWLEHSLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSK
        DWLE   P+                FPPAK+ +LRTEIGTFRQL++EQLYEAWERYK++LR CPQHGYPDWLQ+ LFYNGL  STK++LD +AGGS  SK
Subjt:  DWLEHSLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSK

Query:  TVTEANDLLEEMAATSYQWPTEKGI--ISKKAGLYEIDESSSLKHKWLH-DNALNKLTSSRWSSLSP
           EA  +LE++A TSY WP E+    I K AGLYE+DE +SLK +     NAL+KLT+   +  +P
Subjt:  TVTEANDLLEEMAATSYQWPTEKGI--ISKKAGLYEIDESSSLKHKWLH-DNALNKLTSSRWSSLSP

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.7e-4242.29Show/hide
Query:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN-----
        +D+++P++    SGI   TI ANNFELK  LI M ++   S   L   +    +        K    +    ++ +    L+DKA+ WL+   P      
Subjt:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN-----

Query:  -----------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEMAA
                   FPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQV +FYNGLN  T+T++D ++GG+ +SKT   A  LLEEMA+
Subjt:  -----------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEMAA

Query:  TSYQWPTEKGIISKKAGLYEIDESSSL
         +YQWPTE+ +  K AG++E++  ++L
Subjt:  TSYQWPTEKGIISKKAGLYEIDESSSL

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]1.7e-4242.29Show/hide
Query:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN-----
        +D+++P++    SGI   TI ANNFELK  LI M ++   S   L   +    +        K    +    ++ +    L+DKA+ WL+   P      
Subjt:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN-----

Query:  -----------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEMAA
                   FPPAKT +LR+EIG FRQ D E LYEAWERYK+++R CPQHG PDWLQV +FYNGLN  T+T++D ++GG+ +SKT   A  LLEEMA+
Subjt:  -----------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEMAA

Query:  TSYQWPTEKGIISKKAGLYEIDESSSL
         +YQWPTE+ +  K AG++E++  ++L
Subjt:  TSYQWPTEKGIISKKAGLYEIDESSSL

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953733.5e-3842.17Show/hide
Query:  AHNKTTLPKQRDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLE
        AH +T     +D+++PV+    SGI   TI ANNFELK  LI M ++   SR  L   +    +        K    +    ++ +    L+DKA+ WL+
Subjt:  AHNKTTLPKQRDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLE

Query:  H----SLPN------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVT-
             S+ +            FPPAKTT+LR+EI  F+Q D E LYEAWERYK ++R CPQHG P+WLQV +FYNGLN  T+T++D +AGG+ +SKT+  
Subjt:  H----SLPN------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVT-

Query:  EANDLLEEMAATSYQWPTEKGIISKKAGLY
         A  LLEEM + +YQWPTEK +  K  G++
Subjt:  EANDLLEEMAATSYQWPTEKGIISKKAGLY

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129459.2e-3941.92Show/hide
Query:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQMREIILSRDILWRTHFPSPIIP-RDLWDN-KDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN----
        RD++ P++   +  I   +I ANNFE+K   IQM +  +    L      S ++   ++ D  K    +    ++ +    L+DKAK WL +SLPN    
Subjt:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQMREIILSRDILWRTHFPSPIIP-RDLWDN-KDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN----

Query:  -------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEM
                     FPPAKT K+R +I +F Q D E LYEAWER+KE+LR CP HG PDWLQV  FYNGL  S KT++D +AGG+ +SK   +A +LLEEM
Subjt:  -------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEM

Query:  AATSYQWPTEKGIISKKAGLYEIDESSSL
        A+ +YQWP+E+    K  G YEID   +L
Subjt:  AATSYQWPTEKGIISKKAGLYEIDESSSL

A0A6J0ZYV0 uncharacterized protein LOC1104134131.6e-3841.92Show/hide
Query:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQMREIILSRDILWRTHFPSPIIP-RDLWDN-KDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN----
        RD+  P++   +  I   +I ANNFE+K   IQM +  +    L      S ++   ++ D  K    +    ++ +    L+DKAK WL +SLPN    
Subjt:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQMREIILSRDILWRTHFPSPIIP-RDLWDN-KDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN----

Query:  -------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEM
                     FPPAKT K+R +I +F Q D E LYEAWER+KE+LR CP HG PDWLQV  FYNGL  S KT++D +AGG+ +SK   +A +LLEEM
Subjt:  -------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEM

Query:  AATSYQWPTEKGIISKKAGLYEIDESSSL
        A+ +YQWP+E+    K  G YEID   +L
Subjt:  AATSYQWPTEKGIISKKAGLYEIDESSSL

A0A6J1DU19 uncharacterized protein LOC1110243612.0e-4146.7Show/hide
Query:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN-FPPA
        RD+ QP  P  + GI+   I ANN ELK GLIQM RE     +     +    I        K         ++ +  + LQD  K+ ++  L N FPPA
Subjt:  RDFLQPVLPTENSGIVYATIQANNFELKTGLIQM-REIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEHSLPN-FPPA

Query:  KTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEMAATSYQWPTEKGIISKK
        KTT+LRTEI +FR+ D EQL+E WERYKE+LR CPQHG  +WLQ+ +FYNGLN  T+T+LD +AGG+ LS+T   A  LL++MA  S+QWP+E+    K 
Subjt:  KTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEMAATSYQWPTEKGIISKK

Query:  AGLYEIDESSSLKHK-WLHDNALNKLT
        AG+YEIDE SSLK +     NA++KL+
Subjt:  AGLYEIDESSSLKHK-WLHDNALNKLT

A0A6P6XAQ1 Reverse transcriptase4.6e-3840Show/hide
Query:  MAHNKTTLPKQRDFLQPVLPTENSGIVYATIQANNFELKTGLIQMREIILSRDILWRTHFPSPIIPR--DLWDN-KDERSSSRCNQIEVISIFLQDKAKD
        MA N++     RDF  P      + IV  T+ ANNFE+K  LIQM  +  S+     T  P+  +    ++ D  K    S    ++ +    L+DKAK 
Subjt:  MAHNKTTLPKQRDFLQPVLPTENSGIVYATIQANNFELKTGLIQMREIILSRDILWRTHFPSPIIPR--DLWDN-KDERSSSRCNQIEVISIFLQDKAKD

Query:  WLEHSLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKT
        WL+   PN                FPP KT KLR +I +F Q + E LYEAWERY+E+ R CP HG PDWL V  FYNGL   TKT +D +AGG+ + KT
Subjt:  WLEHSLPN----------------FPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKT

Query:  VTEANDLLEEMAATSYQWPTEKGIISKKAGLYEIDESSSLKHKWLHDNALNKLTSSRWSS
          EA  L+EEMAA +YQW  E+G   + AG+ E+D  + L  K   DN +  L     SS
Subjt:  VTEANDLLEEMAATSYQWPTEKGIISKKAGLYEIDESSSLKHKWLHDNALNKLTSSRWSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCACAACAAGACGACGCTCCCAAAGCAAAGAGATTTTCTGCAGCCAGTTCTCCCGACTGAGAATTCCGGCATTGTTTATGCCACGATTCAAGCGAACAACTTCGA
ATTAAAGACGGGACTGATCCAGATGCGAGAGATAATTCTTTCAAGGGACATCCTTTGGAGGACTCACTTCCCATCTCCAATCATTCCTAGAGATTTGTGGGACAATAAAG
ATGAACGGAGTTCCAGCCGATGCAATCAGATTGAGGTTATTTCCATTTTCCTCCAAGACAAAGCGAAAGATTGGCTAGAACATTCCTTACCAAATTTTCCGCCTGCGAAG
ACCACAAAGCTCCGTACAGAGATTGGAACGTTCAGACAGTTGGATGAGGAGCAGTTATATGAGGCATGGGAGAGATATAAAGAGATGCTTAGGACATGCCCACAGCACGG
CTATCCCGATTGGCTTCAAGTGATGTTATTCTACAATGGGTTGAATCCTTCTACCAAGACAGTTCTTGATACATCAGCGGGTGGAAGCTTTCTATCCAAGACGGTGACTG
AGGCAAATGATCTTTTGGAGGAGATGGCGGCAACCAGCTATCAATGGCCTACAGAGAAGGGAATAATATCAAAGAAGGCTGGCTTATATGAGATCGATGAGTCAAGCTCT
CTAAAGCACAAATGGCTTCACGACAACGCGCTAAACAAGCTTACTTCTTCTAGGTGGTCAAGTCTATCTCCACTTTGGCAGAAGGATACTCGAAAAGGAAAGTCAGGATG
TCGAAGAAGTTCAATATGTTGGAAATCGAGCGTTTACCCAAGAATTGTCGAACTTCTATCATCCAAATCTGCACAACTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCACAACAAGACGACGCTCCCAAAGCAAAGAGATTTTCTGCAGCCAGTTCTCCCGACTGAGAATTCCGGCATTGTTTATGCCACGATTCAAGCGAACAACTTCGA
ATTAAAGACGGGACTGATCCAGATGCGAGAGATAATTCTTTCAAGGGACATCCTTTGGAGGACTCACTTCCCATCTCCAATCATTCCTAGAGATTTGTGGGACAATAAAG
ATGAACGGAGTTCCAGCCGATGCAATCAGATTGAGGTTATTTCCATTTTCCTCCAAGACAAAGCGAAAGATTGGCTAGAACATTCCTTACCAAATTTTCCGCCTGCGAAG
ACCACAAAGCTCCGTACAGAGATTGGAACGTTCAGACAGTTGGATGAGGAGCAGTTATATGAGGCATGGGAGAGATATAAAGAGATGCTTAGGACATGCCCACAGCACGG
CTATCCCGATTGGCTTCAAGTGATGTTATTCTACAATGGGTTGAATCCTTCTACCAAGACAGTTCTTGATACATCAGCGGGTGGAAGCTTTCTATCCAAGACGGTGACTG
AGGCAAATGATCTTTTGGAGGAGATGGCGGCAACCAGCTATCAATGGCCTACAGAGAAGGGAATAATATCAAAGAAGGCTGGCTTATATGAGATCGATGAGTCAAGCTCT
CTAAAGCACAAATGGCTTCACGACAACGCGCTAAACAAGCTTACTTCTTCTAGGTGGTCAAGTCTATCTCCACTTTGGCAGAAGGATACTCGAAAAGGAAAGTCAGGATG
TCGAAGAAGTTCAATATGTTGGAAATCGAGCGTTTACCCAAGAATTGTCGAACTTCTATCATCCAAATCTGCACAACTATGA
Protein sequenceShow/hide protein sequence
MAHNKTTLPKQRDFLQPVLPTENSGIVYATIQANNFELKTGLIQMREIILSRDILWRTHFPSPIIPRDLWDNKDERSSSRCNQIEVISIFLQDKAKDWLEHSLPNFPPAK
TTKLRTEIGTFRQLDEEQLYEAWERYKEMLRTCPQHGYPDWLQVMLFYNGLNPSTKTVLDTSAGGSFLSKTVTEANDLLEEMAATSYQWPTEKGIISKKAGLYEIDESSS
LKHKWLHDNALNKLTSSRWSSLSPLWQKDTRKGKSGCRRSSICWKSSVYPRIVELLSSKSAQL