; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000756 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000756
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr4:15285950..15286471
RNA-Seq ExpressionLag0000756
SyntenyLag0000756
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]3.2e-2848.97Show/hide
Query:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ
        M +   + + ++  S+S++ N S S   LL+NICNL++ RLDS+NYV W+FQIS +LK+H L  Y+DG+   P++ V+ E     +++NPEY+ W  +DQ
Subjt:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ

Query:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ
        AL+TL+NATL+QTALS+VIG  T++E W  LE+ FS+STR+NI+Q
Subjt:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ

KAA8532888.1 hypothetical protein F0562_032995 [Nyssa sinensis]1.2e-3045.99Show/hide
Query:  SMTTSASDSALVSSSSSSN--TSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE-GQ-PSRVNPEYKSWYERDQALI
        S T   S SA+ +S+S+S+  T + LL+NICNL++VRLDSTNYV W+FQ + +LK+H L ++VDGS+  P+  +R E G+  + +NP +KSW  +DQALI
Subjt:  SMTTSASDSALVSSSSSSN--TSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE-GQ-PSRVNPEYKSWYERDQALI

Query:  TLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ---------KGKNK--GWIKKNEGLPSS--SLLCRIELHDPIPCT
        TLIN TL+ TAL+++IG  +A+EVW  LE+ FSSS+R NI+Q         KG N    +I+K + +     S+  +IE  D + CT
Subjt:  TLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ---------KGKNK--GWIKKNEGLPSS--SLLCRIELHDPIPCT

KAA8537769.1 hypothetical protein F0562_027652 [Nyssa sinensis]3.2e-2848.97Show/hide
Query:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ
        M +   + + ++  S+S++ N S S   LL+NICNL++ RLDS+NYV W+FQIS +LK+H L  Y+DG+   P++ V+ E     +++NPEY+ W  +DQ
Subjt:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ

Query:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ
        AL+TL+NATL+QTALS+VIG  T++E W  LE+ FS+STR+NI+Q
Subjt:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.2e-3047.8Show/hide
Query:  SALVSSSSSSNTS------VSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIV----RTEGQPSR------VNPEYKSWYERD
        S    +SSS+NT       + LL+NICNLVS+RLDST+++LW+FQ++ +LK+HKLF ++DGS+ AP Q +     TE QP+       +NP ++ W  +D
Subjt:  SALVSSSSSSNTS------VSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIV----RTEGQPSR------VNPEYKSWYERD

Query:  QALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ-KGKNKGWIKKNE
        QAL+TLINATL+  AL+YV+   T+K+VW+ LEKH+SS++RTN+V  K   +  +KK E
Subjt:  QALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ-KGKNKGWIKKNE

XP_022158689.1 uncharacterized protein LOC111025150 [Momordica charantia]2.4e-2842.58Show/hide
Query:  NTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIV-----RTEGQPSRVNPEYKSWYERDQALITLINATLTQTALSYVIG
        ++ + LL+NICNLVS+RLDS+N+VLW+FQ++ +LK+HKL+ ++DGS   P + +      +   P   NP +  W  +D AL+TL+NA L+ +AL+YV+G
Subjt:  NTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIV-----RTEGQPSRVNPEYKSWYERDQALITLINATLTQTALSYVIG

Query:  CQTAKEVWDRLEKHFSSSTRTNIVQKGKNKGWIKKNEGLPSSSLLCRI-ELHDPI
        C ++++VW  L KH+SSS+RTN+V    +   I K  G      + RI EL D +
Subjt:  CQTAKEVWDRLEKHFSSSTRTNIVQKGKNKGWIKKNEGLPSSSLLCRI-ELHDPI

TrEMBL top hitse value%identityAlignment
A0A5B7C9B1 Retrotran_gag_3 domain-containing protein1.2e-2849.31Show/hide
Query:  SDSALVSSSSSSNTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE-GQ-PSRVNPEYKSWYERDQALITLINATLT
        S+++  S S+S+ + + LL+NICNL+++ LDSTNYV W+FQIS +L++H L  Y+DGS+  P + +  E G+  +  NP+Y  W   DQAL+TLINATL+
Subjt:  SDSALVSSSSSSNTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE-GQ-PSRVNPEYKSWYERDQALITLINATLT

Query:  QTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQKGKNKGWIKK
         +AL+YVIG  T+KEVW  LE+ FSSS+R NI+Q   N   + K
Subjt:  QTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQKGKNKGWIKK

A0A5J5A0G3 Retrotran_gag_3 domain-containing protein1.5e-2848.97Show/hide
Query:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ
        M +   + + ++  S+S++ N S S   LL+NICNL++ RLDS+NYV W+FQIS +LK+H L  Y+DG+   P++ V+ E     +++NPEY+ W  +DQ
Subjt:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ

Query:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ
        AL+TL+NATL+QTALS+VIG  T++E W  LE+ FS+STR+NI+Q
Subjt:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ

A0A5J5ARL5 Retrotran_gag_3 domain-containing protein5.6e-3145.99Show/hide
Query:  SMTTSASDSALVSSSSSSN--TSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE-GQ-PSRVNPEYKSWYERDQALI
        S T   S SA+ +S+S+S+  T + LL+NICNL++VRLDSTNYV W+FQ + +LK+H L ++VDGS+  P+  +R E G+  + +NP +KSW  +DQALI
Subjt:  SMTTSASDSALVSSSSSSN--TSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE-GQ-PSRVNPEYKSWYERDQALI

Query:  TLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ---------KGKNK--GWIKKNEGLPSS--SLLCRIELHDPIPCT
        TLIN TL+ TAL+++IG  +A+EVW  LE+ FSSS+R NI+Q         KG N    +I+K + +     S+  +IE  D + CT
Subjt:  TLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ---------KGKNK--GWIKKNEGLPSS--SLLCRIELHDPIPCT

A0A5J5B049 Retrotran_gag_3 domain-containing protein1.5e-2848.97Show/hide
Query:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ
        M +   + + ++  S+S++ N S S   LL+NICNL++ RLDS+NYV W+FQIS +LK+H L  Y+DG+   P++ V+ E     +++NPEY+ W  +DQ
Subjt:  MRSMTTSASDSALVSSSSSSNTSVS---LLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTE--GQPSRVNPEYKSWYERDQ

Query:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ
        AL+TL+NATL+QTALS+VIG  T++E W  LE+ FS+STR+NI+Q
Subjt:  ALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ

A0A6J1D9L6 uncharacterized protein LOC1110188925.6e-3147.8Show/hide
Query:  SALVSSSSSSNTS------VSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIV----RTEGQPSR------VNPEYKSWYERD
        S    +SSS+NT       + LL+NICNLVS+RLDST+++LW+FQ++ +LK+HKLF ++DGS+ AP Q +     TE QP+       +NP ++ W  +D
Subjt:  SALVSSSSSSNTS------VSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIV----RTEGQPSR------VNPEYKSWYERD

Query:  QALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ-KGKNKGWIKKNE
        QAL+TLINATL+  AL+YV+   T+K+VW+ LEKH+SS++RTN+V  K   +  +KK E
Subjt:  QALITLINATLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ-KGKNKGWIKKNE

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-1029.01Show/hide
Query:  SNTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTEGQPSRVNPEYKSWYERDQALITLINATLTQTALSYVIGCQTA
        +NTS+ L  N+ N+   +L STNY++W  Q+  L   ++L  ++DGS   P   + T+  P RVNP+Y  W  +D+ + + +   ++ +    V    TA
Subjt:  SNTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTEGQPSRVNPEYKSWYERDQALITLINATLTQTALSYVIGCQTA

Query:  KEVWDRLEKHFSSSTRTNIVQ-KGKNKGWIK
         ++W+ L K +++ +  ++ Q + + K W K
Subjt:  KEVWDRLEKHFSSSTRTNIVQ-KGKNKGWIK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-0929.2Show/hide
Query:  MTTSASDSALVSSSSSSNTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTEGQPSRVNPEYKSWYERDQALITLINA
        M T A +  LV      NT++ L  N+ N+   +L STNY++W  Q+  L   ++L  ++DGS   P   + T+  P RVNP+Y  W  +D+ + + I  
Subjt:  MTTSASDSALVSSSSSSNTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTEGQPSRVNPEYKSWYERDQALITLINA

Query:  TLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ
         ++ +    V    TA ++W+ L K +++ +  ++ Q
Subjt:  TLTQTALSYVIGCQTAKEVWDRLEKHFSSSTRTNIVQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAGCATGACGACAAGTGCGAGTGATTCTGCCCTAGTTTCTTCGAGTTCTTCGTCAAACACCTCTGTTTCACTCCTCACAAATATCTGTAATTTGGTTTCAGTTCG
TTTGGATTCAACGAACTACGTTCTTTGGCGCTTTCAGATCTCGCCACTGTTGAAATCTCACAAGCTTTTCAAGTATGTCGATGGCTCAATCAAAGCTCCTGATCAAATTG
TTCGCACAGAAGGCCAACCGTCTCGCGTTAATCCAGAGTATAAATCTTGGTATGAACGAGATCAAGCTTTGATCACTCTGATCAATGCGACGCTAACACAGACTGCACTT
TCCTATGTTATCGGTTGTCAAACGGCCAAAGAAGTCTGGGATAGATTAGAAAAGCATTTCTCTTCGTCCACTAGAACGAATATTGTGCAAAAAGGTAAAAACAAAGGATG
GATTAAAAAAAATGAAGGATTACCATCATCCTCTCTATTATGCAGGATTGAGTTGCATGATCCAATTCCTTGCACACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAAGCATGACGACAAGTGCGAGTGATTCTGCCCTAGTTTCTTCGAGTTCTTCGTCAAACACCTCTGTTTCACTCCTCACAAATATCTGTAATTTGGTTTCAGTTCG
TTTGGATTCAACGAACTACGTTCTTTGGCGCTTTCAGATCTCGCCACTGTTGAAATCTCACAAGCTTTTCAAGTATGTCGATGGCTCAATCAAAGCTCCTGATCAAATTG
TTCGCACAGAAGGCCAACCGTCTCGCGTTAATCCAGAGTATAAATCTTGGTATGAACGAGATCAAGCTTTGATCACTCTGATCAATGCGACGCTAACACAGACTGCACTT
TCCTATGTTATCGGTTGTCAAACGGCCAAAGAAGTCTGGGATAGATTAGAAAAGCATTTCTCTTCGTCCACTAGAACGAATATTGTGCAAAAAGGTAAAAACAAAGGATG
GATTAAAAAAAATGAAGGATTACCATCATCCTCTCTATTATGCAGGATTGAGTTGCATGATCCAATTCCTTGCACACCTTAG
Protein sequenceShow/hide protein sequence
MRSMTTSASDSALVSSSSSSNTSVSLLTNICNLVSVRLDSTNYVLWRFQISPLLKSHKLFKYVDGSIKAPDQIVRTEGQPSRVNPEYKSWYERDQALITLINATLTQTAL
SYVIGCQTAKEVWDRLEKHFSSSTRTNIVQKGKNKGWIKKNEGLPSSSLLCRIELHDPIPCTP