; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012171 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012171
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase, RNA-dependent DNA polymerase
Genome locationchr1:38254297..38254809
RNA-Seq ExpressionLag0012171
SyntenyLag0012171
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067279.1 uncharacterized protein E6C27_scaffold418G001000 [Cucumis melo var. makuwa]1.9e-3855.35Show/hide
Query:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE
        S  +T TTSF +P LNQ+LNQ+TTIKLDRGN+LLWK LALPIL+SY+L  HL G +P  P+   + T    PN  +   AG  S       ++ S  +  
Subjt:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE

Query:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL
        V+P YE WI  D LLLGWLYNSMTPEV  Q+MGF NAK+LW   Q+LFGIQSR +ED+L
Subjt:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL

XP_016900937.1 PREDICTED: uncharacterized protein LOC107991116 [Cucumis melo]1.9e-3855.35Show/hide
Query:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE
        S  +T TTSF +P LNQ+LNQ+TTIKLDRGN+LLWK LALPIL+SY+L  HL G +P  P+   + T    PN  +   AG  S       ++ S  +  
Subjt:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE

Query:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL
        V+P YE WI  D LLLGWLYNSMTPEV  Q+MGF NAK+LW   Q+LFGIQSR +ED+L
Subjt:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]4.8e-3753.5Show/hide
Query:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP
        S  +  F++PPLNQ+LNQ+TT+KLDR N+LLWK LALPIL+ Y+LEGHLT  TP P  F  V +A  +  +  E  A +T        A+ SI    V+P
Subjt:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP

Query:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR
        L+E W+  D LLLGWLYNSMTP+VA Q+MGF N ++LW   Q+ FG+QSR EED+LR
Subjt:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR

XP_016902205.1 PREDICTED: uncharacterized protein LOC107991581 isoform X5 [Cucumis melo]4.8e-3753.5Show/hide
Query:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP
        S  +  F++PPLNQ+LNQ+TT+KLDR N+LLWK LALPIL+ Y+LEGHLT  TP P  F  V +A  +  +  E  A +T        A+ SI    V+P
Subjt:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP

Query:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR
        L+E W+  D LLLGWLYNSMTP+VA Q+MGF N ++LW   Q+ FG+QSR EED+LR
Subjt:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.6e-3759.6Show/hide
Query:  FNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSPLYESWI
        F SPPLNQLLNQ+T+IK+DRGNFLLW+NLALPILRSY+L  +LTG  P PP      T  V  ++    E GSTSS        +S P   ++P YE+WI
Subjt:  FNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSPLYESWI

Query:  VVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR
        VVD+LLLGWLYNSM  +VA QVMGF  ++ELWT +QELFG+QSR E DYL+
Subjt:  VVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR

TrEMBL top hitse value%identityAlignment
A0A1S4DY80 uncharacterized protein LOC1079911169.4e-3955.35Show/hide
Query:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE
        S  +T TTSF +P LNQ+LNQ+TTIKLDRGN+LLWK LALPIL+SY+L  HL G +P  P+   + T    PN  +   AG  S       ++ S  +  
Subjt:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE

Query:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL
        V+P YE WI  D LLLGWLYNSMTPEV  Q+MGF NAK+LW   Q+LFGIQSR +ED+L
Subjt:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL

A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X12.3e-3753.5Show/hide
Query:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP
        S  +  F++PPLNQ+LNQ+TT+KLDR N+LLWK LALPIL+ Y+LEGHLT  TP P  F  V +A  +  +  E  A +T        A+ SI    V+P
Subjt:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP

Query:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR
        L+E W+  D LLLGWLYNSMTP+VA Q+MGF N ++LW   Q+ FG+QSR EED+LR
Subjt:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR

A0A1S4E1U9 uncharacterized protein LOC107991581 isoform X42.3e-3753.5Show/hide
Query:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP
        S  +  F++PPLNQ+LNQ+TT+KLDR N+LLWK LALPIL+ Y+LEGHLT  TP P  F  V +A  +  +  E  A +T        A+ SI    V+P
Subjt:  STGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSP

Query:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR
        L+E W+  D LLLGWLYNSMTP+VA Q+MGF N ++LW   Q+ FG+QSR EED+LR
Subjt:  LYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR

A0A5A7VPY0 Uncharacterized protein9.4e-3955.35Show/hide
Query:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE
        S  +T TTSF +P LNQ+LNQ+TTIKLDRGN+LLWK LALPIL+SY+L  HL G +P  P+   + T    PN  +   AG  S       ++ S  +  
Subjt:  SAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE

Query:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL
        V+P YE WI  D LLLGWLYNSMTPEV  Q+MGF NAK+LW   Q+LFGIQSR +ED+L
Subjt:  VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYL

A0A6J1DCW4 uncharacterized protein LOC1110195988.0e-3859.6Show/hide
Query:  FNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSPLYESWI
        F SPPLNQLLNQ+T+IK+DRGNFLLW+NLALPILRSY+L  +LTG  P PP      T  V  ++    E GSTSS        +S P   ++P YE+WI
Subjt:  FNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSPLYESWI

Query:  VVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR
        VVD+LLLGWLYNSM  +VA QVMGF  ++ELWT +QELFG+QSR E DYL+
Subjt:  VVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.5e-0421.26Show/hide
Query:  VTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSPLYESWIVVDQLLLGWLYN
        +  +  D  N++ WK      LR  +  G + G  P P  F                                       SPLY+ W   + +++ WL N
Subjt:  VTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAEVSPLYESWIVVDQLLLGWLYN

Query:  SMTPEVATQVMGFDNAKELWTTIQELF
        SMT ++   VM  + A ++W  ++ +F
Subjt:  SMTPEVATQVMGFDNAKELWTTIQELF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATCGTCAATCATTCTGTCATGAATTCCGCCGCATCCACCGGAACAACAAGCTTCAACAGCCCGCCGCTTAATCAACTGTTGAACCAGGTCACCACCATCAAATT
GGATCGAGGAAATTTCCTTTTATGGAAGAACCTTGCTCTACCGATCCTTCGTAGCTACAGGTTGGAAGGTCATCTAACTGGAGCTACACCCTCTCCACCGCAGTTTCCTC
AAGTAGCAACAGCAAGGGTAAATCCCAATTCTCAAGTTGAAGCTGAAGCTGGTTCTACCAGCTCTGCTGCTATTGCAAGTGTTGCAGCTGAGTCAATTCCGATGGCAGAG
GTAAGTCCGTTGTATGAATCGTGGATTGTAGTCGATCAGCTGTTGTTGGGTTGGTTATACAACTCTATGACCCCTGAGGTCGCAACTCAAGTAATGGGTTTCGACAATGC
CAAGGAACTGTGGACAACAATACAAGAATTGTTTGGCATTCAGTCGCGTGGCGAGGAGGATTATCTCCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCATCGTCAATCATTCTGTCATGAATTCCGCCGCATCCACCGGAACAACAAGCTTCAACAGCCCGCCGCTTAATCAACTGTTGAACCAGGTCACCACCATCAAATT
GGATCGAGGAAATTTCCTTTTATGGAAGAACCTTGCTCTACCGATCCTTCGTAGCTACAGGTTGGAAGGTCATCTAACTGGAGCTACACCCTCTCCACCGCAGTTTCCTC
AAGTAGCAACAGCAAGGGTAAATCCCAATTCTCAAGTTGAAGCTGAAGCTGGTTCTACCAGCTCTGCTGCTATTGCAAGTGTTGCAGCTGAGTCAATTCCGATGGCAGAG
GTAAGTCCGTTGTATGAATCGTGGATTGTAGTCGATCAGCTGTTGTTGGGTTGGTTATACAACTCTATGACCCCTGAGGTCGCAACTCAAGTAATGGGTTTCGACAATGC
CAAGGAACTGTGGACAACAATACAAGAATTGTTTGGCATTCAGTCGCGTGGCGAGGAGGATTATCTCCGCTAG
Protein sequenceShow/hide protein sequence
MAIVNHSVMNSAASTGTTSFNSPPLNQLLNQVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGATPSPPQFPQVATARVNPNSQVEAEAGSTSSAAIASVAAESIPMAE
VSPLYESWIVVDQLLLGWLYNSMTPEVATQVMGFDNAKELWTTIQELFGIQSRGEEDYLR