; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037792 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037792
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:9217994..9218992
RNA-Seq ExpressionLag0037792
SyntenyLag0037792
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG68750.1 hypothetical protein EZV62_003685 [Acer yangbiense]4.0e-1753.93Show/hide
Query:  VFALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS
        + A S   + K    +++QYL Q K+I DKF+AIGEPLSY DHLGY+LEGL  EY+ F TSI+NR D+PS+ D+ SLLL++E +L K++
Subjt:  VFALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]1.8e-1759.04Show/hide
Query:  RSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST
        +  K   LSVSQYLA+IK+I  K S+IGEP+S  DH+ YI+EGL  EYN F TSIQNR+D  +L D+R+LLLAY+ +LEKQ++
Subjt:  RSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.6e-2173.33Show/hide
Query:  SVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST
        SVSQYLA+IK+IADKF+A+GEPLSY DHL ++L+GL SEYN F TSI NR D PSL D+RSLLLAYEA+L+KQ+T
Subjt:  SVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]6.0e-2150.88Show/hide
Query:  KTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQSTAPNSSPSPQAFFNQFQQPPS
        K   L+VSQYLAQIKD+ D F+AIGEPLSY DHL YILEGL SEYNPF +SI NR +RPS+ D+R+LL+ Y+++LEKQ+             +  Q   +
Subjt:  KTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQSTAPNSSPSPQAFFNQFQQPPS

Query:  SVSHSPVSSESIVP
        +V+H  ++S++  P
Subjt:  SVSHSPVSSESIVP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]2.8e-1867.11Show/hide
Query:  SLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS
        +LS+SQYL+QIKD+ADKFS +GE +SY DHL +IL+GL SEYN F TSIQN  D  S+ D+ SLLL+YEAQLEKQ+
Subjt:  SLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS

TrEMBL top hitse value%identityAlignment
A0A438FTV3 Uncharacterized protein3.4e-1440.48Show/hide
Query:  WVFALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS-TAPNSSPSP
        W   ++  ++ K   LS+ +Y+ ++K I +  +AIGEP+S  DHL Y+  GLD EYNPF TSIQNR+D+P++  I SLLL+Y+ +LE+Q+  + NS+   
Subjt:  WVFALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS-TAPNSSPSP

Query:  QAFFNQFQQPPSSVSHSPVSSESIVP
         A  N  ++P  S    P +   I P
Subjt:  QAFFNQFQQPPSSVSHSPVSSESIVP

A0A5C7IHH0 Uncharacterized protein1.9e-1753.93Show/hide
Query:  VFALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS
        + A S   + K    +++QYL Q K+I DKF+AIGEPLSY DHLGY+LEGL  EY+ F TSI+NR D+PS+ D+ SLLL++E +L K++
Subjt:  VFALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQS

A0A6J1D6N7 uncharacterized protein LOC1110174388.7e-1859.04Show/hide
Query:  RSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST
        +  K   LSVSQYLA+IK+I  K S+IGEP+S  DH+ YI+EGL  EYN F TSIQNR+D  +L D+R+LLLAY+ +LEKQ++
Subjt:  RSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST

A0A6J1DQX7 uncharacterized protein LOC1110223157.6e-2273.33Show/hide
Query:  SVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST
        SVSQYLA+IK+IADKF+A+GEPLSY DHL ++L+GL SEYN F TSI NR D PSL D+RSLLLAYEA+L+KQ+T
Subjt:  SVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQST

A5BPS3 Uncharacterized protein9.6e-1738.32Show/hide
Query:  PAPSKFLDATESQL--------------VSWVF-ALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDR
        P P+KFLD  + Q+              +SW++ +L+     K   +++S+YLA+IK++ DK+SA+GEPLSY D L Y L GL  EY+ F TSI NR+D+
Subjt:  PAPSKFLDATESQL--------------VSWVF-ALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDR

Query:  PSLVDIRSLLLAYEAQLEKQSTAPN--------SSPSPQAFFNQFQQPPSSVSHSPVSSESIVPNIP
         SL ++ SLL  Y   LE+++TA          ++ S Q  F++ QQP  +    P +S S  PN P
Subjt:  PSLVDIRSLLLAYEAQLEKQSTAPN--------SSPSPQAFFNQFQQPPSSVSHSPVSSESIVPNIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.3e-0526.5Show/hide
Query:  ALSFKRSAKTTS---LSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQSTAPNSSPSPQ
        AL F+   +TT+   LSV +Y  ++K ++D  + +  P+S    + ++L GL  +Y+     I++++  PS  + RS+LL  E++L  +S +  S  +  
Subjt:  ALSFKRSAKTTS---LSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQSTAPNSSPSPQ

Query:  AFFNQFQQPPSSVSHSP
        +  N     P      P
Subjt:  AFFNQFQQPPSSVSHSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGCACCTTCGAAGTTTCTGGACGCTACTGAATCCCAGCTCGTATCATGGGTCTTTGCTCTCAGCTTCAAAAGATCCGCAAAGACAACCTCTCTCTCTGTGTCTCA
ATACCTTGCTCAGATCAAAGATATTGCGGACAAGTTTTCGGCCATAGGCGAACCTCTCTCGTATGGTGATCACCTTGGTTATATTCTTGAAGGCCTCGATTCTGAATATA
ACCCATTCAACACCTCCATTCAAAATCGCAACGATCGTCCTTCTCTAGTAGATATTCGTAGCCTCCTTCTTGCCTATGAGGCACAACTTGAGAAACAATCCACGGCCCCT
AACTCCTCCCCATCTCCTCAAGCCTTTTTCAACCAGTTTCAGCAACCCCCTTCCTCTGTTTCCCATTCCCCTGTTTCATCCGAGTCCATCGTACCCAACATCCCTCACCC
AGATGAAGCTTGGTTCATGGATTCCGGAGCTACTCCCACCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGCACCTTCGAAGTTTCTGGACGCTACTGAATCCCAGCTCGTATCATGGGTCTTTGCTCTCAGCTTCAAAAGATCCGCAAAGACAACCTCTCTCTCTGTGTCTCA
ATACCTTGCTCAGATCAAAGATATTGCGGACAAGTTTTCGGCCATAGGCGAACCTCTCTCGTATGGTGATCACCTTGGTTATATTCTTGAAGGCCTCGATTCTGAATATA
ACCCATTCAACACCTCCATTCAAAATCGCAACGATCGTCCTTCTCTAGTAGATATTCGTAGCCTCCTTCTTGCCTATGAGGCACAACTTGAGAAACAATCCACGGCCCCT
AACTCCTCCCCATCTCCTCAAGCCTTTTTCAACCAGTTTCAGCAACCCCCTTCCTCTGTTTCCCATTCCCCTGTTTCATCCGAGTCCATCGTACCCAACATCCCTCACCC
AGATGAAGCTTGGTTCATGGATTCCGGAGCTACTCCCACCACATGA
Protein sequenceShow/hide protein sequence
MPAPSKFLDATESQLVSWVFALSFKRSAKTTSLSVSQYLAQIKDIADKFSAIGEPLSYGDHLGYILEGLDSEYNPFNTSIQNRNDRPSLVDIRSLLLAYEAQLEKQSTAP
NSSPSPQAFFNQFQQPPSSVSHSPVSSESIVPNIPHPDEAWFMDSGATPTT