; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021122 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021122
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:4830985..4832141
RNA-Seq ExpressionLag0021122
SyntenyLag0021122
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG68750.1 hypothetical protein EZV62_003685 [Acer yangbiense]6.4e-2454.1Show/hide
Query:  DSNAVYESSSTTLIMGL----RSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYE
        +S    ESSS  L   L    RSQL  ++K+G ++ QYL Q K++VDKFAAIGEPLSYRDHLGY+LEGLG EY+ FVTSI+NR D+PS+ DV +LL+++E
Subjt:  DSNAVYESSSTTLIMGL----RSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYE

Query:  AHLEKQS---------SVDTLN
          L K++          VDTLN
Subjt:  AHLEKQS---------SVDTLN

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]4.6e-3045.45Show/hide
Query:  QLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSSVDTLNLVQVNLANLS
        ++Q+++KDG+SV+QYLA+IK++  K ++IGEP+S +DH+ YI+EGLG EYN FVTSIQNR+D  +L DV  LL+AY+  LEKQ+SVD LN+VQ N+ANL 
Subjt:  QLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSSVDTLNLVQVNLANLS

Query:  IN--PNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR
        +N      R  R      P+ F++   P               G+LG+P  + +P WP S    R
Subjt:  IN--PNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]4.4e-4153.01Show/hide
Query:  SDSNAVYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHL
        S    VY+S +T  IMGL+++LQ +RKDG SV+QYLA+IK++ DKFAA+GEPLSYRDHL ++L+GLGSEYN FVTSI NR D PSL DV +LL+AYEA L
Subjt:  SDSNAVYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHL

Query:  EKQSSVDTLNLVQVNLANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR
        +KQ++VD LN+ Q NL NLS+  N KR         P  FS+P      FP    S +    +LG+PQS    KWP   + S+
Subjt:  EKQSSVDTLNLVQVNLANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]2.8e-4063.33Show/hide
Query:  VYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSS
        VYESSS   IMG  SQLQKI+KDG++V+QYLAQIKDV+D FAAIGEPLSYRDHL YILEGLGSEYNPFV+SI NRT+RPS+ADV NLLI Y++ LEKQ++
Subjt:  VYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSS

Query:  VDTLNLVQVNLANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFP--FPQP
         D L L+Q N+A+LSIN   +  Q    N      S P   +FP   P P
Subjt:  VDTLNLVQVNLANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFP--FPQP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]2.7e-3051.5Show/hide
Query:  MGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSSVDTLNLVQVN
        M L+++LQKIRKD +S++QYL+QIKDV DKF+ +GE +SYRDHL +IL+GLGSEYN FVTSIQN  D  S+ DV +LL++YEA LEKQ+++D LN+ Q  
Subjt:  MGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSSVDTLNLVQVN

Query:  LANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQS
        L+ LS   N KR   +L+     S    PSP F    P   PS    V  RP  S   KWP S   S
Subjt:  LANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQS

TrEMBL top hitse value%identityAlignment
A0A5C7IHH0 Uncharacterized protein3.1e-2454.1Show/hide
Query:  DSNAVYESSSTTLIMGL----RSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYE
        +S    ESSS  L   L    RSQL  ++K+G ++ QYL Q K++VDKFAAIGEPLSYRDHLGY+LEGLG EY+ FVTSI+NR D+PS+ DV +LL+++E
Subjt:  DSNAVYESSSTTLIMGL----RSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYE

Query:  AHLEKQS---------SVDTLN
          L K++          VDTLN
Subjt:  AHLEKQS---------SVDTLN

A0A6J1D6N7 uncharacterized protein LOC1110174382.2e-3045.45Show/hide
Query:  QLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSSVDTLNLVQVNLANLS
        ++Q+++KDG+SV+QYLA+IK++  K ++IGEP+S +DH+ YI+EGLG EYN FVTSIQNR+D  +L DV  LL+AY+  LEKQ+SVD LN+VQ N+ANL 
Subjt:  QLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSSVDTLNLVQVNLANLS

Query:  IN--PNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR
        +N      R  R      P+ F++   P               G+LG+P  + +P WP S    R
Subjt:  IN--PNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR

A0A6J1DQX7 uncharacterized protein LOC1110223152.1e-4153.01Show/hide
Query:  SDSNAVYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHL
        S    VY+S +T  IMGL+++LQ +RKDG SV+QYLA+IK++ DKFAA+GEPLSYRDHL ++L+GLGSEYN FVTSI NR D PSL DV +LL+AYEA L
Subjt:  SDSNAVYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHL

Query:  EKQSSVDTLNLVQVNLANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR
        +KQ++VD LN+ Q NL NLS+  N KR         P  FS+P      FP    S +    +LG+PQS    KWP   + S+
Subjt:  EKQSSVDTLNLVQVNLANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSR

A0A7J0DER3 Uncharacterized protein4.0e-2441.77Show/hide
Query:  VYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSS
        +Y ++S   +  LR+ LQ I+KDG++   Y+ + + + +  A+IGEP++Y DHL Y L GLG +YNPFVTSIQ++  RPS+ +V +LL++Y+A LE+QS+
Subjt:  VYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSS

Query:  VDTLNLVQVNLANLSI------NPNQKRFQRSLQNLKP----QSFSWPPSPTFPFPQP
         DTL+ +Q NLANL+       NP+   F  S     P    ++ S+ P+P+ P P+P
Subjt:  VDTLNLVQVNLANLSI------NPNQKRFQRSLQNLKP----QSFSWPPSPTFPFPQP

A0A7J0E8R3 Uncharacterized protein4.0e-2441.77Show/hide
Query:  VYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSS
        +Y ++S   +  LR+ LQ I+KDG++   Y+ + + + +  A+IGEP++Y DHL Y L GLG +YNPFVTSIQ++  RPS+ +V +LL++Y+A LE+QS+
Subjt:  VYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHLEKQSS

Query:  VDTLNLVQVNLANLSI------NPNQKRFQRSLQNLKP----QSFSWPPSPTFPFPQP
         DTL+ +Q NLANL+       NP+   F  S     P    ++ S+ P+P+ P P+P
Subjt:  VDTLNLVQVNLANLSI------NPNQKRFQRSLQNLKP----QSFSWPPSPTFPFPQP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCATCCTCGAGTACCTCTTCTACTGATAATGACACTGTTCCTGTGGTTTCCTCTACCTCTACTACTCCGGTGACTATCCTTATTACCTCTCATCGCCAAAATCA
AAATCACCCCCCTCTCTCAAATGTCCAAGCTCGACCTCTAAACTCAAATATCCCTCCCTGGCTTTCAACAATTTCCTTCTGCTTATCCCTACCCCACCGCTACCACTGGA
TTCCAGTACCCCCCTCAAACATCGCCTTCCCTTCCCTTCTTCCCTTCTCATTCCTCTCATCCGCCATTTTTTTCCTTCAACTCAGCAACATCCATCGCCCTACCCAACCC
TCACTCCTCCCCTCCGCCGAAGCATTTGGATCCGACTCAAATGCAGTCTATGAATCTTCTTCCACAACTCTCATTATGGGTCTTCGATCACAGCTCCAGAAAATTCGGAA
GGATGGTGTTTCGGTGGCACAGTACCTTGCTCAAATTAAAGACGTTGTCGACAAATTCGCAGCCATCGGTGAGCCTCTCTCTTATCGGGACCACCTTGGTTACATACTTG
AAGGACTGGGTTCTGAGTACAACCCCTTTGTCACATCCATCCAAAATCGAACCGATAGACCCTCTCTCGCCGATGTCTGCAACCTTCTTATTGCTTATGAAGCTCATCTG
GAGAAACAGTCCTCAGTTGATACCTTAAATTTGGTGCAAGTCAACCTTGCAAATCTCTCTATTAATCCCAACCAAAAGCGATTTCAACGTTCCCTTCAAAATCTCAAGCC
GCAATCCTTTTCCTGGCCTCCCTCTCCTACTTTCCCCTTTCCTCAGCCCTTTTCCTCTCCTTCAAATGGTCTTGGTGTGTTGGGTCGCCCTCAATCCTCCCCTCGCCCGA
AATGGCCTTCTTCGAACAATCAGAGTCGCCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCATCCTCGAGTACCTCTTCTACTGATAATGACACTGTTCCTGTGGTTTCCTCTACCTCTACTACTCCGGTGACTATCCTTATTACCTCTCATCGCCAAAATCA
AAATCACCCCCCTCTCTCAAATGTCCAAGCTCGACCTCTAAACTCAAATATCCCTCCCTGGCTTTCAACAATTTCCTTCTGCTTATCCCTACCCCACCGCTACCACTGGA
TTCCAGTACCCCCCTCAAACATCGCCTTCCCTTCCCTTCTTCCCTTCTCATTCCTCTCATCCGCCATTTTTTTCCTTCAACTCAGCAACATCCATCGCCCTACCCAACCC
TCACTCCTCCCCTCCGCCGAAGCATTTGGATCCGACTCAAATGCAGTCTATGAATCTTCTTCCACAACTCTCATTATGGGTCTTCGATCACAGCTCCAGAAAATTCGGAA
GGATGGTGTTTCGGTGGCACAGTACCTTGCTCAAATTAAAGACGTTGTCGACAAATTCGCAGCCATCGGTGAGCCTCTCTCTTATCGGGACCACCTTGGTTACATACTTG
AAGGACTGGGTTCTGAGTACAACCCCTTTGTCACATCCATCCAAAATCGAACCGATAGACCCTCTCTCGCCGATGTCTGCAACCTTCTTATTGCTTATGAAGCTCATCTG
GAGAAACAGTCCTCAGTTGATACCTTAAATTTGGTGCAAGTCAACCTTGCAAATCTCTCTATTAATCCCAACCAAAAGCGATTTCAACGTTCCCTTCAAAATCTCAAGCC
GCAATCCTTTTCCTGGCCTCCCTCTCCTACTTTCCCCTTTCCTCAGCCCTTTTCCTCTCCTTCAAATGGTCTTGGTGTGTTGGGTCGCCCTCAATCCTCCCCTCGCCCGA
AATGGCCTTCTTCGAACAATCAGAGTCGCCCCTAA
Protein sequenceShow/hide protein sequence
MASSSSTSSTDNDTVPVVSSTSTTPVTILITSHRQNQNHPPLSNVQARPLNSNIPPWLSTISFCLSLPHRYHWIPVPPSNIAFPSLLPFSFLSSAIFFLQLSNIHRPTQP
SLLPSAEAFGSDSNAVYESSSTTLIMGLRSQLQKIRKDGVSVAQYLAQIKDVVDKFAAIGEPLSYRDHLGYILEGLGSEYNPFVTSIQNRTDRPSLADVCNLLIAYEAHL
EKQSSVDTLNLVQVNLANLSINPNQKRFQRSLQNLKPQSFSWPPSPTFPFPQPFSSPSNGLGVLGRPQSSPRPKWPSSNNQSRP