; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035052 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035052
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:14143409..14149839
RNA-Seq ExpressionLag0035052
SyntenyLag0035052
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN77126.1 hypothetical protein VITISV_013628 [Vitis vinifera]4.2e-1334.94Show/hide
Query:  TPSTPST------TLNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWERKST----------------------------------DKFAAIREP
        T S+P+T       +N    + L  F++G    P K LDD QLQ NP FI WER +                                   DK++A+ EP
Subjt:  TPSTPST------TLNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWERKST----------------------------------DKFAAIREP

Query:  ISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANLNS
        +SYRD L + L+GL  EY+ FVTSI NR D  +L++V +LL  Y   LE+ +   QL   Q NL +
Subjt:  ISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANLNS

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]8.2e-1762.34Show/hide
Query:  RKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL
        ++ T K ++I EPIS +DH+++I++GLG EYN FVTSIQNR D  TLEDV  LLLAY+ RLEK ++VDQLN+VQAN+
Subjt:  RKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.8e-3044.92Show/hide
Query:  LNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWER----------------------------------------KST-----------------
        LN V+AN L G+LDG I  PP+ LD HQLQPNP + +WER                                        K+T                 
Subjt:  LNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWER----------------------------------------KST-----------------

Query:  --------------DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL
                      DKFAA+ EP+SYRDHLAH+LDGLGSEYN FVTSI NR DSP+LEDV +LLLAYEARL+K + VDQLN+ QANL
Subjt:  --------------DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]5.3e-1661.64Show/hide
Query:  DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL
        D FAAI EP+SYRDHL++IL+GLGSEYN FV+SI NR + P++ DV NLL+ Y++RLEK +  D L L+QAN+
Subjt:  DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]5.6e-1856.32Show/hide
Query:  FISWERKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANLNSVHY
        ++S  +   DKF+ + E ISYRDHL HILDGLGSEYN FVTSIQN  D+ ++EDV +LLL+YEA+LEK + +D LN+ QA L+ + +
Subjt:  FISWERKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANLNSVHY

TrEMBL top hitse value%identityAlignment
A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.5e-1332.28Show/hide
Query:  STTLNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWERKST------------------------------------------------------
        S  LNV++AN L  F+D   S+PPK LD    Q NP+F+ W+R +                                                       
Subjt:  STTLNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWERKST------------------------------------------------------

Query:  -----------------DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQAN
                         D+FA I EP+SYRD L  IL+GL  EY+ FVTSI NR D P+L++V +LL  YE RL + S    LN  QAN
Subjt:  -----------------DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQAN

A0A5C7IHH0 Uncharacterized protein3.5e-1361.9Show/hide
Query:  RKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEK
        ++  DKFAAI EP+SYRDHL ++L+GLG EY+ FVTSI+NR D P++EDV +LLL++E RL K
Subjt:  RKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEK

A0A6J1D6N7 uncharacterized protein LOC1110174383.9e-1762.34Show/hide
Query:  RKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL
        ++ T K ++I EPIS +DH+++I++GLG EYN FVTSIQNR D  TLEDV  LLLAY+ RLEK ++VDQLN+VQAN+
Subjt:  RKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL

A0A6J1DQX7 uncharacterized protein LOC1110223151.8e-3044.92Show/hide
Query:  LNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWER----------------------------------------KST-----------------
        LN V+AN L G+LDG I  PP+ LD HQLQPNP + +WER                                        K+T                 
Subjt:  LNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWER----------------------------------------KST-----------------

Query:  --------------DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL
                      DKFAA+ EP+SYRDHLAH+LDGLGSEYN FVTSI NR DSP+LEDV +LLLAYEARL+K + VDQLN+ QANL
Subjt:  --------------DKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANL

A5BPS3 Uncharacterized protein2.0e-1334.94Show/hide
Query:  TPSTPST------TLNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWERKST----------------------------------DKFAAIREP
        T S+P+T       +N    + L  F++G    P K LDD QLQ NP FI WER +                                   DK++A+ EP
Subjt:  TPSTPST------TLNVVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWERKST----------------------------------DKFAAIREP

Query:  ISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANLNS
        +SYRD L + L+GL  EY+ FVTSI NR D  +L++V +LL  Y   LE+ +   QL   Q NL +
Subjt:  ISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLVQANLNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGAGAAGTCGCTACCGCCAGTCGGTCAACGGAGGTCTTCGGCGACGAGCGGTTGGTGGAGGTCGACAACAACGGTAGTCGGAGGGGCAAGATGTGACCTACGAAA
CAAGGTAAATTTGCACACCGGTGTGGTGCTTGCCACACCACTGATGAGCAAGTTGAACTGGATTGGCCAAAGACTCCAGGAACCAAGCAGAGGGAGAGAGACTCAACCCA
CGCAAGTGGGCCGAGGCCAAGGCCATGAGATTGGGTCTTGGCCCAATCCCTTCAGTCTGTCTCTCCTCCGGAGTCTGCTTCTTGGTCGTATCCTCGGGGCCCTACCCTCT
CAATGGCACGAGATGGTTGGACCACAAACAGATTATTCATTAGAGGAGAACTGGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACGTTTTTACC
CTTGTATTTACGGTTGTTTACAGACGCCGTTTACGTCATTTTCGAGGACAAAGATGTGTTTTGGATGGTCAACGAATTTCAAAACGGTACATACAGAGGGTGTTGTGTAG
TCGTTGACATCCCGAGTGTCAATACTGGTATACACCCCGTGGGGACAGTTTCAAAGGGATCGTCGCGTAGGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACG
CGGCGGATGAAGGTCGTTGGCACGACAGAGATGGTTCGTCAGCGTGGTATGAGCGGTAGGCAGTGTTATTCACTTTGTTTGGCGGTGGGAAATGGTGTGGAATGGAGGGT
TCGGGGGTGTGGAATGCTCTCAAGGGTTTCTACTTTTGGAGGAAGAAGATGGACAGACTCCGTTTATGTCATTTCCGAGGACAGATATGTGTTTTGGATGGTGAACGAAT
TTCAAAACGGTACATATAGGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGTGGGAAATGGTGTGGAATGGAGAGTTCGGGGTGTGGAATGCTCTCAAGGG
TTTCTACTTTTGGAGGAAGAAGATGGACGGGTGGGCTGCTGCATGAAGGCTATTCCGGTGGTGAGGGTGGGCGGTGACATGACGTTTTTACCCCTTGTATTTACGGTTAA
TTACAGACGTCGTTTACGTCATTTCCGAGGACAGAGAAGAAAATTTCCAGCGAAGAATCTCCAAGAGGTTGCTGTGTTTTTCATCGTTAGAGCATCGTTGGCGAAAATCG
GTCAAGTCTACAACGAATCTATGGTATTCAAAGCTTCTTCCTCTACCTCATCATCTGGACCACTCACACAATCTCTTGAAACTCCCTCAACGCCTAGCACCACTCTGAAT
GTTGTCTTGGCCAATAGACTCCATGGATTTCTTGATGGCTTGATTTCGGCTCCACCAAAGGTTTTGGATGATCATCAGCTTCAACCGAATCCTGATTTCATTTCTTGGGA
AAGGAAGTCCACCGATAAGTTTGCTGCCATAAGAGAACCCATCTCTTATCGAGATCATTTGGCTCACATCCTTGATGGTCTTGGGAGTGAGTACAATTTGTTTGTCACCT
CAATTCAGAATCGATTCGATAGCCCTACTTTAGAAGATGTCTGTAACTTGCTTCTTGCTTATGAAGCTCGGTTGGAAAAATATAGTAATGTTGACCAGTTAAATCTCGTG
CAAGCTAATCTTAATAGCGTTCACTACATATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGAGAAGTCGCTACCGCCAGTCGGTCAACGGAGGTCTTCGGCGACGAGCGGTTGGTGGAGGTCGACAACAACGGTAGTCGGAGGGGCAAGATGTGACCTACGAAA
CAAGGTAAATTTGCACACCGGTGTGGTGCTTGCCACACCACTGATGAGCAAGTTGAACTGGATTGGCCAAAGACTCCAGGAACCAAGCAGAGGGAGAGAGACTCAACCCA
CGCAAGTGGGCCGAGGCCAAGGCCATGAGATTGGGTCTTGGCCCAATCCCTTCAGTCTGTCTCTCCTCCGGAGTCTGCTTCTTGGTCGTATCCTCGGGGCCCTACCCTCT
CAATGGCACGAGATGGTTGGACCACAAACAGATTATTCATTAGAGGAGAACTGGCATGGCTACCGACATTTTAGGGAACTGTTTACTGGATTATTTTGGACGTTTTTACC
CTTGTATTTACGGTTGTTTACAGACGCCGTTTACGTCATTTTCGAGGACAAAGATGTGTTTTGGATGGTCAACGAATTTCAAAACGGTACATACAGAGGGTGTTGTGTAG
TCGTTGACATCCCGAGTGTCAATACTGGTATACACCCCGTGGGGACAGTTTCAAAGGGATCGTCGCGTAGGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACG
CGGCGGATGAAGGTCGTTGGCACGACAGAGATGGTTCGTCAGCGTGGTATGAGCGGTAGGCAGTGTTATTCACTTTGTTTGGCGGTGGGAAATGGTGTGGAATGGAGGGT
TCGGGGGTGTGGAATGCTCTCAAGGGTTTCTACTTTTGGAGGAAGAAGATGGACAGACTCCGTTTATGTCATTTCCGAGGACAGATATGTGTTTTGGATGGTGAACGAAT
TTCAAAACGGTACATATAGGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGTGGGAAATGGTGTGGAATGGAGAGTTCGGGGTGTGGAATGCTCTCAAGGG
TTTCTACTTTTGGAGGAAGAAGATGGACGGGTGGGCTGCTGCATGAAGGCTATTCCGGTGGTGAGGGTGGGCGGTGACATGACGTTTTTACCCCTTGTATTTACGGTTAA
TTACAGACGTCGTTTACGTCATTTCCGAGGACAGAGAAGAAAATTTCCAGCGAAGAATCTCCAAGAGGTTGCTGTGTTTTTCATCGTTAGAGCATCGTTGGCGAAAATCG
GTCAAGTCTACAACGAATCTATGGTATTCAAAGCTTCTTCCTCTACCTCATCATCTGGACCACTCACACAATCTCTTGAAACTCCCTCAACGCCTAGCACCACTCTGAAT
GTTGTCTTGGCCAATAGACTCCATGGATTTCTTGATGGCTTGATTTCGGCTCCACCAAAGGTTTTGGATGATCATCAGCTTCAACCGAATCCTGATTTCATTTCTTGGGA
AAGGAAGTCCACCGATAAGTTTGCTGCCATAAGAGAACCCATCTCTTATCGAGATCATTTGGCTCACATCCTTGATGGTCTTGGGAGTGAGTACAATTTGTTTGTCACCT
CAATTCAGAATCGATTCGATAGCCCTACTTTAGAAGATGTCTGTAACTTGCTTCTTGCTTATGAAGCTCGGTTGGAAAAATATAGTAATGTTGACCAGTTAAATCTCGTG
CAAGCTAATCTTAATAGCGTTCACTACATATAG
Protein sequenceShow/hide protein sequence
MLEKSLPPVGQRRSSATSGWWRSTTTVVGGARCDLRNKVNLHTGVVLATPLMSKLNWIGQRLQEPSRGRETQPTQVGRGQGHEIGSWPNPFSLSLLRSLLLGRILGALPS
QWHEMVGPQTDYSLEENWHGYRHFRELFTGLFWTFLPLYLRLFTDAVYVIFEDKDVFWMVNEFQNGTYRGCCVVVDIPSVNTGIHPVGTVSKGSSRRGRNDRACKRWLST
RRMKVVGTTEMVRQRGMSGRQCYSLCLAVGNGVEWRVRGCGMLSRVSTFGGRRWTDSVYVISEDRYVFWMVNEFQNGTYRGCCVVVDIPSVNTVGNGVEWRVRGVECSQG
FLLLEEEDGRVGCCMKAIPVVRVGGDMTFLPLVFTVNYRRRLRHFRGQRRKFPAKNLQEVAVFFIVRASLAKIGQVYNESMVFKASSSTSSSGPLTQSLETPSTPSTTLN
VVLANRLHGFLDGLISAPPKVLDDHQLQPNPDFISWERKSTDKFAAIREPISYRDHLAHILDGLGSEYNLFVTSIQNRFDSPTLEDVCNLLLAYEARLEKYSNVDQLNLV
QANLNSVHYI