; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042062 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042062
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:35789167..35791394
RNA-Seq ExpressionLag0042062
SyntenyLag0042062
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032442.1 faeA, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-3963.57Show/hide
Query:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK
        LHYN SL T+IP    TRC+ L   R KF  K+RVLAFR+NPRGR G SSF CFS  G EV+NLS L+E ERPPFDINLAVILAGFAFEAYTSP ENFGK
Subjt:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK

Query:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        REVDAAGC TV+LSE            RE+ + ++F+ +K
Subjt:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

XP_022933070.1 uncharacterized protein LOC111439777 isoform X1 [Cucurbita moschata]1.2e-3963.57Show/hide
Query:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK
        LHYN SL T+IP    TRC+ L   R KF  K+RVLAFR+NPRGR G SSF CFS  G EV+NLS L+E ERPPFDINLAVILAGFAFEAYTSP ENFGK
Subjt:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK

Query:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        REVDAAGC TV+LSE            RE+ + ++F+ +K
Subjt:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

XP_022994042.1 uncharacterized protein LOC111489848 isoform X1 [Cucurbita maxima]1.5e-4065Show/hide
Query:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK
        LHYN SL T+IP    TRC  L   R KF  K+RVLAFR+NPRGR G SSF CFS TGAEV+NLS L+E ERPPFDINLAVILAGFAFEAYTSP ENFGK
Subjt:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK

Query:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        REVDAAGC TV+LSE            RE+ + ++F+ +K
Subjt:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

XP_023530237.1 uncharacterized protein LOC111792861 isoform X1 [Cucurbita pepo subsp. pepo]2.5e-4064.29Show/hide
Query:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK
        LHYN SL T+IP    TRC+ L   R KF  K+RVLAFR+NPRGR G SSF CFS TG EV+NLS L+E ERPPFDINLAVILAGFAFEAYTSP ENFGK
Subjt:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK

Query:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        REVDAAGC TV+LSE            RE+ + ++F+ +K
Subjt:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

XP_023530246.1 uncharacterized protein LOC111792861 isoform X2 [Cucurbita pepo subsp. pepo]2.5e-4064.29Show/hide
Query:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK
        LHYN SL T+IP    TRC+ L   R KF  K+RVLAFR+NPRGR G SSF CFS TG EV+NLS L+E ERPPFDINLAVILAGFAFEAYTSP ENFGK
Subjt:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK

Query:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        REVDAAGC TV+LSE            RE+ + ++F+ +K
Subjt:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

TrEMBL top hitse value%identityAlignment
A0A1S3CCU0 uncharacterized protein LOC103499378 isoform X12.1e-3256.85Show/hide
Query:  MASLTL-HYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSP
        MASL   H + SL + IP    T  LH    RP F AK RVL FR++ + RLG SSF CF  +G E++N SL  + ER PFDINLAVILAGFAFEAYTSP
Subjt:  MASLTL-HYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSP

Query:  PENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        PENFGKREVDAAGCTTVYLSE            RE+ + ++F+ +K
Subjt:  PENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

A0A5A7UEA7 Multiple C2 and transmembrane domain-containing protein 12.1e-3256.85Show/hide
Query:  MASLTL-HYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSP
        MASL   H + SL + IP    T  LH    RP F AK RVL FR++ + RLG SSF CF  +G E++N SL  + ER PFDINLAVILAGFAFEAYTSP
Subjt:  MASLTL-HYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSP

Query:  PENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        PENFGKREVDAAGCTTVYLSE            RE+ + ++F+ +K
Subjt:  PENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

A0A6J1DNK8 uncharacterized protein LOC1110222921.6e-3760.81Show/hide
Query:  MASLTLH--YNFSLFTIIPKPPRTRCLHLP-HYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYT
        MASL  H  YN S   ++P   +TRCL +  + RP F  ++RVL FRINP GR+G SSF C  RTG EVENLSLL+  ERPPFDINLAVILAGFAFEAYT
Subjt:  MASLTLH--YNFSLFTIIPKPPRTRCLHLP-HYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYT

Query:  SPPENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        SPPENFG+ EVDAAGCTTVYLSE  I         RE+ + ++F+ +K
Subjt:  SPPENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

A0A6J1F3W8 uncharacterized protein LOC111439777 isoform X16.0e-4063.57Show/hide
Query:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK
        LHYN SL T+IP    TRC+ L   R KF  K+RVLAFR+NPRGR G SSF CFS  G EV+NLS L+E ERPPFDINLAVILAGFAFEAYTSP ENFGK
Subjt:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK

Query:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        REVDAAGC TV+LSE            RE+ + ++F+ +K
Subjt:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

A0A6J1JUL3 uncharacterized protein LOC111489848 isoform X17.1e-4165Show/hide
Query:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK
        LHYN SL T+IP    TRC  L   R KF  K+RVLAFR+NPRGR G SSF CFS TGAEV+NLS L+E ERPPFDINLAVILAGFAFEAYTSP ENFGK
Subjt:  LHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGK

Query:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK
        REVDAAGC TV+LSE            RE+ + ++F+ +K
Subjt:  REVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-0828.35Show/hide
Query:  INSDIAADLERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKYDIMSVVEDFFHLGVINKASNETYICLIPKK-LDGKSMTDYRPNSLIPCAY
        +N +    L R +   E+   + SL + KSP  DGFT EF++ Y   +   ++ + +     G++  +  E  I LIPK   D     ++RP SL+    
Subjt:  INSDIAADLERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKYDIMSVVEDFFHLGVINKASNETYICLIPKK-LDGKSMTDYRPNSLIPCAY

Query:  KIIARVLSNRLKHVLSHTIAINQMAFL
        KI+ ++L+NR++  +   I  +Q+ F+
Subjt:  KIIARVLSNRLKHVLSHTIAINQMAFL

P08548 LINE-1 reverse transcriptase homolog4.0e-0927.37Show/hide
Query:  DEIERPPFDINLAVILAGFAFEAYTSPPENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKY
        DEI   P +I    IL  +  + Y+   EN  + +     C    LS+  +       L R +   E+   +++L   KSP  DGFT EF++ +   +  
Subjt:  DEIERPPFDINLAVILAGFAFEAYTSPPENFGKREVDAAGCTTVYLSECPINSDIAADLERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKY

Query:  DIMSVVEDFFHLGVINKASNETYICLIPKKLDGKSMT---DYRPNSLIPCAYKIIARVLSNRLKHVLSHTIAINQMAFL
         ++++ ++    G++     E  I LIPK   GK  T   +YRP SL+    KI+ ++L+NR++  +   I  +Q+ F+
Subjt:  DIMSVVEDFFHLGVINKASNETYICLIPKKLDGKSMT---DYRPNSLIPCAYKIIARVLSNRLKHVLSHTIAINQMAFL

P11369 LINE-1 retrotransposable element ORF2 protein1.2e-0829.77Show/hide
Query:  INSDIAADLERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKYDIMSVVEDFFHL----GVINKASNETYICLIPK-KLDGKSMTDYRPNSLI
        +N D    L   +  +E+   + SL + KSP  DGF+ EF++    T K D++ ++   FH     G +  +  E  I LIPK + D   + ++RP SL+
Subjt:  INSDIAADLERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKYDIMSVVEDFFHL----GVINKASNETYICLIPK-KLDGKSMTDYRPNSLI

Query:  PCAYKIIARVLSNRLKHVLSHTIAINQMAFL
            KI+ ++L+NR++  +   I  +Q+ F+
Subjt:  PCAYKIIARVLSNRLKHVLSHTIAINQMAFL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-1432.84Show/hide
Query:  LERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKYDIMSVVEDFFHLGVINKASNETYICLIPKKLDGKSMTDYRPNSLIPCAYKIIARVLSN
        LE  +  +E+  A++ +  +KSP LDG T+EFF+ +W T+  D   V+ + F  G +  +     + L+PKK D + + ++RP SL+   YKI+A+ +S 
Subjt:  LERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKYDIMSVVEDFFHLGVINKASNETYICLIPKKLDGKSMTDYRPNSLIPCAYKIIARVLSN

Query:  RLKHVLSHTIAINQMAFLDDRKILDASLIANEVI
        RLK VL+  I  +Q   +  R I D   +  +++
Subjt:  RLKHVLSHTIAINQMAFLDDRKILDASLIANEVI

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTTACTCTTCACTACAATTTTTCTCTCTTCACTATCATTCCCAAGCCCCCTCGGACCCGATGCCTCCACCTCCCCCATTACCGACCCAAGTTTTGTGCCAA
ATCTAGGGTTTTAGCTTTCCGGATAAATCCTAGGGGTCGCCTCGGAGGATCCTCTTTTCCTTGCTTCAGCAGAACAGGGGCTGAAGTTGAAAATTTATCGCTCCTGGATG
AAATTGAACGCCCGCCCTTCGATATCAATCTCGCCGTTATTCTCGCCGGTTTCGCTTTTGAAGCTTACACTAGTCCGCCTGAAAATTTTGGGAAGCGTGAAGTCGACGCT
GCAGGTTGTACGACTGTGTATCTTTCAGAATGCCCTATAAATTCGGATATCGCGGCTGATTTAGAACGAGAATTGATTGAAGAGGAAGTTTTTCTTGCTGTTAAATCTTT
GGGATCAAGTAAGTCTCCAAGCCTTGATGGTTTTACCGTTGAATTCTTTAAACATTATTGGATTACTATTAAATATGATATTATGTCGGTGGTTGAAGATTTCTTTCATT
TAGGTGTTATTAATAAGGCGTCGAATGAAACCTATATTTGCTTAATCCCTAAGAAATTGGATGGAAAATCAATGACAGATTATCGTCCTAATAGTCTTATTCCTTGTGCC
TACAAGATCATTGCAAGAGTCTTGTCTAACCGTTTAAAGCATGTCTTATCTCACACCATTGCAATAAACCAAATGGCTTTTCTAGACGATAGGAAAATTTTGGATGCATC
CTTAATTGCCAATGAGGTTATTGATGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTTACTCTTCACTACAATTTTTCTCTCTTCACTATCATTCCCAAGCCCCCTCGGACCCGATGCCTCCACCTCCCCCATTACCGACCCAAGTTTTGTGCCAA
ATCTAGGGTTTTAGCTTTCCGGATAAATCCTAGGGGTCGCCTCGGAGGATCCTCTTTTCCTTGCTTCAGCAGAACAGGGGCTGAAGTTGAAAATTTATCGCTCCTGGATG
AAATTGAACGCCCGCCCTTCGATATCAATCTCGCCGTTATTCTCGCCGGTTTCGCTTTTGAAGCTTACACTAGTCCGCCTGAAAATTTTGGGAAGCGTGAAGTCGACGCT
GCAGGTTGTACGACTGTGTATCTTTCAGAATGCCCTATAAATTCGGATATCGCGGCTGATTTAGAACGAGAATTGATTGAAGAGGAAGTTTTTCTTGCTGTTAAATCTTT
GGGATCAAGTAAGTCTCCAAGCCTTGATGGTTTTACCGTTGAATTCTTTAAACATTATTGGATTACTATTAAATATGATATTATGTCGGTGGTTGAAGATTTCTTTCATT
TAGGTGTTATTAATAAGGCGTCGAATGAAACCTATATTTGCTTAATCCCTAAGAAATTGGATGGAAAATCAATGACAGATTATCGTCCTAATAGTCTTATTCCTTGTGCC
TACAAGATCATTGCAAGAGTCTTGTCTAACCGTTTAAAGCATGTCTTATCTCACACCATTGCAATAAACCAAATGGCTTTTCTAGACGATAGGAAAATTTTGGATGCATC
CTTAATTGCCAATGAGGTTATTGATGTCTAG
Protein sequenceShow/hide protein sequence
MASLTLHYNFSLFTIIPKPPRTRCLHLPHYRPKFCAKSRVLAFRINPRGRLGGSSFPCFSRTGAEVENLSLLDEIERPPFDINLAVILAGFAFEAYTSPPENFGKREVDA
AGCTTVYLSECPINSDIAADLERELIEEEVFLAVKSLGSSKSPSLDGFTVEFFKHYWITIKYDIMSVVEDFFHLGVINKASNETYICLIPKKLDGKSMTDYRPNSLIPCA
YKIIARVLSNRLKHVLSHTIAINQMAFLDDRKILDASLIANEVIDV