; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008916 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008916
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:32270729..32273515
RNA-Seq ExpressionLag0008916
SyntenyLag0008916
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN79134.1 hypothetical protein VITISV_000843 [Vitis vinifera]2.7e-2775.31Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP
        EEMNALK+NGTWE+++LP++KK VGCKWVFTIK    GS+ERYKARLVAKVNS+W L+QLDVKNAFLN DLE+EVFM   P
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP

RVW39241.1 RCC1 and BTB domain-containing protein 2 [Vitis vinifera]1.9e-2879.27Show/hide
Query:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK-VNSDWSLYQLDVKNAFLNDDLEEEVFMDLP
        MEEM ALK+N TWEIVELPK K PVGCKWVFTIKY +YG IERYKARLVAK  N DWSL+Q+DVKNAFLN +L+EEV+MDLP
Subjt:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK-VNSDWSLYQLDVKNAFLNDDLEEEVFMDLP

RVW92108.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.0e-2669.51Show/hide
Query:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP
        MEE+ AL++NGTWE++ LP+ KKPVGCKWVFT+KY A G++E+YKARLVAK   DW L+Q D+KNAFLN +LEEEVFM LPP
Subjt:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP

RVW95296.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.8e-2676.54Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP
        EEMNAL +   WEIV+LPK+KK +GCKWVFTI Y   GSIERYKARLVAKV  +W L+QLDVKNAFLN DLEEEVFMDLPP
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP

RVX07801.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.0e-2676.83Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPPD
        EEMNAL +N TWEIV+LPK+KK V CKWV TIK    GSI+RYKARLVAKV+ +W L+QLDVKNAFLN DLEEEVFMDLPPD
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPPD

TrEMBL top hitse value%identityAlignment
A0A438DUU0 RCC1 and BTB domain-containing protein 29.2e-2979.27Show/hide
Query:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK-VNSDWSLYQLDVKNAFLNDDLEEEVFMDLP
        MEEM ALK+N TWEIVELPK K PVGCKWVFTIKY +YG IERYKARLVAK  N DWSL+Q+DVKNAFLN +L+EEV+MDLP
Subjt:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK-VNSDWSLYQLDVKNAFLNDDLEEEVFMDLP

A0A438I5X4 Retrovirus-related Pol polyprotein from transposon RE11.5e-2669.51Show/hide
Query:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP
        MEE+ AL++NGTWE++ LP+ KKPVGCKWVFT+KY A G++E+YKARLVAK   DW L+Q D+KNAFLN +LEEEVFM LPP
Subjt:  MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP

A0A438IEX2 Retrovirus-related Pol polyprotein from transposon RE18.6e-2776.54Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP
        EEMNAL +   WEIV+LPK+KK +GCKWVFTI Y   GSIERYKARLVAKV  +W L+QLDVKNAFLN DLEEEVFMDLPP
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP

A0A438JFR8 Retrovirus-related Pol polyprotein from transposon RE11.9e-2676.83Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPPD
        EEMNAL +N TWEIV+LPK+KK V CKWV TIK    GSI+RYKARLVAKV+ +W L+QLDVKNAFLN DLEEEVFMDLPPD
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPPD

A5BNN1 Integrase catalytic domain-containing protein1.3e-2775.31Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP
        EEMNALK+NGTWE+++LP++KK VGCKWVFTIK    GS+ERYKARLVAKVNS+W L+QLDVKNAFLN DLE+EVFM   P
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPP

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.8e-1235.77Show/hide
Query:  EMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDDL
        E+NA K N TW I + P++K  V  +WVF++KYN  G+  RYKARLVA+                              +  +  ++Q+DVK AFLN  L
Subjt:  EMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDDL

Query:  EEEVFMDLP------PDNKLKAN
        +EE++M LP       DN  K N
Subjt:  EEEVFMDLP------PDNKLKAN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-1440Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDD
        EEM +L++NGT+++VELPK K+P+ CKWVF +K +    + RYKARLV K                               + D  + QLDVK AFL+ D
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDD

Query:  LEEEVFMDLP
        LEEE++M+ P
Subjt:  LEEEVFMDLP

P92520 Uncharacterized mitochondrial protein AtMg008204.3e-0750Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK
        EE++AL +N TW +V  P ++  +GCKWVF  K ++ G+++R KARLVAK
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-1238.74Show/hide
Query:  EMNALKQNGTWEIVELPKDKKP-VGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDD
        E+NA   N TW++V  P      VGC+W+FT KYN+ GS+ RYKARLVAK                              V+  W + QLDV NAFL   
Subjt:  EMNALKQNGTWEIVELPKDKKP-VGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDD

Query:  LEEEVFMDLPP
        L ++V+M  PP
Subjt:  LEEEVFMDLPP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-1238.74Show/hide
Query:  EMNALKQNGTWEIVELPKDKKP-VGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDD
        E+NA   N TW++V  P      VGC+W+FT K+N+ GS+ RYKARLVAK                              V+  W + QLDV NAFL   
Subjt:  EMNALKQNGTWEIVELPKDKKP-VGCKWVFTIKYNAYGSIERYKARLVAK------------------------------VNSDWSLYQLDVKNAFLNDD

Query:  LEEEVFMDLPP
        L +EV+M  PP
Subjt:  LEEEVFMDLPP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.0e-2044.14Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNS------------------------------DWSLYQLDVKNAFLNDD
        +E+ A++   TWEI  LP +KKP+GCKWV+ IKYN+ G+IERYKARLVAK  +                              +++L+QLD+ NAFLN D
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNS------------------------------DWSLYQLDVKNAFLNDD

Query:  LEEEVFMDLPP
        L+EE++M LPP
Subjt:  LEEEVFMDLPP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.0e-0850Show/hide
Query:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK
        EE++AL +N TW +V  P ++  +GCKWVF  K ++ G+++R KARLVAK
Subjt:  EEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGATGAATGCTCTAAAACAAAATGGGACTTGGGAAATAGTAGAGTTGCCAAAAGATAAGAAACCAGTGGGTTGTAAATGGGTGTTCACCATAAAATACAATGC
ATATGGTAGTATTGAGAGATATAAGGCCAGACTAGTTGCTAAAGTCAATTCGGATTGGTCACTTTACCAACTTGATGTAAAAAATGCATTTCTGAATGACGATCTCGAGG
AAGAGGTTTTTATGGACTTACCCCCAGATAACAAATTAAAGGCTAATTTAGGGACATGGAGGACAGAATGTAAGGGGGTGGGAGTCAACAACATTGGCAGTGGAAGCAGA
CTAAGTATTACTCTTATCACTTTGCTTGGTACTCTCCCAATTTGCAGGCTTCCCATGGATTTTCCATGTCTCACGAGTGTGTTGAGGTCTGTTACAATAGTCACACCATA
CTTGAGGTTTGGTGGGTGGAACAGAAGCCTCAACCACCATGGCGGAGGTTTCAACTGTACTAACAGCTGTGAAAGATTGGTTTCACAGAAGCCTTATTTGGAATTTTTTG
GGTCAAATCTTCTGGTTGCAAGCCTCGAGCGCCCGAATCCAAGGCCTTCAAGAAATTGGATTTTTCTTTCTCTCATGGAGGCGGAAGCTTCAAACGTCCAGAGAACCTAT
GATCGACGCTTGCAGTCGGTTGAAACGTCCGTAGAAGACATCAAGAAGAGTGTGACTGACATTCAACAGTCCATTAGGCAACTTACACAACAGATGAGTACACTGACAGC
AAATCAACAAAATGTGGGAGACAACAGAATGGCAATAAACCCAAGAAACCCCAAGAAAACCAAGACAGATTGGTGGCTGGAGAAAGAAACACTCAAGAAAACAGAAATGA
AGTTGTTGTCCATCCAAGAAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGATGAATGCTCTAAAACAAAATGGGACTTGGGAAATAGTAGAGTTGCCAAAAGATAAGAAACCAGTGGGTTGTAAATGGGTGTTCACCATAAAATACAATGC
ATATGGTAGTATTGAGAGATATAAGGCCAGACTAGTTGCTAAAGTCAATTCGGATTGGTCACTTTACCAACTTGATGTAAAAAATGCATTTCTGAATGACGATCTCGAGG
AAGAGGTTTTTATGGACTTACCCCCAGATAACAAATTAAAGGCTAATTTAGGGACATGGAGGACAGAATGTAAGGGGGTGGGAGTCAACAACATTGGCAGTGGAAGCAGA
CTAAGTATTACTCTTATCACTTTGCTTGGTACTCTCCCAATTTGCAGGCTTCCCATGGATTTTCCATGTCTCACGAGTGTGTTGAGGTCTGTTACAATAGTCACACCATA
CTTGAGGTTTGGTGGGTGGAACAGAAGCCTCAACCACCATGGCGGAGGTTTCAACTGTACTAACAGCTGTGAAAGATTGGTTTCACAGAAGCCTTATTTGGAATTTTTTG
GGTCAAATCTTCTGGTTGCAAGCCTCGAGCGCCCGAATCCAAGGCCTTCAAGAAATTGGATTTTTCTTTCTCTCATGGAGGCGGAAGCTTCAAACGTCCAGAGAACCTAT
GATCGACGCTTGCAGTCGGTTGAAACGTCCGTAGAAGACATCAAGAAGAGTGTGACTGACATTCAACAGTCCATTAGGCAACTTACACAACAGATGAGTACACTGACAGC
AAATCAACAAAATGTGGGAGACAACAGAATGGCAATAAACCCAAGAAACCCCAAGAAAACCAAGACAGATTGGTGGCTGGAGAAAGAAACACTCAAGAAAACAGAAATGA
AGTTGTTGTCCATCCAAGAAGACTAG
Protein sequenceShow/hide protein sequence
MEEMNALKQNGTWEIVELPKDKKPVGCKWVFTIKYNAYGSIERYKARLVAKVNSDWSLYQLDVKNAFLNDDLEEEVFMDLPPDNKLKANLGTWRTECKGVGVNNIGSGSR
LSITLITLLGTLPICRLPMDFPCLTSVLRSVTIVTPYLRFGGWNRSLNHHGGGFNCTNSCERLVSQKPYLEFFGSNLLVASLERPNPRPSRNWIFLSLMEAEASNVQRTY
DRRLQSVETSVEDIKKSVTDIQQSIRQLTQQMSTLTANQQNVGDNRMAINPRNPKKTKTDWWLEKETLKKTEMKLLSIQED