; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014113 (gene) of Snake gourd v1 genome

Gene IDTan0014113
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationLG07:6883861..6885019
RNA-Seq ExpressionTan0014113
SyntenyTan0014113
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW57887.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.3e-2255.56Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQA
        +GL +QLQ+IKK+G+++S+YLA+IK V DK+SA+GEP+SY+D L H L+GL  +Y+ FV +I NRSD P LE+V SLL  YE RLE++   +QL L Q+
Subjt:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQA

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]1.1e-2863.37Show/hide
Query:  QLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLSSLT
        ++Q++KKDGLSVSQYLA+IK++T K S+IGEPIS +DH+++I++GLG EYN FVT+IQNRSD   LEDVR+LLLAY+ RLE+Q +V+QLN+ QAN+++L 
Subjt:  QLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLSSLT

Query:  I
        +
Subjt:  I

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.8e-3964.12Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN
        MGLKT+LQ ++KDG SVSQYLA+IK++ DKF+A+GEP+SYRDHLAH+LDGLGSEYN FVT+I NR+D+P+LEDVRSLLLAYEARL++Q  V+QLN+AQAN
Subjt:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN

Query:  LSSLTIQHNIFLASPTHLLKNGLPDPLPLVP
        L +L++QHN     P     N      P  P
Subjt:  LSSLTIQHNIFLASPTHLLKNGLPDPLPLVP

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]3.9e-3165.09Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN
        MG  +QLQ+IKKDGL+VSQYLAQIKDV D F+AIGEP+SYRDHL++IL+GLGSEYN FV++I NR++ P++ DVR+LL+ Y++RLE+QTA + L L QAN
Subjt:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN

Query:  LSSLTI
        ++ L+I
Subjt:  LSSLTI

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]3.8e-3469.72Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN
        M LK +LQ+I+KD LS+SQYL+QIKDV DKFS +GE ISYRDHL HILDGLGSEYN FVT+IQN  DN ++EDV SLLL+YEA+LE+Q A++ LN+AQA 
Subjt:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN

Query:  LSSLTIQHN
        LS L+ QHN
Subjt:  LSSLTIQHN

TrEMBL top hitse value%identityAlignment
A0A438FD18 Retrovirus-related Pol polyprotein from transposon RE12.1e-2255.56Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQA
        +GL +QLQ+IKK+G+++S+YLA+IK V DK+SA+GEP+SY+D L H L+GL  +Y+ FV +I NRSD P LE+V SLL  YE RLE++   +QL L Q+
Subjt:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQA

A0A5C7IHH0 Uncharacterized protein4.7e-2256.98Show/hide
Query:  KTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQT
        ++QL  +KK+G +++QYL Q K++ DKF+AIGEP+SYRDHL ++L+GLG EY+ FVT+I+NR D P++EDV SLLL++E RL ++T
Subjt:  KTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQT

A0A6J1D6N7 uncharacterized protein LOC1110174385.2e-2963.37Show/hide
Query:  QLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLSSLT
        ++Q++KKDGLSVSQYLA+IK++T K S+IGEPIS +DH+++I++GLG EYN FVT+IQNRSD   LEDVR+LLLAY+ RLE+Q +V+QLN+ QAN+++L 
Subjt:  QLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLSSLT

Query:  I
        +
Subjt:  I

A0A6J1DQX7 uncharacterized protein LOC1110223158.5e-4064.12Show/hide
Query:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN
        MGLKT+LQ ++KDG SVSQYLA+IK++ DKF+A+GEP+SYRDHLAH+LDGLGSEYN FVT+I NR+D+P+LEDVRSLLLAYEARL++Q  V+QLN+AQAN
Subjt:  MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQAN

Query:  LSSLTIQHNIFLASPTHLLKNGLPDPLPLVP
        L +L++QHN     P     N      P  P
Subjt:  LSSLTIQHNIFLASPTHLLKNGLPDPLPLVP

A0A7J0E8R3 Uncharacterized protein8.0e-2247.06Show/hide
Query:  LKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLS
        L+T LQ IKKDGL+   Y+ + + + +  ++IGEP++Y DHL + L GLG +YN FVT+IQ+++  P++E+V SLLL+Y+ARLERQ+A + L+  QANL+
Subjt:  LKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLS

Query:  SLTIQHNIFLASPTHLLKN
        +LT Q   F    T+   N
Subjt:  SLTIQHNIFLASPTHLLKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.8e-0830.39Show/hide
Query:  KTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLSS
        + +L+    D LSV +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I+++S  P+  + RS+LL  E+RL  ++   + +L+  N  S
Subjt:  KTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLSS

Query:  LT
        L+
Subjt:  LT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTCAAAACACAGTTACAAAGAATCAAGAAAGATGGTCTATCAGTCAGTCAATACTTAGCCCAAATTAAGGATGTCACTGATAAATTCTCTGCGATTGGTGAACC
TATATCCTATAGAGATCATTTAGCTCACATCCTTGATGGTTTAGGTAGTGAATATAATGTTTTTGTCACTACAATCCAAAATCGATCTGATAATCCTGCTTTGGAAGATG
TTAGAAGCTTGTTGTTGGCCTATGAAGCTCGTTTAGAGCGACAGACTGCTGTTGAACAACTAAATTTGGCTCAAGCTAACCTCAGTAGTCTCACCATTCAACACAACATA
TTCTTGGCAAGCCCCACACACCTCCTCAAAAATGGTCTTCCAGACCCTCTTCCTCTCGTCCCCAATGCCAAATTTGTGATTGAGAAGATGAGTAATACTAAACTAGACTT
TAGGCACTTAAACAACTCAAAAGCAACAAAAATGAGCTTAGCAAAAGAGATACTAAATCTAGCTATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTCAAAACACAGTTACAAAGAATCAAGAAAGATGGTCTATCAGTCAGTCAATACTTAGCCCAAATTAAGGATGTCACTGATAAATTCTCTGCGATTGGTGAACC
TATATCCTATAGAGATCATTTAGCTCACATCCTTGATGGTTTAGGTAGTGAATATAATGTTTTTGTCACTACAATCCAAAATCGATCTGATAATCCTGCTTTGGAAGATG
TTAGAAGCTTGTTGTTGGCCTATGAAGCTCGTTTAGAGCGACAGACTGCTGTTGAACAACTAAATTTGGCTCAAGCTAACCTCAGTAGTCTCACCATTCAACACAACATA
TTCTTGGCAAGCCCCACACACCTCCTCAAAAATGGTCTTCCAGACCCTCTTCCTCTCGTCCCCAATGCCAAATTTGTGATTGAGAAGATGAGTAATACTAAACTAGACTT
TAGGCACTTAAACAACTCAAAAGCAACAAAAATGAGCTTAGCAAAAGAGATACTAAATCTAGCTATCTAA
Protein sequenceShow/hide protein sequence
MGLKTQLQRIKKDGLSVSQYLAQIKDVTDKFSAIGEPISYRDHLAHILDGLGSEYNVFVTTIQNRSDNPALEDVRSLLLAYEARLERQTAVEQLNLAQANLSSLTIQHNI
FLASPTHLLKNGLPDPLPLVPNAKFVIEKMSNTKLDFRHLNNSKATKMSLAKEILNLAI