; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002041 (gene) of Snake gourd v1 genome

Gene IDTan0002041
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationLG01:102734017..102734412
RNA-Seq ExpressionTan0002041
SyntenyTan0002041
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035484.1 putative mitochondrial protein [Cucumis melo var. makuwa]3.3e-3860Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+TR+K  IFKPKA L   ++ +T+P +++ A   PHW+K M+EE++AL KN+TW L S+  +QK+VGCKWVFKIKRNS  +I+RYK RLVAK FHQ+P+
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKP+TIR+LLT+A+   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

RVW84602.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-3656.39Show/hide
Query:  MVTRTKSDIFKPKALLAATNFV---ETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQ
        M TR K+ IFKPK  L+ T  +     +P S + A+K P W++ M  E++AL+ N TW LVS P +Q ++GC+WV+K+K   DGT+ RYKARLVAKGFHQ
Subjt:  MVTRTKSDIFKPKALLAATNFV---ETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQ

Query:  SPDIDYFETFSPVVKPITIRILLTLALAFNWTI
        +PD DYFETFSPVVKP TIR++L+LAL+ NW+I
Subjt:  SPDIDYFETFSPVVKPITIRILLTLALAFNWTI

TYK18915.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-3760.77Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+T++K DIFKPKA L   ++ +T+  + + A   PHW+K M+EE+ AL KN TW L+ +  +QK+VGCKWVFKIKRNS G+ISRYKARLVAKGFHQ+ +
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKPITIR+LLT+ +   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

XP_008461310.2 PREDICTED: uncharacterized mitochondrial protein AtMg00820-like isoform X1 [Cucumis melo]3.3e-3860Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+TR+K  IFKPKA L   ++ +T+P +++ A   PHW+K M+EE++AL KN+TW L S+  +QK+VGCKWVFKIKRNS  +I+RYK RLVAK FHQ+P+
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKP+TIR+LLT+A+   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

XP_016902739.1 PREDICTED: uncharacterized mitochondrial protein AtMg00820-like isoform X2 [Cucumis melo]3.3e-3860Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+TR+K  IFKPKA L   ++ +T+P +++ A   PHW+K M+EE++AL KN+TW L S+  +QK+VGCKWVFKIKRNS  +I+RYK RLVAK FHQ+P+
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKP+TIR+LLT+A+   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

TrEMBL top hitse value%identityAlignment
A0A1S3CEF5 uncharacterized mitochondrial protein AtMg00820-like isoform X11.6e-3860Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+TR+K  IFKPKA L   ++ +T+P +++ A   PHW+K M+EE++AL KN+TW L S+  +QK+VGCKWVFKIKRNS  +I+RYK RLVAK FHQ+P+
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKP+TIR+LLT+A+   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

A0A1S4E3D4 uncharacterized mitochondrial protein AtMg00820-like isoform X21.6e-3860Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+TR+K  IFKPKA L   ++ +T+P +++ A   PHW+K M+EE++AL KN+TW L S+  +QK+VGCKWVFKIKRNS  +I+RYK RLVAK FHQ+P+
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKP+TIR+LLT+A+   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

A0A2N9FNX4 CCHC-type domain-containing protein1.8e-3759.54Show/hide
Query:  MVTRTKSDIFKPKALLAAT-NFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSP
        MVTR++ +I+KPK LL AT +   ++P  +  ALK P WR TM EE+DAL++N TWELV    +  ++GCKWVF+IKRN DG+ISRYKARLVAKGFHQ P
Subjt:  MVTRTKSDIFKPKALLAAT-NFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSP

Query:  DIDYFETFSPVVKPITIRILLTLALAFNWTI
         +DY +TFSPVVKP TIR++L LAL+  W +
Subjt:  DIDYFETFSPVVKPITIRILLTLALAFNWTI

A0A5A7SW14 Putative mitochondrial protein1.6e-3860Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+TR+K  IFKPKA L   ++ +T+P +++ A   PHW+K M+EE++AL KN+TW L S+  +QK+VGCKWVFKIKRNS  +I+RYK RLVAK FHQ+P+
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKP+TIR+LLT+A+   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

A0A5D3D5W0 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-3760.77Show/hide
Query:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD
        M+T++K DIFKPKA L   ++ +T+  + + A   PHW+K M+EE+ AL KN TW L+ +  +QK+VGCKWVFKIKRNS G+ISRYKARLVAKGFHQ+ +
Subjt:  MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPD

Query:  IDYFETFSPVVKPITIRILLTLALAFNWTI
        IDY ETFSPVVKPITIR+LLT+ +   W+I
Subjt:  IDYFETFSPVVKPITIRILLTLALAFNWTI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.7e-1643.33Show/hide
Query:  WRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPDIDYFETFSPVVKPITIRILLTLALAFN
        W + +  E +A   NNTW +  RP ++ +V  +WVF +K N  G   RYKARLVA+GF Q   IDY ETF+PV +  + R +L+L + +N
Subjt:  WRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPDIDYFETFSPVVKPITIRILLTLALAFN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-1945.37Show/hide
Query:  ETKPRSIQVALKCP---HWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPDIDYFETFSPVVKPITIRIL
        + +P S++  L  P      K MQEE ++L KN T++LV  P  ++ + CKWVFK+K++ D  + RYKARLV KGF Q   ID+ E FSPVVK  +IR +
Subjt:  ETKPRSIQVALKCP---HWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPDIDYFETFSPVVKPITIRIL

Query:  LTLALAFN
        L+LA + +
Subjt:  LTLALAFN

P92520 Uncharacterized mitochondrial protein AtMg008209.2e-3154.4Show/hide
Query:  MVTRTKSDIFK--PKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQS
        M+TR+K+ I K  PK  L  T  ++ +P+S+  ALK P W + MQEE DAL +N TW LV  PV+Q ++GCKWVFK K +SDGT+ R KARLVAKGFHQ 
Subjt:  MVTRTKSDIFK--PKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQS

Query:  PDIDYFETFSPVVKPITIRILLTLA
          I + ET+SPVV+  TIR +L +A
Subjt:  PDIDYFETFSPVVKPITIRILLTLA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.3e-2849.62Show/hide
Query:  MVTRTKSDIFK--PKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQ-KVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQ
        M TR K+ I K  PK  LA +   E++PR+   ALK   WR  M  E +A + N+TW+LV  P S   +VGC+W+F  K NSDG+++RYKARLVAKG++Q
Subjt:  MVTRTKSDIFK--PKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQ-KVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQ

Query:  SPDIDYFETFSPVVKPITIRILLTLALAFNWTI
         P +DY ETFSPV+K  +IRI+L +A+  +W I
Subjt:  SPDIDYFETFSPVVKPITIRILLTLALAFNWTI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-2647.37Show/hide
Query:  MVTRTKSDIFKP--KALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELV-SRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQ
        M TR K  I KP  K   A +    ++PR+   A+K   WR+ M  E +A + N+TW+LV   P S  +VGC+W+F  K NSDG+++RYKARLVAKG++Q
Subjt:  MVTRTKSDIFKP--KALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELV-SRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQ

Query:  SPDIDYFETFSPVVKPITIRILLTLALAFNWTI
         P +DY ETFSPV+K  +IRI+L +A+  +W I
Subjt:  SPDIDYFETFSPVVKPITIRILLTLALAFNWTI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-2349.46Show/hide
Query:  WRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPDIDYFETFSPVVKPITIRILLTLALAFNWTI
        W   M +E  A+   +TWE+ + P ++K +GCKWV+KIK NSDGTI RYKARLVAKG+ Q   ID+ ETFSPV K  +++++L ++  +N+T+
Subjt:  WRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPDIDYFETFSPVVKPITIRILLTLALAFNWTI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.5e-3254.4Show/hide
Query:  MVTRTKSDIFK--PKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQS
        M+TR+K+ I K  PK  L  T  ++ +P+S+  ALK P W + MQEE DAL +N TW LV  PV+Q ++GCKWVFK K +SDGT+ R KARLVAKGFHQ 
Subjt:  MVTRTKSDIFK--PKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQS

Query:  PDIDYFETFSPVVKPITIRILLTLA
          I + ET+SPVV+  TIR +L +A
Subjt:  PDIDYFETFSPVVKPITIRILLTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTACTAGGACAAAGAGTGATATTTTTAAACCTAAGGCTTTACTAGCAGCCACTAATTTTGTTGAAACTAAACCTCGTAGTATTCAGGTAGCTTTAAAATGCCCTCA
TTGGCGTAAGACAATGCAAGAAGAGTATGATGCCTTACTTAAAAATAACACATGGGAGTTAGTATCAAGACCAGTGAGTCAGAAAGTAGTTGGTTGCAAGTGGGTATTCA
AAATCAAAAGGAATTCTGATGGCACCATTAGTAGATATAAGGCTCGATTGGTTGCAAAGGGGTTTCACCAATCACCAGATATAGATTATTTTGAAACTTTTAGTCCCGTT
GTGAAACCAATTACCATACGAATTCTACTCACCCTTGCCCTTGCTTTTAACTGGACTATTTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTACTAGGACAAAGAGTGATATTTTTAAACCTAAGGCTTTACTAGCAGCCACTAATTTTGTTGAAACTAAACCTCGTAGTATTCAGGTAGCTTTAAAATGCCCTCA
TTGGCGTAAGACAATGCAAGAAGAGTATGATGCCTTACTTAAAAATAACACATGGGAGTTAGTATCAAGACCAGTGAGTCAGAAAGTAGTTGGTTGCAAGTGGGTATTCA
AAATCAAAAGGAATTCTGATGGCACCATTAGTAGATATAAGGCTCGATTGGTTGCAAAGGGGTTTCACCAATCACCAGATATAGATTATTTTGAAACTTTTAGTCCCGTT
GTGAAACCAATTACCATACGAATTCTACTCACCCTTGCCCTTGCTTTTAACTGGACTATTTGTTAG
Protein sequenceShow/hide protein sequence
MVTRTKSDIFKPKALLAATNFVETKPRSIQVALKCPHWRKTMQEEYDALLKNNTWELVSRPVSQKVVGCKWVFKIKRNSDGTISRYKARLVAKGFHQSPDIDYFETFSPV
VKPITIRILLTLALAFNWTIC