; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021156 (gene) of Snake gourd v1 genome

Gene IDTan0021156
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG02:23486121..23487331
RNA-Seq ExpressionTan0021156
SyntenyTan0021156
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV76858.1 Exo_endo_phos domain-containing protein, partial [Cephalotus follicularis]9.9e-1638.71Show/hide
Query:  EERLNIVVAVAAAMKT---FCWNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHID--VSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGN
        E RL+     A  +KT    C+ V+  G  R          A LWNEE+ +SI+S+S+ HID  + ++  D  WR TG+YG+ +  +R++TW L+C L  
Subjt:  EERLNIVVAVAAAMKT---FCWNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHID--VSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGN

Query:  HDPSPWVIGGDLNELLWDSEKWGG
        H  +PW+  GD NE+L   EK GG
Subjt:  HDPSPWVIGGDLNELLWDSEKWGG

KAF5477448.1 hypothetical protein F2P56_004088 [Juglans regia]6.9e-1736.03Show/hide
Query:  AVAAAMKTFCWNVHGSGSSRAFRSV------------------------WAVAYACLWNEEIKVSIRSYSSHHIDVSVVWND-RSWRFTGLYGQPDQRLR
        A+   +KT CWN  G G+    R++                         +   + LWN E+ + ++SYS +HIDV V  +D R WRFTG+YG P+   R
Subjt:  AVAAAMKTFCWNVHGSGSSRAFRSV------------------------WAVAYACLWNEEIKVSIRSYSSHHIDVSVVWND-RSWRFTGLYGQPDQRLR

Query:  NQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWGG
          TW L+  L + +  PW++GGD NE+L+  EKWGG
Subjt:  NQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWGG

XP_012859003.1 PREDICTED: uncharacterized protein LOC105978134 [Erythranthe guttata]3.8e-1541.07Show/hide
Query:  WNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHIDVSV--VWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEK
        WN+ G    R          A LW++E+ V +RSYS + IDV+V  +   + WRFTG YG P+   +  +WEL+  LG  D  PW++GGD NE L  SEK
Subjt:  WNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHIDVSV--VWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEK

Query:  WGG----PRFLK
         GG    P F++
Subjt:  WGG----PRFLK

XP_024195790.1 uncharacterized protein LOC112198938 [Rosa chinensis]3.8e-1546.53Show/hide
Query:  HGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHIDVSV--VWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWGG
        HG   SR      A     LWN++I VS++SYS  HIDV V  +   + WRFTG+YGQP    R+ TW L+  LG     PW++GGD NE+L   EK GG
Subjt:  HGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHIDVSV--VWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWGG

Query:  P
        P
Subjt:  P

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]8.4e-1542.72Show/hide
Query:  CWNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHIDVSVVWND-RSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEK
        C++V   G S             LWN +++V +RS+S +HIDV +  +D   WRFTGLYG PD   R  TW L+  L + +  PW++GGDLNE+L   EK
Subjt:  CWNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHIDVSVVWND-RSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEK

Query:  WGG
         GG
Subjt:  WGG

TrEMBL top hitse value%identityAlignment
A0A1Q3C9T4 Exo_endo_phos domain-containing protein (Fragment)4.8e-1638.71Show/hide
Query:  EERLNIVVAVAAAMKT---FCWNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHID--VSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGN
        E RL+     A  +KT    C+ V+  G  R          A LWNEE+ +SI+S+S+ HID  + ++  D  WR TG+YG+ +  +R++TW L+C L  
Subjt:  EERLNIVVAVAAAMKT---FCWNVHGSGSSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHID--VSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGN

Query:  HDPSPWVIGGDLNELLWDSEKWGG
        H  +PW+  GD NE+L   EK GG
Subjt:  HDPSPWVIGGDLNELLWDSEKWGG

A0A2N9EFF7 Uncharacterized protein1.2e-1448.15Show/hide
Query:  ACLWNEEIKVSIRSYSSHHIDVSVVWND-RSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWG
        A LW+ E+ V I+SYS  HID  ++ ND  SWRFTG YG PD   + ++W+L+ RLG+    PW+I GD NE++ + EK G
Subjt:  ACLWNEEIKVSIRSYSSHHIDVSVVWND-RSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWG

A0A2N9F9I5 Uncharacterized protein3.1e-1547.5Show/hide
Query:  ACLWNEEIKVSIRSYSSHHIDVSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWG
        A  W++E+ VSI SYS HHID  + ++  +WRFTG YG P    +   W+L+  L  H   PW+ GGD NELL   EKWG
Subjt:  ACLWNEEIKVSIRSYSSHHIDVSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWG

A0A2N9J6Y2 Uncharacterized protein1.2e-1446.25Show/hide
Query:  ACLWNEEIKVSIRSYSSHHIDVSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWG
        A  W++E+ VSI SYS HHID  + +++ +WR TG YG P    ++  W+L+  L  H   PW+ GGD NELL   EKWG
Subjt:  ACLWNEEIKVSIRSYSSHHIDVSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWG

A0A2P6SDG4 Putative RNA-directed DNA polymerase1.2e-1449.38Show/hide
Query:  LWNEEIKVSIRSYSSHHIDVSV--VWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWGG
        LWN++ KVS++SYS++HIDV +    + R WRFTG+YG P    R++TW L+ +L  H   PWV+GGD NE+   S+K GG
Subjt:  LWNEEIKVSIRSYSSHHIDVSV--VWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTTCTCCGTCCGATTTTTCGAAGTCAGAGGGGGGAAAATTAATTTTTTCAGCTACTGTGGGTGGACCAGATTCGCCGGTGACGTCGTCGGTTACAATCAATTC
CTTGATGGGCATTGCGCCATTATTAAATTCCAGCGTGAAGACATCAAGCCTCGGGGAAGAGGGTTCAAAGCGAAAACGCAAATCAGAGTTACCTTCTGATCTTCCAAACA
AGCGGTTTAATCTCGTCTCGAAGTCGACCAATGGCGAAGAGCGTCTTAATATTGTGGTAGCGGTTGCAGCAGCCATGAAAACGTTTTGTTGGAACGTTCATGGGTCGGGG
AGCTCTAGGGCATTTCGTTCTGTTTGGGCGGTGGCTTATGCCTGTTTGTGGAATGAGGAAATCAAGGTGTCAATTCGATCTTACTCCTCTCACCATATTGATGTGTCAGT
GGTGTGGAATGATAGAAGTTGGAGATTCACGGGCTTGTATGGTCAACCGGACCAACGATTGCGCAATCAAACATGGGAGTTGATGTGTAGATTGGGAAACCATGACCCAT
CACCATGGGTGATAGGAGGGGATCTAAATGAGCTCCTTTGGGACTCGGAAAAGTGGGGAGGTCCAAGATTTTTGAAAGGTGGTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTTCTCCGTCCGATTTTTCGAAGTCAGAGGGGGGAAAATTAATTTTTTCAGCTACTGTGGGTGGACCAGATTCGCCGGTGACGTCGTCGGTTACAATCAATTC
CTTGATGGGCATTGCGCCATTATTAAATTCCAGCGTGAAGACATCAAGCCTCGGGGAAGAGGGTTCAAAGCGAAAACGCAAATCAGAGTTACCTTCTGATCTTCCAAACA
AGCGGTTTAATCTCGTCTCGAAGTCGACCAATGGCGAAGAGCGTCTTAATATTGTGGTAGCGGTTGCAGCAGCCATGAAAACGTTTTGTTGGAACGTTCATGGGTCGGGG
AGCTCTAGGGCATTTCGTTCTGTTTGGGCGGTGGCTTATGCCTGTTTGTGGAATGAGGAAATCAAGGTGTCAATTCGATCTTACTCCTCTCACCATATTGATGTGTCAGT
GGTGTGGAATGATAGAAGTTGGAGATTCACGGGCTTGTATGGTCAACCGGACCAACGATTGCGCAATCAAACATGGGAGTTGATGTGTAGATTGGGAAACCATGACCCAT
CACCATGGGTGATAGGAGGGGATCTAAATGAGCTCCTTTGGGACTCGGAAAAGTGGGGAGGTCCAAGATTTTTGAAAGGTGGTGGATGA
Protein sequenceShow/hide protein sequence
MATSPSDFSKSEGGKLIFSATVGGPDSPVTSSVTINSLMGIAPLLNSSVKTSSLGEEGSKRKRKSELPSDLPNKRFNLVSKSTNGEERLNIVVAVAAAMKTFCWNVHGSG
SSRAFRSVWAVAYACLWNEEIKVSIRSYSSHHIDVSVVWNDRSWRFTGLYGQPDQRLRNQTWELMCRLGNHDPSPWVIGGDLNELLWDSEKWGGPRFLKGGG