; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008142 (gene) of Snake gourd v1 genome

Gene IDTan0008142
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG11:61218678..61219364
RNA-Seq ExpressionTan0008142
SyntenyTan0008142
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030478056.1 uncharacterized protein LOC115695105 [Cannabis sativa]7.6e-3036.8Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE
        M++F  C+  C L +  + GD FTWSK R ++   KERLD  F+N+ + N  N   + HL ++ SDH+ I+             K+ +  RFE+ W+ + 
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE

Query:  EARDLIRAHWL-SRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSN--LPNPSNHAELIKAENELEELLLEEEKYWKIRSREDWL
        +  D+I  HWL S   +    +  NL S  + L+S   R+  G +K+ I+  ++++  L+N  + + S+  EL KAE  L+ELL +EE YW+ RSR DWL
Subjt:  EARDLIRAHWL-SRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSN--LPNPSNHAELIKAENELEELLLEEEKYWKIRSREDWL

Query:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
          GD+NTK+FH KAN R   N I+ ++D  G
Subjt:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

XP_030487129.1 uncharacterized protein LOC115704047 [Cannabis sativa]4.5e-3035.17Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSRSIDA-TKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDD----TPIRSKKKVNITRFEENW
        +D+F E +  C L +  + G++FTW  + S  A  KERLD  F+N    + +    + HL F++SDH+ +LA +   +     P +S+      RFE+ W
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSRSIDA-TKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDD----TPIRSKKKVNITRFEENW

Query:  IGNEEARDLIRAHWLSRECNTPNGLK-ENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPSNH---AELIKAENELEELLLEEEKYWKIRS
        +  ++  ++I  +W +   + P  L  +N++SC   L+SW R +  G L + I+   E++  L N  N ++H   ++L+++E  L++LL +EE YW  RS
Subjt:  IGNEEARDLIRAHWLSRECNTPNGLK-ENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPSNH---AELIKAENELEELLLEEEKYWKIRS

Query:  REDWLKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
        R  WLK GD NTK+FH KA+ R   N I KL D NG
Subjt:  REDWLKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.3e-2936.36Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE
        M +F   +  C L +    GD+FTW+++R S  + KERLD  F+NH   +      ++HL ++ SDH+ +LA + F+ TP     +    RFE+ W+ ++
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE

Query:  EARDLIRAHWL-SRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERI--LLLSNLPNPSNHAELIKAENELEELLLEEEKYWKIRSREDWL
        E  ++I   WL S E +  + L  +L  C   L  W  R+  G +K+ I+  ++ +  L  S   +P   A++  AE+ L+ELL  EE+YW+ RSR DWL
Subjt:  EARDLIRAHWL-SRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERI--LLLSNLPNPSNHAELIKAENELEELLLEEEKYWKIRSREDWL

Query:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
        + GD+NTK+FH KA+ RH  N I  L D +G
Subjt:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]3.1e-3135.34Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDT-PIRSKKKVNITRFEENWIGN
        M +F   +  C LA+    GD+FTW+K+R  +   KERLD  F+NH   + +    +THL ++ SDH+ +LA++ F+ T P+++ +K    RFE+ W+ +
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDT-PIRSKKKVNITRFEENWIGN

Query:  EEARDLIRAHWL-SRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPN--PSNHAELIKAENELEELLLEEEKYWKIRSREDW
        +E  ++I + W  + + ++   L  +L+ C   L+ W  R+  G +K+ I+  ++ +  L+   N  P   +++  AE  L++LL  EE+YW+ RSR DW
Subjt:  EEARDLIRAHWL-SRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPN--PSNHAELIKAENELEELLLEEEKYWKIRSREDW

Query:  LKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
        L+ GD+NTK+FH KA+ RH  N I  L+D +G
Subjt:  LKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

XP_030933437.1 uncharacterized protein LOC115959237 [Quercus lobata]1.4e-2837.07Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIR-SKKKVNITRFEENWIGN
        MD F   +  C LAD  + G+ FTW+  R  +D TKERLDR  ++    N+     +THL  H+ DH+PI+ H     T +R   K+ N  RFEE W+  
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIR-SKKKVNITRFEENWIGN

Query:  EEARDLIRAHWLSRECNTPNGLK---ENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNP-SNHAELIKAENELEELLLEEEKYWKIRSRED
        ++   +I   W     N   GL+   E +  C  +L++W          E I+R ++R+ +LS   +   N AE++ A   L++LLL++E YW  RSR  
Subjt:  EEARDLIRAHWLSRECNTPNGLK---ENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNP-SNHAELIKAENELEELLLEEEKYWKIRSRED

Query:  WLKWGDKNTKWFHHKANQRHKKNSIDKLVDPN
        WL+ GDKNTK+FH KA+QR ++N I  + D N
Subjt:  WLKWGDKNTKWFHHKANQRHKKNSIDKLVDPN

TrEMBL top hitse value%identityAlignment
A0A803NG99 Uncharacterized protein1.5e-3136.75Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKS-RSIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFD---DTPIRSKKKVNITRFEENWI
        M  F   +  C L +  Y GD FTW+K+ ++  A KERLD  F+N+  M+   + ++THL ++ SDH+ +L  + F    D P   +K+ +  RFEE W+
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKS-RSIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFD---DTPIRSKKKVNITRFEENWI

Query:  GNEEARDLIRAHW-LSRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPSNH--AELIKAENELEELLLEEEKYWKIRSRE
           E  D+I + W L   C+    L + L SC + L+ W  R+  G +K  I++ ++ + +L+N          +++ AE  L++LL  EE+YWK RSR 
Subjt:  GNEEARDLIRAHW-LSRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPSNH--AELIKAENELEELLLEEEKYWKIRSRE

Query:  DWLKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
        DWLK GD+NTK+FH KA+ R   N I  L+D  G
Subjt:  DWLKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

A0A803NZC3 Uncharacterized protein9.7e-3136.36Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTW-SKSRSIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHL-CFDDTPIRSKKKVNITRFEENWIGN
        +D+F   +  C L +  + GD++TW +K       KERLD  F NH  ++     S+THL F  SDH+ +L  +   +D P++  K  +  RF++ W+  
Subjt:  MDSFGECIFICKLADAGYRGDKFTW-SKSRSIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHL-CFDDTPIRSKKKVNITRFEENWIGN

Query:  EEARDLIRAHWLSRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPS--NHAELIKAENELEELLLEEEKYWKIRSREDWL
        ++  D+I  +W S   N  + + +NL+SC   L+ W R +  G L + IR  + ++  L+N+ + S  +  EL  +E  L+ELL +EE+YWK RSR  WL
Subjt:  EEARDLIRAHWLSRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPS--NHAELIKAENELEELLLEEEKYWKIRSREDWL

Query:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
        + GD NTK+FH KA+ R   NSI  L D +G
Subjt:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

A0A803P2K3 Uncharacterized protein5.1e-3237.97Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSRSIDAT-KERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNIT----RFEENW
        MD F + +  C L +  Y GD FTW K R    T KERLD  F+N    +    ++  HL ++ SDH+ I    C +  P+ S  +  I     RFE+ W
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSRSIDAT-KERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNIT----RFEENW

Query:  IGNEEARDLIRAHWLSRECNTPNG---LKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPN--PSNHAELIKAENELEELLLEEEKYWKIR
        +  EEA DLI+ +W  ++C++ N     K NLT C   L+ W R++  G+ K+ I   ++++  L+N+ +  P+   +L   EN L++LL +EE YW+ R
Subjt:  IGNEEARDLIRAHWLSRECNTPNG---LKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPN--PSNHAELIKAENELEELLLEEEKYWKIR

Query:  SREDWLKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
        SR DWL+ GD+NTK+FH  A+ R + NSI  L D NG
Subjt:  SREDWLKWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

A0A803PNH7 Uncharacterized protein5.7e-3136.36Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSRS-IDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE
        M++F + +  CKL +  + GD FTW K+RS +   K RLD  F+NH   N+     + HL + SSDH+ I A     ++ I+  ++ +  RFE+ W+ ++
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSRS-IDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE

Query:  EARDLIRAHWLSRECNTP-NGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPN--PSNHAELIKAENELEELLLEEEKYWKIRSREDWL
        + +D+I A W S   + P + +  NL  C   L+ W   +  G++K  I   + ++  L+N P+  P+    L  +E+ L+ELL +EE YW+ RSR DWL
Subjt:  EARDLIRAHWLSRECNTP-NGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPN--PSNHAELIKAENELEELLLEEEKYWKIRSREDWL

Query:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG
        + GD+NTK+FH KA+ R   N+I  L+D NG
Subjt:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNG

A0A803PW06 Uncharacterized protein9.7e-3136.64Show/hide
Query:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE
        MD+F   +  C L +  + GD FTW K R + D  KERLD  F+N         ++  HL ++ SDH+ +  H+   + P +   +    RFE+ W+  +
Subjt:  MDSFGECIFICKLADAGYRGDKFTWSKSR-SIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNE

Query:  EARDLIRAHWLSRECNTPNGL-KENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPS--NHAELIKAENELEELLLEEEKYWKIRSREDWL
        EA  LI+ +W      T   L K NL  C   L+ W R++  G++K+ I + ++++  L+N  + S    AEL ++EN L++LL +EE YW+ RSR DWL
Subjt:  EARDLIRAHWLSRECNTPNGL-KENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPS--NHAELIKAENELEELLLEEEKYWKIRSREDWL

Query:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNGT
        + GD+NTK+FH  A+ R K N+I  L D NG+
Subjt:  KWGDKNTKWFHHKANQRHKKNSIDKLVDPNGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTTTCGGCGAGTGCATTTTTATATGCAAGTTGGCTGATGCGGGGTACAGGGGAGATAAATTCACGTGGAGTAAGAGCAGAAGTATTGACGCAACAAAAGAGCG
CCTCGACAGGTACTTTTTAAACCATAGCATGATGAACCGGGTCAACAAGGTCAGTATCACTCATCTTTCTTTTCATTCGTCTGACCACAAACCCATTCTTGCTCACTTAT
GTTTTGATGATACTCCCATTAGATCCAAAAAAAAGGTTAACATAACTCGTTTTGAAGAAAACTGGATTGGTAATGAGGAAGCTAGGGACCTGATTAGAGCTCATTGGTTG
TCCAGAGAGTGCAACACTCCGAATGGTCTAAAGGAAAATCTTACCTCCTGCATCCAAAAATTGAAATCCTGGGATAGGCGTAGGCTTAAAGGTTCCTTAAAAGAGGCTAT
CAGAAGGAAAGAGGAGCGGATCCTTCTCCTTTCAAATCTCCCTAACCCGTCTAATCACGCTGAGCTTATTAAAGCTGAAAATGAGTTGGAGGAACTGCTTTTGGAGGAAG
AAAAATACTGGAAGATCAGATCTAGGGAAGATTGGCTCAAATGGGGGGATAAAAATACAAAGTGGTTCCATCACAAAGCCAACCAAAGGCATAAAAAGAACTCTATTGAT
AAGCTGGTTGACCCTAATGGAACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTTTCGGCGAGTGCATTTTTATATGCAAGTTGGCTGATGCGGGGTACAGGGGAGATAAATTCACGTGGAGTAAGAGCAGAAGTATTGACGCAACAAAAGAGCG
CCTCGACAGGTACTTTTTAAACCATAGCATGATGAACCGGGTCAACAAGGTCAGTATCACTCATCTTTCTTTTCATTCGTCTGACCACAAACCCATTCTTGCTCACTTAT
GTTTTGATGATACTCCCATTAGATCCAAAAAAAAGGTTAACATAACTCGTTTTGAAGAAAACTGGATTGGTAATGAGGAAGCTAGGGACCTGATTAGAGCTCATTGGTTG
TCCAGAGAGTGCAACACTCCGAATGGTCTAAAGGAAAATCTTACCTCCTGCATCCAAAAATTGAAATCCTGGGATAGGCGTAGGCTTAAAGGTTCCTTAAAAGAGGCTAT
CAGAAGGAAAGAGGAGCGGATCCTTCTCCTTTCAAATCTCCCTAACCCGTCTAATCACGCTGAGCTTATTAAAGCTGAAAATGAGTTGGAGGAACTGCTTTTGGAGGAAG
AAAAATACTGGAAGATCAGATCTAGGGAAGATTGGCTCAAATGGGGGGATAAAAATACAAAGTGGTTCCATCACAAAGCCAACCAAAGGCATAAAAAGAACTCTATTGAT
AAGCTGGTTGACCCTAATGGAACCTAG
Protein sequenceShow/hide protein sequence
MDSFGECIFICKLADAGYRGDKFTWSKSRSIDATKERLDRYFLNHSMMNRVNKVSITHLSFHSSDHKPILAHLCFDDTPIRSKKKVNITRFEENWIGNEEARDLIRAHWL
SRECNTPNGLKENLTSCIQKLKSWDRRRLKGSLKEAIRRKEERILLLSNLPNPSNHAELIKAENELEELLLEEEKYWKIRSREDWLKWGDKNTKWFHHKANQRHKKNSID
KLVDPNGT