; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021160 (gene) of Snake gourd v1 genome

Gene IDTan0021160
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzf-RVT domain-containing protein
Genome locationLG05:3555205..3555769
RNA-Seq ExpressionTan0021160
SyntenyTan0021160
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7845245.1 transcription factor SCREAM2-like isoform X1 [Senna tora]4.6e-0427.63Show/hide
Query:  SDHNETNT------LWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSR---SGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQT---LIH
        + ++ TNT      +WK IWK+  +P  +S  W+    +LPT IN+  RGV  D+R    G+              L  L +N + ++ G   T    + 
Subjt:  SDHNETNT------LWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSR---SGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQT---LIH

Query:  QISSSAQTSKH--------CWIPPLDGALKLNINASWCEILEKGGIGWIVRD
          S++ Q             W  P+   +K+N++AS  +   KGGIG +VR+
Subjt:  QISSSAQTSKH--------CWIPPLDGALKLNINASWCEILEKGGIGWIVRD

OMO90013.1 reverse transcriptase [Corchorus capsularis]4.4e-0730.53Show/hide
Query:  ETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQTLIHQ-ISSSAQTSKHCWIP
        E +  W+ +W S  +P  K  +W++INN+LPT   +  RG+           +      L ++   L+E    +LV   + L  +  S +   S H W P
Subjt:  ETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQTLIHQ-ISSSAQTSKHCWIP

Query:  PLDGALKLNINASWCEILEKGGIGWIVRDST
        P  G LKLN +AS+    E+ G+G + RD T
Subjt:  PLDGALKLNINASWCEILEKGGIGWIVRDST

TQE13975.1 hypothetical protein C1H46_000397 [Malus baccata]9.3e-0546.43Show/hide
Query:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGM
        TS SD N    LWKS+WK+K  P  K  VWK   ++LPT +N+ K+GV  + R  M
Subjt:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGM

XP_011470502.1 PREDICTED: uncharacterized protein LOC105353223 [Fragaria vesca subsp. vesca]4.2e-0528.29Show/hide
Query:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGV---VTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGG-----------D
        +S S H    ++WK +WK    P  +   W+I+  VLPT   + K+GV   V     G  K  G+        +  L E +   L              D
Subjt:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGV---VTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGG-----------D

Query:  QTLIHQISSSAQTSK-HC-WIPPLDGALKLNINASWCEILEKGGIGWIVRDS
        + + +   +  Q+ + H  W  P  G LK+NI+ S+ E+ E+GG+G +VRD+
Subjt:  QTLIHQISSSAQTSK-HC-WIPPLDGALKLNINASWCEILEKGGIGWIVRDS

XP_030963581.1 uncharacterized protein LOC115984707 [Quercus lobata]9.3e-0527.16Show/hide
Query:  SDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGV-VTD--SRSGMQKTYGVRPGT------------LSSSLTTLVEN-----------
        S  +  N LW+ +W     PM +   WK+  NVLPT +N+ ++GV + D     GM+    +                L S +  L  N           
Subjt:  SDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGV-VTD--SRSGMQKTYGVRPGT------------LSSSLTTLVEN-----------

Query:  NLTELVGGDQTLIHQISSSAQTSKHCWIPPLDGALKLNINASWCEILEKGGIGWIVRDSTEN
        NL  ++   + LI   S S   S+  WI P  G  K+N++ +  EI     +G ++RD+T N
Subjt:  NLTELVGGDQTLIHQISSSAQTSKHCWIPPLDGALKLNINASWCEILEKGGIGWIVRDSTEN

TrEMBL top hitse value%identityAlignment
A0A1R3GC81 Reverse transcriptase6.5e-0424.46Show/hide
Query:  NTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGMQKTYGVR------PGTLSSSLTTLVENNLTELVGGDQTLIHQISSSAQTSKHC
        +++WK +W +   P  K  +W++I  +LP    + +RG+  +    +   Y  R       GT S +L + V +++ +++ G        ++  ++S+  
Subjt:  NTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGMQKTYGVR------PGTLSSSLTTLVENNLTELVGGDQTLIHQISSSAQTSKHC

Query:  WIPPLDGALKLNINASWCEILEKGGIGWIVRDSTENPIC
        W PP  G LK+N +A++     K G+G ++RD     +C
Subjt:  WIPPLDGALKLNINASWCEILEKGGIGWIVRDSTENPIC

A0A1R3J5A5 Reverse transcriptase2.2e-0730.53Show/hide
Query:  ETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQTLIHQ-ISSSAQTSKHCWIP
        E +  W+ +W S  +P  K  +W++INN+LPT   +  RG+           +      L ++   L+E    +LV   + L  +  S +   S H W P
Subjt:  ETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQTLIHQ-ISSSAQTSKHCWIP

Query:  PLDGALKLNINASWCEILEKGGIGWIVRDST
        P  G LKLN +AS+    E+ G+G + RD T
Subjt:  PLDGALKLNINASWCEILEKGGIGWIVRDST

A0A1Z5R427 Uncharacterized protein5.0e-0426.98Show/hide
Query:  LWKSIWKSKAIPMAKSCVWKIINNVLPTAI-NICKRGVVTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQTLIHQISSSAQTSKHCWIPPLDG
        LW +IW     P  +   WK+  ++LPT + ++C   + T       ++ G        S   L      E+    Q++ H    +  T K  W PP++G
Subjt:  LWKSIWKSKAIPMAKSCVWKIINNVLPTAI-NICKRGVVTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQTLIHQISSSAQTSKHCWIPPLDG

Query:  ALKLNINASWCEILEKGGIGWIVRDS
          KLN++ S+ E     G+G + RDS
Subjt:  ALKLNINASWCEILEKGGIGWIVRDS

A0A540L1B6 zf-RVT domain-containing protein6.5e-0447.92Show/hide
Query:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGV
        TS SD N    LW+S+WK+K  P  K  VWK   ++LPT +N+ K+GV
Subjt:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGV

A0A540NSN9 zf-RVT domain-containing protein4.5e-0546.43Show/hide
Query:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGM
        TS SD N    LWKS+WK+K  P  K  VWK   ++LPT +N+ K+GV  + R  M
Subjt:  TSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCCCCCTCGGACCACAATGAAACCAATACCTTATGGAAATCAATCTGGAAGTCGAAGGCGATCCCCATGGCAAAATCTTGTGTTTGGAAAATTATTAATAATGT
CCTCCCTACTGCAATTAATATCTGTAAAAGGGGAGTGGTCACTGACAGTCGAAGTGGAATGCAAAAGACTTATGGGGTGCGACCGGGGACGCTCTCTAGTAGCCTTACAA
CATTAGTGGAGAATAATCTTACGGAACTCGTGGGAGGAGATCAAACTCTGATTCATCAGATATCTTCAAGTGCTCAAACTTCTAAGCATTGCTGGATCCCACCCCTTGAT
GGGGCCCTAAAGCTCAACATAAATGCGTCGTGGTGTGAGATATTAGAGAAAGGTGGCATTGGGTGGATTGTTCGCGACTCTACCGAAAACCCAATTTGTGTGGGGCTTTA
G
mRNA sequenceShow/hide mRNA sequence
ATGACGTCCCCCTCGGACCACAATGAAACCAATACCTTATGGAAATCAATCTGGAAGTCGAAGGCGATCCCCATGGCAAAATCTTGTGTTTGGAAAATTATTAATAATGT
CCTCCCTACTGCAATTAATATCTGTAAAAGGGGAGTGGTCACTGACAGTCGAAGTGGAATGCAAAAGACTTATGGGGTGCGACCGGGGACGCTCTCTAGTAGCCTTACAA
CATTAGTGGAGAATAATCTTACGGAACTCGTGGGAGGAGATCAAACTCTGATTCATCAGATATCTTCAAGTGCTCAAACTTCTAAGCATTGCTGGATCCCACCCCTTGAT
GGGGCCCTAAAGCTCAACATAAATGCGTCGTGGTGTGAGATATTAGAGAAAGGTGGCATTGGGTGGATTGTTCGCGACTCTACCGAAAACCCAATTTGTGTGGGGCTTTA
G
Protein sequenceShow/hide protein sequence
MTSPSDHNETNTLWKSIWKSKAIPMAKSCVWKIINNVLPTAINICKRGVVTDSRSGMQKTYGVRPGTLSSSLTTLVENNLTELVGGDQTLIHQISSSAQTSKHCWIPPLD
GALKLNINASWCEILEKGGIGWIVRDSTENPICVGL