; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008552 (gene) of Snake gourd v1 genome

Gene IDTan0008552
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG01:109515394..109516668
RNA-Seq ExpressionTan0008552
SyntenyTan0008552
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY69209.1 hypothetical protein CUMW_270170 [Citrus unshiu]5.5e-2129.87Show/hide
Query:  RWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLASS
        RW + +GK + V    W+ RP + +   P  +  D  V +LL  +G W+  ++++ F   D   ILKI  P     D+L+WH+ + G+YSVKSGY L + 
Subjt:  RWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLASS

Query:  LLEETSSLAEEQIQNWWKTLWHYSIPNKGKSA-IGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHLLQQ---
           +           W+K     ++  +G  A +G ++RN +GE M T T + P L +V++ EA  + EG++LA ++    L VE+DS +V +  +    
Subjt:  LLEETSSLAEEQIQNWWKTLWHYSIPNKGKSA-IGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHLLQQ---

Query:  KASNLPEMGNVIEEIKKIASRMVKCYLTWCN
         A  L +    +  +K    R  + + T+CN
Subjt:  KASNLPEMGNVIEEIKKIASRMVKCYLTWCN

TXG46769.1 hypothetical protein EZV62_026063 [Acer yangbiense]4.9e-2243.08Show/hide
Query:  WRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLASSL
        WRI DG SV + +D WL RP + K    + +     V+ LL  DG W+V +++  F  EDA  IL +PRP+   PD L+WHY K G YSV SGY+   +L
Subjt:  WRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLASSL

Query:  LEETSSLAEEQIQNWWKTLWHYSIPNKGKS
            SS      ++WWK LWH  IP K K+
Subjt:  LEETSSLAEEQIQNWWKTLWHYSIPNKGKS

XP_015388020.1 uncharacterized protein LOC107177951 [Citrus sinensis]1.1e-2629.9Show/hide
Query:  LRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLAS
        LRWRI DGK V V Q  WL RP + K   P  + +D  V  L+  D  W    ++  FH+EDA QILKIP PR    D+++WHY K G YSVKSGY LA 
Subjt:  LRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLAS

Query:  SLLEETSSLAEEQIQNWWKTLWHYSIPNKGKS--------------------------------------------------------------AIGVVV
         +       + ++ ++ W ++W+  IP K K                                                                +G+V+
Subjt:  SLLEETSSLAEEQIQNWWKTLWHYSIPNKGKS--------------------------------------------------------------AIGVVV

Query:  RNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHLLQQKASNLPEMGNVIEEIKKIASRMVKCYL----TWCNRRANTL
        RN +G+ +    K    L  +D  EA A R G+++A   G   L VETDS  V +L+  + S   E+  ++ E+++   R+    L      CN  A+TL
Subjt:  RNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHLLQQKASNLPEMGNVIEEIKKIASRMVKCYL----TWCNRRANTL

Query:  A
        A
Subjt:  A

XP_023923071.1 uncharacterized protein LOC112034488 [Quercus suber]2.4e-2436.32Show/hide
Query:  WRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDV-KVESLLTVDGD-WDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLAS
        WRI DGKSV +  D WL    S KI  P M    V KV SL+  D   W V  ++S F   +A  +  IP   T  P+ LIW +  +G YSVKSGY    
Subjt:  WRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDV-KVESLLTVDGD-WDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLAS

Query:  SLLEETSSLA----EEQIQNWWKTLWHYSIPNKGK--SAIGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHL
          LEE  +      +E ++  W+ +W+ S+P+K    + IGVV+RN  G+ M ++++Q+P  + V  VEA+A R+ + LA E+GF  + +E DS +++  
Subjt:  SLLEETSSLA----EEQIQNWWKTLWHYSIPNKGK--SAIGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHL

Query:  LQQKASNLPEMGNVIEEIKKIAS
        L+ ++  L   G+++++I  +AS
Subjt:  LQQKASNLPEMGNVIEEIKKIAS

XP_024948107.1 uncharacterized protein LOC112495648 [Citrus sinensis]2.1e-2032.7Show/hide
Query:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV
        G++     LRWRI DGKS  + Q  W+ RPS+ K   P  + +D  V  L+  +  W   +++  F +E A QIL+IP+PRT  PD+ +WH+ K G Y+ 
Subjt:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV

Query:  KSGYYLASSLL---EETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRNEKGEAMFTMTKQL--PSLTEVDLVEAVAVREGMQLALEMGFKLLKVETD
        KSGY +A +     + TSS    + Q  W  +W   +P K K  +    RN    A+    +++      +    EA A+  G+++  + G   L VE+D
Subjt:  KSGYYLASSLL---EETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRNEKGEAMFTMTKQL--PSLTEVDLVEAVAVREGMQLALEMGFKLLKVETD

Query:  STSVVHLLQQK
        S  +V L+  K
Subjt:  STSVVHLLQQK

TrEMBL top hitse value%identityAlignment
A0A803NGM9 Uncharacterized protein2.2e-2339.19Show/hide
Query:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV
        G++      RWR+ +G++V V++D WL RP S KI     +  ++ V  L   DG WD   V+S+F+ EDA  IL +P       DK++WHY K+G Y+V
Subjt:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV

Query:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN
        KSGY +A+SL  E     ++   +WWK LW   IP K K  +  +V N
Subjt:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN

A0A803NPU7 Uncharacterized protein6.3e-2327.2Show/hide
Query:  LRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLAS
        LRW++ DG ++    D W+    + K     M S +  V S +T D  W++ ++ + F   D  +IL+IP       D+LIW++  +G Y+VKSGY LA+
Subjt:  LRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLAS

Query:  SLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSA--------------------------IGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLA
        SL E+  +++  Q QNWW   W  ++P+K + A                           G ++RN  GE +   +K +      +++EA+++  G+Q  
Subjt:  SLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSA--------------------------IGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLA

Query:  LEMGFKLLKVETDSTSVVHLLQQKASNLPEMGNVIEEIKKIASRMVKCYLTWCNRRANTLA
        L+ G  +  +ETDS  V   LQ  ++ + +   ++ +I  + S      +    R AN+ A
Subjt:  LEMGFKLLKVETDSTSVVHLLQQKASNLPEMGNVIEEIKKIASRMVKCYLTWCNRRANTLA

A0A803PEK8 Uncharacterized protein1.7e-2339.86Show/hide
Query:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV
        G++      RWR+ +G++V V++D WL RP + KI     +  ++ V  L   DG WD   V+S+F+ EDA  IL  P       DK++WHY K+G Y+V
Subjt:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV

Query:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN
        KSGY +ASSL  E     ++   +WWKTLWH  IP K K  +  +  N
Subjt:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN

A0A803Q0L5 Uncharacterized protein6.3e-2336.49Show/hide
Query:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV
        G++      RWR+ +G++V V++D WL RP + KI     +  ++ V  +   DG WD   ++ +F+ +DA  IL +P       DK++WHY K+G Y+V
Subjt:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV

Query:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN
        KSGY +ASSL+EE     ++ + +WWK LW + IP K K  +  +  N
Subjt:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN

A0A803Q1K6 Uncharacterized protein8.2e-2339.86Show/hide
Query:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV
        G++      RWR+ +G+SV V++D WL RP + K+     +  ++ V  L   DG WD   ++SIF+  DA  IL IP       DK++WHY K+G YSV
Subjt:  GQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSV

Query:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN
        KSGY +A+SL  E     E  I  WWK LW  + P K K  +  V  N
Subjt:  KSGYYLASSLLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.7e-0428.85Show/hide
Query:  RWRIEDGKSVHV----VQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGD---WDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKS
        R  I DG+++ +    + D    RP + + TY  M      + +L    G    WD + +     + D   I +I   ++  PDK+IW+Y   G Y+V+S
Subjt:  RWRIEDGKSVHV----VQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGD---WDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKS

Query:  GYYL
        GY+L
Subjt:  GYYL

AT4G29090.1 Ribonuclease H-like superfamily protein6.5e-0426.67Show/hide
Query:  KSAIGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHLLQQKASNLPEMGNVIEEIKKIASRMVKCYLTWCNRR
        +  IG V+RNEKGE  +   + LP L  V   E  A+R  +       +  +  E+DS  ++ +L       P +   I++++++ S+  +    +  R 
Subjt:  KSAIGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHLLQQKASNLPEMGNVIEEIKKIASRMVKCYLTWCNRR

Query:  ANTLA
         NTLA
Subjt:  ANTLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCAGAGAGCTGCTACAGGAAGGCTAAGGTGGAGAATTGAAGATGGGAAGAGTGTGCATGTAGTTCAAGATATATGGCTCTACAGGCCATCCTCACTGAAAATTAC
CTATCCTTCGATGGTGTCTATGGATGTTAAGGTGGAGAGTTTGTTGACTGTTGATGGTGATTGGGATGTGACCGTAGTTAAATCTATATTTCATGAGGAGGATGCCTATC
AAATTCTCAAAATTCCAAGGCCTAGAACTATATGTCCTGATAAACTGATTTGGCACTACTTGAAAGATGGGATCTATTCTGTCAAGTCAGGATATTATCTAGCAAGCTCT
CTTCTAGAAGAAACATCAAGCTTGGCAGAGGAACAAATTCAGAATTGGTGGAAGACACTTTGGCATTATTCTATTCCGAATAAGGGAAAAAGTGCAATTGGGGTAGTTGT
TCGAAATGAGAAAGGGGAAGCGATGTTTACCATGACTAAGCAACTTCCTTCACTTACGGAAGTTGATTTGGTTGAAGCAGTTGCGGTTCGTGAAGGGATGCAACTTGCAC
TAGAAATGGGCTTCAAGCTTTTGAAGGTTGAAACGGATTCAACATCAGTCGTCCATCTCCTTCAACAAAAAGCTTCAAATCTTCCAGAAATGGGTAATGTGATTGAAGAA
ATCAAGAAGATAGCGTCGAGAATGGTGAAATGCTACTTAACCTGGTGCAATCGGAGAGCCAATACGTTAGCACTCTTTGGCGAAACATGCACTCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCAGAGAGCTGCTACAGGAAGGCTAAGGTGGAGAATTGAAGATGGGAAGAGTGTGCATGTAGTTCAAGATATATGGCTCTACAGGCCATCCTCACTGAAAATTAC
CTATCCTTCGATGGTGTCTATGGATGTTAAGGTGGAGAGTTTGTTGACTGTTGATGGTGATTGGGATGTGACCGTAGTTAAATCTATATTTCATGAGGAGGATGCCTATC
AAATTCTCAAAATTCCAAGGCCTAGAACTATATGTCCTGATAAACTGATTTGGCACTACTTGAAAGATGGGATCTATTCTGTCAAGTCAGGATATTATCTAGCAAGCTCT
CTTCTAGAAGAAACATCAAGCTTGGCAGAGGAACAAATTCAGAATTGGTGGAAGACACTTTGGCATTATTCTATTCCGAATAAGGGAAAAAGTGCAATTGGGGTAGTTGT
TCGAAATGAGAAAGGGGAAGCGATGTTTACCATGACTAAGCAACTTCCTTCACTTACGGAAGTTGATTTGGTTGAAGCAGTTGCGGTTCGTGAAGGGATGCAACTTGCAC
TAGAAATGGGCTTCAAGCTTTTGAAGGTTGAAACGGATTCAACATCAGTCGTCCATCTCCTTCAACAAAAAGCTTCAAATCTTCCAGAAATGGGTAATGTGATTGAAGAA
ATCAAGAAGATAGCGTCGAGAATGGTGAAATGCTACTTAACCTGGTGCAATCGGAGAGCCAATACGTTAGCACTCTTTGGCGAAACATGCACTCGTTGA
Protein sequenceShow/hide protein sequence
MGQRAATGRLRWRIEDGKSVHVVQDIWLYRPSSLKITYPSMVSMDVKVESLLTVDGDWDVTVVKSIFHEEDAYQILKIPRPRTICPDKLIWHYLKDGIYSVKSGYYLASS
LLEETSSLAEEQIQNWWKTLWHYSIPNKGKSAIGVVVRNEKGEAMFTMTKQLPSLTEVDLVEAVAVREGMQLALEMGFKLLKVETDSTSVVHLLQQKASNLPEMGNVIEE
IKKIASRMVKCYLTWCNRRANTLALFGETCTR