; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020828 (gene) of Snake gourd v1 genome

Gene IDTan0020828
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG05:75218043..75220856
RNA-Seq ExpressionTan0020828
SyntenyTan0020828
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB66613.1 hypothetical protein L484_024909 [Morus notabilis]1.9e-1534.19Show/hide
Query:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA
        E +  R+G++ G+ +G+ + V Q +  +CWG  +R+RV +D+T+P++R +K+ + +    +W  + YEKLP FC  CG+ GH  R+C  +    + ++  
Subjt:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA

Query:  NQFGSGLRHPTSDGGVH
          +GS LR P   GG H
Subjt:  NQFGSGLRHPTSDGGVH

EXB66629.1 hypothetical protein L484_024925 [Morus notabilis]1.9e-1534.19Show/hide
Query:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA
        E +  R+G++ G+ +G+ + V Q +  +CWG  +R+RV +D+T+P++R +K+ + +    +W  + YEKLP FC  CG+ GH  R+C  +    + ++  
Subjt:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA

Query:  NQFGSGLRHPTSDGGVH
          +GS LR P   GG H
Subjt:  NQFGSGLRHPTSDGGVH

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]6.1e-1433.55Show/hide
Query:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKSLLEVD----ANQ
        +K +    GN +G F+ VD  E    WG+S+RIRV +D+T+PL+R +K+ I       W P+ YE+LP FC  CG IGH   DC  + L   D     ++
Subjt:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKSLLEVD----ANQ

Query:  FGSGLRHPTSDGGVH----GWSRPRSFDLRGRGRGGRGRGPSDSTTGLEEESKDD
        +G  LR   S  G      G S  R           + RG  ++   L E++  D
Subjt:  FGSGLRHPTSDGGVH----GWSRPRSFDLRGRGRGGRGRGPSDSTTGLEEESKDD

XP_024018029.1 uncharacterized protein LOC112090594 [Morus notabilis]2.5e-1534.19Show/hide
Query:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA
        E +  R+G++ G+ +G+ + V Q +  +CWG  +R+RV +D+T+P++R +K+ + +    +W  + YEKLP FC  CG+ GH  R+C  +    + ++  
Subjt:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA

Query:  NQFGSGLRHPTSDGGVH
          +GS LR P   GG H
Subjt:  NQFGSGLRHPTSDGGVH

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]4.2e-1532.61Show/hide
Query:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDC----SFKSLLEVDANQ
        +K++GQI GN +G F+ +D  +    WG +MRIRV +DV +PL+R +K+ +  +   IW    YE+LP++C  CGR+GH DR+C    SF     VD+ Q
Subjt:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDC----SFKSLLEVDANQ

Query:  FGSGLRHPTSDGGVHGWSRPRSFD--LRGRGRGGRGRGPSDSTTGLEEESKDDPKMLSEFNSRRQNHSDKKSDGSGVTRQPMGP
        +G+ LR         G  R  S D  + G   GG+G       T   + ++++   L    +R    +D++ D   ++ Q + P
Subjt:  FGSGLRHPTSDGGVHGWSRPRSFD--LRGRGRGGRGRGPSDSTTGLEEESKDDPKMLSEFNSRRQNHSDKKSDGSGVTRQPMGP

TrEMBL top hitse value%identityAlignment
A0A6J1DU55 uncharacterized protein LOC1110231353.0e-1433.55Show/hide
Query:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKSLLEVD----ANQ
        +K +    GN +G F+ VD  E    WG+S+RIRV +D+T+PL+R +K+ I       W P+ YE+LP FC  CG IGH   DC  + L   D     ++
Subjt:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKSLLEVD----ANQ

Query:  FGSGLRHPTSDGGVH----GWSRPRSFDLRGRGRGGRGRGPSDSTTGLEEESKDD
        +G  LR   S  G      G S  R           + RG  ++   L E++  D
Subjt:  FGSGLRHPTSDGGVH----GWSRPRSFDLRGRGRGGRGRGPSDSTTGLEEESKDD

A0A6P9EID8 uncharacterized protein LOC1183488561.5e-1337.38Show/hide
Query:  GQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSF---KSLLEVDANQFGSGL
        G++ GN++G   +VD  +SE  WG  +RIRVR+++++PL R + VK+ +  S +W    YE+LP FC  CG++GH  +DC     +   + D   FG  L
Subjt:  GQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSF---KSLLEVDANQFGSGL

Query:  RHPTSDG
           T+ G
Subjt:  RHPTSDG

A0A7N2MBB4 Uncharacterized protein2.5e-1339.64Show/hide
Query:  RQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSF----KSLLEVDA
        +Q++  G+  G  +G   KVD+ E     G  +R+RVR+D++EP+ R   V+I   TS  W  + YE+LP+FC  CG++ H DRDCS     K  L ++ 
Subjt:  RQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSF----KSLLEVDA

Query:  NQFGSGLRHPT
         QFG  LR  T
Subjt:  NQFGSGLRHPT

W9RBJ0 CCHC-type domain-containing protein9.2e-1634.19Show/hide
Query:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA
        E +  R+G++ G+ +G+ + V Q +  +CWG  +R+RV +D+T+P++R +K+ + +    +W  + YEKLP FC  CG+ GH  R+C  +    + ++  
Subjt:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA

Query:  NQFGSGLRHPTSDGGVH
          +GS LR P   GG H
Subjt:  NQFGSGLRHPTSDGGVH

W9RRS1 CCHC-type domain-containing protein9.2e-1634.19Show/hide
Query:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA
        E +  R+G++ G+ +G+ + V Q +  +CWG  +R+RV +D+T+P++R +K+ + +    +W  + YEKLP FC  CG+ GH  R+C  +    + ++  
Subjt:  ERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSFKS---LLEVDA

Query:  NQFGSGLRHPTSDGGVH
          +GS LR P   GG H
Subjt:  NQFGSGLRHPTSDGGVH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G36228.1 nucleic acid binding;zinc ion binding6.5e-0630.23Show/hide
Query:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSF
        S+R  +I  + LG  + +D  E      + +R++VRMD TEPL+   +V+ + +  +      YEKL   C  C R+ H    C +
Subjt:  SKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGRIGHMDRDCSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAACACAGCTTGGGCTTATGGTGGAACAACTCCATTTGAAAGCGGAAGAAGATAGAGCGATTGTGGTGGGAGAAGAAGAGATTGAAGAAAGACAATCGAAGAG
GTTGGGGCAAATCTTTGGGAACCGGCTAGGGACGTTTTTAAAGGTGGATCAAAGAGAGTCGGAACAGTGTTGGGGTAGCTCCATGCGAATCAGAGTTCGTATGGACGTTA
CAGAGCCTCTGAAGCGCGACTTGAAAGTGAAGATATGGGAAGCGACATCTAAGATCTGGTGCCCTGTTACCTACGAGAAGTTACCTGTTTTTTGCAATCGATGTGGTCGT
ATTGGGCATATGGACCGTGATTGTAGCTTCAAATCATTGCTGGAAGTGGATGCCAATCAGTTTGGCTCTGGTCTTCGACATCCTACCTCCGACGGAGGGGTTCATGGCTG
GTCACGACCTAGATCTTTTGATTTGCGAGGAAGAGGCAGAGGCGGTCGAGGAAGAGGACCCTCAGATTCGACGACGGGTTTAGAGGAGGAGTCGAAAGACGATCCGAAAA
TGCTTTCTGAATTCAACTCGAGACGGCAGAATCACTCTGATAAAAAATCGGACGGTAGTGGGGTGACGAGGCAACCGATGGGACCGAAGGGGCCGACGGACGGTGGTTTG
TTTGACCGGATTAACAACAAAGATTTTCAATTAATCTTAAATGCCATTAACTTCTCGGATTCGGCGATTCAGAAAGAGGCTGTTTCAATGGGACGAAGCCTGGACAATTA
CAAGATGTTTTCGAAGGAACCTCGCCGAATTTTTACTTCGGTGAAGAGAGGCATTACGAAATCCCTGTCATCTGACCCAGACTCTAGAGAGAAGACGGGAGACACGACGG
ATTTGGCGTTGGGAGATATACAGTTTGGGCCTCCTTCTCATTTGTTTGGGCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAACACAGCTTGGGCTTATGGTGGAACAACTCCATTTGAAAGCGGAAGAAGATAGAGCGATTGTGGTGGGAGAAGAAGAGATTGAAGAAAGACAATCGAAGAG
GTTGGGGCAAATCTTTGGGAACCGGCTAGGGACGTTTTTAAAGGTGGATCAAAGAGAGTCGGAACAGTGTTGGGGTAGCTCCATGCGAATCAGAGTTCGTATGGACGTTA
CAGAGCCTCTGAAGCGCGACTTGAAAGTGAAGATATGGGAAGCGACATCTAAGATCTGGTGCCCTGTTACCTACGAGAAGTTACCTGTTTTTTGCAATCGATGTGGTCGT
ATTGGGCATATGGACCGTGATTGTAGCTTCAAATCATTGCTGGAAGTGGATGCCAATCAGTTTGGCTCTGGTCTTCGACATCCTACCTCCGACGGAGGGGTTCATGGCTG
GTCACGACCTAGATCTTTTGATTTGCGAGGAAGAGGCAGAGGCGGTCGAGGAAGAGGACCCTCAGATTCGACGACGGGTTTAGAGGAGGAGTCGAAAGACGATCCGAAAA
TGCTTTCTGAATTCAACTCGAGACGGCAGAATCACTCTGATAAAAAATCGGACGGTAGTGGGGTGACGAGGCAACCGATGGGACCGAAGGGGCCGACGGACGGTGGTTTG
TTTGACCGGATTAACAACAAAGATTTTCAATTAATCTTAAATGCCATTAACTTCTCGGATTCGGCGATTCAGAAAGAGGCTGTTTCAATGGGACGAAGCCTGGACAATTA
CAAGATGTTTTCGAAGGAACCTCGCCGAATTTTTACTTCGGTGAAGAGAGGCATTACGAAATCCCTGTCATCTGACCCAGACTCTAGAGAGAAGACGGGAGACACGACGG
ATTTGGCGTTGGGAGATATACAGTTTGGGCCTCCTTCTCATTTGTTTGGGCTTTGA
Protein sequenceShow/hide protein sequence
METTQLGLMVEQLHLKAEEDRAIVVGEEEIEERQSKRLGQIFGNRLGTFLKVDQRESEQCWGSSMRIRVRMDVTEPLKRDLKVKIWEATSKIWCPVTYEKLPVFCNRCGR
IGHMDRDCSFKSLLEVDANQFGSGLRHPTSDGGVHGWSRPRSFDLRGRGRGGRGRGPSDSTTGLEEESKDDPKMLSEFNSRRQNHSDKKSDGSGVTRQPMGPKGPTDGGL
FDRINNKDFQLILNAINFSDSAIQKEAVSMGRSLDNYKMFSKEPRRIFTSVKRGITKSLSSDPDSREKTGDTTDLALGDIQFGPPSHLFGL