; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003692 (gene) of Snake gourd v1 genome

Gene IDTan0003692
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG02:14768966..14769763
RNA-Seq ExpressionTan0003692
SyntenyTan0003692
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG51551.1 hypothetical protein EZV62_024075 [Acer yangbiense]1.9e-1448.1Show/hide
Query:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTL
        LGQ  GEV+ +D    G  +G++ RVRV ID+++PL+R L++ LD++G +  +  RYEKL   CF CGLLGH+ +EC L
Subjt:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTL

TXG69574.1 hypothetical protein EZV62_004509 [Acer yangbiense]8.4e-1541.58Show/hide
Query:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECT--LDTVSKEKPKGPSYKSGLRV
        +GQ  GEV+ +D    G  +G++ R+RV ID++ PL+R L+++LDD G +  +  RYEK+   CF CGLLGH  KEC+  L  + KE  K   + + LR 
Subjt:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECT--LDTVSKEKPKGPSYKSGLRV

Query:  T
        +
Subjt:  T

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]2.4e-1448.72Show/hide
Query:  RLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKEC
        RLG   G  + VD +  G  WG   R+RV IDIT+PLRRG+KI +D      WIP +YE+L   C+FCG++GHS  +C
Subjt:  RLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKEC

XP_030957736.1 uncharacterized protein LOC115979822 [Quercus lobata]1.4e-1443.48Show/hide
Query:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLD--TVSKEKPKGPSYKSGLRV
        +G + GEVL+VD +  G  WG+  RVRVKID+T+ L RG KIK+ +EGVDRW+  +YE+L   C+ CGLL H  KEC  +    ++ K +   Y + +R 
Subjt:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLD--TVSKEKPKGPSYKSGLRV

Query:  TEFKKQGDNKKFNRK
           KK G +  F +K
Subjt:  TEFKKQGDNKKFNRK

XP_031127667.1 uncharacterized protein LOC116029767 [Ipomoea triloba]1.1e-1441.75Show/hide
Query:  SAVLRRLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSKEKPKGPSYKS
        +A ++ +G   G  ++VD +  GG W  F R+RV + +T+PL+R +K++L D G   W+  +YE+L+T CF CGLLGHS K C    +   +PK   Y S
Subjt:  SAVLRRLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSKEKPKGPSYKS

Query:  GLR
        GLR
Subjt:  GLR

TrEMBL top hitse value%identityAlignment
A0A5C7H3R3 CCHC-type domain-containing protein9.0e-1548.1Show/hide
Query:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTL
        LGQ  GEV+ +D    G  +G++ RVRV ID+++PL+R L++ LD++G +  +  RYEKL   CF CGLLGH+ +EC L
Subjt:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTL

A0A5C7IL40 CCHC-type domain-containing protein4.1e-1541.58Show/hide
Query:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECT--LDTVSKEKPKGPSYKSGLRV
        +GQ  GEV+ +D    G  +G++ R+RV ID++ PL+R L+++LDD G +  +  RYEK+   CF CGLLGH  KEC+  L  + KE  K   + + LR 
Subjt:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECT--LDTVSKEKPKGPSYKSGLRV

Query:  T
        +
Subjt:  T

A0A5C7IT66 CCHC-type domain-containing protein5.9e-1442Show/hide
Query:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLD-TVSKEKPKGPSYKSGLRVT
        LG   GEV +VD  P+G   G+F RVRV +++  PLRR L++ +  +G +  +P +YE+L + CF CGL+GHS +ECT        K  G  Y + LR T
Subjt:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLD-TVSKEKPKGPSYKSGLRVT

A0A6J1DU55 uncharacterized protein LOC1110231351.2e-1448.72Show/hide
Query:  RLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKEC
        RLG   G  + VD +  G  WG   R+RV IDIT+PLRRG+KI +D      WIP +YE+L   C+FCG++GHS  +C
Subjt:  RLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKEC

A0A803N338 Uncharacterized protein2.6e-1446.24Show/hide
Query:  SAVLRRLGQTSGEVLQVDTS-PMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSKEK
        S   R +G   GE L+VD S P+G  W  + RV+V ++IT+PLRRGLK+ + + GV +WI  +YE+L   C+FCG++GH+ K+C    V KEK
Subjt:  SAVLRRLGQTSGEVLQVDTS-PMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSKEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding3.4e-0631.82Show/hide
Query:  VLRRLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSK
        +L  + +  G  L+VD + +    GRFARV +++++ +PL+  + I  D   V       YE LS +C  CG+ GH    C  + V K
Subjt:  VLRRLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSK

AT5G36228.1 nucleic acid binding;zinc ion binding9.3e-0427.78Show/hide
Query:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSKEKPKGP
        +  T GEV+ +D +        F RV+V++D T+PLR   +++         I   YEKL  +C  C  + H    C      +E    P
Subjt:  LGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFCGLLGHSQKECTLDTVSKEKPKGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTGACTCGTTGATCCCTAAGCGGAATAACCTGACATTGACAAAGGAAGAAGAAAAAGTTACTTATTTAAGGGAGTTCGTGAAAACGGAAAAGCCAGTGGACCT
ATCGGCAGTTTTGAGGAGACTAGGGCAAACATCAGGGGAAGTGCTACAGGTTGATACATCTCCTATGGGGGGAATATGGGGGAGATTTGCTAGGGTTAGGGTGAAAATTG
ACATAACGCAACCATTGAGAAGAGGGTTGAAAATCAAATTAGATGATGAGGGTGTTGATAGATGGATACCTTGTCGCTATGAAAAACTCTCAACCCTTTGCTTCTTTTGT
GGCCTACTTGGACATTCACAAAAAGAGTGTACTCTTGATACTGTTTCTAAAGAAAAACCTAAAGGGCCAAGCTATAAATCAGGGTTGAGGGTAACTGAATTTAAAAAACA
AGGAGACAACAAGAAATTCAATCGAAAACCTGAGACTAATTCCCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTGACTCGTTGATCCCTAAGCGGAATAACCTGACATTGACAAAGGAAGAAGAAAAAGTTACTTATTTAAGGGAGTTCGTGAAAACGGAAAAGCCAGTGGACCT
ATCGGCAGTTTTGAGGAGACTAGGGCAAACATCAGGGGAAGTGCTACAGGTTGATACATCTCCTATGGGGGGAATATGGGGGAGATTTGCTAGGGTTAGGGTGAAAATTG
ACATAACGCAACCATTGAGAAGAGGGTTGAAAATCAAATTAGATGATGAGGGTGTTGATAGATGGATACCTTGTCGCTATGAAAAACTCTCAACCCTTTGCTTCTTTTGT
GGCCTACTTGGACATTCACAAAAAGAGTGTACTCTTGATACTGTTTCTAAAGAAAAACCTAAAGGGCCAAGCTATAAATCAGGGTTGAGGGTAACTGAATTTAAAAAACA
AGGAGACAACAAGAAATTCAATCGAAAACCTGAGACTAATTCCCCCTAA
Protein sequenceShow/hide protein sequence
MEVDSLIPKRNNLTLTKEEEKVTYLREFVKTEKPVDLSAVLRRLGQTSGEVLQVDTSPMGGIWGRFARVRVKIDITQPLRRGLKIKLDDEGVDRWIPCRYEKLSTLCFFC
GLLGHSQKECTLDTVSKEKPKGPSYKSGLRVTEFKKQGDNKKFNRKPETNSP