; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016944 (gene) of Snake gourd v1 genome

Gene IDTan0016944
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG10:24142880..24144122
RNA-Seq ExpressionTan0016944
SyntenyTan0016944
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG47986.1 hypothetical protein EZV62_027280 [Acer yangbiense]2.8e-3037.71Show/hide
Query:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSGK
        +  PW FD SLL+L+       +A++ F   DFW+QI NA + CMT EM + +G LIG +++ID      +   ++R ++ +D++ P  R ++L+++ GK
Subjt:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSGK

Query:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSK-----VTKEANLKYEFGDWLRASITKKPEGKSGIYTSRNSNDK
        E    ++YEKLP++CF CGI+GHSY+ C   +     VTKE    +EFG W+RAS        +G+Y  R    K
Subjt:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSK-----VTKEANLKYEFGDWLRASITKKPEGKSGIYTSRNSNDK

TXG53482.1 hypothetical protein EZV62_022651 [Acer yangbiense]1.8e-2940.22Show/hide
Query:  SSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG-K
        S PW FD SL++L+       I  + F   DFWVQI N PM CMT E+A+ LG +IGEV E+D       +  F+R RVA+DIT P  R + + +    +
Subjt:  SSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG-K

Query:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLK-YEFGDWLRASITKKPEGKSGIYTSRNSNDKLKSELTLE
        E   PI+YE+LP FCF CG++GH+   CP        N K + +G W+RA+I  KP G        NS +  +    LE
Subjt:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLK-YEFGDWLRASITKKPEGKSGIYTSRNSNDKLKSELTLE

TXG60547.1 hypothetical protein EZV62_015120 [Acer yangbiense]9.7e-3143.93Show/hide
Query:  PWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQI-KSGKEV
        PW FDKSLL+L+  E   +I+ M F   DFWVQIHN P+ CM   MAK L   IGEV EI  +  + W + F R +V IDI+ P  R ++L + +SG  V
Subjt:  PWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQI-KSGKEV

Query:  WCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEA--NLKYEFGDWLRASITKKPEGKSGIYTSRNSNDKLKS
           +KYE+LP+FCF CG IGH    CP  +  KEA       FG W+RA I++K + +S    S  S+ + +S
Subjt:  WCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEA--NLKYEFGDWLRASITKKPEGKSGIYTSRNSNDKLKS

TXG69574.1 hypothetical protein EZV62_004509 [Acer yangbiense]4.8e-3041.18Show/hide
Query:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKS-G
        +  PW FD  L++L+ ++   +I +M F    FW+QIHNAP+ CMT EM + +G LIGEV +ID          ++R RV ID+++P  R +++++   G
Subjt:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKS-G

Query:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPS--SKVTKEANLKYEFGDWLRAS
         E    I+YEK+P FCF+CG++GH  + C      + KE N ++EFG WLRAS
Subjt:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPS--SKVTKEANLKYEFGDWLRAS

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]3.2e-4249.68Show/hide
Query:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG
        ++S PW F+KSLL+L +    +Q  +M F    FW+QIHN P  C++ EMA +LG  +G+VEEI+ DG + W  PF+R RV ID++ P  RGIKL+   G
Subjt:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG

Query:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLKYEFGDWLRASITKK
        K++WCP++YEKLPDFC+ECG IGHS R C         N   ++GDWLRA++ KK
Subjt:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLKYEFGDWLRASITKK

TrEMBL top hitse value%identityAlignment
A0A5C7GV57 Uncharacterized protein1.4e-3037.71Show/hide
Query:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSGK
        +  PW FD SLL+L+       +A++ F   DFW+QI NA + CMT EM + +G LIG +++ID      +   ++R ++ +D++ P  R ++L+++ GK
Subjt:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSGK

Query:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSK-----VTKEANLKYEFGDWLRASITKKPEGKSGIYTSRNSNDK
        E    ++YEKLP++CF CGI+GHSY+ C   +     VTKE    +EFG W+RAS        +G+Y  R    K
Subjt:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSK-----VTKEANLKYEFGDWLRASITKKPEGKSGIYTSRNSNDK

A0A5C7HA98 CCHC-type domain-containing protein8.9e-3040.22Show/hide
Query:  SSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG-K
        S PW FD SL++L+       I  + F   DFWVQI N PM CMT E+A+ LG +IGEV E+D       +  F+R RVA+DIT P  R + + +    +
Subjt:  SSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG-K

Query:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLK-YEFGDWLRASITKKPEGKSGIYTSRNSNDKLKSELTLE
        E   PI+YE+LP FCF CG++GH+   CP        N K + +G W+RA+I  KP G        NS +  +    LE
Subjt:  EVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLK-YEFGDWLRASITKKPEGKSGIYTSRNSNDKLKSELTLE

A0A5C7HTV0 CCHC-type domain-containing protein4.7e-3143.93Show/hide
Query:  PWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQI-KSGKEV
        PW FDKSLL+L+  E   +I+ M F   DFWVQIHN P+ CM   MAK L   IGEV EI  +  + W + F R +V IDI+ P  R ++L + +SG  V
Subjt:  PWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQI-KSGKEV

Query:  WCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEA--NLKYEFGDWLRASITKKPEGKSGIYTSRNSNDKLKS
           +KYE+LP+FCF CG IGH    CP  +  KEA       FG W+RA I++K + +S    S  S+ + +S
Subjt:  WCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEA--NLKYEFGDWLRASITKKPEGKSGIYTSRNSNDKLKS

A0A5C7IL40 CCHC-type domain-containing protein2.3e-3041.18Show/hide
Query:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKS-G
        +  PW FD  L++L+ ++   +I +M F    FW+QIHNAP+ CMT EM + +G LIGEV +ID          ++R RV ID+++P  R +++++   G
Subjt:  NSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKS-G

Query:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPS--SKVTKEANLKYEFGDWLRAS
         E    I+YEK+P FCF+CG++GH  + C      + KE N ++EFG WLRAS
Subjt:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPS--SKVTKEANLKYEFGDWLRAS

A0A6J1D765 uncharacterized protein LOC1110179021.6e-4249.68Show/hide
Query:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG
        ++S PW F+KSLL+L +    +Q  +M F    FW+QIHN P  C++ EMA +LG  +G+VEEI+ DG + W  PF+R RV ID++ P  RGIKL+   G
Subjt:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG

Query:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLKYEFGDWLRASITKK
        K++WCP++YEKLPDFC+ECG IGHS R C         N   ++GDWLRA++ KK
Subjt:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPSSKVTKEANLKYEFGDWLRASITKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding5.7e-0522.73Show/hide
Query:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG
        +   PW F+  + +++   +    A   F    FW+QI   P+  +T  +   +G  +G                FL   +  D+++     +K Q    
Subjt:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG

Query:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPSS
                YEKL +FC  CG++ H    CP+S
Subjt:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCPSS

AT5G25600.1 BEST Arabidopsis thaliana protein match is: nucleic acid binding6.7e-0627.45Show/hide
Query:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDIT--IPFLRGIKLQIK
        M   PW+F++  + L+  E+  +      T+ D WVQ+   P+   T+ + + + + +G V  +D+D E S    F+R +V I IT  + F R ++ + +
Subjt:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDIT--IPFLRGIKLQIK

Query:  SG
         G
Subjt:  SG

AT5G36228.1 nucleic acid binding;zinc ion binding3.1e-1126.15Show/hide
Query:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG
        +  +PWVF++  + L+  E  D       T  D WV I   P+  ++    +++ + +GEV  +D++ E +    F+R +V +D T P     +++  S 
Subjt:  MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSG

Query:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCP
        +      +YEKL   C  C  + H    CP
Subjt:  KEVWCPIKYEKLPDFCFECGIIGHSYRMCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCCAGTCCATGGGTTTTCGATAAGTCTTTGCTAATACTGAAGAATGTGGAAGAAGATGATCAAATTGCGAATATGGTTTTTACCCATCAAGATTTTTGGGTTCA
AATTCATAATGCCCCTATGAATTGTATGACTGTGGAGATGGCTAAAGTGCTAGGTAATCTAATTGGAGAGGTGGAGGAGATTGATTGGGATGGAGAAAATAGTTGGGTTA
GACCTTTTCTACGTTTCAGAGTAGCCATAGATATTACTATTCCATTCCTTAGAGGGATCAAATTGCAAATCAAGTCGGGGAAAGAAGTTTGGTGTCCAATCAAATATGAA
AAGCTTCCTGACTTTTGCTTTGAGTGCGGGATTATTGGCCATTCTTATCGTATGTGCCCTTCTTCCAAAGTTACAAAAGAAGCCAATCTTAAGTATGAATTCGGCGACTG
GTTGCGTGCTTCAATAACTAAAAAACCAGAGGGGAAGTCTGGGATTTACACCTCGAGAAATTCCAATGACAAATTAAAAAGTGAGCTTACCTTAGAGAAAAAGGGCAGCA
TCATGTTAATGGAAACTGACTCCTTGCCTGAACTTGCTGAGGTCCACTCCCATCTGCATGAAGTCCCTTCTAATAATGGTGTAGATTCGGTATCTGGTAAGAATGATTCT
GAGTTGTCCAAGAATAGTGTTGTGGTGTTTACCAATGATCGGAGTTTAAGAACTTGTTTACAGGGCAAAGAGAGACAGAAATTGAAACCTAGACAGAGGAGGGAAGTCCC
TCTCCAAGGACAATCTGGGAATGTTAAACCCATGACTGGTGTTGAAGAAATAGTTGACAAGAAGAGGAAAGGGGAAGATTTCATCCAATCAGATTTTAAACAAAAAAGAT
TGTGTACTAGTGATGATAAGTTAATCCCAATGGAAGATGAAGAACGTTTGGCGGTGGCTGATTTTCAGCCCCGCCAGGGACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCCAGTCCATGGGTTTTCGATAAGTCTTTGCTAATACTGAAGAATGTGGAAGAAGATGATCAAATTGCGAATATGGTTTTTACCCATCAAGATTTTTGGGTTCA
AATTCATAATGCCCCTATGAATTGTATGACTGTGGAGATGGCTAAAGTGCTAGGTAATCTAATTGGAGAGGTGGAGGAGATTGATTGGGATGGAGAAAATAGTTGGGTTA
GACCTTTTCTACGTTTCAGAGTAGCCATAGATATTACTATTCCATTCCTTAGAGGGATCAAATTGCAAATCAAGTCGGGGAAAGAAGTTTGGTGTCCAATCAAATATGAA
AAGCTTCCTGACTTTTGCTTTGAGTGCGGGATTATTGGCCATTCTTATCGTATGTGCCCTTCTTCCAAAGTTACAAAAGAAGCCAATCTTAAGTATGAATTCGGCGACTG
GTTGCGTGCTTCAATAACTAAAAAACCAGAGGGGAAGTCTGGGATTTACACCTCGAGAAATTCCAATGACAAATTAAAAAGTGAGCTTACCTTAGAGAAAAAGGGCAGCA
TCATGTTAATGGAAACTGACTCCTTGCCTGAACTTGCTGAGGTCCACTCCCATCTGCATGAAGTCCCTTCTAATAATGGTGTAGATTCGGTATCTGGTAAGAATGATTCT
GAGTTGTCCAAGAATAGTGTTGTGGTGTTTACCAATGATCGGAGTTTAAGAACTTGTTTACAGGGCAAAGAGAGACAGAAATTGAAACCTAGACAGAGGAGGGAAGTCCC
TCTCCAAGGACAATCTGGGAATGTTAAACCCATGACTGGTGTTGAAGAAATAGTTGACAAGAAGAGGAAAGGGGAAGATTTCATCCAATCAGATTTTAAACAAAAAAGAT
TGTGTACTAGTGATGATAAGTTAATCCCAATGGAAGATGAAGAACGTTTGGCGGTGGCTGATTTTCAGCCCCGCCAGGGACAATGA
Protein sequenceShow/hide protein sequence
MNSSPWVFDKSLLILKNVEEDDQIANMVFTHQDFWVQIHNAPMNCMTVEMAKVLGNLIGEVEEIDWDGENSWVRPFLRFRVAIDITIPFLRGIKLQIKSGKEVWCPIKYE
KLPDFCFECGIIGHSYRMCPSSKVTKEANLKYEFGDWLRASITKKPEGKSGIYTSRNSNDKLKSELTLEKKGSIMLMETDSLPELAEVHSHLHEVPSNNGVDSVSGKNDS
ELSKNSVVVFTNDRSLRTCLQGKERQKLKPRQRREVPLQGQSGNVKPMTGVEEIVDKKRKGEDFIQSDFKQKRLCTSDDKLIPMEDEERLAVADFQPRQGQ