; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012122 (gene) of Snake gourd v1 genome

Gene IDTan0012122
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG07:67866408..67867341
RNA-Seq ExpressionTan0012122
SyntenyTan0012122
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG50019.1 hypothetical protein EZV62_025894 [Acer yangbiense]1.2e-3031.98Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP
        M  EE  N   +L+LKE E G ++ +     +   +R    +  K+L+ K ++ E F S++P+IW       +IEI   N+F  +F+ ++++  +++GGP
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP

Query:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIG------------------RFEGVESAKQPLRRAVKLKIGSMGDELW
        W+FD+ L+V EE  G  +++ M+F    FW+    +P +C T +    LG+ IG                  R   V    +PLRR +++ +   G E  
Subjt:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIG------------------RFEGVESAKQPLRRAVKLKIGSMGDELW

Query:  VQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA
        + +RYE+LPD CY CG IGH+V+DC      ++    NL FG WL+A
Subjt:  VQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA

TXG66887.1 hypothetical protein EZV62_008162 [Acer yangbiense]2.6e-3032.18Show/hide
Query:  MEDEEFNNRLASLNL--KEEELGGVVEIDDNELEDFDKRNQDE-VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVK
        M+ EE     ASL+L  K+E L  V E     L+D   R  D  +V K+LT K ++ E F++++PKIW        +E+   N F+  FR++ ++  I+ 
Subjt:  MEDEEFNNRLASLNL--KEEELGGVVEIDDNELEDFDKRNQDE-VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVK

Query:  GGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGD
        GGPWTFD  L+V E+  G  ++ ++ F    FW+   + P +C T++  E +G+ IG    ++                     +PL+R ++L++   G 
Subjt:  GGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGD

Query:  ELWVQVRYEKLPDFCYGCGIIGHLVKDC----GTEVGGIKTNLQFGDWLKANARMGGEELT
        E  + +RYEKLP++C+ CGIIGH  +DC    G ++GG K + ++G W++A+   G   +T
Subjt:  ELWVQVRYEKLPDFCYGCGIIGHLVKDC----GTEVGGIKTNLQFGDWLKANARMGGEELT

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]8.5e-3433.61Show/hide
Query:  EEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTF
        EE+ N      L  EE    V+ID + LE   K  +  ++CK+L+ ++I   V K+ +   W ++     ++I G N+F+ +F   +++ RI++ GPWTF
Subjt:  EEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTF

Query:  DRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVES------------------AKQPLRRAVKLKIGSMGDELWVQV
        DR L++ +          M+FR    WVHF DL   C  +  A  LGN+IG FE VES                    +PL R +KL +       W+ +
Subjt:  DRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVES------------------AKQPLRRAVKLKIGSMGDELWVQV

Query:  RYEKLPDFCYGCGIIGHLVKDCG-TEVGGIKTNLQFGDWLK
        +YE+LPDF Y CG + H++KDC    V  +  NLQ+G WL+
Subjt:  RYEKLPDFCYGCGIIGHLVKDCG-TEVGGIKTNLQFGDWLK

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.5e-3035.12Show/hide
Query:  VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCF
        VV K+ T+K I AE  +S++  +W +  N  + E  G N+++  F+S +EK R++  GPWTF++ L+V            M F +  FW+   ++P  C 
Subjt:  VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCF

Query:  TRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGDELWVQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKTNL--QFG
        + + A  LG  +G  E +E                     +PLRR +KLK  S G ++W  +RYEKLPDFCY CG IGH  ++C      + TN   Q+G
Subjt:  TRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGDELWVQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKTNL--QFG

Query:  DWLKA
        DWL+A
Subjt:  DWLKA

XP_024953751.1 uncharacterized protein LOC112498094 [Citrus sinensis]6.8e-3133.33Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP
        ME EE   R  ++ L +EE GG V       +  +K     ++ K++ T+ +  E  K  + ++W     + KIE  G N+F+  F S+A+K  I+ GGP
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP

Query:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVES------------------AKQPLRRAVKL-KIGSMGDEL
        W FDR L+V  E  G  ++K  +F +  FWV   D+P +C T++   ALG +IG+ E VE+                    +PL++ ++L + G   +++
Subjt:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVES------------------AKQPLRRAVKL-KIGSMGDEL

Query:  WVQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKAN
         +QV YE+LPDFC+ CG IGH  ++C       K  L +G WLKA+
Subjt:  WVQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKTNLQFGDWLKAN

TrEMBL top hitse value%identityAlignment
A0A5C7GZQ4 CCHC-type domain-containing protein5.6e-3131.98Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP
        M  EE  N   +L+LKE E G ++ +     +   +R    +  K+L+ K ++ E F S++P+IW       +IEI   N+F  +F+ ++++  +++GGP
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP

Query:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIG------------------RFEGVESAKQPLRRAVKLKIGSMGDELW
        W+FD+ L+V EE  G  +++ M+F    FW+    +P +C T +    LG+ IG                  R   V    +PLRR +++ +   G E  
Subjt:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIG------------------RFEGVESAKQPLRRAVKLKIGSMGDELW

Query:  VQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA
        + +RYE+LPD CY CG IGH+V+DC      ++    NL FG WL+A
Subjt:  VQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKT---NLQFGDWLKA

A0A5C7ICE5 CCHC-type domain-containing protein1.2e-3032.18Show/hide
Query:  MEDEEFNNRLASLNL--KEEELGGVVEIDDNELEDFDKRNQDE-VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVK
        M+ EE     ASL+L  K+E L  V E     L+D   R  D  +V K+LT K ++ E F++++PKIW        +E+   N F+  FR++ ++  I+ 
Subjt:  MEDEEFNNRLASLNL--KEEELGGVVEIDDNELEDFDKRNQDE-VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVK

Query:  GGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGD
        GGPWTFD  L+V E+  G  ++ ++ F    FW+   + P +C T++  E +G+ IG    ++                     +PL+R ++L++   G 
Subjt:  GGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGD

Query:  ELWVQVRYEKLPDFCYGCGIIGHLVKDC----GTEVGGIKTNLQFGDWLKANARMGGEELT
        E  + +RYEKLP++C+ CGIIGH  +DC    G ++GG K + ++G W++A+   G   +T
Subjt:  ELWVQVRYEKLPDFCYGCGIIGHLVKDC----GTEVGGIKTNLQFGDWLKANARMGGEELT

A0A5C7ISU8 Uncharacterized protein1.6e-3029.64Show/hide
Query:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP
        M+ EE     A+L++K ++   +  + D   E   ++    +V KIL+ K ++ +VF++++PKIW ++    +IE+   N F+  FRS++++CR++ GGP
Subjt:  MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGP

Query:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIG------------------RFEGVESAKQPLRRAVKLKIGSMGDELW
        W+FD  L+V E++ G  ++  + F    FW+  ++   +C T++    LG  +G                  R + V    +PL+R ++L +   G E  
Subjt:  WTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIG------------------RFEGVESAKQPLRRAVKLKIGSMGDELW

Query:  VQVRYEKLPDFCYGCGIIGHLVKDC-GTEVGGIK---TNLQFGDWLKANARMG
        + +RYEKLP++C+ CG++GH  + C     GG K   T+ +FG W++A++  G
Subjt:  VQVRYEKLPDFCYGCGIIGHLVKDC-GTEVGGIK---TNLQFGDWLKANARMG

A0A6J1BSZ1 uncharacterized protein LOC1110054814.1e-3433.61Show/hide
Query:  EEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTF
        EE+ N      L  EE    V+ID + LE   K  +  ++CK+L+ ++I   V K+ +   W ++     ++I G N+F+ +F   +++ RI++ GPWTF
Subjt:  EEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTF

Query:  DRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVES------------------AKQPLRRAVKLKIGSMGDELWVQV
        DR L++ +          M+FR    WVHF DL   C  +  A  LGN+IG FE VES                    +PL R +KL +       W+ +
Subjt:  DRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVES------------------AKQPLRRAVKLKIGSMGDELWVQV

Query:  RYEKLPDFCYGCGIIGHLVKDCG-TEVGGIKTNLQFGDWLK
        +YE+LPDF Y CG + H++KDC    V  +  NLQ+G WL+
Subjt:  RYEKLPDFCYGCGIIGHLVKDCG-TEVGGIKTNLQFGDWLK

A0A6J1D765 uncharacterized protein LOC1110179027.3e-3135.12Show/hide
Query:  VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCF
        VV K+ T+K I AE  +S++  +W +  N  + E  G N+++  F+S +EK R++  GPWTF++ L+V            M F +  FW+   ++P  C 
Subjt:  VVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCF

Query:  TRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGDELWVQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKTNL--QFG
        + + A  LG  +G  E +E                     +PLRR +KLK  S G ++W  +RYEKLPDFCY CG IGH  ++C      + TN   Q+G
Subjt:  TRKTAEALGNSIGRFEGVE------------------SAKQPLRRAVKLKIGSMGDELWVQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKTNL--QFG

Query:  DWLKA
        DWL+A
Subjt:  DWLKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding3.3e-0725.81Show/hide
Query:  FRSKAEKCRIVKGGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVESAKQPLRRAVKLKIGSMGDELWVQV
        F+S+     I++ GPW+F+  + V +     L+  A EF+   FW+    +P    T +   ++G  +G F               L+     D   ++ 
Subjt:  FRSKAEKCRIVKGGPWTFDRGLVVFEEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVESAKQPLRRAVKLKIGSMGDELWVQV

Query:  RYEKLPDFCYGCGIIGHLVKDCGT
        +YEKL +FC  CG++ H   +C T
Subjt:  RYEKLPDFCYGCGIIGHLVKDCGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATGAAGAGTTCAACAACCGGTTAGCAAGTCTGAACCTCAAAGAGGAGGAACTAGGAGGAGTGGTGGAGATCGACGACAATGAACTCGAGGACTTCGATAAAAG
AAATCAAGACGAAGTGGTTTGCAAAATTCTAACTACAAAAACCATTCACGCAGAAGTCTTCAAGAGCATAGTTCCTAAGATATGGAATATGGAGGGAAATATGAAGAAGA
TAGAAATCTTTGGAAGAAATGTTTTTATGTGCTCATTCAGAAGCAAGGCTGAGAAATGTAGAATCGTCAAAGGAGGTCCATGGACCTTCGACAGGGGCCTGGTGGTTTTC
GAAGAAATCAAAGGAGCTCTCAACCTAAAGGCAATGGAATTCAGGTATGCTTTATTTTGGGTGCATTTTCTTGATCTCCCCAGAGTGTGTTTTACCAGGAAAACAGCGGA
GGCTCTAGGGAATTCCATCGGGAGGTTCGAAGGAGTTGAATCTGCGAAACAACCGTTGAGAAGAGCTGTTAAGCTCAAGATTGGGTCGATGGGAGACGAGCTGTGGGTTC
AAGTCAGATATGAAAAGCTCCCAGACTTTTGCTATGGATGTGGAATAATAGGGCATCTGGTCAAGGATTGCGGGACAGAGGTAGGAGGAATCAAAACCAACCTGCAGTTT
GGAGACTGGCTGAAAGCTAACGCTCGAATGGGAGGAGAAGAGCTAACTAGGCGCATAATGAGAAACTTCGTAACCCATGGAAGGGGTCGGGAAGGGGTACTCACCGCCAT
GAGAGAGAGAGCGAAAGAGAGGGGAGTATCTGGAGGAGCAGGAAGAAGTGCTGTTGCGGGAGGGGCCGGAGATCAGTGGTGGGCGATATGCCGTAAAAATGGGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATGAAGAGTTCAACAACCGGTTAGCAAGTCTGAACCTCAAAGAGGAGGAACTAGGAGGAGTGGTGGAGATCGACGACAATGAACTCGAGGACTTCGATAAAAG
AAATCAAGACGAAGTGGTTTGCAAAATTCTAACTACAAAAACCATTCACGCAGAAGTCTTCAAGAGCATAGTTCCTAAGATATGGAATATGGAGGGAAATATGAAGAAGA
TAGAAATCTTTGGAAGAAATGTTTTTATGTGCTCATTCAGAAGCAAGGCTGAGAAATGTAGAATCGTCAAAGGAGGTCCATGGACCTTCGACAGGGGCCTGGTGGTTTTC
GAAGAAATCAAAGGAGCTCTCAACCTAAAGGCAATGGAATTCAGGTATGCTTTATTTTGGGTGCATTTTCTTGATCTCCCCAGAGTGTGTTTTACCAGGAAAACAGCGGA
GGCTCTAGGGAATTCCATCGGGAGGTTCGAAGGAGTTGAATCTGCGAAACAACCGTTGAGAAGAGCTGTTAAGCTCAAGATTGGGTCGATGGGAGACGAGCTGTGGGTTC
AAGTCAGATATGAAAAGCTCCCAGACTTTTGCTATGGATGTGGAATAATAGGGCATCTGGTCAAGGATTGCGGGACAGAGGTAGGAGGAATCAAAACCAACCTGCAGTTT
GGAGACTGGCTGAAAGCTAACGCTCGAATGGGAGGAGAAGAGCTAACTAGGCGCATAATGAGAAACTTCGTAACCCATGGAAGGGGTCGGGAAGGGGTACTCACCGCCAT
GAGAGAGAGAGCGAAAGAGAGGGGAGTATCTGGAGGAGCAGGAAGAAGTGCTGTTGCGGGAGGGGCCGGAGATCAGTGGTGGGCGATATGCCGTAAAAATGGGATCTAA
Protein sequenceShow/hide protein sequence
MEDEEFNNRLASLNLKEEELGGVVEIDDNELEDFDKRNQDEVVCKILTTKTIHAEVFKSIVPKIWNMEGNMKKIEIFGRNVFMCSFRSKAEKCRIVKGGPWTFDRGLVVF
EEIKGALNLKAMEFRYALFWVHFLDLPRVCFTRKTAEALGNSIGRFEGVESAKQPLRRAVKLKIGSMGDELWVQVRYEKLPDFCYGCGIIGHLVKDCGTEVGGIKTNLQF
GDWLKANARMGGEELTRRIMRNFVTHGRGREGVLTAMRERAKERGVSGGAGRSAVAGGAGDQWWAICRKNGI