; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000060 (gene) of Snake gourd v1 genome

Gene IDTan0000060
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG02:73107627..73108891
RNA-Seq ExpressionTan0000060
SyntenyTan0000060
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG72599.1 hypothetical protein EZV62_001178 [Acer yangbiense]2.3e-2239.53Show/hide
Query:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI
        P H    ++ L++ +G    +++ F    FWI +H++P    + + A+    ++GE  ++ L++S  CWGN LRV+VR+D++KPLKR L++K+G+  E I
Subjt:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI

Query:  WCPITYEKLPDFCYNCGRIRHEDRECGFE
           + YE+LP+FCY CGRI H  +ECG E
Subjt:  WCPITYEKLPDFCYNCGRIRHEDRECGFE

TXG73549.1 hypothetical protein EZV62_002128 [Acer yangbiense]6.0e-2332.09Show/hide
Query:  MEATKLGMLMEHLCLSEEENMAIVIAGDDIDENTRQFLNTLICKSLSPK----------PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTG
        M +T++  L + L L EE+   + +A D   E  +     L+ K LS K          P H    ++ L++  G+   S L F    FWI +H +P   
Subjt:  MEATKLGMLMEHLCLSEEENMAIVIAGDDIDENTRQFLNTLICKSLSPK----------PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTG

Query:  QSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC----GFEPQPDREAK
         + ++A+   +++GE  ++  ++S  CWG  +RV+VR+D+ KPLKR L+ K G+  E I   + YE+LPDFC+ CGRI H  +EC      +   D    
Subjt:  QSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC----GFEPQPDREAK

Query:  QFGSALRHPFWSGGR
        ++GS L+ P    G+
Subjt:  QFGSALRHPFWSGGR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]8.3e-2537.71Show/hide
Query:  LSPKPIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSM
        LS  P   N  +L+L       +  D++F + +FWI +HN+P    S ++A   G ++G+ ++++ + ++   G  +RVRV++DV+KPL+RG+K+K  S 
Subjt:  LSPKPIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSM

Query:  AEEIWCPITYEKLPDFCYNCGRIRHEDRECGFEPQ--PDREAKQFGSALR---------HP----FWSGGRRFGQ
         ++IWCP+ YEKLPDFCY CG+I H  REC    +       +Q+G  LR         HP    FW GG RFG+
Subjt:  AEEIWCPITYEKLPDFCYNCGRIRHEDRECGFEPQ--PDREAKQFGSALR---------HP----FWSGGRRFGQ

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]2.7e-2337.5Show/hide
Query:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI
        P   +  +++L +   +   S+L F  V+FWIHL +LP +  +  +A   GN +G F  VD N+    WG SLR+RV +D+TKPL+RG+K+ +       
Subjt:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI

Query:  WCPITYEKLPDFCYNCGRIRHEDRECG---FEPQPD-REAKQFGSALRHPFWSGGRRFGQ
        W PI YE+LPDFCY CG I H   +C       Q D R   ++G  LR      G + G+
Subjt:  WCPITYEKLPDFCYNCGRIRHEDRECG---FEPQPD-REAKQFGSALRHPFWSGGRRFGQ

XP_039815364.1 uncharacterized protein LOC120678302 [Panicum virgatum]1.3e-2237.75Show/hide
Query:  KSLSPKPIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVG
        +++   P  I+   L++ + + +    ++ F  +  WI +  LP      ++A   G  +GEF +VDL   E   G  LRV+VR+++ KPL+RG+ + VG
Subjt:  KSLSPKPIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVG

Query:  SMAEEIWCPITYEKLPDFCYNCGRIRHEDRECGFEPQPDREAKQFGSALRH
          A+E WCPITYE LPDFCY CGRI H D+ C  +     +A  FG  LR+
Subjt:  SMAEEIWCPITYEKLPDFCYNCGRIRHEDRECGFEPQPDREAKQFGSALRH

TrEMBL top hitse value%identityAlignment
A0A5C7IU01 CCHC-type domain-containing protein1.1e-2239.53Show/hide
Query:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI
        P H    ++ L++ +G    +++ F    FWI +H++P    + + A+    ++GE  ++ L++S  CWGN LRV+VR+D++KPLKR L++K+G+  E I
Subjt:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI

Query:  WCPITYEKLPDFCYNCGRIRHEDRECGFE
           + YE+LP+FCY CGRI H  +ECG E
Subjt:  WCPITYEKLPDFCYNCGRIRHEDRECGFE

A0A5C7IWT1 CCHC-type domain-containing protein2.9e-2332.09Show/hide
Query:  MEATKLGMLMEHLCLSEEENMAIVIAGDDIDENTRQFLNTLICKSLSPK----------PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTG
        M +T++  L + L L EE+   + +A D   E  +     L+ K LS K          P H    ++ L++  G+   S L F    FWI +H +P   
Subjt:  MEATKLGMLMEHLCLSEEENMAIVIAGDDIDENTRQFLNTLICKSLSPK----------PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTG

Query:  QSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC----GFEPQPDREAK
         + ++A+   +++GE  ++  ++S  CWG  +RV+VR+D+ KPLKR L+ K G+  E I   + YE+LPDFC+ CGRI H  +EC      +   D    
Subjt:  QSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC----GFEPQPDREAK

Query:  QFGSALRHPFWSGGR
        ++GS L+ P    G+
Subjt:  QFGSALRHPFWSGGR

A0A6J1D765 uncharacterized protein LOC1110179024.0e-2537.71Show/hide
Query:  LSPKPIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSM
        LS  P   N  +L+L       +  D++F + +FWI +HN+P    S ++A   G ++G+ ++++ + ++   G  +RVRV++DV+KPL+RG+K+K  S 
Subjt:  LSPKPIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSM

Query:  AEEIWCPITYEKLPDFCYNCGRIRHEDRECGFEPQ--PDREAKQFGSALR---------HP----FWSGGRRFGQ
         ++IWCP+ YEKLPDFCY CG+I H  REC    +       +Q+G  LR         HP    FW GG RFG+
Subjt:  AEEIWCPITYEKLPDFCYNCGRIRHEDRECGFEPQ--PDREAKQFGSALR---------HP----FWSGGRRFGQ

A0A6J1DU55 uncharacterized protein LOC1110231351.3e-2337.5Show/hide
Query:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI
        P   +  +++L +   +   S+L F  V+FWIHL +LP +  +  +A   GN +G F  VD N+    WG SLR+RV +D+TKPL+RG+K+ +       
Subjt:  PIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEI

Query:  WCPITYEKLPDFCYNCGRIRHEDRECG---FEPQPD-REAKQFGSALRHPFWSGGRRFGQ
        W PI YE+LPDFCY CG I H   +C       Q D R   ++G  LR      G + G+
Subjt:  WCPITYEKLPDFCYNCGRIRHEDRECG---FEPQPD-REAKQFGSALRHPFWSGGRRFGQ

W9RRS1 CCHC-type domain-containing protein2.5e-2239.19Show/hide
Query:  VVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITY
        +V+L    NG    SD  F Y  FW+ LHNLP  G+  ++ +  G+ +G    V  +    CWG  +RVRV +D+TKP++R +K+ +G     +W  + Y
Subjt:  VVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITY

Query:  EKLPDFCYNCGRIRHEDREC---GFEPQPDREAKQFGSALRHPFWSGG
        EKLPDFCY CG+  H  REC   G     D     +GS LR P   GG
Subjt:  EKLPDFCYNCGRIRHEDREC---GFEPQPDREAKQFGSALRHPFWSGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G13450.1 unknown protein7.8e-0524.24Show/hide
Query:  VSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC
        +  W+ +  +P      + A    + +GE   +D + S T     +RVR+R  +T  L+  L++ +    E       YE+L   C +C R+ H    C
Subjt:  VSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC

AT3G42140.1 zinc ion binding;nucleic acid binding1.0e-0425.47Show/hide
Query:  SDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIR
        SD  FK + FWI +  +P    + ++  + G R+G F + +L +              V V K                      YEKL +FC  CG + 
Subjt:  SDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIR

Query:  HEDREC
        H+  EC
Subjt:  HEDREC

AT5G36228.1 nucleic acid binding;zinc ion binding2.8e-1029.41Show/hide
Query:  YVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC
        ++  W+H+  +P    S +  +   + +GE   +D N+  T     +RV+VR+D T+PL+   +V+  S  E       YEKL   C NC R+ H+   C
Subjt:  YVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVDLNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDREC

Query:  GF
         +
Subjt:  GF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCGACAAAATTAGGAATGCTAATGGAGCATTTATGTTTATCAGAGGAGGAAAACATGGCTATTGTGATAGCTGGCGATGATATTGATGAAAACACTCGCCAATT
CCTAAATACCTTGATTTGCAAGAGTCTGTCACCGAAGCCCATTCATATTAACGTGGTAGTTCTTCTTCTAGATGAGCGAAATGGGGCATGCCGGTTTTCAGATCTACATT
TCAAATATGTGTCCTTCTGGATTCACTTACATAATTTGCCCCCTACAGGTCAATCCCTCAAATTAGCACAGACATATGGAAATCGAGTAGGAGAGTTCAAAAAGGTGGAC
TTGAACAAATCGGAGACCTGTTGGGGAAATTCCTTGCGAGTGCGCGTGCGAGTAGATGTCACGAAACCGTTGAAACGAGGATTGAAGGTGAAGGTAGGTTCAATGGCGGA
AGAAATCTGGTGTCCTATCACTTACGAAAAGCTCCCTGATTTCTGTTATAATTGTGGGAGAATCAGGCATGAAGATCGTGAATGTGGTTTTGAGCCGCAACCAGATCGGG
AAGCTAAACAATTTGGTTCAGCACTTCGGCATCCCTTTTGGAGTGGAGGTCGTCGATTTGGACAAACCGCAGGTCCTCAAATTTTCAAGGTAAAAGCAGAGGAGGCCGAG
AAACTGATGGAGAGTGGAGATCGACAACAACCACCGGAAGAGGGTGCCGACAGTGATGATCCACCACAAGATAACAATGATTTCCAAATCATCTTAAATGCAATTAAATT
CTCAAGTTCAACGGTTGGATCTGTGGCTGATGTGTCCAATTTCAACGACCATACAAAAACTCCTAGGAAGGTAAAAGTTTCGGTTATCCAAGACTTGTCTAAAGCTTTAA
ATTTTGACAATTGCAGGGCTTTGAATCCTAGAGTTATGGAGGAGAGAGATCTGAAAACGAATGAGGATGAAGGGTCGGTCAAGTTGGGTATGGAGCTGGCATTTCAAGGG
ACCAAAGGTGAAGGTGGTAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCGACAAAATTAGGAATGCTAATGGAGCATTTATGTTTATCAGAGGAGGAAAACATGGCTATTGTGATAGCTGGCGATGATATTGATGAAAACACTCGCCAATT
CCTAAATACCTTGATTTGCAAGAGTCTGTCACCGAAGCCCATTCATATTAACGTGGTAGTTCTTCTTCTAGATGAGCGAAATGGGGCATGCCGGTTTTCAGATCTACATT
TCAAATATGTGTCCTTCTGGATTCACTTACATAATTTGCCCCCTACAGGTCAATCCCTCAAATTAGCACAGACATATGGAAATCGAGTAGGAGAGTTCAAAAAGGTGGAC
TTGAACAAATCGGAGACCTGTTGGGGAAATTCCTTGCGAGTGCGCGTGCGAGTAGATGTCACGAAACCGTTGAAACGAGGATTGAAGGTGAAGGTAGGTTCAATGGCGGA
AGAAATCTGGTGTCCTATCACTTACGAAAAGCTCCCTGATTTCTGTTATAATTGTGGGAGAATCAGGCATGAAGATCGTGAATGTGGTTTTGAGCCGCAACCAGATCGGG
AAGCTAAACAATTTGGTTCAGCACTTCGGCATCCCTTTTGGAGTGGAGGTCGTCGATTTGGACAAACCGCAGGTCCTCAAATTTTCAAGGTAAAAGCAGAGGAGGCCGAG
AAACTGATGGAGAGTGGAGATCGACAACAACCACCGGAAGAGGGTGCCGACAGTGATGATCCACCACAAGATAACAATGATTTCCAAATCATCTTAAATGCAATTAAATT
CTCAAGTTCAACGGTTGGATCTGTGGCTGATGTGTCCAATTTCAACGACCATACAAAAACTCCTAGGAAGGTAAAAGTTTCGGTTATCCAAGACTTGTCTAAAGCTTTAA
ATTTTGACAATTGCAGGGCTTTGAATCCTAGAGTTATGGAGGAGAGAGATCTGAAAACGAATGAGGATGAAGGGTCGGTCAAGTTGGGTATGGAGCTGGCATTTCAAGGG
ACCAAAGGTGAAGGTGGTAGGTGA
Protein sequenceShow/hide protein sequence
MEATKLGMLMEHLCLSEEENMAIVIAGDDIDENTRQFLNTLICKSLSPKPIHINVVVLLLDERNGACRFSDLHFKYVSFWIHLHNLPPTGQSLKLAQTYGNRVGEFKKVD
LNKSETCWGNSLRVRVRVDVTKPLKRGLKVKVGSMAEEIWCPITYEKLPDFCYNCGRIRHEDRECGFEPQPDREAKQFGSALRHPFWSGGRRFGQTAGPQIFKVKAEEAE
KLMESGDRQQPPEEGADSDDPPQDNNDFQIILNAIKFSSSTVGSVADVSNFNDHTKTPRKVKVSVIQDLSKALNFDNCRALNPRVMEERDLKTNEDEGSVKLGMELAFQG
TKGEGGR