; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001752 (gene) of Snake gourd v1 genome

Gene IDTan0001752
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG01:74992912..74993757
RNA-Seq ExpressionTan0001752
SyntenyTan0001752
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG66887.1 hypothetical protein EZV62_008162 [Acer yangbiense]1.3e-2338.82Show/hide
Query:  VLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNMEG
        +L G PW  D  LL L+KPS       L FN VAFW+   + PL C  KEM + +G+ +G   + D G     +G  MR+K+ ++++KPL+  ++L +E 
Subjt:  VLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNMEG

Query:  PMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLR
            +L+ ++YEKLP++C HCGIIGH ++DC       I G+K  +YG  +R
Subjt:  PMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.7e-3143.67Show/hide
Query:  RVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNME
        R+L   PW  DR L+ +  P   +KP D+ F  V+ WV F D+ LAC NK M  +LGN +G+FE+ +    N  WG  +RV++R ++ KPL  GI+LN++
Subjt:  RVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNME

Query:  GPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVGRQ
        GPMGG  I I+YE+LP F  HCG + H  KDC+    + +  +K  QYG  LRF G +
Subjt:  GPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVGRQ

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]9.1e-3337.93Show/hide
Query:  DCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQL
        D  RV+   PW  D+ L+ LQKP      S+L FN VAFW+   D+P++  NK M  +LGN +G F + D  E   SWG S+R+++ I+ITKPLR GI++
Subjt:  DCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQL

Query:  NMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAK-FHQYGQNLRFVGRQMNI-----ARSPSSVDR--RKSHPNENRGHQELLWLRL
        N++GPMGG  I I+YE+LP FC  CG+IGH   DC+  Y      ++   +YG  LRFVG +         +SP+  D     S  ++ RG +E      
Subjt:  NMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAK-FHQYGQNLRFVGRQMNI-----ARSPSSVDR--RKSHPNENRGHQELLWLRL

Query:  QLTFRIQRRRSRQRRNLEGF-SQVSISVTEDS
               +++  ++ N +GF SQ +   T D+
Subjt:  QLTFRIQRRRSRQRRNLEGF-SQVSISVTEDS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.4e-2840.62Show/hide
Query:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ
        +D  ++    PW  DR L+ + KP     PS+L F  +  WV F D+PL C  ++M  +LGN +G FEE D  + N  WG ++RV++ ++I+KPLR GI+
Subjt:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ

Query:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVG
        LN++GP+GG  I I+YE+LP FC HCG               +    K HQYG  LR+ G
Subjt:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVG

XP_028117212.1 uncharacterized protein LOC114314884 [Camellia sinensis]3.8e-2336.65Show/hide
Query:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ
        +D  RV+   PW  D+ L+ L++ +   +PS++ F+ + FWV   ++PL    +++   LGNT+G F + + G+G ++WG ++ ++I INI KPLR G++
Subjt:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ

Query:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFH----QYGQNLR
        L + G    V I+++YE+LP FC HCG++GH   DC      +++GA       QYG  LR
Subjt:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFH----QYGQNLR

TrEMBL top hitse value%identityAlignment
A0A5C7H9Y2 CCHC-type domain-containing protein3.2e-2334.43Show/hide
Query:  DCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQL
        D  R+L G PWL D+ LL L++ S   K +DL F +V FW+   ++PLAC N+E+   LG  +G  +E D GE     G  +R+++ I++  PL+ G+++
Subjt:  DCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQL

Query:  NMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVGRQMNIARSPSSVDRRKSHPNENRG
         +        + I YE+LP FC +CG IGH  +DC L  K +   + F ++G  +R V R     RS  + +++ S      G
Subjt:  NMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVGRQMNIARSPSSVDRRKSHPNENRG

A0A5C7ICE5 CCHC-type domain-containing protein6.4e-2438.82Show/hide
Query:  VLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNMEG
        +L G PW  D  LL L+KPS       L FN VAFW+   + PL C  KEM + +G+ +G   + D G     +G  MR+K+ ++++KPL+  ++L +E 
Subjt:  VLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNMEG

Query:  PMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLR
            +L+ ++YEKLP++C HCGIIGH ++DC       I G+K  +YG  +R
Subjt:  PMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLR

A0A6J1BSZ1 uncharacterized protein LOC1110054818.3e-3243.67Show/hide
Query:  RVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNME
        R+L   PW  DR L+ +  P   +KP D+ F  V+ WV F D+ LAC NK M  +LGN +G+FE+ +    N  WG  +RV++R ++ KPL  GI+LN++
Subjt:  RVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNME

Query:  GPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVGRQ
        GPMGG  I I+YE+LP F  HCG + H  KDC+    + +  +K  QYG  LRF G +
Subjt:  GPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVGRQ

A0A6J1DU55 uncharacterized protein LOC1110231354.4e-3337.93Show/hide
Query:  DCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQL
        D  RV+   PW  D+ L+ LQKP      S+L FN VAFW+   D+P++  NK M  +LGN +G F + D  E   SWG S+R+++ I+ITKPLR GI++
Subjt:  DCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQL

Query:  NMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAK-FHQYGQNLRFVGRQMNI-----ARSPSSVDR--RKSHPNENRGHQELLWLRL
        N++GPMGG  I I+YE+LP FC  CG+IGH   DC+  Y      ++   +YG  LRFVG +         +SP+  D     S  ++ RG +E      
Subjt:  NMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAK-FHQYGQNLRFVGRQMNI-----ARSPSSVDR--RKSHPNENRGHQELLWLRL

Query:  QLTFRIQRRRSRQRRNLEGF-SQVSISVTEDS
               +++  ++ N +GF SQ +   T D+
Subjt:  QLTFRIQRRRSRQRRNLEGF-SQVSISVTEDS

A0A6J1DX30 uncharacterized protein LOC1110248746.6e-2940.62Show/hide
Query:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ
        +D  ++    PW  DR L+ + KP     PS+L F  +  WV F D+PL C  ++M  +LGN +G FEE D  + N  WG ++RV++ ++I+KPLR GI+
Subjt:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ

Query:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVG
        LN++GP+GG  I I+YE+LP FC HCG               +    K HQYG  LR+ G
Subjt:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVG

SwissProt top hitse value%identityAlignment
O04244 Uncharacterized protein At4g020005.2e-0727.34Show/hide
Query:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNF---VAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRW
        +D L VL  EPWL + + +   +  +     +L F+    +  WV    IPL    +E   ++ + +G     D  +   +    +RV+IR  IT  LR+
Subjt:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNF---VAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRW

Query:  GIQLNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC
          ++  +      LIS +YE+L + CS C  + HH   C
Subjt:  GIQLNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC

Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding5.7e-0929.1Show/hide
Query:  LRVLHGEPW-LLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLN
        +  L G PW +L  +LL     S F    D +      WV   +IP   +++ +  ++   +G   + D    N   G   RV I +N+ KPL+  + +N
Subjt:  LRVLHGEPW-LLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLN

Query:  MEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC
              G    + YE L K CS CGI GH    C
Subjt:  MEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC

AT2G13450.1 unknown protein2.0e-0928.06Show/hide
Query:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNF---VAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRW
        +D L VL  EPWL + + +  Q+  +     +L F+    +  WV    IPL    +E   ++ + +G     D  +   +    +RV+IR  IT  LR+
Subjt:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNF---VAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRW

Query:  GIQLNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC
         +++  +      LIS +YE+L + CS C  + HH   C
Subjt:  GIQLNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC

AT3G42140.1 zinc ion binding;nucleic acid binding1.1e-0422.9Show/hide
Query:  VLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNMEG
        +L   PW  + ++  +Q+ +     SD  F  + FW+    IPL      +   +G  MG+F      E NL   +S                       
Subjt:  VLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNMEG

Query:  PMGGVLISIKYEKLPKFCSHCGIIGHHFKDC
             ++  +YEKL  FC+ CG++ H   +C
Subjt:  PMGGVLISIKYEKLPKFCSHCGIIGHHFKDC

AT4G02000.1 unknown protein3.7e-0827.34Show/hide
Query:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNF---VAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRW
        +D L VL  EPWL + + +   +  +     +L F+    +  WV    IPL    +E   ++ + +G     D  +   +    +RV+IR  IT  LR+
Subjt:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNF---VAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRW

Query:  GIQLNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC
          ++  +      LIS +YE+L + CS C  + HH   C
Subjt:  GIQLNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC

AT5G36228.1 nucleic acid binding;zinc ion binding1.6e-1127.94Show/hide
Query:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ
        +D L  L   PW+ + + +ALQ+   F  P++    F+  WV    IPL   ++   + + +T+G     D  E   S    +RVK+R++ T+PLR+  +
Subjt:  MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQ

Query:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC
        +         +I  +YEKL + C++C  + H    C
Subjt:  LNMEGPMGGVLISIKYEKLPKFCSHCGIIGHHFKDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGTCTCAGAGTGCTCCATGGAGAACCTTGGCTTCTTGACCGATTTCTTTTAGCTTTACAGAAACCTTCTCTATTTTCGAAACCTTCAGACTTGGTGTTCAATTT
TGTGGCCTTTTGGGTACTTTTTCTCGACATACCTTTGGCTTGTTTCAATAAGGAAATGACACAACAATTAGGCAACACAATGGGCATTTTTGAAGAGTTTGATGAAGGTG
AAGGAAATTTGAGTTGGGGCATGAGTATGAGGGTGAAAATCCGTATAAACATTACTAAACCGTTGAGATGGGGCATACAGTTGAATATGGAGGGGCCAATGGGAGGAGTT
TTGATATCAATAAAGTATGAGAAGTTACCAAAATTTTGCTCCCATTGTGGAATCATAGGCCACCATTTTAAGGACTGCAACTTGTTCTACAAAAATATGATACAAGGGGC
AAAATTTCACCAGTATGGTCAAAACCTTCGTTTTGTGGGTCGCCAAATGAATATTGCAAGATCACCATCATCTGTTGATCGAAGAAAATCCCATCCGAATGAAAACAGAG
GGCATCAGGAACTCTTATGGCTACGACTCCAATTAACCTTTCGAATACAGAGGAGAAGATCTCGTCAGCGAAGAAACTTAGAGGGTTTCAGTCAGGTGTCGATTTCAGTT
ACAGAAGATTCATTGATGGAGATGTTGCCATTAATGCAGATGAGCGACAAATCAGGATTTACAAAAGGGGACAAGTTGAAACGACAGAATATTGAGTTTGCAAAAAAACT
CTGCTTTGATGAGGGTCAATTGGGCATGATTCACGGGATACTGAGTGAAGAGACGTTGGAGGAAGACCTAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTGTCTCAGAGTGCTCCATGGAGAACCTTGGCTTCTTGACCGATTTCTTTTAGCTTTACAGAAACCTTCTCTATTTTCGAAACCTTCAGACTTGGTGTTCAATTT
TGTGGCCTTTTGGGTACTTTTTCTCGACATACCTTTGGCTTGTTTCAATAAGGAAATGACACAACAATTAGGCAACACAATGGGCATTTTTGAAGAGTTTGATGAAGGTG
AAGGAAATTTGAGTTGGGGCATGAGTATGAGGGTGAAAATCCGTATAAACATTACTAAACCGTTGAGATGGGGCATACAGTTGAATATGGAGGGGCCAATGGGAGGAGTT
TTGATATCAATAAAGTATGAGAAGTTACCAAAATTTTGCTCCCATTGTGGAATCATAGGCCACCATTTTAAGGACTGCAACTTGTTCTACAAAAATATGATACAAGGGGC
AAAATTTCACCAGTATGGTCAAAACCTTCGTTTTGTGGGTCGCCAAATGAATATTGCAAGATCACCATCATCTGTTGATCGAAGAAAATCCCATCCGAATGAAAACAGAG
GGCATCAGGAACTCTTATGGCTACGACTCCAATTAACCTTTCGAATACAGAGGAGAAGATCTCGTCAGCGAAGAAACTTAGAGGGTTTCAGTCAGGTGTCGATTTCAGTT
ACAGAAGATTCATTGATGGAGATGTTGCCATTAATGCAGATGAGCGACAAATCAGGATTTACAAAAGGGGACAAGTTGAAACGACAGAATATTGAGTTTGCAAAAAAACT
CTGCTTTGATGAGGGTCAATTGGGCATGATTCACGGGATACTGAGTGAAGAGACGTTGGAGGAAGACCTAATTTAG
Protein sequenceShow/hide protein sequence
MDCLRVLHGEPWLLDRFLLALQKPSLFSKPSDLVFNFVAFWVLFLDIPLACFNKEMTQQLGNTMGIFEEFDEGEGNLSWGMSMRVKIRINITKPLRWGIQLNMEGPMGGV
LISIKYEKLPKFCSHCGIIGHHFKDCNLFYKNMIQGAKFHQYGQNLRFVGRQMNIARSPSSVDRRKSHPNENRGHQELLWLRLQLTFRIQRRRSRQRRNLEGFSQVSISV
TEDSLMEMLPLMQMSDKSGFTKGDKLKRQNIEFAKKLCFDEGQLGMIHGILSEETLEEDLI