; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g28470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g28470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCCHC-type domain-containing protein
Genome locationchr1:20210697..20211553
RNA-Seq ExpressionMoc01g28470
SyntenyMoc01g28470
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]3.8e-4745.29Show/hide
Query:  SPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG
        SP++  ++K+DGRINFGLWQVQVKDVLIQSGLHKAL+G+ +  +SK  SG                 S   DE+WE++DLRAASAIR  LAKN+LANVHG
Subjt:  SPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG

Query:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRME----------------------------------------------------------------
        ISTAK+LWEKLE LYQ KGISNRLYLKEQFHTLRM+                                                                
Subjt:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRME----------------------------------------------------------------

Query:  ---EEERRLKSEGRTSHEDSALVARNWKKKDSVQKKACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDD
            EE+RL S G TS E + L+  N KKK   QK   CW CGQSGH+K++CP  A S+      A+NV+++ GDD
Subjt:  ---EEERRLKSEGRTSHEDSALVARNWKKKDSVQKKACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDD

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.3e-4745.29Show/hide
Query:  SPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG
        SP++ D++K+DGRINFGLWQVQVKDVLIQSGLHKAL+G+ +  +SK  SG                 S   DE+WE++DLRAASAIR  LAKN+LANVHG
Subjt:  SPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG

Query:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRME----------------------------------------------------------------
        ISTAK+LWEKLE LYQ KGI NRLYLKEQFHTLRM+                                                                
Subjt:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRME----------------------------------------------------------------

Query:  ---EEERRLKSEGRTSHEDSALVARNWKKKDSVQKKACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDD
            EE+RL S G TS E + L+  N KKK   QK   CW CGQSGH+K++CP  A S+      A+NV+++ GDD
Subjt:  ---EEERRLKSEGRTSHEDSALVARNWKKKDSVQKKACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDD

KAF7802225.1 cytochrome p450 [Senna tora]1.2e-4851.23Show/hide
Query:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL
        MS F S VK D++KFDGRINFGLWQVQVKDVLIQSGLHKAL+G+ S   S+K                   +SSMSD DWEE+DLRAAS IR SLAKN+L
Subjt:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL

Query:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEEEER--------------------RLKSEGRT---------SHEDSALVARNWKKKDSV
        ANV GISTAKELW+KLE LYQAKGISN L LKEQFHTL M+E  +                    ++  E +T         S+E    +  + +KK   
Subjt:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEEEER--------------------RLKSEGRT---------SHEDSALVARNWKKKDSV

Query:  QKKACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDDD
         K   CW CG+SGH+KK+CP  A  + G   DA +VSL+RG+ D
Subjt:  QKKACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDDD

KAF7810708.1 cytochrome p450 [Senna tora]2.0e-5152.89Show/hide
Query:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL
        MS F S +K D++KFDGRINFG WQVQVKDVLIQSGL KAL+G+ S   S+K                   +SSMSD DWEE+DLRAAS IR  LAKN+L
Subjt:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL

Query:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE--------------------------EERRLKSEGRTSHEDSALVARNWKK-KDSVQK
        ANV GISTAKELW+KLE LYQAKGISNRL LKEQFHTLRM E                          EE+R+K E R S  DS +V +N    +    K
Subjt:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE--------------------------EERRLKSEGRTSHEDSALVARNWKK-KDSVQK

Query:  KACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDDD
           CW CG+SGH+KK+CP  A  + G   DA +VSL+RG+ D
Subjt:  KACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDDD

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]9.1e-8168.2Show/hide
Query:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL
        MSFFMSPVKID++KFDG INFGLWQVQVKDVLIQS LHKALKGR SEGAS+KLS DGG M+ SGGSSRGSKKSSMS EDWEEMDLRAASAIRTSLAKNIL
Subjt:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL

Query:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE----------------------------------------------------------
        ANVH ISTAKELWEKLEALYQAKGISNRLYLKEQFHTL+MEE                                                          
Subjt:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE----------------------------------------------------------

Query:  ---------EERRLKSEGRTSHEDSALVARNW-KKKDSVQKKACCWGCGQSGHMKKDCPNR
                 EERRLKSEGRTSHEDSALV  NW KKKDSVQKKACCWGCGQSGHMKKDCPNR
Subjt:  ---------EERRLKSEGRTSHEDSALVARNW-KKKDSVQKKACCWGCGQSGHMKKDCPNR

TrEMBL top hitse value%identityAlignment
A0A6A2Y9V1 CCHC-type domain-containing protein2.7e-4656.72Show/hide
Query:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG
        + D++KFDGRINFGLWQVQVKD+LIQSGL+KALKG+    SEG       D         SS    KS MS+E+WEE+D+RAAS IR  LAKN+LANV  
Subjt:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG

Query:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE--------EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPNRAGS
         S+  ELWEKLE +YQAK +SNRLYLKE+FH L+MEE        EERRLK+    S E  AL V  N KK    +KK  CWGCGQ GH+KKDC N   +
Subjt:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE--------EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPNRAGS

Query:  S
        S
Subjt:  S

A0A6A2YS90 Transcription initiation factor IIA subunit 21.2e-4343.97Show/hide
Query:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG
        + D++KFDGRINFGLWQVQVKD+LIQSGL+KALKG+    SEG       D         SS    KS MS+E+WEE+D+RAAS IR  LAKN+LANV  
Subjt:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG

Query:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE---------------------------------------------------------------
         S+ KELWEKLE +YQAK +SNRLYLKE+FH L+MEE                                                               
Subjt:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE---------------------------------------------------------------

Query:  ----EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPN-RAGSSKGFGWDADNVSLIRGDDDQFL
            EERRLK+    S E  AL V  N KK    +KK  CWGCGQ GH+KKDC N  A S+ G   DA NV +   +DD+F+
Subjt:  ----EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPN-RAGSSKGFGWDADNVSLIRGDDDQFL

A0A6A3BK59 CCHC-type domain-containing protein1.2e-4343.97Show/hide
Query:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG
        + D++KFDGRINFGLWQVQVKD+LIQSGL+KALKG+    SEG       D         SS    KS MS+E+WEE+D+RAAS IR  LAKN+LANV  
Subjt:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG

Query:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE---------------------------------------------------------------
         S+ KELWEKLE +YQAK +SNRLYLKE+FH L+MEE                                                               
Subjt:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE---------------------------------------------------------------

Query:  ----EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPN-RAGSSKGFGWDADNVSLIRGDDDQFL
            EERRLK+    S E  AL V  N KK    +KK  CWGCGQ GH+KKDC N  A S+ G   DA NV +   +DD+F+
Subjt:  ----EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPN-RAGSSKGFGWDADNVSLIRGDDDQFL

A0A6A3CWI3 CCHC-type domain-containing protein6.2e-4343.62Show/hide
Query:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG
        + D++KFDGRINFGLWQVQVKD+LIQSGL+KALKG+    SEG       D         SS    KS MS+E+WEE+D+RAAS IR  LAKN+LANV  
Subjt:  KIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQ---SEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHG

Query:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE---------------------------------------------------------------
         S+ KELWEKLE +YQAK +SNRLYLKE+FH L+MEE                                                               
Subjt:  ISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE---------------------------------------------------------------

Query:  ----EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPN-RAGSSKGFGWDADNVSLIRGDDDQFL
            EERRLK+    S E  AL V  N KK    +KK  CWGCGQ GH+KKDC N  A  + G   DA NV +   +DD+F+
Subjt:  ----EERRLKSEGRTSHEDSAL-VARNWKKKDSVQKKACCWGCGQSGHMKKDCPN-RAGSSKGFGWDADNVSLIRGDDDQFL

A0A6J1CG82 uncharacterized protein LOC1110105214.4e-8168.2Show/hide
Query:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL
        MSFFMSPVKID++KFDG INFGLWQVQVKDVLIQS LHKALKGR SEGAS+KLS DGG M+ SGGSSRGSKKSSMS EDWEEMDLRAASAIRTSLAKNIL
Subjt:  MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNIL

Query:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE----------------------------------------------------------
        ANVH ISTAKELWEKLEALYQAKGISNRLYLKEQFHTL+MEE                                                          
Subjt:  ANVHGISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRMEE----------------------------------------------------------

Query:  ---------EERRLKSEGRTSHEDSALVARNW-KKKDSVQKKACCWGCGQSGHMKKDCPNR
                 EERRLKSEGRTSHEDSALV  NW KKKDSVQKKACCWGCGQSGHMKKDCPNR
Subjt:  ---------EERRLKSEGRTSHEDSALVARNW-KKKDSVQKKACCWGCGQSGHMKKDCPNR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-1625Show/hide
Query:  MSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVH
        MS VK ++ KF+G   F  WQ +++D+LIQ GLHK L                             K  +M  EDW ++D RAASAIR  L+ +++ N+ 
Subjt:  MSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVH

Query:  GISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRM---------------------------EEEERRL------------------------------
           TA+ +W +LE+LY +K ++N+LYLK+Q + L M                           EEE++ +                              
Subjt:  GISTAKELWEKLEALYQAKGISNRLYLKEQFHTLRM---------------------------EEEERRL------------------------------

Query:  -------KSEGRTSHEDSALV-------------------ARNWKKKDSVQKKACCWGCGQSGHMKKDCPN---RAGSSKGFGWDADNVSLIRGDDDQFL
               K   +  ++  AL+                   AR   K  S  +   C+ C Q GH K+DCPN     G + G   D +  ++++ +D+  L
Subjt:  -------KSEGRTSHEDSALV-------------------ARNWKKKDSVQKKACCWGCGQSGHMKKDCPN---RAGSSKGFGWDADNVSLIRGDDDQFL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTCTTTATGAGTCCAGTGAAGATTGACATGGATAAATTTGACGGAAGGATCAACTTTGGCTTGTGGCAAGTGCAAGTCAAGGATGTGCTGATACAATCTGGGTT
ACACAAGGCATTGAAGGGAAGACAGAGTGAAGGTGCTTCGAAAAAGCTAAGCGGTGATGGTGGTTCAATGAAGTTCAGTGGTGGTTCCAGCAGAGGTTCTAAGAAGTCTA
GCATGAGTGATGAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGCGCGATACGAACAAGTTTGGCTAAGAATATTCTTGCGAATGTGCATGGAATTTCGACAGCCAAA
GAACTTTGGGAGAAGCTCGAAGCGTTGTATCAGGCAAAGGGTATCTCAAATCGGCTGTACCTGAAGGAGCAGTTTCACACGCTGCGAATGGAGGAAGAGGAAAGAAGGCT
GAAGAGTGAAGGGCGTACTTCACATGAAGATTCGGCACTGGTAGCTCGCAATTGGAAGAAGAAAGACTCCGTACAAAAGAAAGCTTGTTGCTGGGGATGCGGACAGTCTG
GACACATGAAGAAAGATTGTCCCAACAGAGCCGGTTCGTCAAAGGGCTTTGGGTGGGATGCTGACAATGTTTCTCTCATCAGAGGAGACGATGATCAGTTCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTCTTTATGAGTCCAGTGAAGATTGACATGGATAAATTTGACGGAAGGATCAACTTTGGCTTGTGGCAAGTGCAAGTCAAGGATGTGCTGATACAATCTGGGTT
ACACAAGGCATTGAAGGGAAGACAGAGTGAAGGTGCTTCGAAAAAGCTAAGCGGTGATGGTGGTTCAATGAAGTTCAGTGGTGGTTCCAGCAGAGGTTCTAAGAAGTCTA
GCATGAGTGATGAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGCGCGATACGAACAAGTTTGGCTAAGAATATTCTTGCGAATGTGCATGGAATTTCGACAGCCAAA
GAACTTTGGGAGAAGCTCGAAGCGTTGTATCAGGCAAAGGGTATCTCAAATCGGCTGTACCTGAAGGAGCAGTTTCACACGCTGCGAATGGAGGAAGAGGAAAGAAGGCT
GAAGAGTGAAGGGCGTACTTCACATGAAGATTCGGCACTGGTAGCTCGCAATTGGAAGAAGAAAGACTCCGTACAAAAGAAAGCTTGTTGCTGGGGATGCGGACAGTCTG
GACACATGAAGAAAGATTGTCCCAACAGAGCCGGTTCGTCAAAGGGCTTTGGGTGGGATGCTGACAATGTTTCTCTCATCAGAGGAGACGATGATCAGTTCCTTTGA
Protein sequenceShow/hide protein sequence
MSFFMSPVKIDMDKFDGRINFGLWQVQVKDVLIQSGLHKALKGRQSEGASKKLSGDGGSMKFSGGSSRGSKKSSMSDEDWEEMDLRAASAIRTSLAKNILANVHGISTAK
ELWEKLEALYQAKGISNRLYLKEQFHTLRMEEEERRLKSEGRTSHEDSALVARNWKKKDSVQKKACCWGCGQSGHMKKDCPNRAGSSKGFGWDADNVSLIRGDDDQFL