; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g2152 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g2152
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionzinc finger protein 830-like isoform X1
Genome locationMC08:30458510..30462414
RNA-Seq ExpressionMC08g2152
SyntenyMC08g2152
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003604 - Matrin/U1-C-like, C2H2-type zinc finger
IPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158136.1 uncharacterized protein LOC111024695 [Momordica charantia]1.55e-23795.3Show/hide
Query:  MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTD-----------------PCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEE
        MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTD                 PCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEE
Subjt:  MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTD-----------------PCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEE

Query:  ARRELMLAEQELAIRGAAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI
        ARRELMLAEQELAIRGAAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI
Subjt:  ARRELMLAEQELAIRGAAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI

Query:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGA
        ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGA
Subjt:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGA

Query:  EKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKARNLSAIAAAAVTPSMED
        EKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKARNLSAIAAAAVTPSMED
Subjt:  EKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKARNLSAIAAAAVTPSMED

XP_022937456.1 uncharacterized protein LOC111443862 [Cucurbita moschata]1.29e-10756.95Show/hide
Query:  MEFKFRASDLRPPPPPPR-QYGSPLPP-AVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQE
        MEF+FRA D RPPPP P  QY SP PP AV+C SKQGF+D CLR +      RKPFD +EAMHCE+EL  LR+EKLLVE+ERQ+FLKE+ARREL+L E+E
Subjt:  MEFKFRASDLRPPPPPPR-QYGSPLPP-AVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQE

Query:  LAIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI
        +AIRG      Y F R  RWG  FSA AAP         V  S+EW  +EQL+SSDR G  AV +P       PR+ P  ADDK  +RQ+ +Q ++ ELI
Subjt:  LAIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI

Query:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------
        ILEKPDP++FREKRKAE P     DDVQP  VK+NPKDEW C LC+VTV S+ TFDQHL GKKH+RKEAGLRAQKASNV   AP P+  KRRK+      
Subjt:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------

Query:  SGSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA
        SG   AE    K+ ++ QCEKTG+                   K+FKFWC+ CKVGA ATEVM  HLNGK+HKA
Subjt:  SGSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA

XP_022969583.1 uncharacterized protein LOC111468562 [Cucurbita maxima]2.05e-10957.1Show/hide
Query:  MEFKFRASDLRPPPPPPR-QYGSPLPPAVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQEL
        MEF+FRA D RPPPP P  QY SP PPAV+C SKQGF+D CLR +      RKPFD +E MHCE+ELMRLR+EKLLVE+ERQ+FLKE+ARREL+L E+E+
Subjt:  MEFKFRASDLRPPPPPPR-QYGSPLPPAVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQEL

Query:  AIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELII
        AIRG      Y F R  RWG  FSA AAP         V  S+EW  +EQL+SSDR GF        P P PPR+ P  ADDK  +RQ+ +Q ++ ELII
Subjt:  AIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELII

Query:  LEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------S
        LEKPDP +FREKRKAE P     +DVQP  VK+NPKDEW C LC+VTV S+ TFDQHL GKKH+RKEA LRAQKASNV   AP P+  KRRK+      S
Subjt:  LEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------S

Query:  GSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA
        G   AE    K+ ++ QCEKTG+                   K+FKFWC+ CKVGA ATEVM  HLNGK+HKA
Subjt:  GSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA

XP_023536654.1 uncharacterized protein LOC111797967 [Cucurbita pepo subsp. pepo]1.42e-11258.29Show/hide
Query:  MEFKFRASDLRPPPPPPR-QYGSPLPP-AVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQE
        MEF+FRA D RPPPP P  QY SP PP AVYC SKQGF+D CLR +      RKPFD +EAMHCE+ELMRLR+EKLLVE+ERQ+FLKE+ARREL+L E+E
Subjt:  MEFKFRASDLRPPPPPPR-QYGSPLPP-AVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQE

Query:  LAIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI
        +AIRG      Y F R  RWG  FSA AAP         V  S+EW  +EQL+SSDR GF AV       P PPR+ P   DDK  +RQ+ +Q ++ ELI
Subjt:  LAIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI

Query:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------
        ILEKPDP +FREKRKAE P     DDVQP  VK+NPKDEW C LC+VTV S+ TFDQHL GKKH+RKEAGLRAQKASNV   AP P+  KRRK+      
Subjt:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------

Query:  SGSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA
        SGS  AE    K+ ++ QCEKTG+                   K+FKFWC+ CKVGA ATEVM  HLNGK+HKA
Subjt:  SGSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA

XP_038890954.1 uncharacterized protein LOC120080381 isoform X1 [Benincasa hispida]1.22e-10352.51Show/hide
Query:  MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTDPCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQELAIRGA
        MEFKFRA D RPPPPPP+QY  P P AVYC S Q F +PCLR+  T    D N+ M  E+ELMRLREEKLL E+ERQRFLKEEARRELML E+E+AIRG 
Subjt:  MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTDPCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQELAIRGA

Query:  AQAAGYSFRE-QRWGTSFSAAAP--------------VVRS-HEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQV-QAEKSELI
        AQ AG+  R+ QRWG  FSAAA               VV+S HEW+ MEQ KSSDR GFRAVA   LPPPPPPR+ P   +DKK   +L+V +A K ELI
Subjt:  AQAAGYSFRE-QRWGTSFSAAAP--------------VVRS-HEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQV-QAEKSELI

Query:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRK------M
        +LEKPDP +FREKRKA    +L  +D+Q S VKK  KDEWSC L +VT +++  F+QHL GKKHQRKEA LRAQK  N+S+AAP  +LKKRRK      M
Subjt:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRK------M

Query:  SGSAGAEKELKSSQCE----------------------KTGNKE--------------------------FKFWCKICKVGAQATEVMEAHLNGKRHK
          SA    E K S+ E                      K G+KE                          F FWC+ CKVGA  T+VM AH+NGK+H+
Subjt:  SGSAGAEKELKSSQCE----------------------KTGNKE--------------------------FKFWCKICKVGAQATEVMEAHLNGKRHK

TrEMBL top hitse value%identityAlignment
A0A0A0LJ38 Uncharacterized protein3.37e-8445.58Show/hide
Query:  MEFKFRASDLR--PPPPPPRQ--YGSPLPP----------AVYCFSKQGFTD------PCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVER
        MEFKFR  DLR  PPPPPP Q  +  P+PP          AVYC S+QGF D         RQ       R+PF++NE MH E+E MRLREEKL+ E+ER
Subjt:  MEFKFRASDLR--PPPPPPRQ--YGSPLPP----------AVYCFSKQGFTD------PCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVER

Query:  QRFLKEEARRELMLAEQELAIRGAAQAA-GYSFRE-QRW---------GTSFSAAAP---------VVRS-HEWQHMEQLKSSDRRGFRAVAVPLLPPPP
        +RFLKEEARREL L E+E+AIRG  Q+A GY F++ QRW         G   SA A          VV+S HEWQ+MEQ+K+SDR GF AVA+       
Subjt:  QRFLKEEARRELMLAEQELAIRGAAQAA-GYSFRE-QRW---------GTSFSAAAP---------VVRS-HEWQHMEQLKSSDRRGFRAVAVPLLPPPP

Query:  PPRLPPFTADDKKPQRQLQVQAEKSELIILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLR
         PR+ P   +DKK        A + +LI+LEKP P  FRE+RKAE   +  I  + PS VKK  KDEWSCALC+VT + E++F+ HL+GKKH+RKEA LR
Subjt:  PPRLPPFTADDKKPQRQLQVQAEKSELIILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLR

Query:  AQKASNVSQAAPMPVLKKRRKM------SGSAGAE-KELKSSQCE-------------------KTGNKE-------------------FKFWCKICKVG
        A+K S VS+ A  P+ KKRRK+      +   GAE KE K  + +                   K  NK+                   F FWC+ CKVG
Subjt:  AQKASNVSQAAPMPVLKKRRKM------SGSAGAE-KELKSSQCE-------------------KTGNKE-------------------FKFWCKICKVG

Query:  AQATEVMEAHLNGKRHKAR
        A  T+VM AH+NGK+H+A+
Subjt:  AQATEVMEAHLNGKRHKAR

A0A6J1E048 uncharacterized protein LOC1110246957.48e-23895.3Show/hide
Query:  MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTD-----------------PCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEE
        MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTD                 PCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEE
Subjt:  MEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTD-----------------PCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEE

Query:  ARRELMLAEQELAIRGAAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI
        ARRELMLAEQELAIRGAAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI
Subjt:  ARRELMLAEQELAIRGAAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI

Query:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGA
        ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGA
Subjt:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGA

Query:  EKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKARNLSAIAAAAVTPSMED
        EKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKARNLSAIAAAAVTPSMED
Subjt:  EKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKARNLSAIAAAAVTPSMED

A0A6J1FAE1 uncharacterized protein LOC1114438626.25e-10856.95Show/hide
Query:  MEFKFRASDLRPPPPPPR-QYGSPLPP-AVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQE
        MEF+FRA D RPPPP P  QY SP PP AV+C SKQGF+D CLR +      RKPFD +EAMHCE+EL  LR+EKLLVE+ERQ+FLKE+ARREL+L E+E
Subjt:  MEFKFRASDLRPPPPPPR-QYGSPLPP-AVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQE

Query:  LAIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI
        +AIRG      Y F R  RWG  FSA AAP         V  S+EW  +EQL+SSDR G  AV +P       PR+ P  ADDK  +RQ+ +Q ++ ELI
Subjt:  LAIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELI

Query:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------
        ILEKPDP++FREKRKAE P     DDVQP  VK+NPKDEW C LC+VTV S+ TFDQHL GKKH+RKEAGLRAQKASNV   AP P+  KRRK+      
Subjt:  ILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------

Query:  SGSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA
        SG   AE    K+ ++ QCEKTG+                   K+FKFWC+ CKVGA ATEVM  HLNGK+HKA
Subjt:  SGSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA

A0A6J1I1E4 uncharacterized protein LOC1114685629.94e-11057.1Show/hide
Query:  MEFKFRASDLRPPPPPPR-QYGSPLPPAVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQEL
        MEF+FRA D RPPPP P  QY SP PPAV+C SKQGF+D CLR +      RKPFD +E MHCE+ELMRLR+EKLLVE+ERQ+FLKE+ARREL+L E+E+
Subjt:  MEFKFRASDLRPPPPPPR-QYGSPLPPAVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQEL

Query:  AIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELII
        AIRG      Y F R  RWG  FSA AAP         V  S+EW  +EQL+SSDR GF        P P PPR+ P  ADDK  +RQ+ +Q ++ ELII
Subjt:  AIRGAAQAAGYSF-REQRWGTSFSA-AAP---------VVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELII

Query:  LEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------S
        LEKPDP +FREKRKAE P     +DVQP  VK+NPKDEW C LC+VTV S+ TFDQHL GKKH+RKEA LRAQKASNV   AP P+  KRRK+      S
Subjt:  LEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKM------S

Query:  GSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA
        G   AE    K+ ++ QCEKTG+                   K+FKFWC+ CKVGA ATEVM  HLNGK+HKA
Subjt:  GSAGAE----KELKSSQCEKTGN-------------------KEFKFWCKICKVGAQATEVMEAHLNGKRHKA

A0A6J1KKD2 myb-like protein D isoform X52.70e-7450.61Show/hide
Query:  MEFKFRASDLRPPPPP-----------------PRQYGSPLPPAVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRF
        MEFKFRA D R PPPP                 P Q   P+PP  +C  KQGF D  L +       R PFD NE MHCE+ELMRLR+EKL++E+ERQRF
Subjt:  MEFKFRASDLRPPPPP-----------------PRQYGSPLPPAVYCFSKQGFTDPCLRQNVT----RKPFDINEAMHCEIELMRLREEKLLVEVERQRF

Query:  LKEEARRELMLAEQELAIRGAAQAAGYSFRE-QRWGTSFSAAAP--------VVRS-HEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKP
        L+E+ARRELML E+E AI   AQ+ GY  R+ Q WG  F+AA P        +V+S  EWQ M++ KSS+  GF   A+P       PR+ PF A+  + 
Subjt:  LKEEARRELMLAEQELAIRGAAQAAGYSFRE-QRWGTSFSAAAP--------VVRS-HEWQHMEQLKSSDRRGFRAVAVPLLPPPPPPRLPPFTADDKKP

Query:  QRQLQVQAEKSELIILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPM-
        +RQ+ VQ ++++LI+LEKPDP +FREKRKAE   +   D+VQ S +KK PK E SCALC+VTV++E+ F++HL GKKH+R+EAGLRAQ AS + QAAP  
Subjt:  QRQLQVQAEKSELIILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPM-

Query:  PVLKKRRKMS---GSAGAEKELKSSQ
        P+  KR K+    GSAGA  ELK S+
Subjt:  PVLKKRRKMS---GSAGAEKELKSSQ

SwissProt top hitse value%identityAlignment
O64571 UBP1-associated proteins 1C2.1e-0425Show/hide
Query:  EWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKT----GNKEFKFWCKICKVGAQATEVM
        +W C+LC  T++ E+ +  H+ GKKHQ K            ++ A M   K++   S     +K   + Q +       + ++ ++C +C + A + + +
Subjt:  EWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKT----GNKEFKFWCKICKVGAQATEVM

Query:  EAHLNGKRHKAR
         AH NGK+H+ +
Subjt:  EAHLNGKRHKAR

O88532 Zinc finger RNA-binding protein6.0e-0422.03Show/hide
Query:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV
        ++P    K P+  + C +C+++ +  +T+ +HL+G+KH++KEA L+A + ++ S              + + G + +L+               C++C V
Subjt:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV

Query:  GAQATEVMEAHLNGKRHK
             +   AH+ G +H+
Subjt:  GAQATEVMEAHLNGKRHK

Q5REX3 Zinc finger RNA-binding protein6.0e-0422.03Show/hide
Query:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV
        ++P    K P+  + C +C+++ +  +T+ +HL+G+KH++KEA L+A + ++ S              + + G + +L+               C++C V
Subjt:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV

Query:  GAQATEVMEAHLNGKRHK
             +   AH+ G +H+
Subjt:  GAQATEVMEAHLNGKRHK

Q6PCR6 Zinc finger RNA-binding protein1.6e-0422.88Show/hide
Query:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV
        ++P    K P+  + C +C+++ +  +T+ +HL+G+KH++KEA L+  ++S+ S              S + G + +L+               C++C V
Subjt:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV

Query:  GAQATEVMEAHLNGKRHK
             +   AH+ G +H+
Subjt:  GAQATEVMEAHLNGKRHK

Q96KR1 Zinc finger RNA-binding protein2.7e-0422.88Show/hide
Query:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV
        ++P    K P+  + C +C+++ +  +T+ +HL+G+KH++KEA L+A + ++ S              S + G + +L+               C++C V
Subjt:  VQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKV

Query:  GAQATEVMEAHLNGKRHK
             +   AH+ G +H+
Subjt:  GAQATEVMEAHLNGKRHK

Arabidopsis top hitse value%identityAlignment
AT2G19380.1 RNA recognition motif (RRM)-containing protein1.5e-0525Show/hide
Query:  EWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKT----GNKEFKFWCKICKVGAQATEVM
        +W C+LC  T++ E+ +  H+ GKKHQ K            ++ A M   K++   S     +K   + Q +       + ++ ++C +C + A + + +
Subjt:  EWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQAAPMPVLKKRRKMSGSAGAEKELKSSQCEKT----GNKEFKFWCKICKVGAQATEVM

Query:  EAHLNGKRHKAR
         AH NGK+H+ +
Subjt:  EAHLNGKRHKAR

AT2G24030.1 zinc ion binding;nucleic acid binding1.1e-0524.15Show/hide
Query:  MEFKFRASDLRPPPPPPRQYGSP-LPPAVYCFSKQGFTDPCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQELAIRG
        MEF++RA D   PPP      SP L P     S++            +    + EA+  EIE  ++R+E ++ E  R+R L  E  +E M  E+E+AIR 
Subjt:  MEFKFRASDLRPPPPPPRQYGSP-LPPAVYCFSKQGFTDPCLRQNVTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQELAIRG

Query:  AAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSS------DRRGFRAVAV-PLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELIILEKPDPEL
         +     S  E+          P    +++ +  + K S      +   + ++   P++  P   ++   T           +++ K  LI+L + D  +
Subjt:  AAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSS------DRRGFRAVAV-PLLPPPPPPRLPPFTADDKKPQRQLQVQAEKSELIILEKPDPEL

Query:  FREKRKAEA------PLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERT-------FDQHLQGKKHQRKEAGLRAQKASN---VSQAAPMPVLKKRRK
           K KA++      P    + +   + V ++ K+++        V  +R         ++ LQ K+ + KE+  +A        VS   P        K
Subjt:  FREKRKAEA------PLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERT-------FDQHLQGKKHQRKEAGLRAQKASN---VSQAAPMPVLKKRRK

Query:  MSGSAGAEKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKA
        +     AE +L+++Q         KFWC+ICKVG     VM  H  GK+HKA
Subjt:  MSGSAGAEKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAACTCCGATTTCTCTCACGCGCCAACAATGGCGGTGGCTTCACTTCTTTCCTTTTCTCTTTAATCTTTCAAAATCTAACTATTCCTTCCGTATCTCCTGATCTTCTTAG
CCCCTCCCTCTCGCCGCTCGCCGGAAGCTATAATCGGAACCGGCGGCTTCATTCTGACATTCTGCGTTTCTTCCTCCGCCGTGATCGGATGGAGTTTAAGTTCCGGGCCA
GCGATCTCCGACCGCCGCCTCCGCCGCCGCGGCAGTATGGTTCTCCTTTGCCGCCGGCCGTCTACTGCTTCTCCAAGCAAGGCTTTACAGATCCCTGTCTCAGACAAAAC
GTAACGAGGAAGCCGTTCGATATCAATGAGGCGATGCACTGCGAGATCGAGTTGATGCGATTGAGGGAAGAGAAATTGCTGGTGGAAGTTGAGAGGCAGCGATTTCTGAA
AGAGGAAGCGAGGAGAGAACTGATGTTGGCCGAACAAGAGCTGGCGATTCGAGGAGCTGCGCAGGCGGCGGGCTATTCTTTCCGTGAGCAGCGATGGGGAACGTCGTTCA
GCGCCGCGGCGCCGGTGGTACGATCGCATGAATGGCAGCATATGGAGCAACTGAAAAGTTCCGACCGCCGCGGGTTTCGTGCAGTAGCCGTACCCTTACTCCCGCCGCCG
CCTCCGCCGCGGCTTCCGCCGTTTACGGCCGATGACAAGAAGCCGCAGCGCCAACTTCAAGTACAGGCTGAAAAAAGTGAACTGATTATACTGGAGAAACCTGACCCAGA
ACTATTTAGGGAAAAGCGGAAGGCCGAGGCGCCATTGGCATTGGACATTGACGACGTTCAACCTTCGGGTGTGAAGAAAAATCCCAAGGATGAGTGGAGCTGTGCGCTTT
GTCGAGTTACCGTTTCGAGTGAAAGGACTTTCGATCAACACCTTCAGGGCAAGAAGCACCAGCGCAAAGAAGCAGGGTTGAGAGCCCAGAAGGCGAGCAACGTTTCGCAA
GCTGCACCCATGCCAGTACTGAAGAAACGGAGAAAGATGAGTGGCTCTGCAGGTGCAGAAAAGGAGCTGAAATCATCTCAATGTGAGAAAACTGGAAATAAGGAGTTTAA
GTTTTGGTGTAAAATATGTAAAGTTGGTGCTCAAGCTACAGAGGTGATGGAGGCTCATCTGAATGGGAAGAGGCACAAGGCAAGAAATTTGTCAGCCATTGCAGCTGCAG
CAGTTACACCATCAATGGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
CCAACTCCGATTTCTCTCACGCGCCAACAATGGCGGTGGCTTCACTTCTTTCCTTTTCTCTTTAATCTTTCAAAATCTAACTATTCCTTCCGTATCTCCTGATCTTCTTA
GCCCCTCCCTCTCGCCGCTCGCCGGAAGCTATAATCGGAACCGGCGGCTTCATTCTGACATTCTGCGTTTCTTCCTCCGCCGTGATCGGATGGAGTTTAAGTTCCGGGCC
AGCGATCTCCGACCGCCGCCTCCGCCGCCGCGGCAGTATGGTTCTCCTTTGCCGCCGGCCGTCTACTGCTTCTCCAAGCAAGGCTTTACAGATCCCTGTCTCAGACAAAA
CGTAACGAGGAAGCCGTTCGATATCAATGAGGCGATGCACTGCGAGATCGAGTTGATGCGATTGAGGGAAGAGAAATTGCTGGTGGAAGTTGAGAGGCAGCGATTTCTGA
AAGAGGAAGCGAGGAGAGAACTGATGTTGGCCGAACAAGAGCTGGCGATTCGAGGAGCTGCGCAGGCGGCGGGCTATTCTTTCCGTGAGCAGCGATGGGGAACGTCGTTC
AGCGCCGCGGCGCCGGTGGTACGATCGCATGAATGGCAGCATATGGAGCAACTGAAAAGTTCCGACCGCCGCGGGTTTCGTGCAGTAGCCGTACCCTTACTCCCGCCGCC
GCCTCCGCCGCGGCTTCCGCCGTTTACGGCCGATGACAAGAAGCCGCAGCGCCAACTTCAAGTACAGGCTGAAAAAAGTGAACTGATTATACTGGAGAAACCTGACCCAG
AACTATTTAGGGAAAAGCGGAAGGCCGAGGCGCCATTGGCATTGGACATTGACGACGTTCAACCTTCGGGTGTGAAGAAAAATCCCAAGGATGAGTGGAGCTGTGCGCTT
TGTCGAGTTACCGTTTCGAGTGAAAGGACTTTCGATCAACACCTTCAGGGCAAGAAGCACCAGCGCAAAGAAGCAGGGTTGAGAGCCCAGAAGGCGAGCAACGTTTCGCA
AGCTGCACCCATGCCAGTACTGAAGAAACGGAGAAAGATGAGTGGCTCTGCAGGTGCAGAAAAGGAGCTGAAATCATCTCAATGTGAGAAAACTGGAAATAAGGAGTTTA
AGTTTTGGTGTAAAATATGTAAAGTTGGTGCTCAAGCTACAGAGGTGATGGAGGCTCATCTGAATGGGAAGAGGCACAAGGCAAGAAATTTGTCAGCCATTGCAGCTGCA
GCAGTTACACCATCAATGGAAGATTAAGAAGAAAGCTCAGAATGCTGTTGATGTAGATACTGGACAGACGGGGAACGATCAACGGGCAAATGTCGCTCTTCGATCAGGCG
AATATAAAGGTGTGATCCCTCGGAGGAGATTAGCGACTAATTTTAGTAATTCGACATTCTTGAATAAAGTGGAT
Protein sequenceShow/hide protein sequence
QLRFLSRANNGGGFTSFLFSLIFQNLTIPSVSPDLLSPSLSPLAGSYNRNRRLHSDILRFFLRRDRMEFKFRASDLRPPPPPPRQYGSPLPPAVYCFSKQGFTDPCLRQN
VTRKPFDINEAMHCEIELMRLREEKLLVEVERQRFLKEEARRELMLAEQELAIRGAAQAAGYSFREQRWGTSFSAAAPVVRSHEWQHMEQLKSSDRRGFRAVAVPLLPPP
PPPRLPPFTADDKKPQRQLQVQAEKSELIILEKPDPELFREKRKAEAPLALDIDDVQPSGVKKNPKDEWSCALCRVTVSSERTFDQHLQGKKHQRKEAGLRAQKASNVSQ
AAPMPVLKKRRKMSGSAGAEKELKSSQCEKTGNKEFKFWCKICKVGAQATEVMEAHLNGKRHKARNLSAIAAAAVTPSMED