; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G017040 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G017040
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionG-patch domain-containing protein
Genome locationCmo_Chr02:9757835..9765291
RNA-Seq ExpressionCmoCh02G017040
SyntenyCmoCh02G017040
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain
IPR005824 - KOW
IPR026822 - Spp2/MOS2, G-patch domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650865.1 hypothetical protein Csa_000747 [Cucumis sativus]1.1e-16475.24Show/hide
Query:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA
        DSK+YVNEFDASK LSET GKSRN+VIP+++NEWRPL RMKNLE PL QSDES LKFE+ASGLD  DDSKMSYGLNVRQSVD +K +DESKS EEPPRPA
Subjt:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA

Query:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD
        PLEVIMLEKFKADL+RLPEDRGFEDFE+VPVESF +ALM  YGW++G+GIGRNA+EDVKV+E++ RTDKQGLGFV D+P  +  KEEEKD GR R R RD
Subjt:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD

Query:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE---VKFRTKSKARDG--------------------------EEVER
          RVKENRDRES+GLA IGKHVRIV GRDAG KG+++EKLD  WLVLKLS RDE   +K R    A  G                           EVE+
Subjt:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE---VKFRTKSKARDG--------------------------EEVER

Query:  VEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
        V EKRENG RD+EKRT RL WLTSHIRVRIISK+FKGGKFYLKKGEIVDVVGP+ICD+SID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
Subjt:  VEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL

Query:  VERDLDKETGVVRDADSHEL
        VERDLDKETGVVRDADSHEL
Subjt:  VERDLDKETGVVRDADSHEL

KAG7036234.1 Protein MOS2 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-22983.59Show/hide
Query:  GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRP
        GDSKEYVNEFDASKSLSETRGKS NVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDS MSYGLNVRQSVDDVKSADESKSAEEPPRP
Subjt:  GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRP

Query:  APLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNR
        APLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFN RTDKQGLGFVGD+PASLPNKEEEKDNGRARGRNR
Subjt:  APLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNR

Query:  DGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKF--------------RTKSKARDGEEVERVEEKRENGQRDEEKR
        DGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVK               RTKSKARDGEEVE VEEKRENGQRDEEKR
Subjt:  DGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKF--------------RTKSKARDGEEVERVEEKRENGQRDEEKR

Query:  TRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDAD
        TRL WLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDAD
Subjt:  TRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDAD

Query:  SHELY--------------------DAGPVLVHTVCTPEFGIKTMVTRSCDRLENPGYEYADSCNELGCEGLWEISAISPAGDTGSMERVTSTEDSSQFL
        SHEL                     DAGPVLVHT                       Y Y +  +      +  +   SPAGDTGSMERVTSTEDSSQFL
Subjt:  SHELY--------------------DAGPVLVHTVCTPEFGIKTMVTRSCDRLENPGYEYADSCNELGCEGLWEISAISPAGDTGSMERVTSTEDSSQFL

Query:  EFLRCLRFGCSDQWRSPI
        EFLRCLRFGCSDQWR PI
Subjt:  EFLRCLRFGCSDQWRSPI

XP_004144463.3 protein MOS2 [Cucumis sativus]1.1e-16475.24Show/hide
Query:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA
        DSK+YVNEFDASK LSET GKSRN+VIP+++NEWRPL RMKNLE PL QSDES LKFE+ASGLD  DDSKMSYGLNVRQSVD +K +DESKS EEPPRPA
Subjt:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA

Query:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD
        PLEVIMLEKFKADL+RLPEDRGFEDFE+VPVESF +ALM  YGW++G+GIGRNA+EDVKV+E++ RTDKQGLGFV D+P  +  KEEEKD GR R R RD
Subjt:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD

Query:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE---VKFRTKSKARDG--------------------------EEVER
          RVKENRDRES+GLA IGKHVRIV GRDAG KG+++EKLD  WLVLKLS RDE   +K R    A  G                           EVE+
Subjt:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE---VKFRTKSKARDG--------------------------EEVER

Query:  VEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
        V EKRENG RD+EKRT RL WLTSHIRVRIISK+FKGGKFYLKKGEIVDVVGP+ICD+SID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
Subjt:  VEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL

Query:  VERDLDKETGVVRDADSHEL
        VERDLDKETGVVRDADSHEL
Subjt:  VERDLDKETGVVRDADSHEL

XP_022931004.1 LOW QUALITY PROTEIN: protein MOS2-like [Cucurbita moschata]1.6e-22186.79Show/hide
Query:  QNKPYPDVGRCKKQPPGTTWEEANSGTQFSI--LIRKALQWLTDLIALNCLDIS-------GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLM
        +NKPYPDVGRCKKQPPGTTWEEANSGTQFS   L  K+      + A    D         GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLM
Subjt:  QNKPYPDVGRCKKQPPGTTWEEANSGTQFSI--LIRKALQWLTDLIALNCLDIS-------GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLM

Query:  RMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSAL
        RMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSAL
Subjt:  RMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSAL

Query:  MESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEK
        MESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEK
Subjt:  MESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEK

Query:  LDLKWLVLKLSNRDEVKFRTKSKARDGE--------------------------EVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLK
        LDLKWLVLKLSNRDEVK      A  G                           EVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLK
Subjt:  LDLKWLVLKLSNRDEVKFRTKSKARDGE--------------------------EVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLK

Query:  KGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL
        KGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL
Subjt:  KGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL

XP_038888213.1 protein MOS2 [Benincasa hispida]2.7e-16877.2Show/hide
Query:  GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRP
        GDSKEYVNEFDASK LSET GKSR +VIP++ENEWRPL RMKNLE PL QS ES LKFE+ASGLD  +DSKMSYGLNVRQSVD +K  DESKSAEEPPRP
Subjt:  GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRP

Query:  APLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNR
        APLEVIMLEKFKADL+RLPEDRGFEDFE+VPVESF +ALME YGW++GRGIGRNA+EDVKVKE+N RTDKQGLGFV D+P  + +KEEEKD GR R RNR
Subjt:  APLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNR

Query:  DGARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE-VKFRTKS---------------------KARD-------GEEVE
        DG  VKENRDRESNGLA IGKHVRIVGGRDAG KGKI+EKLD  WLVLKLS RDE +K + ++                     K +D         E E
Subjt:  DGARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE-VKFRTKS---------------------KARD-------GEEVE

Query:  RVEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGS
        RVEEKRENG RD+EKR  RL WLTSHIRVRIISK+FKGGKFYLKKGEIVDVVGP+ICD+SID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGS
Subjt:  RVEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGS

Query:  LVERDLDKETGVVRDADSHEL
        LVERDLDKETGVVRDADSHEL
Subjt:  LVERDLDKETGVVRDADSHEL

TrEMBL top hitse value%identityAlignment
A0A0A0LCH0 G-patch domain-containing protein5.2e-16575.24Show/hide
Query:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA
        DSK+YVNEFDASK LSET GKSRN+VIP+++NEWRPL RMKNLE PL QSDES LKFE+ASGLD  DDSKMSYGLNVRQSVD +K +DESKS EEPPRPA
Subjt:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA

Query:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD
        PLEVIMLEKFKADL+RLPEDRGFEDFE+VPVESF +ALM  YGW++G+GIGRNA+EDVKV+E++ RTDKQGLGFV D+P  +  KEEEKD GR R R RD
Subjt:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD

Query:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE---VKFRTKSKARDG--------------------------EEVER
          RVKENRDRES+GLA IGKHVRIV GRDAG KG+++EKLD  WLVLKLS RDE   +K R    A  G                           EVE+
Subjt:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE---VKFRTKSKARDG--------------------------EEVER

Query:  VEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
        V EKRENG RD+EKRT RL WLTSHIRVRIISK+FKGGKFYLKKGEIVDVVGP+ICD+SID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
Subjt:  VEEKRENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL

Query:  VERDLDKETGVVRDADSHEL
        VERDLDKETGVVRDADSHEL
Subjt:  VERDLDKETGVVRDADSHEL

A0A1S3CCE8 protein MOS21.2e-16475Show/hide
Query:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA
        DSKEYVNEFDASK LSET GKSR +VIP++ENEWRPL RMKNLE PL QSDES LKFE+ SGLD  DDSKMSYGLNVRQSVD +K +DESKS EEPPRPA
Subjt:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA

Query:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD
        PLEVIMLEKFKADL+RLPEDRGFEDFE+VPVESF +ALME YGW++G+GIGRNA+EDVKVKE++ RTDKQGLGFV D+P  +  KEEEKD GR R RNRD
Subjt:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD

Query:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE-VKFRTKS---------------------KARD-------GEEVER
          RVKENRDRES+GLA I KHVRI+ GRDAG KG+++EKLD  WLVLKLS RDE VK + ++                     K +D         EVER
Subjt:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE-VKFRTKS---------------------KARD-------GEEVER

Query:  VEEKRENGQRDEEKR-TRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
        V EKRENG RD+EKR +RL WLTSHIRVRIISK+FKGGKFYLKKGEIVDVVGP+ICD+SID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
Subjt:  VEEKRENGQRDEEKR-TRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL

Query:  VERDLDKETGVVRDADSHEL
        VERDLD+ETGVVRDADSH+L
Subjt:  VERDLDKETGVVRDADSHEL

A0A5A7VL15 Protein MOS21.2e-16475Show/hide
Query:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA
        DSKEYVNEFDASK LSET GKSR +VIP++ENEWRPL RMKNLE PL QSDES LKFE+ SGLD  DDSKMSYGLNVRQSVD +K +DESKS EEPPRPA
Subjt:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA

Query:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD
        PLEVIMLEKFKADL+RLPEDRGFEDFE+VPVESF +ALME YGW++G+GIGRNA+EDVKVKE++ RTDKQGLGFV D+P  +  KEEEKD GR R RNRD
Subjt:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD

Query:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE-VKFRTKS---------------------KARD-------GEEVER
          RVKENRDRES+GLA I KHVRI+ GRDAG KG+++EKLD  WLVLKLS RDE VK + ++                     K +D         EVER
Subjt:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDE-VKFRTKS---------------------KARD-------GEEVER

Query:  VEEKRENGQRDEEKR-TRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
        V EKRENG RD+EKR +RL WLTSHIRVRIISK+FKGGKFYLKKGEIVDVVGP+ICD+SID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL
Subjt:  VEEKRENGQRDEEKR-TRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL

Query:  VERDLDKETGVVRDADSHEL
        VERDLD+ETGVVRDADSH+L
Subjt:  VERDLDKETGVVRDADSHEL

A0A6J1ET34 LOW QUALITY PROTEIN: protein MOS2-like7.9e-22286.79Show/hide
Query:  QNKPYPDVGRCKKQPPGTTWEEANSGTQFSI--LIRKALQWLTDLIALNCLDIS-------GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLM
        +NKPYPDVGRCKKQPPGTTWEEANSGTQFS   L  K+      + A    D         GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLM
Subjt:  QNKPYPDVGRCKKQPPGTTWEEANSGTQFSI--LIRKALQWLTDLIALNCLDIS-------GDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLM

Query:  RMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSAL
        RMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSAL
Subjt:  RMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSAL

Query:  MESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEK
        MESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEK
Subjt:  MESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEK

Query:  LDLKWLVLKLSNRDEVKFRTKSKARDGE--------------------------EVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLK
        LDLKWLVLKLSNRDEVK      A  G                           EVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLK
Subjt:  LDLKWLVLKLSNRDEVKFRTKSKARDGE--------------------------EVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLK

Query:  KGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL
        KGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL
Subjt:  KGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL

A0A6J1HTX0 protein MOS2-like1.4e-16274.22Show/hide
Query:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA
        DSKEYVNEFDASK  SETR  SRN+VIP+++NEWRPL RMKNLE PLGQSDES LKFE+ASGLD P+DSKMS+GLNVRQSVD +KSAD+S+S EE PRPA
Subjt:  DSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA

Query:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD
        PLEV+MLEKFKADLKRLPEDRGFEDFE+VPVESF +ALME YGW++GRGIGRNA+EDVKVKE+N RTDKQGLGFV D+P  L NK++EKD  R R +NRD
Subjt:  PLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRD

Query:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLK-----------------LSNRDEVKFRTKSKARDGEEVER-------VEEKR
        G RVKENRDR S+GL+ IGKHVRI+GGRDAG KGKIVEKLD  WLVLK                 L +++E +F  K +    ++V +       V EKR
Subjt:  GARVKENRDRESNGLA-IGKHVRIVGGRDAGSKGKIVEKLDLKWLVLK-----------------LSNRDEVKFRTKSKARDGEEVER-------VEEKR

Query:  ENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDL
        ENG RDEE++  R+ W+TSHIRVRIISKDFKGGKFYLKKGEIVDVVGP++CD+SID SRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDL
Subjt:  ENGQRDEEKRT-RLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDL

Query:  DKETGVVRDADSHEL
        DKETGVVRDADSHEL
Subjt:  DKETGVVRDADSHEL

SwissProt top hitse value%identityAlignment
Q21924 G-patch domain and KOW motifs-containing protein homolog 14.7e-1424.05Show/hide
Query:  DFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIV
        D+  +P+ESFG A++    W+ G GIG+N ++ V +K  N R    GLG     P                G+N++     +  + +   + +G  +++V
Subjt:  DFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIV

Query:  GGRDAGSKGKIVEKLD----------LKWLVLKLSNRDEVKFRTKSKARDG---------EEVERVEEKRE---------------------------NG
         GR+ G  GK+  + D          +    +K+S    V    K   RD          +E +R+E +R+                           + 
Subjt:  GGRDAGSKGKIVEKLD----------LKWLVLKLSNRDEVKFRTKSKARDG---------EEVERVEEKRE---------------------------NG

Query:  QRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESR-ELVQGVSQELLETALPRR-GGPVLVLYGKHKGVYGSLVERDLDK
           E +R    W  + + VR I +DFK G  Y +K  IVDV G    D++I++ R      + Q  LET +PR  G  ++++ GK  G    ++++D  K
Subjt:  QRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESR-ELVQGVSQELLETALPRR-GGPVLVLYGKHKGVYGSLVERDLDK

Query:  ETGVVRDADSHELYDA
        E    R   ++++  A
Subjt:  ETGVVRDADSHELYDA

Q6NU07 G-patch domain and KOW motifs-containing protein1.5e-2025.65Show/hide
Query:  KEYVNEFDASKSLSETRG-KSRNVVIPAI-ENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA
        KEY+   +  + LS     +S+ +VIP I +N W    + K  ++     DE+ L                     V++ +++ + A E  S        
Subjt:  KEYVNEFDASKSLSETRG-KSRNVVIPAI-ENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPA

Query:  PLEVI------MLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRA
        PL +         ++ K D+   P+     D++ VPV+ +G A++   GW++G GIGR  ++DVK  E  LR   +GLG   D  A L + E +K     
Subjt:  PLEVI------MLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRA

Query:  RGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLD------------------LKWLVLKLSN--------------------------
            R   +  E  + ES GL  G  V+I  G      GK VE +D                  +    L+L N                          
Subjt:  RGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLD------------------LKWLVLKLSN--------------------------

Query:  RD----------------------EVKFRTKSKARDGEEVERVEEKRENGQRDEEKRTR---LCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTIC
        RD                      +VK    S   D    E+   +R   Q  E+K+ R     WL   IRVR I K++KGGK+Y  K  + DV+ PT C
Subjt:  RD----------------------EVKFRTKSKARDGEEVERVEEKRENGQRDEEKRTR---LCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTIC

Query:  DLSIDESRELVQGVSQELLETALPRRGGP-VLVLYGKHKGVYGSLVERDLDKETGVVRDADSHE
         +   E+  +++ + Q++LET +P+  G  V+V+ GK++G+ G ++ RD  K   +V+    H+
Subjt:  DLSIDESRELVQGVSQELLETALPRRGGP-VLVLYGKHKGVYGSLVERDLDKETGVVRDADSHE

Q90X38 G-patch domain and KOW motifs-containing protein2.2e-1928.02Show/hide
Query:  EKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLG----FVGDMPASLP----------NKEEE-----
        +K   DL+  PE     D+E VPVE++G A+++  GW++  GIGR  ++DVK  E  LR    GLG     + D+   +P           KEEE     
Subjt:  EKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLG----FVGDMPASLP----------NKEEE-----

Query:  ------------KD-NGRARGRNRDGARVKENRDRESNGLAIGKH-VRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKFRTKSKARDGEEVERVEEK
                    KD  G+  G + D  RV          + I +H +++V  ++     K     DL     +LS   + K R K + +  E+ +R+  K
Subjt:  ------------KD-NGRARGRNRDGARVKENRDRESNGLAIGKH-VRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKFRTKSKARDGEEVERVEEK

Query:  RENGQ--------RDEEKRTRL----------------------CWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELL
         E G+        RD++KR                          WL   +RVR I K FKGGK+Y  K  + DV+ P  C    +E R ++  + Q++L
Subjt:  RENGQ--------RDEEKRTRL----------------------CWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELL

Query:  ETALPRRGGP-VLVLYGKHKGVYGSLVERDLDKETGVVR
        ET +P+     ++V+ G+H+G  G +++RD +K   +V+
Subjt:  ETALPRRGGP-VLVLYGKHKGVYGSLVERDLDKETGVVR

Q92917 G-patch domain and KOW motifs-containing protein9.5e-1526.91Show/hide
Query:  DFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIV
        ++E VPVE++G A++   GW+ G GIGR   + VK +  +LR    GLG      A+L   +     G +R    D  + K+  D +  GL  G  V ++
Subjt:  DFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIV

Query:  GGRDAGSKGKIVEKLDLKWL--VLKLSNRDEV----KFRTKSKARDGEEVERVEEKRENG---------------QRDEEKRTR----------------
         G   G  GK VE LD   +  +++L+    V    ++  +  ++   +   ++ +++NG               Q+D  +R R                
Subjt:  GGRDAGSKGKIVEKLDLKWL--VLKLSNRDEV----KFRTKSKARDGEEVERVEEKRENG---------------QRDEEKRTR----------------

Query:  -----LCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPR-RGGPVLVLYGKHKGVYGSLVERDLDKETGVV
               WL   +RVR +   +KGG++Y  K  I DV+ P  C    DE R +++G+ +++LET +P+  G  V+V+ G   G  G L+ RD  +   +V
Subjt:  -----LCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPR-RGGPVLVLYGKHKGVYGSLVERDLDKETGVV

Query:  R
        +
Subjt:  R

Q9C801 Protein MOS27.6e-8946.26Show/hide
Query:  NCLDISGDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDE-SGLKFE---TASGLDAPDDSKMSYGLNVRQSVDDVKSADE
        N +D  G SKE+V EFD SK+L+ +  K    VIP IEN WRP  +MKNL+ PL   +  SGL+FE      G + PD+  +SYGLN+RQ V D     +
Subjt:  NCLDISGDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDE-SGLKFE---TASGLDAPDDSKMSYGLNVRQSVDDVKSADE

Query:  SKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEK
           A E  + +  E +ML+  + DL  L +D   EDFE VPV+ FG+ALM  YGW+ G+GIG+NA+EDV++KE+   T K+GLGF  D    +  K + K
Subjt:  SKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEK

Query:  DNGR--ARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLS-NRDEVKF------------------------------
        ++ +   +G   +G  V            +GK VRI+ GRD G KGKIVEK    + V+K+S + +EVK                               
Subjt:  DNGR--ARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLS-NRDEVKF------------------------------

Query:  --RTKSKARDGEEVERVE----EKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPR
          +T  + R  E   R E    EK++ GQ   E++ +  WL SHI+VRI+SKD+KGG+ YLKKG++VDVVGPT CD+++DE++ELVQGV QELLETALPR
Subjt:  --RTKSKARDGEEVERVE----EKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPR

Query:  RGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHELYD
        RGGPVLVL GKHKGVYG+LVE+DLDKETGVVRD D+H++ D
Subjt:  RGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHELYD

Arabidopsis top hitse value%identityAlignment
AT1G33520.1 D111/G-patch domain-containing protein5.4e-9046.26Show/hide
Query:  NCLDISGDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDE-SGLKFE---TASGLDAPDDSKMSYGLNVRQSVDDVKSADE
        N +D  G SKE+V EFD SK+L+ +  K    VIP IEN WRP  +MKNL+ PL   +  SGL+FE      G + PD+  +SYGLN+RQ V D     +
Subjt:  NCLDISGDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDE-SGLKFE---TASGLDAPDDSKMSYGLNVRQSVDDVKSADE

Query:  SKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEK
           A E  + +  E +ML+  + DL  L +D   EDFE VPV+ FG+ALM  YGW+ G+GIG+NA+EDV++KE+   T K+GLGF  D    +  K + K
Subjt:  SKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEK

Query:  DNGR--ARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLS-NRDEVKF------------------------------
        ++ +   +G   +G  V            +GK VRI+ GRD G KGKIVEK    + V+K+S + +EVK                               
Subjt:  DNGR--ARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLS-NRDEVKF------------------------------

Query:  --RTKSKARDGEEVERVE----EKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPR
          +T  + R  E   R E    EK++ GQ   E++ +  WL SHI+VRI+SKD+KGG+ YLKKG++VDVVGPT CD+++DE++ELVQGV QELLETALPR
Subjt:  --RTKSKARDGEEVERVE----EKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPR

Query:  RGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHELYD
        RGGPVLVL GKHKGVYG+LVE+DLDKETGVVRD D+H++ D
Subjt:  RGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHELYD

AT1G55460.1 DNA/RNA-binding protein Kin17, conserved region1.4e-0526.49Show/hide
Query:  DGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKFRTKSKARDGEEVERVEEKR----ENGQRDEEKRTRL----CWL
        D  R K+   R  +G+ +G    + GG    + GK  E+ +   L+      D+V+ R + + R G+     +E+R    E  + +E+K+ R+     WL
Subjt:  DGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKFRTKSKARDGEEVERVEEKR----ENGQRDEEKRTRL----CWL

Query:  TSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDK
           I V+++SK      +Y +KG +  V+   + ++ + +S+ +++ V Q+ LET LP+ GG V ++ G ++G    L+  D +K
Subjt:  TSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDK

AT4G25020.1 D111/G-patch domain-containing protein2.2e-6739.9Show/hide
Query:  SGDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPR
        +G+ K++V EFD S++L++++ K    VIP IE+  R     ++++ PL  +  SGL+FE         D+ ++YGLN+RQ V++V+            +
Subjt:  SGDSKEYVNEFDASKSLSETRGKSRNVVIPAIENEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPR

Query:  PAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRN
        P P+E ++L+  + DL+ LP+    EDFE  PV+ FG AL+  YGW+ G+GIG  A+EDVK+ E+   +  +G GF                     G++
Subjt:  PAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMESYGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRN

Query:  RDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKFRTKSKARDGEEVERVEEKRENGQRDEEKRTRLCWLTSHIRVR
            ++ +N+      +  G H  +  G +       +E ++   +V K +   E + RT+ KA           K+    +  E R +  WL SHI+VR
Subjt:  RDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNRDEVKFRTKSKARDGEEVERVEEKRENGQRDEEKRTRLCWLTSHIRVR

Query:  IISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL
        IISKD KGG+ YLKK  + DVVGPT CD+++DE++ELVQG+ QELLETALPRRGG VLVL G+HKGVYG LVE+DLDKETGVV DADS E+
Subjt:  IISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATTGGACGATACAACCAATTATGGTCTGGAGCGGCGAGAGCGACGACGGTAGGGCTGCCAATTTCCGGCGAAGGGGATTGGCTGAAACATTTGCAAATTCTTC
TTGGACCGGAGCTCAGAACCTTTCGAATTTGGTAAACGTGAGTTCATCTTCAGTCTTTGTTAGAACTCTGCAAGGAGACGGAAGATTTCATCAGCCAGGGCAAAAGCACT
TGGACATATTGTGCAATCCTACTAGAGGTTATGGGTTTGGATGCTTTAAAAAGTACCTGAAGGAGTCTTATACATCTCTCATTTTAGGGGTGAAAAGAATGCTTACATGC
TCAAATTATTCCACAGTTGAATCTTATCCTAACAATGAGTCGTGGATGATATACTGCTTTTCCCCAAAAGAGTATGAGCAGAACAAACCATATCCCGATGTGGGAAGATG
TAAGAAACAGCCACCGGGCACGACATGGGAAGAGGCAAATTCTGGAACTCAGTTTAGCATCTTGATACGAAAGGCATTGCAGTGGTTAACTGATCTAATAGCTTTGAATT
GCCTTGACATTTCAGGGGATTCCAAAGAGTATGTTAATGAATTCGATGCTTCGAAATCCTTGTCCGAAACCAGGGGTAAATCCAGAAACGTGGTTATTCCCGCCATTGAG
AACGAATGGAGGCCTTTGATGAGAATGAAGAATCTCGAATCTCCGCTTGGTCAATCGGACGAGTCTGGTCTCAAGTTCGAGACCGCCTCTGGATTAGATGCGCCCGATGA
TTCCAAGATGTCGTACGGTCTCAATGTCAGGCAGTCCGTCGACGATGTGAAGAGTGCCGATGAGTCGAAATCTGCGGAGGAGCCGCCGCGGCCTGCTCCTTTGGAGGTTA
TTATGTTGGAGAAATTTAAAGCTGACCTGAAGAGGCTGCCTGAGGATAGAGGCTTTGAGGATTTTGAGGACGTTCCTGTGGAAAGTTTTGGTTCCGCTTTGATGGAAAGC
TATGGCTGGCAGAAGGGTCGGGGAATTGGGAGAAATGCCCGGGAGGATGTTAAAGTTAAGGAATTTAATCTAAGAACAGACAAGCAAGGGCTAGGATTTGTTGGCGACAT
GCCTGCTAGCCTTCCAAACAAGGAGGAAGAGAAAGATAACGGGAGAGCGCGTGGGAGAAACAGAGATGGAGCTAGAGTTAAGGAAAACAGAGATCGAGAAAGCAATGGAT
TGGCTATTGGGAAGCATGTTAGGATTGTTGGTGGAAGAGATGCAGGTTCGAAAGGTAAAATTGTAGAGAAATTGGATCTCAAATGGCTTGTCCTTAAGCTTTCTAACAGA
GACGAAGTTAAATTCAGGACGAAATCAAAGGCTAGAGACGGAGAAGAAGTTGAACGAGTAGAGGAAAAGCGTGAAAATGGTCAAAGGGATGAAGAGAAGAGGACTAGATT
GTGTTGGCTTACCAGTCACATTCGTGTAAGGATCATCAGCAAAGATTTCAAGGGAGGAAAGTTTTATCTCAAAAAAGGAGAGATAGTTGACGTAGTTGGACCCACCATTT
GTGATCTATCCATTGATGAAAGTAGAGAGCTTGTTCAAGGGGTCTCTCAAGAGCTTCTTGAGACAGCACTTCCCAGACGTGGGGGACCCGTTCTTGTTCTGTATGGTAAG
CACAAGGGCGTTTATGGGAGTTTAGTTGAGAGGGATCTTGACAAAGAGACAGGGGTAGTGCGTGATGCTGATAGCCATGAATTATATGACGCTGGACCTGTCTTAGTGCA
TACTGTGTGTACACCAGAGTTTGGGATAAAAACCATGGTGACACGTAGCTGTGACAGACTGGAAAATCCTGGATATGAATATGCTGATTCATGCAATGAATTGGGCTGTG
AAGGGCTATGGGAGATATCAGCCATCTCGCCAGCCGGAGACACAGGGTCAATGGAAAGAGTAACGAGCACAGAAGACTCGTCTCAATTCCTCGAGTTTCTCCGGTGTCTG
CGGTTTGGGTGTTCTGATCAATGGCGAAGTCCAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAATTGGACGATACAACCAATTATGGTCTGGAGCGGCGAGAGCGACGACGGTAGGGCTGCCAATTTCCGGCGAAGGGGATTGGCTGAAACATTTGCAAATTCTTC
TTGGACCGGAGCTCAGAACCTTTCGAATTTGGTAAACGTGAGTTCATCTTCAGTCTTTGTTAGAACTCTGCAAGGAGACGGAAGATTTCATCAGCCAGGGCAAAAGCACT
TGGACATATTGTGCAATCCTACTAGAGGTTATGGGTTTGGATGCTTTAAAAAGTACCTGAAGGAGTCTTATACATCTCTCATTTTAGGGGTGAAAAGAATGCTTACATGC
TCAAATTATTCCACAGTTGAATCTTATCCTAACAATGAGTCGTGGATGATATACTGCTTTTCCCCAAAAGAGTATGAGCAGAACAAACCATATCCCGATGTGGGAAGATG
TAAGAAACAGCCACCGGGCACGACATGGGAAGAGGCAAATTCTGGAACTCAGTTTAGCATCTTGATACGAAAGGCATTGCAGTGGTTAACTGATCTAATAGCTTTGAATT
GCCTTGACATTTCAGGGGATTCCAAAGAGTATGTTAATGAATTCGATGCTTCGAAATCCTTGTCCGAAACCAGGGGTAAATCCAGAAACGTGGTTATTCCCGCCATTGAG
AACGAATGGAGGCCTTTGATGAGAATGAAGAATCTCGAATCTCCGCTTGGTCAATCGGACGAGTCTGGTCTCAAGTTCGAGACCGCCTCTGGATTAGATGCGCCCGATGA
TTCCAAGATGTCGTACGGTCTCAATGTCAGGCAGTCCGTCGACGATGTGAAGAGTGCCGATGAGTCGAAATCTGCGGAGGAGCCGCCGCGGCCTGCTCCTTTGGAGGTTA
TTATGTTGGAGAAATTTAAAGCTGACCTGAAGAGGCTGCCTGAGGATAGAGGCTTTGAGGATTTTGAGGACGTTCCTGTGGAAAGTTTTGGTTCCGCTTTGATGGAAAGC
TATGGCTGGCAGAAGGGTCGGGGAATTGGGAGAAATGCCCGGGAGGATGTTAAAGTTAAGGAATTTAATCTAAGAACAGACAAGCAAGGGCTAGGATTTGTTGGCGACAT
GCCTGCTAGCCTTCCAAACAAGGAGGAAGAGAAAGATAACGGGAGAGCGCGTGGGAGAAACAGAGATGGAGCTAGAGTTAAGGAAAACAGAGATCGAGAAAGCAATGGAT
TGGCTATTGGGAAGCATGTTAGGATTGTTGGTGGAAGAGATGCAGGTTCGAAAGGTAAAATTGTAGAGAAATTGGATCTCAAATGGCTTGTCCTTAAGCTTTCTAACAGA
GACGAAGTTAAATTCAGGACGAAATCAAAGGCTAGAGACGGAGAAGAAGTTGAACGAGTAGAGGAAAAGCGTGAAAATGGTCAAAGGGATGAAGAGAAGAGGACTAGATT
GTGTTGGCTTACCAGTCACATTCGTGTAAGGATCATCAGCAAAGATTTCAAGGGAGGAAAGTTTTATCTCAAAAAAGGAGAGATAGTTGACGTAGTTGGACCCACCATTT
GTGATCTATCCATTGATGAAAGTAGAGAGCTTGTTCAAGGGGTCTCTCAAGAGCTTCTTGAGACAGCACTTCCCAGACGTGGGGGACCCGTTCTTGTTCTGTATGGTAAG
CACAAGGGCGTTTATGGGAGTTTAGTTGAGAGGGATCTTGACAAAGAGACAGGGGTAGTGCGTGATGCTGATAGCCATGAATTATATGACGCTGGACCTGTCTTAGTGCA
TACTGTGTGTACACCAGAGTTTGGGATAAAAACCATGGTGACACGTAGCTGTGACAGACTGGAAAATCCTGGATATGAATATGCTGATTCATGCAATGAATTGGGCTGTG
AAGGGCTATGGGAGATATCAGCCATCTCGCCAGCCGGAGACACAGGGTCAATGGAAAGAGTAACGAGCACAGAAGACTCGTCTCAATTCCTCGAGTTTCTCCGGTGTCTG
CGGTTTGGGTGTTCTGATCAATGGCGAAGTCCAATATGAGCTCTATGAGCTCCAGCAAGGGTGACTACAAAAAATTATTTGAGAATAGACTAGAGATGTTCATTTAACCC
GTGTGATGAGAGATTAAATGGAGAAGGAAGGAGTGAGTGAGGAGAGATTATTTTCTGTTAGCTAAATGAAATTAGAGTCGGAACTGGGGATCTTGTGTTTCTCTAGGTTT
CAAGAATTTTAAGGATCACA
Protein sequenceShow/hide protein sequence
MANWTIQPIMVWSGESDDGRAANFRRRGLAETFANSSWTGAQNLSNLVNVSSSSVFVRTLQGDGRFHQPGQKHLDILCNPTRGYGFGCFKKYLKESYTSLILGVKRMLTC
SNYSTVESYPNNESWMIYCFSPKEYEQNKPYPDVGRCKKQPPGTTWEEANSGTQFSILIRKALQWLTDLIALNCLDISGDSKEYVNEFDASKSLSETRGKSRNVVIPAIE
NEWRPLMRMKNLESPLGQSDESGLKFETASGLDAPDDSKMSYGLNVRQSVDDVKSADESKSAEEPPRPAPLEVIMLEKFKADLKRLPEDRGFEDFEDVPVESFGSALMES
YGWQKGRGIGRNAREDVKVKEFNLRTDKQGLGFVGDMPASLPNKEEEKDNGRARGRNRDGARVKENRDRESNGLAIGKHVRIVGGRDAGSKGKIVEKLDLKWLVLKLSNR
DEVKFRTKSKARDGEEVERVEEKRENGQRDEEKRTRLCWLTSHIRVRIISKDFKGGKFYLKKGEIVDVVGPTICDLSIDESRELVQGVSQELLETALPRRGGPVLVLYGK
HKGVYGSLVERDLDKETGVVRDADSHELYDAGPVLVHTVCTPEFGIKTMVTRSCDRLENPGYEYADSCNELGCEGLWEISAISPAGDTGSMERVTSTEDSSQFLEFLRCL
RFGCSDQWRSPI