; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G100010 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G100010
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionprotein MRG1-like isoform X1
Genome locationCiama_Chr05:30940871..30949479
RNA-Seq ExpressionCaUC05G100010
SyntenyCaUC05G100010
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0000123 - histone acetyltransferase complex (cellular component)
InterPro domainsIPR008676 - MRG
IPR016197 - Chromo-like domain superfamily
IPR026541 - MRG domain
IPR038217 - MRG, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064420.1 protein MRG1-like isoform X1 [Cucumis melo var. makuwa]1.2e-13277.78Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSP+NRVASDSAD TSKNDSEDDDDNGVQ+PPSHPCPFSEGEKVLAFHSFVIYEAK      VL++  +    RC                 +  L  R
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
        +        WDEWVGLDRLLKFTEENV+KQQELNEKRGTDKKASRAS  KPKNV+KG+KRKNDASK              IPVKLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNV+DI+KKYLEYRLKKD TKDESVGEIVKGLICYFDKALP MLLYKSERQQYEEL++NDVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        L+ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

XP_004141278.1 protein MRG1 isoform X2 [Cucumis sativus]5.3e-13377.49Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSP+NRVASDSAD TSKNDSEDDDDNGVQ+PPSHPCPFSEGEKVLAFHSFVIYEAK      VL++  +    RC           + Y    K     
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
                 WDEWVGLDRLLKFTEENV+KQQELNEKRGTDKKASRAS  KPKNV+KG+KRKNDASK              IPVKLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNV+DI+KKYLEYRLKKD TKDES+GEIVKGLICYFDKALP MLLYKSERQQYEEL++NDVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        L+ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

XP_008452608.1 PREDICTED: protein MRG1-like isoform X1 [Cucumis melo]4.0e-13377.78Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSP+NRVASDSAD TSKNDSEDDDDNGVQ+PPSHPCPFSEGEKVLAFHSFVIYEAK      VL++  +    RC           + Y    K     
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
                 WDEWVGLDRLLKFTEENV+KQQELNEKRGTDKKASRAS  KPKNV+KG+KRKNDASK              IPVKLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNV+DI+KKYLEYRLKKD TKDESVGEIVKGLICYFDKALP MLLYKSERQQYEEL++NDVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        L+ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

XP_022936817.1 protein MRG1-like isoform X3 [Cucurbita moschata]2.6e-13279.12Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSPD+RVASDSAD TSKNDSEDDD+NGV SPPSHPCPFSEGEKVLAFHSF+IYEAK  DF  +LQ+ +E      + +D ++L         LK  +  
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK------------IPVKLKKQLVDDSEFVTHLGKL
        V        WDEWVGLDRLLKFTEENV+KQ ELNEKRGTDKKA+RASQ KPKNV+KGRKRKNDASK            IP+KLKKQLVDDSEFVTHLGKL
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK------------IPVKLKKQLVDDSEFVTHLGKL

Query:  VKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLI
        VKLPRTPNVEDI+KKYLEYRLKKD TKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIV+DVSPSS+YGAEHLLRLFVRLPELLSQANIEEETL 
Subjt:  VKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLI

Query:  ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        ELQQKLVDLLKFLRKNQN FFLSSYHVPENMETSTNNADD
Subjt:  ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

XP_038897432.1 protein MRG2-like isoform X1 [Benincasa hispida]3.4e-13278.53Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSP+NRVASDSAD TSKNDSEDDDDNGVQ+PPS PCPFSEGEKVLAFHSFVIYEAK      VL++  +    RC           + Y    K     
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
                 WDEWVGLDRLLKFTEENV+KQQELNEKRGT+KKASRASQ KPKNV KGRKRKNDASK              IPVKLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNVEDI+KKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEEL++NDVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNA
        L+ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNA
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNA

TrEMBL top hitse value%identityAlignment
A0A0A0L0E2 MRG domain-containing protein2.6e-13377.49Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSP+NRVASDSAD TSKNDSEDDDDNGVQ+PPSHPCPFSEGEKVLAFHSFVIYEAK      VL++  +    RC           + Y    K     
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
                 WDEWVGLDRLLKFTEENV+KQQELNEKRGTDKKASRAS  KPKNV+KG+KRKNDASK              IPVKLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNV+DI+KKYLEYRLKKD TKDES+GEIVKGLICYFDKALP MLLYKSERQQYEEL++NDVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        L+ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

A0A1S3BU93 protein MRG1-like isoform X12.0e-13377.78Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSP+NRVASDSAD TSKNDSEDDDDNGVQ+PPSHPCPFSEGEKVLAFHSFVIYEAK      VL++  +    RC           + Y    K     
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
                 WDEWVGLDRLLKFTEENV+KQQELNEKRGTDKKASRAS  KPKNV+KG+KRKNDASK              IPVKLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNV+DI+KKYLEYRLKKD TKDESVGEIVKGLICYFDKALP MLLYKSERQQYEEL++NDVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        L+ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

A0A5A7VB09 Protein MRG1-like isoform X15.7e-13377.78Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSP+NRVASDSAD TSKNDSEDDDDNGVQ+PPSHPCPFSEGEKVLAFHSFVIYEAK      VL++  +    RC                 +  L  R
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
        +        WDEWVGLDRLLKFTEENV+KQQELNEKRGTDKKASRAS  KPKNV+KG+KRKNDASK              IPVKLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNV+DI+KKYLEYRLKKD TKDESVGEIVKGLICYFDKALP MLLYKSERQQYEEL++NDVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        L+ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

A0A6J1F8J0 protein MRG1-like isoform X22.2e-13278.65Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSPD+RVASDSAD TSKNDSEDDD+NGV SPPSHPCPFSEGEKVLAFHSF+IYEAK  DF  +LQ+ +E      + +D ++L         LK  +  
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG
        V        WDEWVGLDRLLKFTEENV+KQ ELNEKRGTDKKA+RASQ KPKNV+KGRKRKNDASK              IP+KLKKQLVDDSEFVTHLG
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK--------------IPVKLKKQLVDDSEFVTHLG

Query:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET
        KLVKLPRTPNVEDI+KKYLEYRLKKD TKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIV+DVSPSS+YGAEHLLRLFVRLPELLSQANIEEET
Subjt:  KLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEET

Query:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        L ELQQKLVDLLKFLRKNQN FFLSSYHVPENMETSTNNADD
Subjt:  LIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

A0A6J1F9F7 protein MRG1-like isoform X31.3e-13279.12Show/hide
Query:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR
        MGSPD+RVASDSAD TSKNDSEDDD+NGV SPPSHPCPFSEGEKVLAFHSF+IYEAK  DF  +LQ+ +E      + +D ++L         LK  +  
Subjt:  MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVR

Query:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK------------IPVKLKKQLVDDSEFVTHLGKL
        V        WDEWVGLDRLLKFTEENV+KQ ELNEKRGTDKKA+RASQ KPKNV+KGRKRKNDASK            IP+KLKKQLVDDSEFVTHLGKL
Subjt:  VEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASK------------IPVKLKKQLVDDSEFVTHLGKL

Query:  VKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLI
        VKLPRTPNVEDI+KKYLEYRLKKD TKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIV+DVSPSS+YGAEHLLRLFVRLPELLSQANIEEETL 
Subjt:  VKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLI

Query:  ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        ELQQKLVDLLKFLRKNQN FFLSSYHVPENMETSTNNADD
Subjt:  ELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

SwissProt top hitse value%identityAlignment
Q4V3E2 Protein MRG22.0e-5845.59Show/hide
Query:  DSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRW
        DS  +T  N   D +D  +   P+ P  F EGE+VLA HS   YEAK      VL+  +E+     +  +    +  I +N                  W
Subjt:  DSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRW

Query:  DEWVGLDRLLKFTEENVKKQQELNEKRGTDKKAS--RASQTKPK--NVIKGRKRKND---------------ASKIPVKLKKQLVDDSEFVTHLGKLVKL
        DEW+ LD LLK ++EN++KQ+E   K+   K A   + S+ KP+  NV +GRKRK D               +  IP  L+KQL+DD EFVT + KLV+L
Subjt:  DEWVGLDRLLKFTEENVKKQQELNEKRGTDKKAS--RASQTKPK--NVIKGRKRKND---------------ASKIPVKLKKQLVDDSEFVTHLGKLVKL

Query:  PRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQ
        PR+PNV+ ILKKY++ ++KK     +S+ EI+KGL CYFDKALP MLLY +ER+QYEE +   VSPS+VYGAEHLLRLFV+LPELL   N+ EETL ELQ
Subjt:  PRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQ

Query:  QKLVDLLKFLRKNQNAFFLSSYHVPENME
           VD+L+FLRKNQ+  F+S+Y   E ME
Subjt:  QKLVDLLKFLRKNQNAFFLSSYHVPENME

Q6BT38 Chromatin modification-related protein EAF38.2e-2832.55Show/hide
Query:  FSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCR-LEDASILLAIISYNSMLKLLKVRVEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEK-
        F    +VLA+H  +IYEAK        ++F+E    + + +E  ++     S N+        V     + +WDEWVG DR+L++ E NV+ Q+EL E+ 
Subjt:  FSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCR-LEDASILLAIISYNSMLKLLKVRVEINVNEDRWDEWVGLDRLLKFTEENVKKQQELNEK-

Query:  -----------------RGTDKK----ASRASQTKPKNVIKGRKRKNDASKIPVKLKKQLVDDSEFVTHLGKLVKLPRTPNVEDILKKYLEYRLKKDVTK
                          GT K+    +S ++ TK K     R  +      P +LK  LVDD EF+T   K++ +P +  V  IL  YL+ +  +D + 
Subjt:  -----------------RGTDKK----ASRASQTKPKNVIKGRKRKNDASKIPVKLKKQLVDDSEFVTHLGKLVKLPRTPNVEDILKKYLEYRLKKDVTK

Query:  D--ESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIV---NDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQQKLVDLLKFLRKNQNAF
           + + EI++GL  YF+K+L  +LLYK ER QY  L+    +D+ PS +YG EHLLRLFV LP L++Q  ++  ++  L ++  D+L+F+  N + +
Subjt:  D--ESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIV---NDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQQKLVDLLKFLRKNQNAF

Q6C9M9 Chromatin modification-related protein EAF36.2e-2835.94Show/hide
Query:  EDRWDEWVGLDRLLKFTEENVKKQQELN------EKRGTD-----------KKASRASQTKPKNV------------IKGR----------------KRK
        ++ WDEWVG +R+L   E+N+K Q+EL        K+G D           + AS A  TK K++            +K R                KRK
Subjt:  EDRWDEWVGLDRLLKFTEENVKKQQELN------EKRGTD-----------KKASRASQTKPKNV------------IKGR----------------KRK

Query:  NDASKIPVKLKKQLVDDSEFVTHLGKLVKLPRTPNVEDILKKY---LEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIV----ND
          A  +P KLK QLVDD EFVT   +LV LPR   V DILK++    E + +      + + E+V G+  YFD++L ++LLY+ ER+QY ++      ++
Subjt:  NDASKIPVKLKKQLVDDSEFVTHLGKLVKLPRTPNVEDILKKY---LEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIV----ND

Query:  VSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQQKLVDLLKFLRKNQNAFFL
         + S VYGAEHLLRLFV LP L++  N++ +++  L++ L D ++FL  +Q  +FL
Subjt:  VSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQQKLVDLLKFLRKNQNAFFL

Q94C32 Protein MRG11.2e-5841.91Show/hide
Query:  NTSKNDSEDDDD--NGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRWDE
        ++SK ++  D D  +G  SP +    FSEGE+VLA+H   +Y AK    E              R ++    +  + +N                  WDE
Subjt:  NTSKNDSEDDDD--NGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRWDE

Query:  WVGLDRLLKFTEENVKKQQELNEKRGTDK--KASRASQTKPK--------------NVIKGRKRKNDAS--------------KIPVKLKKQLVDDSEFV
        WV  DRLLK TEEN+ KQ+ L++K+G +K  K+ R++QTK +              N  KG+KRK+++               +IP  LKKQL DD E++
Subjt:  WVGLDRLLKFTEENVKKQQELNEKRGTDK--KASRASQTKPK--------------NVIKGRKRKNDAS--------------KIPVKLKKQLVDDSEFV

Query:  THLGKLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANI
            K+VKLPR+PNV++IL KYLE++ KKD    +SV EI+KG+  YFDKALP MLLYK ER+QY+E IV+D SPS+VYGAEHLLRLFV+LP+L S  N+
Subjt:  THLGKLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANI

Query:  EEETLIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        EEET   +QQ L D LKF++KNQ+ F L S +  + +        D
Subjt:  EEETLIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD

Q9UBU8 Mortality factor 4-like protein 14.5e-2631.05Show/hide
Query:  SPPSHPCP-FSEGEKVLAFHSFVIYEAKNPDF---ETVLQSFLEY------AMCRCRLEDASILLAIISYNSMLKLLKV-----RVEINVNEDRWDEWVG
        +P   P P F EGE+VL FH  ++YEAK       +  ++ F+ Y      +  R R  + S+     ++  ++ L  V      V   +    WDEWV 
Subjt:  SPPSHPCP-FSEGEKVLAFHSFVIYEAKNPDF---ETVLQSFLEY------AMCRCRLEDASILLAIISYNSMLKLLKV-----RVEINVNEDRWDEWVG

Query:  LDRLLKFTEENVKKQQEL---NEKRGTDKK------ASRASQTKPKNV-IKGRKRKNDAS----------------------------------------
          R+LK+ + N++KQ+EL   N+++  + K        + S  + KNV +K +K K                                            
Subjt:  LDRLLKFTEENVKKQQEL---NEKRGTDKK------ASRASQTKPKNV-IKGRKRKNDAS----------------------------------------

Query:  KIPVKLKKQLVDDSEFVTHLGKLVKLPRTPNVEDILKKYLEYRLKKDVT--KDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVN--DVSPSSVY
        KIP +LK  LVDD + +T   +L  LP   NV+ IL+ Y  Y+  +  T  K+ +V E+V G+  YF+  L   LLYK ER QY E++ +  D   S VY
Subjt:  KIPVKLKKQLVDDSEFVTHLGKLVKLPRTPNVEDILKKYLEYRLKKDVT--KDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVN--DVSPSSVY

Query:  GAEHLLRLFVRLPELLSQANIEEETLIELQQKLVDLLKFLRKNQNAFFLSS
        GA HLLRLFVR+  +L+   ++E++L  L   L D LK+L KN    F +S
Subjt:  GAEHLLRLFVRLPELLSQANIEEETLIELQQKLVDLLKFLRKNQNAFFLSS

Arabidopsis top hitse value%identityAlignment
AT1G02740.1 MRG family protein1.4e-5945.59Show/hide
Query:  DSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRW
        DS  +T  N   D +D  +   P+ P  F EGE+VLA HS   YEAK      VL+  +E+     +  +    +  I +N                  W
Subjt:  DSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRW

Query:  DEWVGLDRLLKFTEENVKKQQELNEKRGTDKKAS--RASQTKPK--NVIKGRKRKND---------------ASKIPVKLKKQLVDDSEFVTHLGKLVKL
        DEW+ LD LLK ++EN++KQ+E   K+   K A   + S+ KP+  NV +GRKRK D               +  IP  L+KQL+DD EFVT + KLV+L
Subjt:  DEWVGLDRLLKFTEENVKKQQELNEKRGTDKKAS--RASQTKPK--NVIKGRKRKND---------------ASKIPVKLKKQLVDDSEFVTHLGKLVKL

Query:  PRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQ
        PR+PNV+ ILKKY++ ++KK     +S+ EI+KGL CYFDKALP MLLY +ER+QYEE +   VSPS+VYGAEHLLRLFV+LPELL   N+ EETL ELQ
Subjt:  PRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQ

Query:  QKLVDLLKFLRKNQNAFFLSSYHVPENME
           VD+L+FLRKNQ+  F+S+Y   E ME
Subjt:  QKLVDLLKFLRKNQNAFFLSSYHVPENME

AT4G37280.1 MRG family protein8.3e-6041.91Show/hide
Query:  NTSKNDSEDDDD--NGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRWDE
        ++SK ++  D D  +G  SP +    FSEGE+VLA+H   +Y AK    E              R ++    +  + +N                  WDE
Subjt:  NTSKNDSEDDDD--NGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRWDE

Query:  WVGLDRLLKFTEENVKKQQELNEKRGTDK--KASRASQTKPK--------------NVIKGRKRKNDAS--------------KIPVKLKKQLVDDSEFV
        WV  DRLLK TEEN+ KQ+ L++K+G +K  K+ R++QTK +              N  KG+KRK+++               +IP  LKKQL DD E++
Subjt:  WVGLDRLLKFTEENVKKQQELNEKRGTDK--KASRASQTKPK--------------NVIKGRKRKNDAS--------------KIPVKLKKQLVDDSEFV

Query:  THLGKLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANI
            K+VKLPR+PNV++IL KYLE++ KKD    +SV EI+KG+  YFDKALP MLLYK ER+QY+E IV+D SPS+VYGAEHLLRLFV+LP+L S  N+
Subjt:  THLGKLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVGEIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANI

Query:  EEETLIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD
        EEET   +QQ L D LKF++KNQ+ F L S +  + +        D
Subjt:  EEETLIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCCCTGACAATCGGGTTGCTTCCGACTCCGCCGATAACACATCCAAGAATGACTCCGAAGACGACGACGACAATGGAGTTCAGAGCCCGCCTTCCCATCCTTG
TCCTTTCTCAGAAGGAGAGAAGGTCCTCGCTTTTCATAGCTTCGTCATATATGAAGCTAAGAACCCCGATTTTGAAACAGTACTTCAATCTTTTCTTGAATATGCAATGT
GCCGATGTAGATTAGAGGATGCCTCAATCTTGCTTGCAATAATTTCATATAATTCAATGCTGAAACTTCTTAAAGTTCGTGTAGAGATCAATGTGAATGAAGATCGTTGG
GATGAGTGGGTAGGACTTGATAGATTACTGAAGTTTACTGAAGAGAATGTGAAGAAACAGCAGGAACTCAATGAGAAGAGGGGGACTGACAAGAAGGCATCTCGAGCATC
CCAAACTAAGCCTAAGAATGTGATAAAAGGGAGGAAGCGAAAGAATGATGCTAGTAAGATTCCAGTTAAACTGAAGAAGCAGCTAGTTGATGACAGTGAGTTTGTAACAC
ATCTTGGAAAGCTGGTAAAGCTCCCACGCACACCTAATGTAGAAGACATACTGAAGAAATATCTTGAATACAGGCTGAAGAAGGATGTAACGAAAGATGAATCGGTTGGA
GAAATCGTGAAGGGGTTGATTTGTTACTTCGATAAAGCTCTACCTGCAATGCTTCTATACAAGAGCGAACGCCAACAATATGAGGAATTGATCGTCAATGATGTATCCCC
TTCTTCCGTATATGGTGCTGAACACCTTCTACGGCTATTTGTTAGGTTGCCTGAGTTATTGTCTCAAGCCAATATTGAAGAAGAAACTTTGATAGAACTGCAACAGAAGT
TAGTTGACTTGCTCAAGTTTTTAAGGAAGAATCAGAATGCGTTTTTCTTGTCATCGTACCATGTGCCTGAGAATATGGAAACAAGTACCAACAATGCCGATGACTAA
mRNA sequenceShow/hide mRNA sequence
TGAGAACAGAAATCCCATGGTCCATTGTTGAATGAGTAAAGTTTTAGCATTACGTCGTCTCTTTGTGAGAAACGTATTGTTAGAGAGTAGAGGAGGTTTTGTTTTCCAAA
TTTTCCGATAGAGAAAACCCCAATTAGTCGGCTAAAACCCACCATTGCAGCCAGAAACCCCAATTGGAGTCATTGCTTCTTGTTTTTGCAGTTCCCAGACACCATTCTCT
CATAACCGGACCCCAACACGGTTCTCAATTCCCATTGAACACATTAGCTATCTAAATGGGAAGCCCTGACAATCGGGTTGCTTCCGACTCCGCCGATAACACATCCAAGA
ATGACTCCGAAGACGACGACGACAATGGAGTTCAGAGCCCGCCTTCCCATCCTTGTCCTTTCTCAGAAGGAGAGAAGGTCCTCGCTTTTCATAGCTTCGTCATATATGAA
GCTAAGAACCCCGATTTTGAAACAGTACTTCAATCTTTTCTTGAATATGCAATGTGCCGATGTAGATTAGAGGATGCCTCAATCTTGCTTGCAATAATTTCATATAATTC
AATGCTGAAACTTCTTAAAGTTCGTGTAGAGATCAATGTGAATGAAGATCGTTGGGATGAGTGGGTAGGACTTGATAGATTACTGAAGTTTACTGAAGAGAATGTGAAGA
AACAGCAGGAACTCAATGAGAAGAGGGGGACTGACAAGAAGGCATCTCGAGCATCCCAAACTAAGCCTAAGAATGTGATAAAAGGGAGGAAGCGAAAGAATGATGCTAGT
AAGATTCCAGTTAAACTGAAGAAGCAGCTAGTTGATGACAGTGAGTTTGTAACACATCTTGGAAAGCTGGTAAAGCTCCCACGCACACCTAATGTAGAAGACATACTGAA
GAAATATCTTGAATACAGGCTGAAGAAGGATGTAACGAAAGATGAATCGGTTGGAGAAATCGTGAAGGGGTTGATTTGTTACTTCGATAAAGCTCTACCTGCAATGCTTC
TATACAAGAGCGAACGCCAACAATATGAGGAATTGATCGTCAATGATGTATCCCCTTCTTCCGTATATGGTGCTGAACACCTTCTACGGCTATTTGTTAGGTTGCCTGAG
TTATTGTCTCAAGCCAATATTGAAGAAGAAACTTTGATAGAACTGCAACAGAAGTTAGTTGACTTGCTCAAGTTTTTAAGGAAGAATCAGAATGCGTTTTTCTTGTCATC
GTACCATGTGCCTGAGAATATGGAAACAAGTACCAACAATGCCGATGACTAATAGAATACATATGCCATTCTTACAATATCAAATTTAGCTATTCAAGCCCTTACTCCCT
TGTACCTTTTCATATCACTGTATATAAATCTAATATCGTCTGAACTTCAATGCGAACGATAATAAATAAACGTTAGGTTAGCCCTTTATATCAAATTCCAGCACTTTTTA
CTTTTCTTTGGATCTATCTCTAATCTTTGAGTTCGTACATCTCCTTTTACTTTAATAATTTAGTGTGAGAGTTTGTGTCTTCGTGATTTGATCTTAAAACTGTACTCCAC
AATCCACATTTCCAAATCCCAAGTGGTTGCTCAACTATTATATTATTATATACTATTTTGTTTTGTTCTTTAACCATTTGATTTATCTACAAAGCCATTTTTTGATTATT
CAATTCTTGTCCACCCACCACAGTTCCCACTTGTGTCTTTTCTTGGATTTTCGTCATCCTGTTCTGCATCTCGAAACTCTCTCGTTTCAAAAAAGATGCAAATCCAAAAA
ATTCCAAAGATTTTTTTTTTCTTTTTTTTTTTTTTTTGGGTTATTATTCTTTAGATACATGTAAATTAAAAACGAACAGGAATGTTGCCTAAATTTGCTAAAAGATCACC
CTCTTGAACCGATGATCTAGTTCTATCATATACGATATATTCGAAGTTCTCTACCTTGTTCTTCAACCCACAAGCCCCCAATTTCTTCATTCCTTCTGTACACTACCAAA
ATTTATTGCCGTCTATAGTTTTTGGGGGAGGGGAAAAGGAGAAAAAAAAAAAGAAAAAAAAAAAAAAAAAAAAAANAAAAAAAACAGAATAGGAGATCCTTTGAGTGATG
GGCGGCGTTAACAGGATGACTTCTTTGTCATCTCATTTCTAATCGAGCTTGCTTATCCCAATCCGTTTTACGACTGCGTTTGTGATAGCCTCCGTTTGCAAACACAGACC
ATTTTGTTCGATCCCAGACATGTAACTGTGCACTTCCTTTCTAATCATTTGCTGCATCACACCCAAGAACTCTGCGCTGAAGAACTGTCTCTCAATCCCCACCTTGTTCC
CACTCTGAACTGCCTGCCTGGGTTCCGACGCGAGTTTCGGTACCTCACCGGAGTCGGATCCAGGCAGAGAGAGGGTGAGAGAGGTGGGAGGATCGGTGGCCGTCTCCATC
GGCGGCTGATCTGAGTGAGAAACGAGAGGAGCGGACGGGAAAGGCTTACAGACACTGGCGGATCTCTTGAGAGGGTGTTCGTCTTCGGGGATGGAGAATGAGCATTTACG
CTTGAGAGTAGAGTTCCAGTGGTTTTTGATAGCGTTGTCGGTGCGACCATTAAGGAGACGAGCAATGGTAGCCCACTTGTTTCCGAATCTGGAGTGAGCCTGGATAATGG
TATCATCTTCATGGGGAGTGAAGGGACGGTGCTCCACCTCGGGCGAAAGCTGGTTGCACCACCTGAGACGGCAAGACTTACCGGATCGACCGGGAATAGACTTGCTTATA
AGAGACCAATTCCTAGGGCCGTAGTTATGAACAAGGCGGCGCAGCAACTCATCCTCGTCGGGGCTCCAAGGACCCTTTATCCGGTCAACCACTACATTAGTATTGGGAGT
GGTAGCCATAAATGAGAAGATTGTTTTGTAAGAAAGAAGAAGAAAAGAAGCGGAGGCGTAGCCTCAGCCTGTAGTAAGAGAGAAGAAGAAGAGAAAGAAGGAAGAAGGAA
TTTATAAAGAGAGTGAGTGGTGACCGGTGATAGCCGTTTGTATTGTAGTTAGAAGGAAAGGTAAGGAAAGCAGTTAGTAGGAAATGCGTATAACAGCTGTCAAAGGACAC
GCGTCGTGCAACGGTGGCGAGATGAACACACGTATTATGAAGCTGTTTGGGTTCCTTTTCAAAACCGCCTTGGCTTTTGATTTTATTAATTAATTGCACACGTCCTTCAA
AACACCTTGTTTCTACAATAACTAACCAAATGGGTTTTGGGAATTAATATTGAAAAAAGAAAAAGGGAAATAGGAATTAGTTAGAATTAGATAAATGGTAATAATGAAAT
TAGAGAAAAAAATAGAGAGAGAGACAGAGAGAGACAGAGAGAGGGTGTAGAAGGAAAGAAGGAAGAAAGGATACGGTTGGGAGTTGGATGGGAGCGGACGACTTTGACTG
AGGCAGCCAGGCTGTATAGCATTTACCAAAAAAAAAAAAAAAAAGTAGGGATATTTTACATCCGAAAAGGACGCACGGAAGTGGTGAGCTGGGTTGGG
Protein sequenceShow/hide protein sequence
MGSPDNRVASDSADNTSKNDSEDDDDNGVQSPPSHPCPFSEGEKVLAFHSFVIYEAKNPDFETVLQSFLEYAMCRCRLEDASILLAIISYNSMLKLLKVRVEINVNEDRW
DEWVGLDRLLKFTEENVKKQQELNEKRGTDKKASRASQTKPKNVIKGRKRKNDASKIPVKLKKQLVDDSEFVTHLGKLVKLPRTPNVEDILKKYLEYRLKKDVTKDESVG
EIVKGLICYFDKALPAMLLYKSERQQYEELIVNDVSPSSVYGAEHLLRLFVRLPELLSQANIEEETLIELQQKLVDLLKFLRKNQNAFFLSSYHVPENMETSTNNADD