; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G017100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G017100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionChromatin modification-related protein EAF3-like
Genome locationCG_Chr01:31636874..31652614
RNA-Seq ExpressionClCG01G017100
SyntenyClCG01G017100
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR008676 - MRG
IPR016197 - Chromo-like domain superfamily
IPR025995 - RNA binding activity-knot of a chromodomain
IPR026541 - MRG domain
IPR038217 - MRG, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034319.1 Protein MRG1, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-17287.26Show/hide
Query:  KDSGPTPGAGHSHSSEPHASSRGSGRASYLAGSLAISECSRMVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRY
        K+ G        HSSE HA+SRGSGR  YLA SLA+S+CSRMVNSSKDDTAT+GDMS GDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQ+VELRKKEWRY
Subjt:  KDSGPTPGAGHSHSSEPHASSRGSGRASYLAGSLAISECSRMVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRY

Query:  FLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSV
        FLHYLGWNKNWDEWVSVDRLMKCT+ENRLKQRALEKGYVEKSSKSGRSA AKPKN I            DARVEKED KNNAPKGKKRKNDS TKDNQSV
Subjt:  FLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSV

Query:  EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPST
        EK IKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITDSLGE+LKGIRCYFDKALPVLLLYNKEREQYHELVVD+VSPST
Subjt:  EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPST

Query:  IYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFFVSTYEGCKGTEGKGKGKND
        IYGAEHLLRLFVKLP LLAYVNIEDETQ RLH KLLDFLKFLQKNQSTFFVS YEG K  EGKGKGKND
Subjt:  IYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFFVSTYEGCKGTEGKGKGKND

XP_008440326.1 PREDICTED: protein MRG1 [Cucumis melo]1.2e-16793.29Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDD ATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQ+VELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKN             TDARVEKE+HKNNAPKGKKRKNDSGTKDNQSVEK IKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKER+QYH+LVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQ RLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQSTFFVS YEGCKGTEGKGK KND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

XP_022132780.1 protein MRG1 isoform X2 [Momordica charantia]2.2e-16692.68Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENR KQ+ALEKGY EK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKNS            TDARVEKEDHKNN  KGKKRKNDSG KDNQSVEK IKIQIPSTLRKQLVDDWEFVTQQDKLVKLPR+PTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLE+RSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD+VSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQSTFFVS Y+GCKGTEGKGKGKND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

XP_022963263.1 protein MRG1-like [Cucurbita moschata]1.6e-16491.77Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDD ATDGDMSSG SPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVS+DRLMKCTDENRLKQRALEKGYVEK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKNSI            DA+VEKED KNNAPKGKKRKNDSGTKDNQSVEK IKIQIPSTLRKQLVDDW+FVTQQDK+VKLPR+PTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLEYRSKRDG+ITDSLGEVLKGIRCYFDKALPVLLLYNKEREQY ELV D+VSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQST+FVS YEGCKGTEGKGKGKND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

XP_038881659.1 protein MRG1 [Benincasa hispida]6.2e-16994.21Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDD ATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKNSI            DARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEK IKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHE VV++VSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQSTFFVS YEGCKGTEGKGK KND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

TrEMBL top hitse value%identityAlignment
A0A1S3B0U2 protein MRG15.6e-16893.29Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDD ATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQ+VELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKN             TDARVEKE+HKNNAPKGKKRKNDSGTKDNQSVEK IKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKER+QYH+LVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQ RLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQSTFFVS YEGCKGTEGKGK KND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

A0A5D3CNW9 Protein MRG15.6e-16893.29Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDD ATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQ+VELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKN             TDARVEKE+HKNNAPKGKKRKNDSGTKDNQSVEK IKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKER+QYH+LVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQ RLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQSTFFVS YEGCKGTEGKGK KND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

A0A6J1BTF5 protein MRG1 isoform X21.1e-16692.68Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENR KQ+ALEKGY EK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKNS            TDARVEKEDHKNN  KGKKRKNDSG KDNQSVEK IKIQIPSTLRKQLVDDWEFVTQQDKLVKLPR+PTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLE+RSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD+VSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQSTFFVS Y+GCKGTEGKGKGKND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

A0A6J1HH91 protein MRG1-like7.6e-16591.77Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNSSKDD ATDGDMSSG SPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVS+DRLMKCTDENRLKQRALEKGYVEK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKNSI            DA+VEKED KNNAPKGKKRKNDSGTKDNQSVEK IKIQIPSTLRKQLVDDW+FVTQQDK+VKLPR+PTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLEYRSKRDG+ITDSLGEVLKGIRCYFDKALPVLLLYNKEREQY ELV D+VSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQST+FVS YEGCKGTEGKGKGKND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

A0A6J1KPL7 protein MRG1-like7.6e-16591.46Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK
        MVNS KDD ATDGDMSSG SPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVS+DRLMKCTDENRLKQRALEKGYVEK
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEK

Query:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI
        SSKSGRSAQAKPKNSI            DA+VEKEDHKNNAPKGKKRKNDSGTKDNQSVEK IKIQIPSTLRKQLVDDW+FVTQQDK+VKLPR+PTVDDI
Subjt:  SSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDI

Query:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF
        LTKYLEYRSKRDG+ITDSLGEVLKGIRCYFDKALPVLLLYNKEREQY ELV D+VSPSTIYGAEHLLRLFVKLPELL YVNIEDETQTRLHQKLLDFLKF
Subjt:  LTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKF

Query:  LQKNQSTFFVSTYEGCKGTEGKGKGKND
        LQKNQST+FVS YEGCKGTEGKGKGKND
Subjt:  LQKNQSTFFVSTYEGCKGTEGKGKGKND

SwissProt top hitse value%identityAlignment
Q4V3E2 Protein MRG21.2e-7448.01Show/hide
Query:  SKDDTATDGDMSSGDSP----PSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVE-
        S  DT T+ D    D P    P+    + EGE+VLA H    YEAKV +VE +  EW+YF+HY+GWNK+WDEW+ +D L+K +DEN  KQ+  E+G  + 
Subjt:  SKDDTATDGDMSSGDSP----PSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVE-

Query:  --KSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSV--EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSP
          KS+ + + ++ KP++                         N  +G+KRK DS   +   +  +  +   IP  LRKQL+DD+EFVTQ  KLV+LPRSP
Subjt:  --KSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSV--EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSP

Query:  TVDDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLL
         VD IL KY++ + K+ G +TDSL E+LKG+RCYFDKALPV+LLYN ER+QY E V   VSPST+YGAEHLLRLFVKLPELL +VN+ +ET   L    +
Subjt:  TVDDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLL

Query:  DFLKFLQKNQSTFFVSTYEGCKGTEGK
        D L+FL+KNQS  FVSTY+  +  E K
Subjt:  DFLKFLQKNQSTFFVSTYEGCKGTEGK

Q5NVP9 Mortality factor 4-like protein 11.3e-4737.75Show/hide
Query:  YSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKAST
        + EGE+VL +HGP +YEAK  +V ++ K+ +YF+H+ GWNKNWDEWV   R++K  D N  KQR L+K   E+ ++      A  K + G   ++    T
Subjt:  YSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKAST

Query:  TDARVEKEDHKNNA-------PKGKKRKNDSGTKDNQSV---EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITD
           + +   + +         P  KKR     T +N+        +K++IP  L+  LVDDW+ +T+Q +L  LP    VD IL  Y  Y+  R  T   
Subjt:  TDARVEKEDHKNNA-------PKGKKRKNDSGTKDNQSV---EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITD

Query:  --SLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD--DVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFF-VST
          ++ EV+ GI+ YF+  L   LLY  ER QY E++ D  D   S +YGA HLLRLFV++  +LAY  +++++   L   L DFLK+L KN +T F  S 
Subjt:  --SLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD--DVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFF-VST

Query:  YE
        YE
Subjt:  YE

Q6AYU1 Mortality factor 4-like protein 11.9e-4838.08Show/hide
Query:  YSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKAST
        + EGE+VL +HGP +YEAK  +V ++ K+ +YF+HY GWNKNWDEWV   R++K  D N  KQR L+K   E+ ++      A  K + G   ++ +  T
Subjt:  YSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKAST

Query:  TDARVEKEDHKNNA-------PKGKKRKNDSGTKDNQSV---EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITD
           + +   + +         P  KKR     T +N+        +K++IP  L+  LVDDW+ +T+Q +L  LP    VD IL  Y  Y+  R  T   
Subjt:  TDARVEKEDHKNNA-------PKGKKRKNDSGTKDNQSV---EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITD

Query:  --SLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD--DVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFF-VST
          ++ EV+ GI+ YF+  L   LLY  ER QY E++ D  D   S +YGA HLLRLFV++  +LAY  +++++   L   L DFLK+L KN +T F  S 
Subjt:  --SLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD--DVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFF-VST

Query:  YE
        YE
Subjt:  YE

Q94C32 Protein MRG11.4e-11563.86Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSS-LYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGY-V
        M +SSK++TA+DGD +SG + PSN   L+SEGE+VLAYHGPR+Y AKVQ+VELRKKEW+YF+HYLGWNKNWDEWVS DRL+K T+EN +KQ+AL+K   V
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSS-LYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGY-V

Query:  EKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGT-KDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTV
        EK +KSGRSAQ K +            S+ D + +K+D K NA KGKKRK++SG  KDN + EK +KIQIP++L+KQL DDWE++ Q+DK+VKLPRSP V
Subjt:  EKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGT-KDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTV

Query:  DDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDF
        D+IL+KYLE+++K+DG +TDS+ E+LKGIR YFDKALPV+LLY KER QY E +VDD SPST+YGAEHLLRLFVKLP+L +YVN+E+ET +R+ Q L DF
Subjt:  DDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDF

Query:  LKFLQKNQSTFFV-STYEGCKGTEGKGKGKND
        LKF+QKNQSTF + S Y+  K ++GKGKGK+D
Subjt:  LKFLQKNQSTFFV-STYEGCKGTEGKGKGKND

Q9UBU8 Mortality factor 4-like protein 14.6e-4233.43Show/hide
Query:  YSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNK---------------------------------------NWDEWVSVDRLMKCTDENRL
        + EGE+VL +HGP +YEAK  +V ++ K+ +YF+HY GWNK                                       +WDEWV   R++K  D N  
Subjt:  YSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNK---------------------------------------NWDEWVSVDRLMKCTDENRL

Query:  KQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNA-------PKGKKRKNDSGTKDNQSV---EKAIKIQIPSTLRKQLVDD
        KQR L+K   E+ ++      A  K + G   ++ +  T   + +   + +         P  KKR     T +N+        +K++IP  L+  LVDD
Subjt:  KQRALEKGYVEKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNA-------PKGKKRKNDSGTKDNQSV---EKAIKIQIPSTLRKQLVDD

Query:  WEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITD--SLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD--DVSPSTIYGAEHLLRLFVKLP
        W+ +T+Q +L  LP    VD IL  Y  Y+  R  T     ++ EV+ GI+ YF+  L   LLY  ER QY E++ D  D   S +YGA HLLRLFV++ 
Subjt:  WEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTITD--SLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVD--DVSPSTIYGAEHLLRLFVKLP

Query:  ELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFF-VSTYE
         +LAY  +++++   L   L DFLK+L KN +T F  S YE
Subjt:  ELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFF-VSTYE

Arabidopsis top hitse value%identityAlignment
AT1G02740.1 MRG family protein8.5e-7648.01Show/hide
Query:  SKDDTATDGDMSSGDSP----PSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVE-
        S  DT T+ D    D P    P+    + EGE+VLA H    YEAKV +VE +  EW+YF+HY+GWNK+WDEW+ +D L+K +DEN  KQ+  E+G  + 
Subjt:  SKDDTATDGDMSSGDSP----PSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVE-

Query:  --KSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSV--EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSP
          KS+ + + ++ KP++                         N  +G+KRK DS   +   +  +  +   IP  LRKQL+DD+EFVTQ  KLV+LPRSP
Subjt:  --KSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSV--EKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSP

Query:  TVDDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLL
         VD IL KY++ + K+ G +TDSL E+LKG+RCYFDKALPV+LLYN ER+QY E V   VSPST+YGAEHLLRLFVKLPELL +VN+ +ET   L    +
Subjt:  TVDDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLL

Query:  DFLKFLQKNQSTFFVSTYEGCKGTEGK
        D L+FL+KNQS  FVSTY+  +  E K
Subjt:  DFLKFLQKNQSTFFVSTYEGCKGTEGK

AT2G23270.1 unknown protein7.6e-0844.32Show/hide
Query:  MGKILI-TFILVVALLGNGGLEGRPLSLMASGSGSAAAIAKDFFEGLSLGAIKQSGPSAGGDGHKFVN-YDTFGGIKDSGP-TPGAGH
        M K+++ +F+  + +  N  +E RPL L  +     A     FF+GLSLGAIK+SGPS+GG+GH+FV+  +T    K SGP T G GH
Subjt:  MGKILI-TFILVVALLGNGGLEGRPLSLMASGSGSAAAIAKDFFEGLSLGAIKQSGPSAGGDGHKFVN-YDTFGGIKDSGP-TPGAGH

AT4G37280.1 MRG family protein1.0e-11663.86Show/hide
Query:  MVNSSKDDTATDGDMSSGDSPPSNSS-LYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGY-V
        M +SSK++TA+DGD +SG + PSN   L+SEGE+VLAYHGPR+Y AKVQ+VELRKKEW+YF+HYLGWNKNWDEWVS DRL+K T+EN +KQ+AL+K   V
Subjt:  MVNSSKDDTATDGDMSSGDSPPSNSS-LYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGY-V

Query:  EKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGT-KDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTV
        EK +KSGRSAQ K +            S+ D + +K+D K NA KGKKRK++SG  KDN + EK +KIQIP++L+KQL DDWE++ Q+DK+VKLPRSP V
Subjt:  EKSSKSGRSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGT-KDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTV

Query:  DDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDF
        D+IL+KYLE+++K+DG +TDS+ E+LKGIR YFDKALPV+LLY KER QY E +VDD SPST+YGAEHLLRLFVKLP+L +YVN+E+ET +R+ Q L DF
Subjt:  DDILTKYLEYRSKRDGTITDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDF

Query:  LKFLQKNQSTFFV-STYEGCKGTEGKGKGKND
        LKF+QKNQSTF + S Y+  K ++GKGKGK+D
Subjt:  LKFLQKNQSTFFV-STYEGCKGTEGKGKGKND

AT4G37290.1 unknown protein1.7e-0745.98Show/hide
Query:  MGKILITFILVVALLGNGGLEGRPLSLMASGSGSAAAIAKDFFEGLSLGAIKQSGPSAGGDGHKFVN-YDTFGGIKDSGPTP-GAGH
        M K +++ IL   L+G+  +E RPL L  +     A++    F+GLSLG+IK SGPS  G+GHK V+  DTF  +K SGP+P G GH
Subjt:  MGKILITFILVVALLGNGGLEGRPLSLMASGSGSAAAIAKDFFEGLSLGAIKQSGPSAGGDGHKFVN-YDTFGGIKDSGPTP-GAGH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGATTTTAATCACTTTCATTCTGGTGGTTGCGCTGTTGGGCAATGGCGGCCTAGAAGGCAGGCCACTCAGCCTCATGGCCTCAGGCTCGGGCTCTGCGGCCGC
CATCGCTAAAGATTTCTTCGAGGGGCTTTCTCTGGGAGCCATTAAGCAGTCGGGTCCAAGCGCCGGTGGCGATGGTCACAAATTCGTCAACTATGACACGTTTGGAGGTA
TTAAGGATTCTGGCCCCACTCCTGGTGCTGGACATAGCCACTCGTCAGAGCCCCACGCGAGCTCACGAGGAAGTGGCCGAGCCTCGTATCTGGCTGGTTCTCTTGCTATC
TCAGAGTGTTCGAGAATGGTAAACTCCTCCAAGGACGACACAGCCACCGACGGCGACATGTCCAGCGGTGATTCTCCACCCTCCAATTCCAGCCTCTACTCCGAAGGCGA
GAAGGTTCTTGCCTATCACGGTCCCCGCATCTATGAAGCCAAGGTCCAAAGAGTTGAGCTTAGGAAGAAGGAATGGAGATACTTTCTGCACTATCTTGGCTGGAATAAAA
ATTGGGACGAATGGGTAAGCGTAGATCGGCTGATGAAATGTACTGATGAGAACCGTTTGAAGCAGCGAGCCCTTGAGAAAGGGTATGTAGAAAAGAGCTCAAAGTCTGGA
CGCTCTGCACAAGCAAAGCCGAAAAACTCAATCGGTTTTTCATGCGAGTCAGCTAAAGCTTCAACCACTGATGCAAGAGTGGAGAAAGAGGACCACAAGAACAATGCACC
AAAGGGGAAGAAACGAAAGAATGACTCAGGCACCAAGGACAATCAATCCGTAGAGAAAGCTATCAAGATTCAAATACCTTCAACACTTAGAAAGCAACTTGTTGATGACT
GGGAATTTGTTACACAGCAGGATAAGCTAGTCAAGCTTCCACGCTCACCTACAGTTGATGACATTTTGACAAAGTACTTGGAATACAGATCGAAGAGGGATGGCACGATT
ACTGACTCGCTGGGAGAAGTCTTGAAAGGAATACGGTGTTATTTTGATAAAGCATTGCCTGTACTGCTCTTGTACAACAAAGAACGTGAACAGTATCATGAATTAGTCGT
AGATGATGTCTCTCCATCAACTATATATGGTGCGGAACATCTGCTTCGACTCTTTGTTAAGTTGCCAGAGCTTTTGGCATATGTGAATATTGAAGATGAAACACAGACCC
GGCTACACCAGAAGTTGCTCGACTTTCTGAAATTTCTGCAGAAGAATCAGAGTACTTTCTTTGTCTCAACATATGAGGGCTGTAAAGGTACTGAGGGAAAGGGTAAGGGC
AAGAATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGATTTTAATCACTTTCATTCTGGTGGTTGCGCTGTTGGGCAATGGCGGCCTAGAAGGCAGGCCACTCAGCCTCATGGCCTCAGGCTCGGGCTCTGCGGCCGC
CATCGCTAAAGATTTCTTCGAGGGGCTTTCTCTGGGAGCCATTAAGCAGTCGGGTCCAAGCGCCGGTGGCGATGGTCACAAATTCGTCAACTATGACACGTTTGGAGGTA
TTAAGGATTCTGGCCCCACTCCTGGTGCTGGACATAGCCACTCGTCAGAGCCCCACGCGAGCTCACGAGGAAGTGGCCGAGCCTCGTATCTGGCTGGTTCTCTTGCTATC
TCAGAGTGTTCGAGAATGGTAAACTCCTCCAAGGACGACACAGCCACCGACGGCGACATGTCCAGCGGTGATTCTCCACCCTCCAATTCCAGCCTCTACTCCGAAGGCGA
GAAGGTTCTTGCCTATCACGGTCCCCGCATCTATGAAGCCAAGGTCCAAAGAGTTGAGCTTAGGAAGAAGGAATGGAGATACTTTCTGCACTATCTTGGCTGGAATAAAA
ATTGGGACGAATGGGTAAGCGTAGATCGGCTGATGAAATGTACTGATGAGAACCGTTTGAAGCAGCGAGCCCTTGAGAAAGGGTATGTAGAAAAGAGCTCAAAGTCTGGA
CGCTCTGCACAAGCAAAGCCGAAAAACTCAATCGGTTTTTCATGCGAGTCAGCTAAAGCTTCAACCACTGATGCAAGAGTGGAGAAAGAGGACCACAAGAACAATGCACC
AAAGGGGAAGAAACGAAAGAATGACTCAGGCACCAAGGACAATCAATCCGTAGAGAAAGCTATCAAGATTCAAATACCTTCAACACTTAGAAAGCAACTTGTTGATGACT
GGGAATTTGTTACACAGCAGGATAAGCTAGTCAAGCTTCCACGCTCACCTACAGTTGATGACATTTTGACAAAGTACTTGGAATACAGATCGAAGAGGGATGGCACGATT
ACTGACTCGCTGGGAGAAGTCTTGAAAGGAATACGGTGTTATTTTGATAAAGCATTGCCTGTACTGCTCTTGTACAACAAAGAACGTGAACAGTATCATGAATTAGTCGT
AGATGATGTCTCTCCATCAACTATATATGGTGCGGAACATCTGCTTCGACTCTTTGTTAAGTTGCCAGAGCTTTTGGCATATGTGAATATTGAAGATGAAACACAGACCC
GGCTACACCAGAAGTTGCTCGACTTTCTGAAATTTCTGCAGAAGAATCAGAGTACTTTCTTTGTCTCAACATATGAGGGCTGTAAAGGTACTGAGGGAAAGGGTAAGGGC
AAGAATGACTGA
Protein sequenceShow/hide protein sequence
MGKILITFILVVALLGNGGLEGRPLSLMASGSGSAAAIAKDFFEGLSLGAIKQSGPSAGGDGHKFVNYDTFGGIKDSGPTPGAGHSHSSEPHASSRGSGRASYLAGSLAI
SECSRMVNSSKDDTATDGDMSSGDSPPSNSSLYSEGEKVLAYHGPRIYEAKVQRVELRKKEWRYFLHYLGWNKNWDEWVSVDRLMKCTDENRLKQRALEKGYVEKSSKSG
RSAQAKPKNSIGFSCESAKASTTDARVEKEDHKNNAPKGKKRKNDSGTKDNQSVEKAIKIQIPSTLRKQLVDDWEFVTQQDKLVKLPRSPTVDDILTKYLEYRSKRDGTI
TDSLGEVLKGIRCYFDKALPVLLLYNKEREQYHELVVDDVSPSTIYGAEHLLRLFVKLPELLAYVNIEDETQTRLHQKLLDFLKFLQKNQSTFFVSTYEGCKGTEGKGKG
KND