; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026827 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026827
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRING-type domain-containing protein
Genome locationtig00153047:1368466..1378542
RNA-Seq ExpressionSgr026827
SyntenySgr026827
Gene Ontology termsGO:0016567 - protein ubiquitination (biological process)
GO:0030014 - CCR4-NOT complex (cellular component)
GO:0004842 - ubiquitin-protein transferase activity (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR039515 - NOT4, modified RING finger, HC subclass (C4C4-type)
IPR039780 - CCR4-NOT transcription complex subunit 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057903.1 suppressor protein SRP40 isoform X2 [Cucumis melo var. makuwa]1.8e-15271.36Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPLPKKKRVRN+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKT KDDNLSS+KINMRE+GEEN  SVHH SD+
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T EEEEEEGDDDCLDDWEAIADALAATDKQ DQ SE SP G             
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------

Query:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
                           RA +N RAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISSFLWYGGISHFYTRYSFAVDIDLAGANVFLKFRM------HLQLFSNF---PI
        CHKRILEEDGRCPGCRKPY+RDPADNETN+  GS T PLARSCSMIS             R      + +AG +   +F M       L ++ ++     
Subjt:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISSFLWYGGISHFYTRYSFAVDIDLAGANVFLKFRM------HLQLFSNF---PI

Query:  SSRRRTRPISMNKAPAPQQRWSAVSV
         S    R ISM++A    QRW AVSV
Subjt:  SSRRRTRPISMNKAPAPQQRWSAVSV

TYJ98591.1 suppressor protein SRP40 isoform X2 [Cucumis melo var. makuwa]2.3e-15282.18Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPLPKKKRVRN+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKT KDDNLSS+KINMRE+GEEN  SVHH SD+
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T EEEEEEGDDDCLDDWEAIADALAATDKQ DQ SE SP G             
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------

Query:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
                           RA +N RAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISS
        CHKRILEEDGRCPGCRKPY+RDPADNETN+  GS T PLARSCSMISS
Subjt:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISS

XP_004138290.2 uncharacterized protein LOC101211244 [Cucumis sativus]5.6e-15181.84Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPLPKKKRVRN+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKT KDD+LSS+KINMRE+GEEN  SVHH SD+
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T EEEEEEGDDDCLDDWEAIADALAATDKQHDQ SE SP G             
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------

Query:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
                           RA +N RAWRPDDAFRPQSLPTLSKQLSLP TDRRFGCGG+ WACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS
        CHKRILEEDGRCPGCRKPY+RDPADNETNV  GS T PLARSCSMIS
Subjt:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS

XP_008453149.1 PREDICTED: uncharacterized protein LOC103493947 [Cucumis melo]6.7e-15282.13Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPLPKKKRVRN+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKT KDDNLSS+KINMRE+GEEN  SVHH SD+
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T EEEEEEGDDDCLDDWEAIADALAATDKQ DQ SE SP G             
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------

Query:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
                           RA +N RAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS
        CHKRILEEDGRCPGCRKPY+RDPADNETN+  GS T PLARSCSMIS
Subjt:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS

XP_022134843.1 uncharacterized protein LOC111007016 [Momordica charantia]8.4e-15583.82Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSITNASISLAP+ARDLPLPKKKRV+N+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKTSKDDNLSSKKINMRE+G +N GSVHHDSDF
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG--------------
        SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T   EEEGDDD LDDWEAIADALAATDKQHDQ SE SP G              
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG--------------

Query:  ------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC
                          RAP++ RAWRPDDAFRPQSLPTLSKQLSLPNTDRR+GCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC
Subjt:  ------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC

Query:  HKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS
        HKRILEEDGRCPGCRKPYE DPAD+ETNVHGGS T PLARSCSMIS
Subjt:  HKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS

TrEMBL top hitse value%identityAlignment
A0A1S3BVJ1 uncharacterized protein LOC1034939473.2e-15282.13Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPLPKKKRVRN+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKT KDDNLSS+KINMRE+GEEN  SVHH SD+
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T EEEEEEGDDDCLDDWEAIADALAATDKQ DQ SE SP G             
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------

Query:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
                           RA +N RAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS
        CHKRILEEDGRCPGCRKPY+RDPADNETN+  GS T PLARSCSMIS
Subjt:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS

A0A5A7UPV5 Suppressor protein SRP40 isoform X28.5e-15371.36Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPLPKKKRVRN+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKT KDDNLSS+KINMRE+GEEN  SVHH SD+
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T EEEEEEGDDDCLDDWEAIADALAATDKQ DQ SE SP G             
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------

Query:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
                           RA +N RAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISSFLWYGGISHFYTRYSFAVDIDLAGANVFLKFRM------HLQLFSNF---PI
        CHKRILEEDGRCPGCRKPY+RDPADNETN+  GS T PLARSCSMIS             R      + +AG +   +F M       L ++ ++     
Subjt:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISSFLWYGGISHFYTRYSFAVDIDLAGANVFLKFRM------HLQLFSNF---PI

Query:  SSRRRTRPISMNKAPAPQQRWSAVSV
         S    R ISM++A    QRW AVSV
Subjt:  SSRRRTRPISMNKAPAPQQRWSAVSV

A0A5D3BIS7 Suppressor protein SRP40 isoform X21.1e-15282.18Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPLPKKKRVRN+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKT KDDNLSS+KINMRE+GEEN  SVHH SD+
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T EEEEEEGDDDCLDDWEAIADALAATDKQ DQ SE SP G             
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLT-EEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG-------------

Query:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
                           RA +N RAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  -------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISS
        CHKRILEEDGRCPGCRKPY+RDPADNETN+  GS T PLARSCSMISS
Subjt:  CHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISS

A0A6J1C357 uncharacterized protein LOC1110070164.1e-15583.82Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSITNASISLAP+ARDLPLPKKKRV+N+KLKQSKLDVRREQWLSRGAVKNKKWNEED+RLDSVVIKTSKDDNLSSKKINMRE+G +N GSVHHDSDF
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG--------------
        SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGS+T   EEEGDDD LDDWEAIADALAATDKQHDQ SE SP G              
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG--------------

Query:  ------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC
                          RAP++ RAWRPDDAFRPQSLPTLSKQLSLPNTDRR+GCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC
Subjt:  ------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC

Query:  HKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS
        HKRILEEDGRCPGCRKPYE DPAD+ETNVHGGS T PLARSCSMIS
Subjt:  HKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS

A0A6J1JM47 uncharacterized protein LOC1114856375.5e-14477.75Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        MVTDSI NASISLAPNARDLPL KK+R RN+KLKQSKLDVRREQWLSRGA+KNKKWNEED+RLDS V+KT +DDNLSSKKIN RE+GEE+  SVHH SD 
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG--------------
        SESPSNSPPS+ SSILGGNDSG HFTGSSSSSSCRSSSSGCRSGS+TEEEE+E DDDCL+DWEA AD L ATDKQHDQ SE SP G              
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSE-SPLG--------------

Query:  ------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC
                          RA +N RAWRPDDA RPQSLPTLSKQLSLPNTDRRFGCG +PWACGGV+ VPTSCPICFEDLDLTDS+FLPC CGFRLCLFC
Subjt:  ------------------RAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC

Query:  HKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS
        HKRILEEDGRCPGCRKPY+RDPADNETN H GS TFPLARSCSMIS
Subjt:  HKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMIS

SwissProt top hitse value%identityAlignment
O95628 CCR4-NOT transcription complex subunit 43.7e-1252.73Show/hide
Query:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYERDPA
        P  CP+C E L++ D +F PC CG+++C FC  RI  +E+G CP CRKPY  DPA
Subjt:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYERDPA

P34909 General negative regulator of transcription subunit 42.9e-0948Show/hide
Query:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRIL---EEDGRCPGCRKPYE
        CP+C E +D+TD +F PC CG+++C FC+  I    E +GRCP CR+ Y+
Subjt:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRIL---EEDGRCPGCRKPYE

Q09818 Putative general negative regulator of transcription C16C9.04c3.8e-0948Show/hide
Query:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEE-DGRCPGCRKPYERD
        CP+C E++D++D +F PC CG+R+C FC   I E+ +GRCP CR+ Y  +
Subjt:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEE-DGRCPGCRKPYERD

Q8BT14 CCR4-NOT transcription complex subunit 43.7e-1252.73Show/hide
Query:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYERDPA
        P  CP+C E L++ D +F PC CG+++C FC  RI  +E+G CP CRKPY  DPA
Subjt:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYERDPA

Arabidopsis top hitse value%identityAlignment
AT1G74870.1 RING/U-box superfamily protein2.1e-1831.05Show/hide
Query:  KKRVRN--AKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDFSESPSNSPPSLASSILGGNDSG
        KKR  N   KLKQ K+D RR+QW+S+    N    E   RL S++ K +   +  + +I+  +  E++   +   S F+ SP        +S+L   DS 
Subjt:  KKRVRN--AKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDFSESPSNSPPSLASSILGGNDSG

Query:  PHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSESPLGRAPINFRAWRPDDAFR---------PQSLPTL---SK
                     S    C S  +TEEEEE   DD  D+W+   DAL + +  +++ S               PD + R         P +  T+   S 
Subjt:  PHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSESPLGRAPINFRAWRPDDAFR---------PQSLPTL---SK

Query:  QLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYER
        +    N++++ G G               CPIC E +D TD  F PC CGFR+CLFCH +I E + RCP CRK Y++
Subjt:  QLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYER

AT3G45630.1 RNA binding (RRM/RBD/RNP motifs) family protein2.7e-1041.07Show/hide
Query:  SCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRIL------EEDGRCPGCRKPYERD
        +CP+C E++DLTD    PC CG+++C++C   I+      + +GRCP CR PY+++
Subjt:  SCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRIL------EEDGRCPGCRKPYERD

AT3G48070.1 RING/U-box superfamily protein3.3e-6448.09Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        M++DSITNAS + AP ARD    KK+  R+AKLKQSKL +RREQWLS+ A+ NK   +++    +  I + K D     +  +  + E+N G+  H+S F
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSESP---------------LG
         ES SNSP    +SIL G +S P+F+ SSS     S S G  SG++TEEE+ + DD CLDDWEAIADALAA D++H++  E+P               +G
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSESP---------------LG

Query:  RAPINFR---------------AWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR
         A +  R               AWRPDD  RPQ LP L KQ S P  +  F           V  VP+SCPIC+EDLDLTDS+FLPC CGFRLCLFCHK 
Subjt:  RAPINFR---------------AWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR

Query:  ILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSM
        I + DGRCPGCRKPYER+    E +V GG  T  LARS SM
Subjt:  ILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSM

AT3G48070.2 RING/U-box superfamily protein3.3e-6448.09Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        M++DSITNAS + AP ARD    KK+  R+AKLKQSKL +RREQWLS+ A+ NK   +++    +  I + K D     +  +  + E+N G+  H+S F
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSESP---------------LG
         ES SNSP    +SIL G +S P+F+ SSS     S S G  SG++TEEE+ + DD CLDDWEAIADALAA D++H++  E+P               +G
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSESP---------------LG

Query:  RAPINFR---------------AWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR
         A +  R               AWRPDD  RPQ LP L KQ S P  +  F           V  VP+SCPIC+EDLDLTDS+FLPC CGFRLCLFCHK 
Subjt:  RAPINFR---------------AWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR

Query:  ILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSM
        I + DGRCPGCRKPYER+    E +V GG  T  LARS SM
Subjt:  ILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSM

AT5G62910.1 RING/U-box superfamily protein2.5e-6446.76Show/hide
Query:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF
        M+TDSITNAS + AP  RD    KK+  ++AK+KQ+KL +RREQWLS+ AV NK+  EE S     V ++ K  + SS K+   E        +HH+S F
Subjt:  MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDD-CLDDWEAIADALAATDK----------QHDQFS---------
         ESPSNS        +GG  S  +F+G SS SS  SSSSG  SG++TEEE  + DDD C+DDWEA+ADALAA ++            +Q S         
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDD-CLDDWEAIADALAATDK----------QHDQFS---------

Query:  -------------ESP-------LGRAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCG
                     E P         R   + RAWR DD  RPQ LP L+KQLS P  D+RF            +++P+SCPIC+EDLDLTDS+FLPC CG
Subjt:  -------------ESP-------LGRAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRRFGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCG

Query:  FRLCLFCHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISSF
        FRLCLFCHK I + DGRCPGCRKPYER+    ET++ GG  T  LARS SM   F
Subjt:  FRLCLFCHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTACTGATTCTATCACCAATGCTTCAATTTCTTTGGCCCCAAACGCCCGGGATTTACCCTTGCCCAAGAAGAAGAGGGTCAGGAATGCCAAATTGAAGCAGTCCAA
GCTTGATGTTCGTCGGGAACAATGGCTTTCTCGAGGTGCTGTAAAGAACAAGAAATGGAACGAAGAAGACAGTCGCCTTGATTCAGTAGTCATTAAAACGAGCAAAGACG
ATAATCTTTCCTCCAAGAAAATAAATATGAGGGAAGTAGGAGAAGAGAATGCCGGATCGGTCCATCACGACAGCGATTTTTCAGAGTCGCCGTCTAATAGCCCCCCTAGT
CTTGCCAGTAGTATCTTGGGTGGTAATGATTCGGGTCCTCATTTTACTGGGAGTAGCAGTAGCAGTAGCTGCCGTAGCAGCTCTAGTGGTTGTCGCTCGGGGAGTCTAAC
GGAGGAGGAGGAGGAAGAAGGAGATGATGACTGCTTGGATGACTGGGAGGCTATTGCCGATGCCCTAGCCGCCACTGACAAGCAACACGATCAATTTTCCGAGTCTCCCC
TAGGGAGGGCTCCAATAAATTTCCGAGCATGGAGGCCTGATGATGCCTTTCGTCCTCAGAGTTTGCCCACTTTATCGAAGCAGCTTAGTTTACCGAATACAGACCGGCGT
TTCGGATGTGGAGGCATCCCTTGGGCCTGTGGTGGTGTCATGTCTGTGCCTACATCTTGTCCAATTTGCTTTGAAGACTTGGACCTTACAGACTCAAGTTTTTTACCCTG
TTTCTGTGGATTCCGGCTGTGCCTTTTCTGTCATAAGAGGATTCTTGAAGAGGATGGGCGCTGTCCTGGGTGCAGGAAGCCATATGAGCGTGATCCTGCTGACAACGAGA
CAAATGTACATGGTGGCAGCTCAACGTTCCCATTGGCTCGTTCTTGTAGCATGATTTCAAGCTTCCTCTGGTATGGAGGTATTAGCCACTTCTATACTAGGTATTCATTT
GCTGTTGATATTGACCTGGCTGGTGCAAACGTGTTTCTAAAATTTAGGATGCACCTTCAATTGTTTTCTAATTTCCCAATATCTTCTAGAAGACGAACCAGACCTATCTC
CATGAACAAGGCCCCTGCTCCTCAGCAAAGGTGGTCGGCTGTTTCGGTCGGAGATTTCAAGCGTACAGTCTGGTTTTGCTTTGGATACGAAGCTCCTCAACCTTTGACCA
AAGACTTGATTCGTCACTGGGCTCTATCATCTTTTGATTATAGGCTCAACGAGACGGCGAAAGCTGGTTGCAAGTTGTATCCATTCGTCATTGCCCTCGACATCTGGGTT
GGTCACCACAACCGCTTTCCCACTCTCAGTACAGAAAATGTAAGTGCCAAAAGGCCTGCGGCATTGCTTGCCACAGCCGACGCATTTCTCATCACCGTCATGAAGAATGT
GGTAGCCACATTCCCGTTCAAGGCGGACCCGAGTGATCTCGAGTTCAAGGAGTTGAGGATGGTCTGCAATTGTGGTTGCAGGGAAGGTAGAGGAGCGAGATGGATTCGTG
GGACGATGTCGTACCTCATGACGAAAGCCAAATGGTTGCAAGAATGGCTATCGGACCTCCGGCTGAGTGGCCGGCAAACACCACCTGCTTATTCACTTTCACAACCTGAT
AAACAACATCCGATCACCAACAGTCGAAGAAGCCATGGAATTGAGGTGGGCATAATCGCCGACGCCGATGCTCCTGATGGAAGGGAACAGCTGACTATTGATCTCTTTTC
TCCAAAAGATTCGCCGGAAGCAGAAAACCAGCCATCCGGCGACCAAGATCCGGGGAAACTGATGATGGAGGAGTCGCGGTTCTTGTCGACGAGAAACGACTTCTCCGGGT
GCTTGTGGGCTTTAATGGCGGACGAGCAAGCATTTCGGAAGAGCTCTTCCTTCAACCCGGTAGCGTCTTCCAGCCTCGTTCCAACCATCTTCTTCAGCTCCGATGATGCT
CTCTGCTCCTGCTCTGCTCTGCTTGAGGTTGACGAAGACTCAGAGTCTTCGGCATATATGAATATGATAAGTTACCGTTTACCTGAGATAGTTGCAGAAATCCTTGGCAA
TGATCGAAAGGTGGCGGCGACGGCGTCGGAGACGACAGGGGCTGTGGTCATGATCAAGACGGAGCTTCCTCTTCCCTCTATCTTCTCCAGCTTCTCCTTTCATGATGAAA
TCTCTCAATCGTTTCACATCAGACAGCTTCACTGGCTTCACTATGTATTCTTCAGCCCCCTCCTTCAAGCAACTGCTATCATCAACGGCGAGAACATGAAGTTGCCGCGA
ACTACTCTGGTTCGAATCATCGACAACCTGACTCCGTCGGTGGGGTTTCGCCGTATGAGCTCTCCAACCGTCGCCATGGCCGGAAAAAACAGTCTGTCGGAGGGGCCTGC
AGCGGGAGCAGAGAATAAGGAAGCGGATGGGGAGAGATGGGGTCATGTTATAAGTGGGAAATGCGAGAGAATCCAAATCCCACCGCAGAGAAGATTAGGCCCACTTGTCA
ATTGTGGCGCTGATCAGCATCAGTCACCAAAGCCAAGCACTGTCAGATCAGAGCACTGTGCAGATTGGCCGTACGCTCTATATGCCGCGCTGTCCACTCGGTCTCGGCCA
TCGTTTGGGCACACCATTGACTTGACCATCCTATTCATTTTTTCAACTCCCTTTTTGCCCATTTTTCCAGGAGAGCGAGGGGAGGGGATAGGGGGGACGCAACAGAGAAT
GGACGGTGGTGCAACATTCGATTCTGTTTGGGAATTGGATACGCAGATTTCTTTGGTCCAATCGGTTGTAGTCGACCTCCATCTGAGCTCGTCTGATCTTACCAATGTTC
ATCTGTGTCCTAGCACTGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTACTGATTCTATCACCAATGCTTCAATTTCTTTGGCCCCAAACGCCCGGGATTTACCCTTGCCCAAGAAGAAGAGGGTCAGGAATGCCAAATTGAAGCAGTCCAA
GCTTGATGTTCGTCGGGAACAATGGCTTTCTCGAGGTGCTGTAAAGAACAAGAAATGGAACGAAGAAGACAGTCGCCTTGATTCAGTAGTCATTAAAACGAGCAAAGACG
ATAATCTTTCCTCCAAGAAAATAAATATGAGGGAAGTAGGAGAAGAGAATGCCGGATCGGTCCATCACGACAGCGATTTTTCAGAGTCGCCGTCTAATAGCCCCCCTAGT
CTTGCCAGTAGTATCTTGGGTGGTAATGATTCGGGTCCTCATTTTACTGGGAGTAGCAGTAGCAGTAGCTGCCGTAGCAGCTCTAGTGGTTGTCGCTCGGGGAGTCTAAC
GGAGGAGGAGGAGGAAGAAGGAGATGATGACTGCTTGGATGACTGGGAGGCTATTGCCGATGCCCTAGCCGCCACTGACAAGCAACACGATCAATTTTCCGAGTCTCCCC
TAGGGAGGGCTCCAATAAATTTCCGAGCATGGAGGCCTGATGATGCCTTTCGTCCTCAGAGTTTGCCCACTTTATCGAAGCAGCTTAGTTTACCGAATACAGACCGGCGT
TTCGGATGTGGAGGCATCCCTTGGGCCTGTGGTGGTGTCATGTCTGTGCCTACATCTTGTCCAATTTGCTTTGAAGACTTGGACCTTACAGACTCAAGTTTTTTACCCTG
TTTCTGTGGATTCCGGCTGTGCCTTTTCTGTCATAAGAGGATTCTTGAAGAGGATGGGCGCTGTCCTGGGTGCAGGAAGCCATATGAGCGTGATCCTGCTGACAACGAGA
CAAATGTACATGGTGGCAGCTCAACGTTCCCATTGGCTCGTTCTTGTAGCATGATTTCAAGCTTCCTCTGGTATGGAGGTATTAGCCACTTCTATACTAGGTATTCATTT
GCTGTTGATATTGACCTGGCTGGTGCAAACGTGTTTCTAAAATTTAGGATGCACCTTCAATTGTTTTCTAATTTCCCAATATCTTCTAGAAGACGAACCAGACCTATCTC
CATGAACAAGGCCCCTGCTCCTCAGCAAAGGTGGTCGGCTGTTTCGGTCGGAGATTTCAAGCGTACAGTCTGGTTTTGCTTTGGATACGAAGCTCCTCAACCTTTGACCA
AAGACTTGATTCGTCACTGGGCTCTATCATCTTTTGATTATAGGCTCAACGAGACGGCGAAAGCTGGTTGCAAGTTGTATCCATTCGTCATTGCCCTCGACATCTGGGTT
GGTCACCACAACCGCTTTCCCACTCTCAGTACAGAAAATGTAAGTGCCAAAAGGCCTGCGGCATTGCTTGCCACAGCCGACGCATTTCTCATCACCGTCATGAAGAATGT
GGTAGCCACATTCCCGTTCAAGGCGGACCCGAGTGATCTCGAGTTCAAGGAGTTGAGGATGGTCTGCAATTGTGGTTGCAGGGAAGGTAGAGGAGCGAGATGGATTCGTG
GGACGATGTCGTACCTCATGACGAAAGCCAAATGGTTGCAAGAATGGCTATCGGACCTCCGGCTGAGTGGCCGGCAAACACCACCTGCTTATTCACTTTCACAACCTGAT
AAACAACATCCGATCACCAACAGTCGAAGAAGCCATGGAATTGAGGTGGGCATAATCGCCGACGCCGATGCTCCTGATGGAAGGGAACAGCTGACTATTGATCTCTTTTC
TCCAAAAGATTCGCCGGAAGCAGAAAACCAGCCATCCGGCGACCAAGATCCGGGGAAACTGATGATGGAGGAGTCGCGGTTCTTGTCGACGAGAAACGACTTCTCCGGGT
GCTTGTGGGCTTTAATGGCGGACGAGCAAGCATTTCGGAAGAGCTCTTCCTTCAACCCGGTAGCGTCTTCCAGCCTCGTTCCAACCATCTTCTTCAGCTCCGATGATGCT
CTCTGCTCCTGCTCTGCTCTGCTTGAGGTTGACGAAGACTCAGAGTCTTCGGCATATATGAATATGATAAGTTACCGTTTACCTGAGATAGTTGCAGAAATCCTTGGCAA
TGATCGAAAGGTGGCGGCGACGGCGTCGGAGACGACAGGGGCTGTGGTCATGATCAAGACGGAGCTTCCTCTTCCCTCTATCTTCTCCAGCTTCTCCTTTCATGATGAAA
TCTCTCAATCGTTTCACATCAGACAGCTTCACTGGCTTCACTATGTATTCTTCAGCCCCCTCCTTCAAGCAACTGCTATCATCAACGGCGAGAACATGAAGTTGCCGCGA
ACTACTCTGGTTCGAATCATCGACAACCTGACTCCGTCGGTGGGGTTTCGCCGTATGAGCTCTCCAACCGTCGCCATGGCCGGAAAAAACAGTCTGTCGGAGGGGCCTGC
AGCGGGAGCAGAGAATAAGGAAGCGGATGGGGAGAGATGGGGTCATGTTATAAGTGGGAAATGCGAGAGAATCCAAATCCCACCGCAGAGAAGATTAGGCCCACTTGTCA
ATTGTGGCGCTGATCAGCATCAGTCACCAAAGCCAAGCACTGTCAGATCAGAGCACTGTGCAGATTGGCCGTACGCTCTATATGCCGCGCTGTCCACTCGGTCTCGGCCA
TCGTTTGGGCACACCATTGACTTGACCATCCTATTCATTTTTTCAACTCCCTTTTTGCCCATTTTTCCAGGAGAGCGAGGGGAGGGGATAGGGGGGACGCAACAGAGAAT
GGACGGTGGTGCAACATTCGATTCTGTTTGGGAATTGGATACGCAGATTTCTTTGGTCCAATCGGTTGTAGTCGACCTCCATCTGAGCTCGTCTGATCTTACCAATGTTC
ATCTGTGTCCTAGCACTGCATAG
Protein sequenceShow/hide protein sequence
MVTDSITNASISLAPNARDLPLPKKKRVRNAKLKQSKLDVRREQWLSRGAVKNKKWNEEDSRLDSVVIKTSKDDNLSSKKINMREVGEENAGSVHHDSDFSESPSNSPPS
LASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSLTEEEEEEGDDDCLDDWEAIADALAATDKQHDQFSESPLGRAPINFRAWRPDDAFRPQSLPTLSKQLSLPNTDRR
FGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYERDPADNETNVHGGSSTFPLARSCSMISSFLWYGGISHFYTRYSF
AVDIDLAGANVFLKFRMHLQLFSNFPISSRRRTRPISMNKAPAPQQRWSAVSVGDFKRTVWFCFGYEAPQPLTKDLIRHWALSSFDYRLNETAKAGCKLYPFVIALDIWV
GHHNRFPTLSTENVSAKRPAALLATADAFLITVMKNVVATFPFKADPSDLEFKELRMVCNCGCREGRGARWIRGTMSYLMTKAKWLQEWLSDLRLSGRQTPPAYSLSQPD
KQHPITNSRRSHGIEVGIIADADAPDGREQLTIDLFSPKDSPEAENQPSGDQDPGKLMMEESRFLSTRNDFSGCLWALMADEQAFRKSSSFNPVASSSLVPTIFFSSDDA
LCSCSALLEVDEDSESSAYMNMISYRLPEIVAEILGNDRKVAATASETTGAVVMIKTELPLPSIFSSFSFHDEISQSFHIRQLHWLHYVFFSPLLQATAIINGENMKLPR
TTLVRIIDNLTPSVGFRRMSSPTVAMAGKNSLSEGPAAGAENKEADGERWGHVISGKCERIQIPPQRRLGPLVNCGADQHQSPKPSTVRSEHCADWPYALYAALSTRSRP
SFGHTIDLTILFIFSTPFLPIFPGERGEGIGGTQQRMDGGATFDSVWELDTQISLVQSVVVDLHLSSSDLTNVHLCPSTA