; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0156 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0156
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRING-type domain-containing protein
Genome locationMC06:1239541..1245875
RNA-Seq ExpressionMC06g0156
SyntenyMC06g0156
Gene Ontology termsGO:0016567 - protein ubiquitination (biological process)
GO:0030014 - CCR4-NOT complex (cellular component)
GO:0004842 - ubiquitin-protein transferase activity (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR039515 - NOT4, modified RING finger, HC subclass (C4C4-type)
IPR039780 - CCR4-NOT transcription complex subunit 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057903.1 suppressor protein SRP40 isoform X2 [Cucumis melo var. makuwa]3.39e-21989.08Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPLPKKKRV+NSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKT KDDNLSS+KINMREIG +N  SVHH SD+
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE    GDDD LDDWEAIADALAATDKQ DQCSESSPRG+ VSQLDSCGD R
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR

Query:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
        NELGVGDGDS +E GRI QRA M+CRAWRPDDAFRPQSLPTLSKQLSLPNTDRR+GCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR
        CHKRILEEDGRCPGCRKPY+ DPAD+ETN+  GSPTLPLARSCSMISR
Subjt:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR

XP_004138290.2 uncharacterized protein LOC101211244 [Cucumis sativus]1.54e-21988.54Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPLPKKKRV+NSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKT KDD+LSS+KINMREIG +N  SVHH SD+
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE    GDDD LDDWEAIADALAATDKQHDQCSESSPRG+ +SQLDSCGD R
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR

Query:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
        NELGVGDGDS +E GRI QRA M+CRAWRPDDAFRPQSLPTLSKQLSLP TDRR+GCGG+ WACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        CHKRILEEDGRCPGCRKPY+ DPAD+ETNV  GSPTLPLARSCSMISRS
Subjt:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

XP_008453149.1 PREDICTED: uncharacterized protein LOC103493947 [Cucumis melo]6.57e-22189.11Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPLPKKKRV+NSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKT KDDNLSS+KINMREIG +N  SVHH SD+
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE    GDDD LDDWEAIADALAATDKQ DQCSESSPRG+ VSQLDSCGD R
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR

Query:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
        NELGVGDGDS +E GRI QRA M+CRAWRPDDAFRPQSLPTLSKQLSLPNTDRR+GCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        CHKRILEEDGRCPGCRKPY+ DPAD+ETN+  GSPTLPLARSCSMISRS
Subjt:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

XP_022134843.1 uncharacterized protein LOC111007016 [Momordica charantia]5.35e-251100Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELG
        SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELG
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELG

Query:  VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR
        VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR
Subjt:  VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR

Query:  ILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        ILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
Subjt:  ILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

XP_038878939.1 uncharacterized protein LOC120071026 [Benincasa hispida]1.53e-21788.57Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKT +DDNLSSKKINMR+IG +N  SVH  SDF
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE    GDDD LDDWEAIADALAATDKQHDQCSESSPRG+ VSQLDSCGDSR
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR

Query:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGG-IPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCL
        NELGVGDGDS ++ GRI QRA M+CRAWRPDDAFRPQSLP LSKQLSLPNTDRR+GCGG IPW CGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCL
Subjt:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGG-IPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCL

Query:  FCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        FCHKRI+EEDGRCPGCRKPY+ DPADSETNV GGSP   LARSCSMISRS
Subjt:  FCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

TrEMBL top hitse value%identityAlignment
A0A1S3BVJ1 uncharacterized protein LOC1034939473.18e-22189.11Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPLPKKKRV+NSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKT KDDNLSS+KINMREIG +N  SVHH SD+
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE    GDDD LDDWEAIADALAATDKQ DQCSESSPRG+ VSQLDSCGD R
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR

Query:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
        NELGVGDGDS +E GRI QRA M+CRAWRPDDAFRPQSLPTLSKQLSLPNTDRR+GCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        CHKRILEEDGRCPGCRKPY+ DPAD+ETN+  GSPTLPLARSCSMISRS
Subjt:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

A0A5A7UPV5 Suppressor protein SRP40 isoform X21.64e-21989.08Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPLPKKKRV+NSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKT KDDNLSS+KINMREIG +N  SVHH SD+
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE    GDDD LDDWEAIADALAATDKQ DQCSESSPRG+ VSQLDSCGD R
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR

Query:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
        NELGVGDGDS +E GRI QRA M+CRAWRPDDAFRPQSLPTLSKQLSLPNTDRR+GCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR
        CHKRILEEDGRCPGCRKPY+ DPAD+ETN+  GSPTLPLARSCSMISR
Subjt:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR

A0A5D3BIS7 Suppressor protein SRP40 isoform X21.03e-21789.05Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPLPKKKRV+NSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKT KDDNLSS+KINMREIG +N  SVHH SD+
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR
        S+SPSNSPPSL SSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE    GDDD LDDWEAIADALAATDKQ DQCSESSPRG+ VSQLDSCGD R
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE----GDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSR

Query:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
        NELGVGDGDS +E GRI QRA M+CRAWRPDDAFRPQSLPTLSKQLSLPNTDRR+GCGG+PWACGGV+ VPTSCPICFEDLDLTDSSFLPCFCGFRLCLF
Subjt:  NELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLF

Query:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMIS
        CHKRILEEDGRCPGCRKPY+ DPAD+ETN+  GSPTLPLARSCSMIS
Subjt:  CHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMIS

A0A6J1C357 uncharacterized protein LOC1110070162.59e-251100Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELG
        SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELG
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELG

Query:  VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR
        VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR
Subjt:  VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKR

Query:  ILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        ILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
Subjt:  ILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

A0A6J1JM47 uncharacterized protein LOC1114856371.29e-20684.48Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        MVTDSI NASISLAP+ARDLPL KK+R +NSKLKQSKLDVRREQWLSRGA+KNKKWNEEDNRLDS V+KT +DDNLSSKKIN REIG ++  SVHH SD 
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGD---DDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRN
        SESPSNSPPS+ SSILGGNDSG HFTGSSSSSSCRSSSSGCRSGSITEEE D   DD L+DWEA AD L ATDKQHDQCSESSPRGD VSQL SCGDSRN
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGD---DDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRN

Query:  ELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC
        ELGVGDGDS +E GRIA+RA M+ RAWRPDDA RPQSLPTLSKQLSLPNTDRR+GCG +PWACGGV+ VPTSCPICFEDLDLTDS+FLPC CGFRLCLFC
Subjt:  ELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFC

Query:  HKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        HKRILEEDGRCPGCRKPY+ DPAD+ETN H GSPT PLARSCSMISRS
Subjt:  HKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

SwissProt top hitse value%identityAlignment
A2YU42 Cellulose synthase-like protein D21.7e-0437.93Show/hide
Query:  DLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPL
        D      LPC C F++C  C    ++  G CPGC+ PY+    D   +V G  PTL L
Subjt:  DLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPL

O95628 CCR4-NOT transcription complex subunit 43.9e-1252.73Show/hide
Query:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYECDPA
        P  CP+C E L++ D +F PC CG+++C FC  RI  +E+G CP CRKPY  DPA
Subjt:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYECDPA

P34909 General negative regulator of transcription subunit 41.1e-0948Show/hide
Query:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRIL---EEDGRCPGCRKPYE
        CP+C E +D+TD +F PC CG+++C FC+  I    E +GRCP CR+ Y+
Subjt:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRIL---EEDGRCPGCRKPYE

Q09818 Putative general negative regulator of transcription C16C9.04c1.4e-0951.06Show/hide
Query:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEE-DGRCPGCRKPY
        CP+C E++D++D +F PC CG+R+C FC   I E+ +GRCP CR+ Y
Subjt:  CPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEE-DGRCPGCRKPY

Q8BT14 CCR4-NOT transcription complex subunit 43.9e-1252.73Show/hide
Query:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYECDPA
        P  CP+C E L++ D +F PC CG+++C FC  RI  +E+G CP CRKPY  DPA
Subjt:  PTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRI-LEEDGRCPGCRKPYECDPA

Arabidopsis top hitse value%identityAlignment
AT1G74870.1 RING/U-box superfamily protein3.8e-1527.83Show/hide
Query:  KKRVKN--SKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDFSESPSNSPPSLASSILGGNDSG
        KKR  N   KLKQ K+D RR+QW+S+    N    E   RL S++      + L+ +K    +   D    +  D + + S ++SP    +S+L   DS 
Subjt:  KKRVKN--SKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDFSESPSNSPPSLASSILGGNDSG

Query:  PHFTGSSSSSSCRSSSSGCRSGSITEEEGD--DDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELGVGDGDSKLECGRIAQRAPMS
                     S    C S  +TEEE +  DD  D+W+   DAL + +  +++ S          + D   D+   +         +C +  + AP +
Subjt:  PHFTGSSSSSSCRSSSSGCRSGSITEEEGD--DDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELGVGDGDSKLECGRIAQRAPMS

Query:  CRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYECDPA
                     ++   S +    N++++ G G               CPIC E +D TD  F PC CGFR+CLFCH +I E + RCP CRK Y+    
Subjt:  CRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYECDPA

Query:  DSET--NVHGGSPTLPLARSCSMISRS
         S        G  T+PL+ S   + R+
Subjt:  DSET--NVHGGSPTLPLARSCSMISRS

AT3G48070.1 RING/U-box superfamily protein2.0e-6449Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNK--KWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDS
        M++DSITNAS + AP+ARD    KK+  +++KLKQSKL +RREQWLS+ A+ NK  K   E NR     I + K D     +  +  +  DN G+  H+S
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNK--KWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDS

Query:  DFSESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE-GDDDG-LDDWEAIADALAATDKQHDQCS--ESSPRGDAVSQLDSCGD
         F ES SNSP    +SIL G +S P+F+ SSS     S S G  SG+ITEEE  DDDG LDDWEAIADALAA D++H++ +  ES    + + Q    G 
Subjt:  DFSESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE-GDDDG-LDDWEAIADALAATDKQHDQCS--ESSPRGDAVSQLDSCGD

Query:  SRNELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLC
        +   L     DS+       +R   S +AWRPDD  RPQ LP L KQ S P  +  +           V  VP+SCPIC+EDLDLTDS+FLPC CGFRLC
Subjt:  SRNELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLC

Query:  LFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS
        LFCHK I + DGRCPGCRKPYE +    E +V GG  T+ LARS SM  RS
Subjt:  LFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISRS

AT3G48070.2 RING/U-box superfamily protein5.8e-6448.86Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNK--KWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDS
        M++DSITNAS + AP+ARD    KK+  +++KLKQSKL +RREQWLS+ A+ NK  K   E NR     I + K D     +  +  +  DN G+  H+S
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNK--KWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDS

Query:  DFSESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE-GDDDG-LDDWEAIADALAATDKQHDQCS--ESSPRGDAVSQLDSCGD
         F ES SNSP    +SIL G +S P+F+ SSS     S S G  SG+ITEEE  DDDG LDDWEAIADALAA D++H++ +  ES    + + Q    G 
Subjt:  DFSESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEE-GDDDG-LDDWEAIADALAATDKQHDQCS--ESSPRGDAVSQLDSCGD

Query:  SRNELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLC
        +   L     DS+       +R   S +AWRPDD  RPQ LP L KQ S P  +  +           V  VP+SCPIC+EDLDLTDS+FLPC CGFRLC
Subjt:  SRNELGVGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLC

Query:  LFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR
        LFCHK I + DGRCPGCRKPYE +    E +V GG  T+ LARS SM  R
Subjt:  LFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR

AT5G60170.1 RNA binding (RRM/RBD/RNP motifs) family protein1.7e-1037.84Show/hide
Query:  SCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEE------DGRCPGCRKPYE----------CDPADSETNV
        +CP+C E++DLTD    PC CG+++C++C   I++       +GRCP CR PY+          CD   SE N+
Subjt:  SCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEE------DGRCPGCRKPYE----------CDPADSETNV

AT5G62910.1 RING/U-box superfamily protein1.0e-6849.15Show/hide
Query:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF
        M+TDSITNAS + AP  RD    KK+  K++K+KQ+KL +RREQWLS+ AV NK+  EE +     V ++ K  + SS K+   E        +HH+S F
Subjt:  MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDF

Query:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEG---DDDG-LDDWEAIADALAATD--KQHDQCSESSPRGDAVSQLDS--C
         ESPSNS        +GG  S  +F+G SS SS  SSSSG  SG+ITEEE    DDDG +DDWEA+ADALAA +  ++  +  ES     +V Q  S  C
Subjt:  SESPSNSPPSLASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEG---DDDG-LDDWEAIADALAATD--KQHDQCSESSPRGDAVSQLDS--C

Query:  GDSRNELG--VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCG
          S ++    VG  D K EC R++ R   S RAWR DD  RPQ LP L+KQLS P  D+R+            +++P+SCPIC+EDLDLTDS+FLPC CG
Subjt:  GDSRNELG--VGDGDSKLECGRIAQRAPMSCRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCG

Query:  FRLCLFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR
        FRLCLFCHK I + DGRCPGCRKPYE +   +ET++ GG  T+ LARS SM  +
Subjt:  FRLCLFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGSPTLPLARSCSMISR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAACTGATTCGATCACCAATGCTTCAATTTCCTTGGCCCCAAGCGCCCGGGATTTACCCTTGCCCAAGAAGAAGAGGGTCAAGAATTCCAAATTGAAGCAGTCCAA
GCTTGATGTTCGTCGGGAACAATGGCTTTCTCGAGGTGCTGTAAAGAACAAAAAATGGAACGAAGAAGACAATCGCCTTGATTCAGTGGTCATTAAAACGAGCAAAGACG
ATAATCTTTCCTCCAAGAAAATAAATATGAGGGAAATAGGAGGAGACAATGGTGGATCGGTTCACCACGACAGCGATTTTTCGGAGTCGCCGTCTAACAGCCCCCCTAGT
CTTGCCAGTAGTATCTTGGGTGGTAATGATTCCGGTCCTCATTTTACTGGGAGTAGCAGTAGCAGTAGCTGCCGTAGCAGCTCTAGCGGTTGTCGCTCAGGGAGTATAAC
GGAGGAGGAAGGAGATGATGATGGCTTGGATGACTGGGAGGCTATTGCTGATGCTCTAGCCGCCACTGACAAGCAACATGATCAATGTTCCGAGTCTTCCCCTAGGGGTG
ATGCTGTTTCTCAATTGGATTCTTGCGGAGATAGCAGAAATGAGTTGGGTGTTGGAGATGGTGACTCGAAATTAGAATGTGGGAGAATTGCGCAGAGGGCTCCAATGAGT
TGCCGAGCATGGAGGCCTGATGATGCCTTTCGTCCTCAGAGTTTGCCCACTTTATCGAAGCAGCTTAGTTTACCAAATACAGACCGGCGTTATGGATGTGGAGGCATCCC
TTGGGCCTGTGGTGGTGTTATGTCTGTGCCTACATCTTGTCCAATTTGCTTTGAAGACTTGGACCTTACAGACTCAAGTTTTTTACCTTGTTTCTGTGGATTCCGGCTGT
GCCTTTTCTGTCACAAGAGGATTCTTGAAGAGGATGGGCGCTGTCCTGGGTGCAGGAAGCCATATGAGTGTGATCCTGCCGACAGTGAGACAAATGTACATGGTGGCAGC
CCGACGTTACCATTAGCTCGTTCTTGTAGCATGATTTCAAGGTCTTAG
mRNA sequenceShow/hide mRNA sequence
AAAGGCCCAATCATATGGGTCTCCATACGACATCGTTTCGTTCTTCTCTTCGGGCAATACAGGAATGCACGATACGATACCAAATTACCAATGCCCGTGGGCCGTGGCTG
GGTAACCCCGTAGATTTAGCAAAAGTGAGGGGTATTTTGGTAACTTGGACGATATTTAAATAAAATCTCGGCCCTCTGTTTTTGCTAGTCTGTGGAAAAGAGCGAGCGAA
GCAGAGGAGTTTCCGTTATCCATTCTCTCTCTCTCTCCCCCTCGAAGATCTTCTCTCTCCCGGAAAAATATCCACACAAATTAGGGTTTCTTTGCGACCTTCGTTCCTCC
AATTCGATTTCCAAGTCTTCAATCATGGTAACTGATTCGATCACCAATGCTTCAATTTCCTTGGCCCCAAGCGCCCGGGATTTACCCTTGCCCAAGAAGAAGAGGGTCAA
GAATTCCAAATTGAAGCAGTCCAAGCTTGATGTTCGTCGGGAACAATGGCTTTCTCGAGGTGCTGTAAAGAACAAAAAATGGAACGAAGAAGACAATCGCCTTGATTCAG
TGGTCATTAAAACGAGCAAAGACGATAATCTTTCCTCCAAGAAAATAAATATGAGGGAAATAGGAGGAGACAATGGTGGATCGGTTCACCACGACAGCGATTTTTCGGAG
TCGCCGTCTAACAGCCCCCCTAGTCTTGCCAGTAGTATCTTGGGTGGTAATGATTCCGGTCCTCATTTTACTGGGAGTAGCAGTAGCAGTAGCTGCCGTAGCAGCTCTAG
CGGTTGTCGCTCAGGGAGTATAACGGAGGAGGAAGGAGATGATGATGGCTTGGATGACTGGGAGGCTATTGCTGATGCTCTAGCCGCCACTGACAAGCAACATGATCAAT
GTTCCGAGTCTTCCCCTAGGGGTGATGCTGTTTCTCAATTGGATTCTTGCGGAGATAGCAGAAATGAGTTGGGTGTTGGAGATGGTGACTCGAAATTAGAATGTGGGAGA
ATTGCGCAGAGGGCTCCAATGAGTTGCCGAGCATGGAGGCCTGATGATGCCTTTCGTCCTCAGAGTTTGCCCACTTTATCGAAGCAGCTTAGTTTACCAAATACAGACCG
GCGTTATGGATGTGGAGGCATCCCTTGGGCCTGTGGTGGTGTTATGTCTGTGCCTACATCTTGTCCAATTTGCTTTGAAGACTTGGACCTTACAGACTCAAGTTTTTTAC
CTTGTTTCTGTGGATTCCGGCTGTGCCTTTTCTGTCACAAGAGGATTCTTGAAGAGGATGGGCGCTGTCCTGGGTGCAGGAAGCCATATGAGTGTGATCCTGCCGACAGT
GAGACAAATGTACATGGTGGCAGCCCGACGTTACCATTAGCTCGTTCTTGTAGCATGATTTCAAGGTCTTAGGTAGAGTTCAGTTTCTGCGAGTTAATGTACGTCTGGTC
TCACTTCTTTTTGTGTCTCTTGGGTATTATGGTAGTCTTGGGCTCTGAGTTTCATAGAAATAGATGTGTGGCATTTGGGCATTGAAGCATGGTTATGAGATTGAGAACCT
TTAAACTGATCTGCATCGTACATGAGTGTGGTTCTTATCTTATCGTACATTTGCCAATGTAGAGGACTTATTTCGTACTTGGTATGTGAATAGACGAGGTGGTATAGAAA
TGAATACATGGAATAATGATAAACATTTATTTTTTCCACTTTTTATTAATTTGTATTTGTATACTTCGTAGGAATGCTGAAGTTTTCTATTTGTCGCTTTCCTCCTTAGT
CCTGCTGATTTTATGAGATTTTAGTGTCATGATTGACTATTTGCATGGGGTTTATAGCATGTCAAGGACTCAAGAAAGACACTCAAGTCCCAGGAGAATATACGCCAAAA
TGAAATGTGGAAATATTTTTAAAAACTCGCATGTTTCACTACTAATTATTGTATACTTTTGCTGTTCATAGCCACCAGCAGCAGTTGAATAGTTGAGGTCAGTCAAAGTG
GCAGGATGATTAATTGCTGGAAGATGGTTTCTCGTAGTTTCTGTAATTGCTTGATCTTCACGAGGACGAATTTCACACTGTGTAAGCTTGTAAGTCATGTATTCATCAGG
GCATTGGTGAATCTGCCTTTTAATGAAGTTCTATCATCATTTTGATCATGGAGTGAATAATGATATGATTGCTGAATAACTTCAGTTGTGAGATTTTCTAATTCTATTGA
ACGAGATCGAGGTCACCAGCCTAAACATTTTTATATTTCATTTTGATCATGTATTTTTGGCTCATTTAACTGTCACACTTATGAAATTTTGAAGGATAGATTGGTTAATC
AGCTACCCATGGAATGGAGGTATTAACTCCTTCTATACATACTGTAAACTAGACTATTACAATTCCTGCGAACGAACAGAATGCGGTTGTCAAGAAGAAAGTATCAGCTT
ATTAACGTTTTCATTAGAATCTGAGTTTCGTTGGGTGTGTGATGTATCCTTGTGCAGTTCATTGAAACCAAATTTTGTGTCCTCTGAAAATTATATCTTGAGCTCTATTA
TTATAAAATGGTCGCTCTGATGTGAATGTTTTGACATCATTTGAACTCCTTGTTTCTTCTCTATATTTGTTTATGCTGTGTTTTTGAAATTTAAGTTGGAGCTTACATTG
TTTTCTAAACATCCCGATATCTTCTAGAAGACGGACCAGCGGAGGACCCTCTTCCTCAGCATTATAGGTGGTCGGCTGTTTCTGTCGGAGATTCCAAGGACACGTTCGAA
CCAAGTTTCCAGTTCTTAATTCATTCAAACAGAAGGGAACCTGTTGGAACATCACAGTCTTTTACGTGTACAACACCTTATAATGTTAGACTACACTGTGTCAATAGCGT
CCACTCCGTTCTCCATCCTCTCGACATCGTCCTTAATACAAGACTGTGATCTATGGTGCTCAGGAAGTGTCTTCCACCACTCCATAAAGGTAGATTTCTTCAACAACATA
TCATCTTTGATCTCGTTGATCCATCGCTTTATTTTCTTCTCCAACTCTGTAATTTCCTTCTCATTTTCAGCAAATGTTCTGGTTTTAGTTTGGATACGAAGCTCCTCAAC
CTCTGCCCAAAGACTTGATTCTTCACTGGATTCTATCATCTTTTGCTTGTGTTCAAGCCATCTTTGTGTAAATCTGTAACGTTTCGGCCTTCCTTTTATCAGATAAGGTC
CAGTGTCATCATTCTTCGAGTGTCTATAGTAATTAGCAATATCTAGAGGCTCAACAAGACGGCGAAAGCGGGTTGCAAGTTGTATCCATTCGTCTTTGCCCTCGAGTTCA
TCTGGGAGCTCATATCTTTTGAGCATTTCTATGATCTCATCCCACACACCAGCTAGCTCAAGCCTCCAGATATTAGCGTGGAAGTCTCTTGCATCCTTTTGGAGTTTAAA
GGCATCATAATATCCCAAACCATCAACCTTACATACAGCTCTGTATTCTTCTTCTAGCCGAGACAATCGCTCCTCGGTATATTGTTTCTTCTCTTCCATCCTTTCTTGAT
TTTTCATCTTCTGCTCTTCACACGCACCAGCAGCTCGAAGGTTCATTAATGCTCTCGTACTCTGTAATTTCACATTCACAAAATTAGGTAGAAAGCAACAGCATCTTAGT
TATCGAGGTACCAACTCTTTTCCATGTCATATAAAACACTTCAAACTATTCTGAGGAAAAGACGAACTTTTGAAGAAGATACACATGGAAATACATGATTGTTAGAGACT
GGAGACGAAAAAGAAAGCAAGAAGAGAAGAACTTACTAGGCCAAGTTCATTCAAGGCTTCCGTTAATGGCGTGTTCCTGCCAGCCAAAGATAACGGAAGCTTAACCAATT
CATCCAGACGAATAGAGTGTAGGAGCTCCCAGTTTCGTTGTATTTTAGATTCATAGCCCCAATGGTCCATTAAACTCTGATGAGAGATCCGACCACATTCCCCCACGGAG
CTCAACTGACAAGAGTAGAACAGTATTTGTAAAACAGCATCTGGGTTGGTCACCACAACTGCTTTCCCACCCTCAGTAAAGAAAACGTAGGTTCCAAAAGGCCGGTAAGG
GCTCAATTTGACGAAGTTTTTCAATGTGTCCAGTAGAAGGTTTGTGCTTCCCATGAGATGGCAAGCCACGTTGCTTGCCACAGCCGATGCGTTCCTCATCACCGTCATGA
AGAAGGTGGTAGCCACATTCCCGATCAAGGCGGGCCCGGGCGATCTCGAGCTCAAGGAGTCAAGGATGGTCTGCAGTTGTGATTGCAGCGAAGGTAGAGGAGCAAGATGG
ATGCGTGGGACGATGTCGTACCTCGTGATGAAGTGTACAAAATGGGTGGACCATTTCTCCCTTTTGAGAGCATGGGAGAAAATGAAGTTACCAACAAGGGGAGACCCAAA
TGTGATACATTTTGGAGGAGTGAAGTTGGGGTTTGAGTTTGAGTTTCTTTGCTGTTCTAACAGCCAGATGGTTGCAAGAATGGCTATCGGACCTCCGGCTGAGTGGCCCG
AAAACACTACCGCCTTATTCACTTTCACTACCTGAGAAATAAAAATTCCATTACCAACAGTTCTTTAAAAAAGCAGTTATACAAAAAAAAATTAGTATCTATCTAGATTA
TAGTAATTTAACGGAAACTTAAAAAAAAAGAAAGCCAATATCTTAATTACCGGGTTTAAATAATCATATAATCCCCTATTTTTGTGAAAATAAGAATTGCTAACATATTT
ATAATGTTATTCTACTTTCATTTCATAGTCAAAAAACCTTTTATTTTTTTTACTCTGGTTAACAAATTAATTAAGATTAACGAGTAGGTAGAAATCTCCTCTCTAGTCGG
GTATCAACAAAATAACATTTATTTCAGTGGTTGTTGGGGTGAAATTCGAAGAATTGGGGGACAATTAGGGCGACGTACCTCCTTCAGCTTCCCCAGGATGCCTTCAAATC
TCTGAAAGAACGCAGAATTGACGATGGCAAAATCGCCGACGCCGATGCTCCTGATGGACGGAAAGAGCCGCCTGCTGATCTCCGTTTCACCGAAAGACGTACTGGAACCG
GAAAACCAGCCATCCGCCGACCAAGATCCGGCGAAACTGATGACAGAGAAGTCGCGAGTTTTCTCGACAAGAAATGGCTTGTCTGGTTGCCTGTGAGCTTTAATGGCGGA
AGCCAAGGCGTTTCTGACGAGCTCTTCTTTGAAACCAGTAGCATCTTCCAGCCTCGCGCCAACCATCTTCCTGAGCTCCGAATCCGGTAATGGTCTCTGCTTGAGCTTGA
CGAAGACTCTCAAGTCTTCGGCCTATATGATAAGTTATTGGATGCCTGTTGCTGTTCTCTCGTTGTTGTTTGTCCATTCCCAAAAAGTAGTTTTGGTTTTTTGTTTGGGA
ATTTCTCGGAAACTAAGGTGGGCTGGTTAGACTGGTTTGATAATAAAATAAATTTATGTTGTAGAATAAATTTAAAATATTAAAAAGAGAATTTGTTTTGATCAGTTTTT
CAAATTTTTATACGATTTTGAAAATGTTTACCTATTTCAAACAGACAAAAGGGATCAATTAAACCACCTAGGAACTTTCCATTTCATAGTTTTAAATCTTGCTTACCTTC
AACATTTCTTTTCTTTTTCTAATTCTAATTTCTGGAAATTTTTTTTAATATTTTAATAAAATAATTACTCTG
Protein sequenceShow/hide protein sequence
MVTDSITNASISLAPSARDLPLPKKKRVKNSKLKQSKLDVRREQWLSRGAVKNKKWNEEDNRLDSVVIKTSKDDNLSSKKINMREIGGDNGGSVHHDSDFSESPSNSPPS
LASSILGGNDSGPHFTGSSSSSSCRSSSSGCRSGSITEEEGDDDGLDDWEAIADALAATDKQHDQCSESSPRGDAVSQLDSCGDSRNELGVGDGDSKLECGRIAQRAPMS
CRAWRPDDAFRPQSLPTLSKQLSLPNTDRRYGCGGIPWACGGVMSVPTSCPICFEDLDLTDSSFLPCFCGFRLCLFCHKRILEEDGRCPGCRKPYECDPADSETNVHGGS
PTLPLARSCSMISRS