; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015768 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015768
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCRC domain-containing protein TSO1-like
Genome locationtig00005930:100758..108340
RNA-Seq ExpressionSgr015768
SyntenySgr015768
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR005172 - CRC domain
IPR033467 - Tesmin/TSO1-like CXC domain
IPR044522 - CRC domain-containing protein TSO1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8656297.1 hypothetical protein MANES_04G117200v8 [Manihot esculenta]2.0e-4839.88Show/hide
Query:  RVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV
        + S KR RK++  +++ E  KRCNCK+S+CLKLYCECF+A  YC DSC C +C NR EYEDTV    QQ E++NPLAFAPKV+K+ ++SP NI+   N  
Subjt:  RVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV

Query:  PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISSEPGYQQLANNFSSTY-----ASNLS-
         PS ARHK+GCNCKKS C KKYCECYQA VGC++ C+CEGC+N+FG +  S  R    W N S E+ D      +      A+  SST+      S+L+ 
Subjt:  PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISSEPGYQQLANNFSSTY-----ASNLS-

Query:  -SSNNHIPEALTSNLTNRNDCPEATLAPFPPSV---SGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYG--------------TMTENVRLTPTNQQS
         S ++    A +++L+ R+    +   P+  S    S     S  S + P P+ + S   S  S + HLY               T T+ V+++  NQ+ 
Subjt:  -SSNNHIPEALTSNLTNRNDCPEATLAPFPPSV---SGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYG--------------TMTENVRLTPTNQQS

Query:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY
         S   +  Q  R S++    L+S+++     +P++P L PY
Subjt:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY

TXG69228.1 hypothetical protein EZV62_004163 [Acer yangbiense]1.2e-5339Show/hide
Query:  NRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENE
        N++S K+KRK++   S++E  KRCNC++S+CLKLYCECF+A  YC DSC C+ C N+ EYEDTV    QQ ES+NPLAFAPK++K  ++SP N++   N 
Subjt:  NRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENE

Query:  VPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS---SSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSN--
          P+ ARHK+GCNCKKS C KKYCECYQA+VGC+ AC+CEGC+N+FG +AGS    +  W N S  K DA     E         FS  +      SN  
Subjt:  VPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS---SSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSN--

Query:  --NHIPEALTSNLT-NRNDCPEATLAPFPPSVSGPP---------HWSRLS---TVIPAPQPFESMASSKPSFNYHLYGTMTENVRLTPTNQQSAST---
          +H   AL  +++ +  DCP+ + A   P  S            H+S +S    +I   +    + S         Y  M E ++ T T  +   T   
Subjt:  --NHIPEALTSNLT-NRNDCPEATLAPFPPSVSGPP---------HWSRLS---TVIPAPQPFESMASSKPSFNYHLYGTMTENVRLTPTNQQSAST---

Query:  ------STSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNEATLE
              S   F  + L S+    ++S ++  WQT+P++P L PY   N    Q E+ L+
Subjt:  ------STSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNEATLE

XP_021609346.1 protein tesmin/TSO1-like CXC 3 [Manihot esculenta]2.0e-4839.88Show/hide
Query:  RVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV
        + S KR RK++  +++ E  KRCNCK+S+CLKLYCECF+A  YC DSC C +C NR EYEDTV    QQ E++NPLAFAPKV+K+ ++SP NI+   N  
Subjt:  RVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV

Query:  PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISSEPGYQQLANNFSSTY-----ASNLS-
         PS ARHK+GCNCKKS C KKYCECYQA VGC++ C+CEGC+N+FG +  S  R    W N S E+ D      +      A+  SST+      S+L+ 
Subjt:  PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISSEPGYQQLANNFSSTY-----ASNLS-

Query:  -SSNNHIPEALTSNLTNRNDCPEATLAPFPPSV---SGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYG--------------TMTENVRLTPTNQQS
         S ++    A +++L+ R+    +   P+  S    S     S  S + P P+ + S   S  S + HLY               T T+ V+++  NQ+ 
Subjt:  -SSNNHIPEALTSNLTNRNDCPEATLAPFPPSV---SGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYG--------------TMTENVRLTPTNQQS

Query:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY
         S   +  Q  R S++    L+S+++     +P++P L PY
Subjt:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY

XP_022153578.1 CRC domain-containing protein TSO1-like [Momordica charantia]1.8e-8139.8Show/hide
Query:  PSVLGPSTSDPQDWSYFDFVSNLHPIDELALESFSAMVTERASPPPPPPLSTSPLQDMQKETSFVESNHRDEVPIDDDQFNTHSGDPFNSVMSLLQPGEL
        P   GPS+S  QD SY DF ++L PI+ L LE  S     R              +DM+ E+SFVE   R  +P  D + NTHS DP        +  ++
Subjt:  PSVLGPSTSDPQDWSYFDFVSNLHPIDELALESFSAMVTERASPPPPPPLSTSPLQDMQKETSFVESNHRDEVPIDDDQFNTHSGDPFNSVMSLLQPGEL

Query:  PELVESDFDMKTKDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYH
        P L         +D  E  DKNE     +    V+ +L +EGN++VE  L   TV   + + +  EL+  I  S+ +           E  EN  GV+Y 
Subjt:  PELVESDFDMKTKDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYH

Query:  KSDISNLNLDDSLQPIKTNESSAASSSIMSEECHKQQ-VSHGKMVG-SNRVS-TKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCC
         SDIS LN +++L+        AASS  +S E  K      G M+   N+V  T RKRKKS  M +N+  KRCNCKKS CLKLYCECFSA  YC +SC C
Subjt:  KSDISNLNLDDSLQPIKTNESSAASSSIMSEECHKQQ-VSHGKMVG-SNRVS-TKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCC

Query:  KDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAG
        KDC+N+IEYED V  VS+Q +SKN  AFAPKV +++ + PT +MV EN    SL RHKKGC CKKSMC K+YCEC+QAEVGCT+AC CE C NTFGTR  
Subjt:  KDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAG

Query:  SSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNR-NDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSK
               NT DE+TDA   +    Y QLANNFSSTYASN S +      A TSNLTN  N CPE+    F  +VS P  W    + +      +      
Subjt:  SSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNR-NDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSK

Query:  PSFNYHLYGTMTEN--------------------------------VRLTPTNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNF
         S N  L+  M  N                                + + P NQ   S S   +  + L+ NP  +L+ SQQ   Q +PT+ S    MN+
Subjt:  PSFNYHLYGTMTEN--------------------------------VRLTPTNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNF

Query:  NGSSAQNEATLEN
        N  S+ N +   N
Subjt:  NGSSAQNEATLEN

XP_031265182.1 CRC domain-containing protein TSO1-like [Pistacia vera]3.6e-5040.62Show/hide
Query:  NRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENE
        N+VS K KRKK+    + +  KRCNC++S+CLKLYCECF+A  +C DSC CKDC N+ EYEDTV  + QQ ES++PLAFAPK++K  +NSP NI+   N 
Subjt:  NRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENE

Query:  VPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISS-EPGYQQLAN-----NFSSTYASNL
          P+ ARHK+GCNCKKS C KKYCECYQA VGC+  C+CE C N+FG +A S  R    W N S + T +  +    P ++ L +       S  YA  L
Subjt:  VPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISS-EPGYQQLAN-----NFSSTYASNL

Query:  SSSNNHIPEALTSNLTNRNDCPEATL--APFPPSVSGPPHWSRLSTVIPAPQPFESMA----SSKPSFNYHL-----------YGTMTENVRLTPTNQQS
        +SS         S +++  + P A L       S SG  HW   S++   P   ES A    SS  +F+  L             T T+ ++++  NQ+ 
Subjt:  SSSNNHIPEALTSNLTNRNDCPEATL--APFPPSVSGPPHWSRLSTVIPAPQPFESMA----SSKPSFNYHL-----------YGTMTENVRLTPTNQQS

Query:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNE
         S   S  Q  R SS+P   L+S ++   Q MP++P L PYM     + +++
Subjt:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNE

TrEMBL top hitse value%identityAlignment
A0A2C9W1V0 CRC domain-containing protein9.5e-4939.88Show/hide
Query:  RVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV
        + S KR RK++  +++ E  KRCNCK+S+CLKLYCECF+A  YC DSC C +C NR EYEDTV    QQ E++NPLAFAPKV+K+ ++SP NI+   N  
Subjt:  RVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV

Query:  PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISSEPGYQQLANNFSSTY-----ASNLS-
         PS ARHK+GCNCKKS C KKYCECYQA VGC++ C+CEGC+N+FG +  S  R    W N S E+ D      +      A+  SST+      S+L+ 
Subjt:  PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSR---AWGNTSDEKTDAQGISSEPGYQQLANNFSSTY-----ASNLS-

Query:  -SSNNHIPEALTSNLTNRNDCPEATLAPFPPSV---SGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYG--------------TMTENVRLTPTNQQS
         S ++    A +++L+ R+    +   P+  S    S     S  S + P P+ + S   S  S + HLY               T T+ V+++  NQ+ 
Subjt:  -SSNNHIPEALTSNLTNRNDCPEATLAPFPPSV---SGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYG--------------TMTENVRLTPTNQQS

Query:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY
         S   +  Q  R S++    L+S+++     +P++P L PY
Subjt:  ASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY

A0A2N9HR44 CRC domain-containing protein2.3e-4740.88Show/hide
Query:  RKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARH
        RK+++  S ++  KRCNCK+S+CLKLYCECF+A  YC DSC C+ C N+ EYE  V    QQ ES+NPLAFAPKV+K  +NSP  IM   N   PS ARH
Subjt:  RKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARH

Query:  KKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIP--EALTSNL
        K+GCNCKKS C KKYCEC+QA VGC+  C+CEGC N+FGT    + R   N S EK D      E G       FS T+   L+   N  P   ALTS+ 
Subjt:  KKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIP--EALTSNL

Query:  TNRNDCPEATLAPF-------PPSVSGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYGTM-----TENVRLTPTNQQSASTST--------SYFQPER
        +    C + + A         PP  +G  HW   S V    Q +ES    + + +  L+  M      E ++ T T  ++   S+           Q ER
Subjt:  TNRNDCPEATLAPF-------PPSVSGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYGTM-----TENVRLTPTNQQSASTST--------SYFQPER

Query:  LSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNE
         SS+     +S ++   Q MP++P L PY NF G   + E
Subjt:  LSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNE

A0A2P6RWF0 Putative transcription factor Tesmin family1.2e-4638.75Show/hide
Query:  SIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFA
        S+  EE +  Q ++ K+   N +S KRKR++   +S  E  KRCNCKKS+CLKLYCECF+A  +C DSC C++C N+ E+EDTV+   QQ ES+NPLAFA
Subjt:  SIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFA

Query:  PKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAG---SSSRAWGNTSDEKTDA----------
        PKV+K   NS  NIM   +   PS ARHK+GCNCKKS C KKYCEC+QA VGC+ AC+C+ C N  GT+A    + +  W     EK D           
Subjt:  PKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAG---SSSRAWGNTSDEKTDA----------

Query:  ---------QGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNRNDCPEATLAPFPPSVSGPPHWSR-LSTVIPAPQPFESMASSKPSFNYHL
                 +G+S       L+N  SST  S+ SSS           + NR    +A L       SG  H  R  S VI  PQ FES   S+ S +   
Subjt:  ---------QGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNRNDCPEATLAPFPPSVSGPPHWSR-LSTVIPAPQPFESMASSKPSFNYHL

Query:  YGTMTENV-----------RLTPTNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY
        Y  M +++           ++   +      S    + E+L S     L+S ++   Q MP++P L PY
Subjt:  YGTMTENV-----------RLTPTNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY

A0A5C7IJA4 CRC domain-containing protein5.7e-5439Show/hide
Query:  NRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENE
        N++S K+KRK++   S++E  KRCNC++S+CLKLYCECF+A  YC DSC C+ C N+ EYEDTV    QQ ES+NPLAFAPK++K  ++SP N++   N 
Subjt:  NRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENE

Query:  VPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS---SSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSN--
          P+ ARHK+GCNCKKS C KKYCECYQA+VGC+ AC+CEGC+N+FG +AGS    +  W N S  K DA     E         FS  +      SN  
Subjt:  VPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS---SSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSN--

Query:  --NHIPEALTSNLT-NRNDCPEATLAPFPPSVSGPP---------HWSRLS---TVIPAPQPFESMASSKPSFNYHLYGTMTENVRLTPTNQQSAST---
          +H   AL  +++ +  DCP+ + A   P  S            H+S +S    +I   +    + S         Y  M E ++ T T  +   T   
Subjt:  --NHIPEALTSNLT-NRNDCPEATLAPFPPSVSGPP---------HWSRLS---TVIPAPQPFESMASSKPSFNYHLYGTMTENVRLTPTNQQSAST---

Query:  ------STSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNEATLE
              S   F  + L S+    ++S ++  WQT+P++P L PY   N    Q E+ L+
Subjt:  ------STSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNFNGSSAQNEATLE

A0A6J1DHU8 CRC domain-containing protein TSO1-like8.5e-8239.8Show/hide
Query:  PSVLGPSTSDPQDWSYFDFVSNLHPIDELALESFSAMVTERASPPPPPPLSTSPLQDMQKETSFVESNHRDEVPIDDDQFNTHSGDPFNSVMSLLQPGEL
        P   GPS+S  QD SY DF ++L PI+ L LE  S     R              +DM+ E+SFVE   R  +P  D + NTHS DP        +  ++
Subjt:  PSVLGPSTSDPQDWSYFDFVSNLHPIDELALESFSAMVTERASPPPPPPLSTSPLQDMQKETSFVESNHRDEVPIDDDQFNTHSGDPFNSVMSLLQPGEL

Query:  PELVESDFDMKTKDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYH
        P L         +D  E  DKNE     +    V+ +L +EGN++VE  L   TV   + + +  EL+  I  S+ +           E  EN  GV+Y 
Subjt:  PELVESDFDMKTKDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYH

Query:  KSDISNLNLDDSLQPIKTNESSAASSSIMSEECHKQQ-VSHGKMVG-SNRVS-TKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCC
         SDIS LN +++L+        AASS  +S E  K      G M+   N+V  T RKRKKS  M +N+  KRCNCKKS CLKLYCECFSA  YC +SC C
Subjt:  KSDISNLNLDDSLQPIKTNESSAASSSIMSEECHKQQ-VSHGKMVG-SNRVS-TKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCC

Query:  KDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAG
        KDC+N+IEYED V  VS+Q +SKN  AFAPKV +++ + PT +MV EN    SL RHKKGC CKKSMC K+YCEC+QAEVGCT+AC CE C NTFGTR  
Subjt:  KDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAG

Query:  SSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNR-NDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSK
               NT DE+TDA   +    Y QLANNFSSTYASN S +      A TSNLTN  N CPE+    F  +VS P  W    + +      +      
Subjt:  SSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNR-NDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSK

Query:  PSFNYHLYGTMTEN--------------------------------VRLTPTNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNF
         S N  L+  M  N                                + + P NQ   S S   +  + L+ NP  +L+ SQQ   Q +PT+ S    MN+
Subjt:  PSFNYHLYGTMTEN--------------------------------VRLTPTNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPYMNF

Query:  NGSSAQNEATLEN
        N  S+ N +   N
Subjt:  NGSSAQNEATLEN

SwissProt top hitse value%identityAlignment
A1Z9E2 Protein lin-54 homolog3.2e-2544.35Show/hide
Query:  RKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQ
        RK CNC KS+CLKLYC+CF+  ++C D C CKDC N ++YE       + C  +NP AF PK+    S                +  H KGCNCK+S C 
Subjt:  RKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQ

Query:  KKYCECYQAEVGCTTACQCEGCHN
        K YCECY+A++ C++ C+C GC N
Subjt:  KKYCECYQAEVGCTTACQCEGCHN

F4JIF5 Protein tesmin/TSO1-like CXC 21.4e-3333.95Show/hide
Query:  VSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVP
        +S+ +K++      + E  KRCNCKKS+CLKLYCECF+A  YC + C C DC N+  +ED V    +Q ES+NPLAFAPKVI+ + +       A     
Subjt:  VSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVP

Query:  PSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHI--PE
        P+ ARHK+GCNCKKS C KKYCECYQ  VGC+  C+CEGC N FG + GSS        +E   ++   +    Q          +S L ++   I  PE
Subjt:  PSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHI--PE

Query:  ALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYGT-----MTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLD
         +    ++  +       P P S+ G    S +       +P  S++ S+   ++          M E +  +P  N +S S +     P  + S+    
Subjt:  ALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYGT-----MTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLD

Query:  L----KSSQQLTWQTMPTYPSLLP
        +       ++L  Q++P++PSL P
Subjt:  L----KSSQQLTWQTMPTYPSLLP

Q84JZ8 Protein tesmin/TSO1-like CXC 41.7e-3137.9Show/hide
Query:  NMNPFFGSEAPPGLEFAENVGVQYHKSDISNLNLDDSLQ-PIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKK
        ++N F       G +  +    Q   S   + N++D    P+ T       S +   E  ++ V  G+    +++     R+ S  + +    KRC C+K
Subjt:  NMNPFFGSEAPPGLEFAENVGVQYHKSDISNLNLDDSLQ-PIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKK

Query:  SECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQ
        S+CLKLYCECFSA  +C + C C++C N+  +ED V    +  +++NPLAFAPKV+  TS++  ++ V EN   P+ ARHK+GCNC+KS C KKYCEC+ 
Subjt:  SECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQ

Query:  AEVGCTTACQCEGCHNTFG
          VGC++ C+C GC NTFG
Subjt:  AEVGCTTACQCEGCHNTFG

Q8L548 Protein tesmin/TSO1-like CXC 31.5e-3535.43Show/hide
Query:  ENVGVQYHKSDISNLNLDDSLQP--------IKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYR-KRCNCKKSECLKLYCE
        +NV  +Y  S    + +  SL P        ++ NES   S  I+ E   K   S    V    +S K+KR+KS +  + +   KRCNCKKS+CLKLYCE
Subjt:  ENVGVQYHKSDISNLNLDDSLQP--------IKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYR-KRCNCKKSECLKLYCE

Query:  CFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAEN-EVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTA
        CF+A  YC + C C +C N+  ++D V    +Q ES+NPLAFAPKVI+   NS + I V E+    P+ ARHK+GCNCKKS C KKYCECYQ  VGC+  
Subjt:  CFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAEN-EVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTA

Query:  CQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTV
        C+CEGC N FG + GS         DE+ +  G       QQ    F      +           L  +  NR   P+   + F     G    S  S +
Subjt:  CQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTV

Query:  IPAPQPFESMAS-SKPSFNYHLYGTMTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY
            +P  S+ S S+          M+EN+  +P T     S   S    +   S PW      + L  ++ PT+PSL P+
Subjt:  IPAPQPFESMAS-SKPSFNYHLYGTMTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY

Q9LUI3 CRC domain-containing protein TSO13.6e-3736.45Show/hide
Query:  KDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYHKSDISNL-----
        KD+P     NE   L+ +     V     G   +    L   + G+    I+++       +       S   PG+    N V +    S+IS +     
Subjt:  KDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYHKSDISNL-----

Query:  ------NLDDSLQPIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCL
              +   S  PI++ ++   +S     E  ++          N  S K+K +KS +  + E  KRCNCKKS+CLKLYCECF+A  YC + C C DC 
Subjt:  ------NLDDSLQPIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCL

Query:  NRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV--PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS
        N+  +E+TV    +Q ES+NPLAFAPKVI+       +IM A ++    P+ ARHK+GCNCKKS C KKYCECYQ  VGC+  C+CEGC N FG + GS
Subjt:  NRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV--PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS

Arabidopsis top hitse value%identityAlignment
AT2G20110.2 Tesmin/TSO1-like CXC domain-containing protein2.7e-2443.26Show/hide
Query:  TRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGC
        TR    + +K+CNCK S CLKLYCECF++  YC D C C +C N +E E       +    +NP AF PK+         N     + V   LARH KGC
Subjt:  TRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGC

Query:  NCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSS
        +CKKS C KKYCEC+QA + C+  C+C  C N  G+    S
Subjt:  NCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSS

AT3G04850.1 Tesmin/TSO1-like CXC domain-containing protein1.2e-3237.9Show/hide
Query:  NMNPFFGSEAPPGLEFAENVGVQYHKSDISNLNLDDSLQ-PIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKK
        ++N F       G +  +    Q   S   + N++D    P+ T       S +   E  ++ V  G+    +++     R+ S  + +    KRC C+K
Subjt:  NMNPFFGSEAPPGLEFAENVGVQYHKSDISNLNLDDSLQ-PIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKK

Query:  SECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQ
        S+CLKLYCECFSA  +C + C C++C N+  +ED V    +  +++NPLAFAPKV+  TS++  ++ V EN   P+ ARHK+GCNC+KS C KKYCEC+ 
Subjt:  SECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQ

Query:  AEVGCTTACQCEGCHNTFG
          VGC++ C+C GC NTFG
Subjt:  AEVGCTTACQCEGCHNTFG

AT3G22760.1 Tesmin/TSO1-like CXC domain-containing protein1.1e-3635.43Show/hide
Query:  ENVGVQYHKSDISNLNLDDSLQP--------IKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYR-KRCNCKKSECLKLYCE
        +NV  +Y  S    + +  SL P        ++ NES   S  I+ E   K   S    V    +S K+KR+KS +  + +   KRCNCKKS+CLKLYCE
Subjt:  ENVGVQYHKSDISNLNLDDSLQP--------IKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYR-KRCNCKKSECLKLYCE

Query:  CFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAEN-EVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTA
        CF+A  YC + C C +C N+  ++D V    +Q ES+NPLAFAPKVI+   NS + I V E+    P+ ARHK+GCNCKKS C KKYCECYQ  VGC+  
Subjt:  CFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAEN-EVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTA

Query:  CQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTV
        C+CEGC N FG + GS         DE+ +  G       QQ    F      +           L  +  NR   P+   + F     G    S  S +
Subjt:  CQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHIPEALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTV

Query:  IPAPQPFESMAS-SKPSFNYHLYGTMTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY
            +P  S+ S S+          M+EN+  +P T     S   S    +   S PW      + L  ++ PT+PSL P+
Subjt:  IPAPQPFESMAS-SKPSFNYHLYGTMTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLDLKSSQQLTWQTMPTYPSLLPY

AT3G22780.1 Tesmin/TSO1-like CXC domain-containing protein2.6e-3836.45Show/hide
Query:  KDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYHKSDISNL-----
        KD+P     NE   L+ +     V     G   +    L   + G+    I+++       +       S   PG+    N V +    S+IS +     
Subjt:  KDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAEN-VGVQYHKSDISNL-----

Query:  ------NLDDSLQPIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCL
              +   S  PI++ ++   +S     E  ++          N  S K+K +KS +  + E  KRCNCKKS+CLKLYCECF+A  YC + C C DC 
Subjt:  ------NLDDSLQPIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCL

Query:  NRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV--PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS
        N+  +E+TV    +Q ES+NPLAFAPKVI+       +IM A ++    P+ ARHK+GCNCKKS C KKYCECYQ  VGC+  C+CEGC N FG + GS
Subjt:  NRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEV--PPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGS

AT4G14770.1 TESMIN/TSO1-like CXC 21.0e-3433.95Show/hide
Query:  VSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVP
        +S+ +K++      + E  KRCNCKKS+CLKLYCECF+A  YC + C C DC N+  +ED V    +Q ES+NPLAFAPKVI+ + +       A     
Subjt:  VSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKNPLAFAPKVIKETSNSPTNIMVAENEVP

Query:  PSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHI--PE
        P+ ARHK+GCNCKKS C KKYCECYQ  VGC+  C+CEGC N FG + GSS        +E   ++   +    Q          +S L ++   I  PE
Subjt:  PSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSSTYASNLSSSNNHI--PE

Query:  ALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYGT-----MTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLD
         +    ++  +       P P S+ G    S +       +P  S++ S+   ++          M E +  +P  N +S S +     P  + S+    
Subjt:  ALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYGT-----MTENVRLTP-TNQQSASTSTSYFQPERLSSNPWLD

Query:  L----KSSQQLTWQTMPTYPSLLP
        +       ++L  Q++P++PSL P
Subjt:  L----KSSQQLTWQTMPTYPSLLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCTTCTGAAGACATCATCACTTCACCGTCTGTTCTGGGGCCATCGACTTCGGATCCGCAGGACTGGAGTTACTTCGATTTCGTTAGTAACCTACATCCTATAGA
CGAGCTTGCGTTAGAGTCGTTCTCTGCTATGGTAACGGAAAGGGCTTCGCCTCCTCCACCTCCTCCTCTGTCTACGTCTCCCCTTCAAGATATGCAAAAAGAAACCAGTT
TTGTTGAAAGCAACCATAGAGATGAAGTTCCAATTGATGACGACCAATTCAACACCCATTCGGGTGATCCTTTCAACTCTGTTATGTCCTTGCTTCAACCCGGTGAATTG
CCGGAGTTGGTAGAGAGCGATTTTGACATGAAAACTAAAGACGTTCCTGAGGAAGATGATAAGAATGAAAAAATGCGATTGTCAGAAGTCTCAGTCGAAGTGGATGTCTC
CCTCGAAGTGGAAGGTAATAGAAGTGTAGAAGAATTTCTGTTGGGGTTAACCGTAGCTGGGCAGACAGACGATGGAATTTTGAATGAGTTAAGCGGATTTATTCCCACAT
CTAATATGAACCCGTTCTTTGGTTCTGAAGCACCACCGGGGCTTGAGTTTGCTGAGAATGTTGGTGTTCAATACCACAAGTCAGATATTTCGAACTTGAACCTCGACGAT
AGTTTGCAGCCTATAAAGACAAATGAAAGTTCTGCTGCAAGTTCAAGTATAATGTCAGAAGAATGTCATAAGCAGCAGGTGTCTCACGGAAAAATGGTTGGGTCGAACAG
GGTCAGCACCAAAAGAAAGAGGAAGAAATCAACACGCATGAGTCAAAATGAATACCGCAAACGTTGCAACTGCAAGAAATCCGAGTGCTTAAAACTTTATTGTGAATGCT
TTTCGGCTGAGCAATATTGTTCGGATTCTTGTTGCTGTAAAGATTGCCTTAACAGGATTGAATATGAGGACACAGTTTATCTTGTAAGTCAGCAATGTGAATCCAAAAAT
CCCCTTGCTTTTGCTCCAAAGGTCATCAAGGAAACGTCTAATTCTCCAACAAATATTATGGTAGCAGAGAATGAGGTACCTCCATCTTTGGCCAGACATAAAAAAGGATG
CAATTGCAAGAAGTCAATGTGTCAGAAAAAGTATTGCGAATGCTATCAGGCAGAAGTTGGGTGTACTACAGCATGTCAATGTGAAGGTTGTCATAATACTTTCGGCACTA
GAGCAGGATCCAGTAGTAGAGCATGGGGAAACACTTCTGATGAAAAGACAGATGCTCAAGGAATATCATCTGAACCTGGTTACCAACAACTTGCGAATAATTTCTCATCA
ACATATGCAAGTAACCTTTCCTCATCAAATAATCATATTCCTGAAGCTCTCACTTCAAATTTAACCAATAGAAATGACTGTCCTGAAGCTACCTTAGCCCCTTTTCCACC
ATCTGTTTCTGGACCTCCTCACTGGAGCCGCCTCTCCACTGTGATTCCTGCACCACAACCATTCGAGAGTATGGCCTCTTCTAAGCCATCTTTTAATTATCACCTCTATG
GGACGATGACAGAGAATGTGAGGCTTACTCCAACAAATCAGCAATCGGCATCAACTTCGACATCGTACTTTCAACCAGAACGATTGAGCTCAAATCCTTGGTTAGATTTA
AAAAGTAGCCAACAATTGACCTGGCAAACCATGCCTACTTATCCATCTCTTCTTCCATACATGAATTTTAATGGCAGTTCTGCTCAAAATGAAGCTACTCTTGAAAATGG
TGTTTGGAGAATGTTCAGCGATAGCCCTCAAGCCCCAGCCTTGAGTTCAGCCAACTGCAGACTTCGTTCATCTCCTTTGGGACCGTGTAATGCCCGAGCCTGCGCACAAA
TAGGAATTGATGCCGCTCGCCTTGTAGCCTCGTGTGATGCTTCAAATTTGTTCCTTAAGCTCCTGAATAAAGAGTTGAGAAAAGTAGAAGAGAAGGTTGTTGATCTGACA
TGCAGTGCAAGGAACCCTCCGAGTAGAGATACTGGTCGAGTTGGAGCAGTTGGACAAATCCATTTGATCTGTATATTTGAAAGAAATGATGATCAGTATGGGAAGAAAGC
CTGGAGAGTAAGGCTACACACAAGAGGGACAAAGAAGGAGACTTTGTTGGTTAACAAACAAAAATCAGACTCAAGCAATGTGATTGTGGATGGAAAAAGAAATGAGAAGG
AATATGAAAACTGCACATTGCACCAAATGGCCAAGAAACTCAGAAAGGGAGAGGCATCTGTGAAAAGGAGAGGTAAAGACAAGGACAAAAGCAGGAAATCTTTTTATGAA
CACAAAGAATCCCAGCAGGCAGAGCACATTCATCCATCCGCAAGCCTCCCAGCTAGGAGAGAGAGCTCTGGATATGCAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAATCTTCTGAAGACATCATCACTTCACCGTCTGTTCTGGGGCCATCGACTTCGGATCCGCAGGACTGGAGTTACTTCGATTTCGTTAGTAACCTACATCCTATAGA
CGAGCTTGCGTTAGAGTCGTTCTCTGCTATGGTAACGGAAAGGGCTTCGCCTCCTCCACCTCCTCCTCTGTCTACGTCTCCCCTTCAAGATATGCAAAAAGAAACCAGTT
TTGTTGAAAGCAACCATAGAGATGAAGTTCCAATTGATGACGACCAATTCAACACCCATTCGGGTGATCCTTTCAACTCTGTTATGTCCTTGCTTCAACCCGGTGAATTG
CCGGAGTTGGTAGAGAGCGATTTTGACATGAAAACTAAAGACGTTCCTGAGGAAGATGATAAGAATGAAAAAATGCGATTGTCAGAAGTCTCAGTCGAAGTGGATGTCTC
CCTCGAAGTGGAAGGTAATAGAAGTGTAGAAGAATTTCTGTTGGGGTTAACCGTAGCTGGGCAGACAGACGATGGAATTTTGAATGAGTTAAGCGGATTTATTCCCACAT
CTAATATGAACCCGTTCTTTGGTTCTGAAGCACCACCGGGGCTTGAGTTTGCTGAGAATGTTGGTGTTCAATACCACAAGTCAGATATTTCGAACTTGAACCTCGACGAT
AGTTTGCAGCCTATAAAGACAAATGAAAGTTCTGCTGCAAGTTCAAGTATAATGTCAGAAGAATGTCATAAGCAGCAGGTGTCTCACGGAAAAATGGTTGGGTCGAACAG
GGTCAGCACCAAAAGAAAGAGGAAGAAATCAACACGCATGAGTCAAAATGAATACCGCAAACGTTGCAACTGCAAGAAATCCGAGTGCTTAAAACTTTATTGTGAATGCT
TTTCGGCTGAGCAATATTGTTCGGATTCTTGTTGCTGTAAAGATTGCCTTAACAGGATTGAATATGAGGACACAGTTTATCTTGTAAGTCAGCAATGTGAATCCAAAAAT
CCCCTTGCTTTTGCTCCAAAGGTCATCAAGGAAACGTCTAATTCTCCAACAAATATTATGGTAGCAGAGAATGAGGTACCTCCATCTTTGGCCAGACATAAAAAAGGATG
CAATTGCAAGAAGTCAATGTGTCAGAAAAAGTATTGCGAATGCTATCAGGCAGAAGTTGGGTGTACTACAGCATGTCAATGTGAAGGTTGTCATAATACTTTCGGCACTA
GAGCAGGATCCAGTAGTAGAGCATGGGGAAACACTTCTGATGAAAAGACAGATGCTCAAGGAATATCATCTGAACCTGGTTACCAACAACTTGCGAATAATTTCTCATCA
ACATATGCAAGTAACCTTTCCTCATCAAATAATCATATTCCTGAAGCTCTCACTTCAAATTTAACCAATAGAAATGACTGTCCTGAAGCTACCTTAGCCCCTTTTCCACC
ATCTGTTTCTGGACCTCCTCACTGGAGCCGCCTCTCCACTGTGATTCCTGCACCACAACCATTCGAGAGTATGGCCTCTTCTAAGCCATCTTTTAATTATCACCTCTATG
GGACGATGACAGAGAATGTGAGGCTTACTCCAACAAATCAGCAATCGGCATCAACTTCGACATCGTACTTTCAACCAGAACGATTGAGCTCAAATCCTTGGTTAGATTTA
AAAAGTAGCCAACAATTGACCTGGCAAACCATGCCTACTTATCCATCTCTTCTTCCATACATGAATTTTAATGGCAGTTCTGCTCAAAATGAAGCTACTCTTGAAAATGG
TGTTTGGAGAATGTTCAGCGATAGCCCTCAAGCCCCAGCCTTGAGTTCAGCCAACTGCAGACTTCGTTCATCTCCTTTGGGACCGTGTAATGCCCGAGCCTGCGCACAAA
TAGGAATTGATGCCGCTCGCCTTGTAGCCTCGTGTGATGCTTCAAATTTGTTCCTTAAGCTCCTGAATAAAGAGTTGAGAAAAGTAGAAGAGAAGGTTGTTGATCTGACA
TGCAGTGCAAGGAACCCTCCGAGTAGAGATACTGGTCGAGTTGGAGCAGTTGGACAAATCCATTTGATCTGTATATTTGAAAGAAATGATGATCAGTATGGGAAGAAAGC
CTGGAGAGTAAGGCTACACACAAGAGGGACAAAGAAGGAGACTTTGTTGGTTAACAAACAAAAATCAGACTCAAGCAATGTGATTGTGGATGGAAAAAGAAATGAGAAGG
AATATGAAAACTGCACATTGCACCAAATGGCCAAGAAACTCAGAAAGGGAGAGGCATCTGTGAAAAGGAGAGGTAAAGACAAGGACAAAAGCAGGAAATCTTTTTATGAA
CACAAAGAATCCCAGCAGGCAGAGCACATTCATCCATCCGCAAGCCTCCCAGCTAGGAGAGAGAGCTCTGGATATGCAAAATAG
Protein sequenceShow/hide protein sequence
MKSSEDIITSPSVLGPSTSDPQDWSYFDFVSNLHPIDELALESFSAMVTERASPPPPPPLSTSPLQDMQKETSFVESNHRDEVPIDDDQFNTHSGDPFNSVMSLLQPGEL
PELVESDFDMKTKDVPEEDDKNEKMRLSEVSVEVDVSLEVEGNRSVEEFLLGLTVAGQTDDGILNELSGFIPTSNMNPFFGSEAPPGLEFAENVGVQYHKSDISNLNLDD
SLQPIKTNESSAASSSIMSEECHKQQVSHGKMVGSNRVSTKRKRKKSTRMSQNEYRKRCNCKKSECLKLYCECFSAEQYCSDSCCCKDCLNRIEYEDTVYLVSQQCESKN
PLAFAPKVIKETSNSPTNIMVAENEVPPSLARHKKGCNCKKSMCQKKYCECYQAEVGCTTACQCEGCHNTFGTRAGSSSRAWGNTSDEKTDAQGISSEPGYQQLANNFSS
TYASNLSSSNNHIPEALTSNLTNRNDCPEATLAPFPPSVSGPPHWSRLSTVIPAPQPFESMASSKPSFNYHLYGTMTENVRLTPTNQQSASTSTSYFQPERLSSNPWLDL
KSSQQLTWQTMPTYPSLLPYMNFNGSSAQNEATLENGVWRMFSDSPQAPALSSANCRLRSSPLGPCNARACAQIGIDAARLVASCDASNLFLKLLNKELRKVEEKVVDLT
CSARNPPSRDTGRVGAVGQIHLICIFERNDDQYGKKAWRVRLHTRGTKKETLLVNKQKSDSSNVIVDGKRNEKEYENCTLHQMAKKLRKGEASVKRRGKDKDKSRKSFYE
HKESQQAEHIHPSASLPARRESSGYAK