; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027396 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027396
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionC2H2-type domain-containing protein
Genome locationtig00153054:1085759..1087519
RNA-Seq ExpressionSgr027396
SyntenySgr027396
Gene Ontology termsGO:0009630 - gravitropism (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily
IPR039288 - Zinc finger protein SHOOT GRAVITROPISM 5-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019893.1 Protein SHOOT GRAVITROPISM 5 [Cucurbita argyrosperma subsp. argyrosperma]4.2e-14479.1Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR E    VRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL    QQSE    Q   CLSRTASSPTP+PSSDTNFSF N +HT+        
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK

Query:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA
          QNL+LQLS  STSIHVSVSPKR EN+STHLQLSIGSCNY  EKN+D       DEK  + L   ++KEEA+EQ+RVAMEEK +AEEAR EAK+QIE+A
Subjt:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA

Query:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        E+ELA+AKRMRQ+AQ ELQRALALKEHA+K+INSTILQITCQVCRQ F+  P I
Subjt:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

XP_022924154.1 protein SHOOT GRAVITROPISM 5-like [Cucurbita moschata]2.4e-14479.1Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESD YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR E    VRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL    QQSE    Q   CLSRTASSPTP+PSSDTNFSF N +HT+        
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK

Query:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA
          QNL+LQLS  STSIHVSVSPKRHEN+STHLQLSIGSCNY  EKN+D       DEK  + L   ++KEEA+EQ+RVAMEEK +AEEAR EAK+QIE+A
Subjt:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA

Query:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        E+ELA+AKRMRQ+AQ ELQRALALKEHA+K+INSTILQITCQVCRQ F+  P I
Subjt:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

XP_023001792.1 protein SHOOT GRAVITROPISM 5-like [Cucurbita maxima]4.2e-14478.81Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR E    VRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL    QQSE    Q   CLSRTASSPTP+PSSDTNFSF N +HT+        
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK

Query:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA
          QNL+LQLS  STSIH+SVSPKR EN+STHLQLSIGSCNY  EKN+D       DEK  + L  +++KEEA+EQ+RVAMEEK +AEEAR EAK+QIE+A
Subjt:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA

Query:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        E+ELA+AKRMRQ+AQ ELQRALALKEHA+K+INSTILQITCQVCRQ F+  P I
Subjt:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

XP_023520059.1 protein SHOOT GRAVITROPISM 5-like [Cucurbita pepo subsp. pepo]4.2e-14479.1Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR E    VRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL    QQSE    Q   CLSRTASSPTP+PSSDTNFSF N +HT+        
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK

Query:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA
          QNL+LQLS  STSIHVSVSPKR EN+STHLQLSIGSCNY  EKN+D       DEK  + L   ++KEEA+EQ+RVAMEEK +AEEAR EAK+QIE+A
Subjt:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA

Query:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        E+ELA+AKRMRQ+AQ ELQRALALKEHA+K+INSTILQITCQVCRQ F+  P I
Subjt:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

XP_038895030.1 zinc finger protein SHOOT GRAVITROPISM 5-like [Benincasa hispida]1.9e-14978.28Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR ET  VVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN------------HHT
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD CKL   Q +  S Q   CLSRTASSPTP+PSSDTNFSFLN            HH 
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN------------HHT

Query:  VTTHANKAT------------QNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAME
           ++ K T            QNLDLQLST STSIHVSVSPKRHEN+STHLQLSIGSCNY  EKN D  +  + ++ +   L   RLKEEA+EQ+RVAME
Subjt:  VTTHANKAT------------QNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAME

Query:  EKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        EK VAEEAR EAK+QIELAE+EL +AKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQ F  LPKI
Subjt:  EKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

TrEMBL top hitse value%identityAlignment
A0A0A0LQY7 C2H2-type domain-containing protein5.9e-14473.06Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR ET  VVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN---------------
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD CKL     +  S Q   CLSRTASSPTP+PSSDTNFSFLN               
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN---------------

Query:  --HHTVTTHANKAT-----------QNLDLQLST-ASTSIHVSVSPKRHENFSTHLQLSIGSCNYS--------CEKNNDQTTMISADEKNSSALALWRL
          +H    ++ K T           QNLDLQLST ++TSIHVSVSPKRHEN+STHLQLSIGSCNY            NN        D+ +   L   +L
Subjt:  --HHTVTTHANKAT-----------QNLDLQLST-ASTSIHVSVSPKRHENFSTHLQLSIGSCNYS--------CEKNNDQTTMISADEKNSSALALWRL

Query:  KEEAREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        KEEA+EQ+R+AMEEK +AEEAR EAK+QIE+AE+EL +AKRMRQQAQAELQRAL LKEHAIKKINSTILQITCQVCRQ F   PKI
Subjt:  KEEAREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

A0A6J1EE04 protein SHOOT GRAVITROPISM 5-like1.2e-14479.1Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESD YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR E    VRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL    QQSE    Q   CLSRTASSPTP+PSSDTNFSF N +HT+        
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK

Query:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA
          QNL+LQLS  STSIHVSVSPKRHEN+STHLQLSIGSCNY  EKN+D       DEK  + L   ++KEEA+EQ+RVAMEEK +AEEAR EAK+QIE+A
Subjt:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA

Query:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        E+ELA+AKRMRQ+AQ ELQRALALKEHA+K+INSTILQITCQVCRQ F+  P I
Subjt:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

A0A6J1GTT9 protein SHOOT GRAVITROPISM 5-like1.7e-14377.99Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR---ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQW
        PDAEVVSLSPKTLLESD YVCEIC+QGFQRDQNLQMHRRRHKVPWKLLKR    T    RKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQW
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR---ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQW

Query:  VCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQ------SELPSLQPPACLSRTASSPTPTPSSDTNFSFLNH-HTVTT
        VCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL   Q      S  P   PP CLSRTASSPTPTPSSDTNFSFLN  HT  T
Subjt:  VCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQ------SELPSLQPPACLSRTASSPTPTPSSDTNFSFLNH-HTVTT

Query:  HANKAT--QNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAK
             T  QNLDLQLST+S SI VSVSPK+HE +STHLQLSIGSCNY  +        ++ DE+  + L L +LKEEA E++RVA EEK VAEEAR EAK
Subjt:  HANKAT--QNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAK

Query:  RQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPK
        +QIELAE+ELA+AKRMRQQAQAELQRALALKEHAI KINSTILQITCQVCRQ F  L K
Subjt:  RQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPK

A0A6J1K323 protein SHOOT GRAVITROPISM 5-like3.4e-14477.22Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR----ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQ
        PDAEVVSLSPKTLLESD YVCEIC+QGFQRDQNLQMHRRRHKVPWKLLKR     T    RKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQ
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR----ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQ

Query:  WVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQ------SELPSLQPPACLSRTASSPTPTPSSDTNFSFLNH-HTVT
        WVCEKC+KGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL   Q      S  P   PP CLSRTASSPTPTPSSDTNFSFLN  HT  
Subjt:  WVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQ------SELPSLQPPACLSRTASSPTPTPSSDTNFSFLNH-HTVT

Query:  THANKAT--QNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEA
        T     T  QNLDLQLST+S SI VSVSPK+HEN++THLQLSIGSCNY  +        ++ DE+  + L+L +LKEEA++++RVA EEK VAEEAR EA
Subjt:  THANKAT--QNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEA

Query:  KRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPK
        K+QIELAE+ELA+AKRMRQQAQAELQRALALKEHAI KINSTILQITCQVCRQ F  L K
Subjt:  KRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPK

A0A6J1KHM5 protein SHOOT GRAVITROPISM 5-like2.0e-14478.81Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR E    VRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR-ETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK
        EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD+CKL    QQSE    Q   CLSRTASSPTP+PSSDTNFSF N +HT+        
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLS--RQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLN-HHTVTTH--ANK

Query:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA
          QNL+LQLS  STSIH+SVSPKR EN+STHLQLSIGSCNY  EKN+D       DEK  + L  +++KEEA+EQ+RVAMEEK +AEEAR EAK+QIE+A
Subjt:  ATQNLDLQLSTASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELA

Query:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI
        E+ELA+AKRMRQ+AQ ELQRALALKEHA+K+INSTILQITCQVCRQ F+  P I
Subjt:  ERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLPKI

SwissProt top hitse value%identityAlignment
F4IPE3 Zinc finger protein SHOOT GRAVITROPISM 51.3e-10357.29Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE
        PDAEVVSLSP+TLLESDRY+CEICNQGFQRDQNLQMHRRRHKVPWKLLKR+    V+KRV+VCPEP+CLHHNP HALGDLVGIKKHFRRKHSNHKQWVCE
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE

Query:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF---------------
        +CSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD C   R   E P     ++  PAC SRTAS+   TPSS+TN+               
Subjt:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF---------------

Query:  ---SFLNHHTVTTHANKATQNLDLQLSTASTSIHVS--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKEE
             ++   +T  +N    NL+LQL   S++ + +          P  H N    +T+L LSI  S +Y    N D+   I A E              
Subjt:  ---SFLNHHTVTTHANKATQNLDLQLSTASTSIHVS--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKEE

Query:  AREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF
          + +++AM+EK  AEEA+ EAKRQ E+AE E A+AK++RQ+AQAEL+RA  LKE ++KKI+STI+Q+TCQ C+ +F
Subjt:  AREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF

O22759 Protein indeterminate-domain 122.4e-5456.45Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE
        PDAEV++LSPKTLL ++R+VCEICN+GFQRDQNLQ+HRR H +PWKL ++ T    +K+V+VCPE +C HH+P+ ALGDL GIKKHF RKH   K+W CE
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE

Query:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLNHH
        KCSK YAVQSD+KAH K CGTR + CDCG +FSR ++FI H+  C    ++S        A L  T+SS    P    N +F  HH
Subjt:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLNHH

Q8H1F5 Protein indeterminate-domain 73.8e-5557.65Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE
        P+AEV++LSPKTL+ ++R++CE+CN+GFQRDQNLQ+H+R H +PWKL +R    VVRK+V+VCPEP C+HH+P+ ALGDL GIKKHF RKH   K+W CE
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE

Query:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSP
        KCSK YAVQSD+KAH KTCGT+ + CDCG +FSR +SFI H+  C    ++S      P   + + ++SP
Subjt:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSP

Q9C9X7 Protein indeterminate-domain 143.1e-9456.57Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE
        P+AEVVSLSP+TLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRET   VRKRV+VCPEP+CLHHNP HALGDLVGIKKHFRRKHSNHKQW+CE
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE

Query:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSL----QPPACLSRTASSPTPTPSSDTNFS-FLNHHTVTTHANKA
        +CSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTC + R Q     L    Q     ++TAS+     + D +    L  H +    +  
Subjt:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSL----QPPACLSRTASSPTPTPSSDTNFS-FLNHHTVTTHANKA

Query:  TQNLDLQL---STASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIE
        ++     L      + SI + + P R+    T L LSIG+         DQ TM   ++K             + E+   ++E     EEAR E KRQIE
Subjt:  TQNLDLQL---STASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIE

Query:  LAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF
        +AE E A AKR+RQ A+AEL +A   +E A ++I++T++QITC  C+Q F
Subjt:  LAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF

Q9FRH4 Protein indeterminate-domain 163.3e-9156.9Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETA-AVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSP+TLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR+     VRKRV+VCPEP+CLHH+P HALGDLVGIKKHFRRKHS HKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETA-AVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLNHHTVTTHANKATQNL
        E+CSKGYAVQSDYKAHLKTCG+RGHSCDCGRVFSRVESFIEHQDTC + + Q           +   A S T   +S  +F  L H        + +   
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLNHHTVTTHANKATQNL

Query:  DLQLSTA-STSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELAEREL
          Q S A +   + S +P      S  LQLSIG    S +  +        +EK  ++L   R  EEAR+           AEE R EAKRQIE+AE++ 
Subjt:  DLQLSTA-STSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELAEREL

Query:  ASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLP
          AKR+R++A+ EL++A  ++E AIK+IN+T+++ITC  C+Q F QLP
Subjt:  ASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLP

Arabidopsis top hitse value%identityAlignment
AT1G25250.1 indeterminate(ID)-domain 162.3e-9256.9Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETA-AVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC
        PDAEVVSLSP+TLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR+     VRKRV+VCPEP+CLHH+P HALGDLVGIKKHFRRKHS HKQWVC
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETA-AVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVC

Query:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLNHHTVTTHANKATQNL
        E+CSKGYAVQSDYKAHLKTCG+RGHSCDCGRVFSRVESFIEHQDTC + + Q           +   A S T   +S  +F  L H        + +   
Subjt:  EKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLNHHTVTTHANKATQNL

Query:  DLQLSTA-STSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELAEREL
          Q S A +   + S +P      S  LQLSIG    S +  +        +EK  ++L   R  EEAR+           AEE R EAKRQIE+AE++ 
Subjt:  DLQLSTA-STSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELAEREL

Query:  ASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLP
          AKR+R++A+ EL++A  ++E AIK+IN+T+++ITC  C+Q F QLP
Subjt:  ASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRFDQLP

AT1G68130.1 indeterminate(ID)-domain 142.2e-9556.57Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE
        P+AEVVSLSP+TLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRET   VRKRV+VCPEP+CLHHNP HALGDLVGIKKHFRRKHSNHKQW+CE
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE

Query:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSL----QPPACLSRTASSPTPTPSSDTNFS-FLNHHTVTTHANKA
        +CSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTC + R Q     L    Q     ++TAS+     + D +    L  H +    +  
Subjt:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSL----QPPACLSRTASSPTPTPSSDTNFS-FLNHHTVTTHANKA

Query:  TQNLDLQL---STASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIE
        ++     L      + SI + + P R+    T L LSIG+         DQ TM   ++K             + E+   ++E     EEAR E KRQIE
Subjt:  TQNLDLQL---STASTSIHVSVSPKRHENFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIE

Query:  LAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF
        +AE E A AKR+RQ A+AEL +A   +E A ++I++T++QITC  C+Q F
Subjt:  LAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF

AT2G01940.1 C2H2-like zinc finger protein9.1e-10557.29Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE
        PDAEVVSLSP+TLLESDRY+CEICNQGFQRDQNLQMHRRRHKVPWKLLKR+    V+KRV+VCPEP+CLHHNP HALGDLVGIKKHFRRKHSNHKQWVCE
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE

Query:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF---------------
        +CSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQD C   R   E P     ++  PAC SRTAS+   TPSS+TN+               
Subjt:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF---------------

Query:  ---SFLNHHTVTTHANKATQNLDLQLSTASTSIHVS--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKEE
             ++   +T  +N    NL+LQL   S++ + +          P  H N    +T+L LSI  S +Y    N D+   I A E              
Subjt:  ---SFLNHHTVTTHANKATQNLDLQLSTASTSIHVS--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKEE

Query:  AREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF
          + +++AM+EK  AEEA+ EAKRQ E+AE E A+AK++RQ+AQAEL+RA  LKE ++KKI+STI+Q+TCQ C+ +F
Subjt:  AREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF

AT2G01940.2 C2H2-like zinc finger protein8.8e-8453.51Show/hide
Query:  MHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRV
        MHRRRHKVPWKLLKR+    V+KRV+VCPEP+CLHHNP HALGDLVGIKKHFRRKHSNHKQWVCE+CSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRV
Subjt:  MHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRV

Query:  ESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF------------------SFLNHHTVTTHANKATQNLDLQLSTASTSIHV
        ESFIEHQD C   R   E P     ++  PAC SRTAS+   TPSS+TN+                    ++   +T  +N    NL+LQL   S++ + 
Subjt:  ESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF------------------SFLNHHTVTTHANKATQNLDLQLSTASTSIHV

Query:  S--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELAERELAS
        +          P  H N    +T+L LSI  S +Y    N D+   I A E                + +++AM+EK  AEEA+ EAKRQ E+AE E A+
Subjt:  S--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELAERELAS

Query:  AKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF
        AK++RQ+AQAEL+RA  LKE ++KKI+STI+Q+TCQ C+ +F
Subjt:  AKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF

AT2G01940.3 C2H2-like zinc finger protein6.1e-10156.35Show/hide
Query:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE
        PDAEVVSLSP+TLLESDRY+CEICNQGFQRDQNLQMHRRRHKVPWKLLKR+    V+KRV+VCPEP+CLHHNP HALGDLVGIKKHFRRKHSNHKQWVCE
Subjt:  PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCE

Query:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFS-RVESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF--------------
        +CSKGYAVQSDYKAHLKTCGTRGHSCDCG   S RVESFIEHQD C   R   E P     ++  PAC SRTAS+   TPSS+TN+              
Subjt:  KCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFS-RVESFIEHQDTCKLSRQQSELP-----SLQPPACLSRTASSPTPTPSSDTNF--------------

Query:  ----SFLNHHTVTTHANKATQNLDLQLSTASTSIHVS--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKE
              ++   +T  +N    NL+LQL   S++ + +          P  H N    +T+L LSI  S +Y    N D+   I A E             
Subjt:  ----SFLNHHTVTTHANKATQNLDLQLSTASTSIHVS--------VSPKRHENF---STHLQLSIG-SCNYSCEKNNDQTTMISADEKNSSALALWRLKE

Query:  EAREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF
           + +++AM+EK  AEEA+ EAKRQ E+AE E A+AK++RQ+AQAEL+RA  LKE ++KKI+STI+Q+TCQ C+ +F
Subjt:  EAREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTILQITCQVCRQRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACCCAGATGCAGAGGTGGTGTCTCTCTCTCCCAAAACCCTATTGGAATCGGATCGTTACGTTTGTGAGATCTGCAACCAGGGATTCCAGAGAGATCAGAATCTGCAGATG
CACCGCCGGCGGCACAAGGTGCCGTGGAAGCTGTTGAAGAGAGAGACGGCGGCGGTGGTGAGAAAAAGAGTGTTCGTTTGCCCGGAGCCGAGCTGCTTACACCACAACCC
TACGCATGCTCTCGGCGATTTGGTCGGAATAAAGAAGCACTTCAGAAGAAAACACAGTAACCACAAGCAATGGGTGTGTGAAAAATGCTCTAAAGGCTACGCAGTTCAGT
CTGATTATAAAGCCCATCTCAAAACCTGCGGAACCAGAGGCCATTCCTGCGACTGTGGTCGAGTTTTCTCCAGGGTGGAAAGTTTCATAGAACACCAAGATACTTGCAAA
CTGAGTCGTCAACAATCAGAACTGCCATCGTTGCAGCCACCAGCTTGCTTGTCTCGAACAGCTTCGAGTCCCACTCCGACCCCTTCTTCTGATACAAACTTCAGCTTCTT
AAATCACCACACTGTTACCACTCATGCAAACAAAGCAACTCAAAATTTGGACCTTCAACTTTCCACCGCTTCAACTTCCATTCATGTCTCGGTTTCTCCAAAACGACACG
AAAACTTCTCGACCCATTTGCAGCTTTCGATTGGTTCTTGTAATTACAGCTGCGAGAAAAATAATGATCAGACGACGATGATCAGTGCTGATGAGAAAAACAGTAGTGCG
TTGGCATTGTGGAGGCTGAAAGAGGAAGCTCGAGAGCAGCTGAGGGTGGCCATGGAGGAGAAGACAGTTGCAGAAGAAGCAAGAACAGAAGCGAAGCGGCAGATTGAACT
GGCAGAGAGAGAGCTGGCGAGTGCCAAGAGAATGAGACAGCAAGCACAAGCTGAGTTACAAAGAGCTCTTGCTTTGAAAGAACATGCCATCAAGAAAATCAACTCCACCA
TTCTCCAAATCACTTGTCAAGTCTGCAGACAGCGTTTTGATCAATTACCAAAAATAAATTTGAATTAG
mRNA sequenceShow/hide mRNA sequence
ACCCAGATGCAGAGGTGGTGTCTCTCTCTCCCAAAACCCTATTGGAATCGGATCGTTACGTTTGTGAGATCTGCAACCAGGGATTCCAGAGAGATCAGAATCTGCAGATG
CACCGCCGGCGGCACAAGGTGCCGTGGAAGCTGTTGAAGAGAGAGACGGCGGCGGTGGTGAGAAAAAGAGTGTTCGTTTGCCCGGAGCCGAGCTGCTTACACCACAACCC
TACGCATGCTCTCGGCGATTTGGTCGGAATAAAGAAGCACTTCAGAAGAAAACACAGTAACCACAAGCAATGGGTGTGTGAAAAATGCTCTAAAGGCTACGCAGTTCAGT
CTGATTATAAAGCCCATCTCAAAACCTGCGGAACCAGAGGCCATTCCTGCGACTGTGGTCGAGTTTTCTCCAGGGTGGAAAGTTTCATAGAACACCAAGATACTTGCAAA
CTGAGTCGTCAACAATCAGAACTGCCATCGTTGCAGCCACCAGCTTGCTTGTCTCGAACAGCTTCGAGTCCCACTCCGACCCCTTCTTCTGATACAAACTTCAGCTTCTT
AAATCACCACACTGTTACCACTCATGCAAACAAAGCAACTCAAAATTTGGACCTTCAACTTTCCACCGCTTCAACTTCCATTCATGTCTCGGTTTCTCCAAAACGACACG
AAAACTTCTCGACCCATTTGCAGCTTTCGATTGGTTCTTGTAATTACAGCTGCGAGAAAAATAATGATCAGACGACGATGATCAGTGCTGATGAGAAAAACAGTAGTGCG
TTGGCATTGTGGAGGCTGAAAGAGGAAGCTCGAGAGCAGCTGAGGGTGGCCATGGAGGAGAAGACAGTTGCAGAAGAAGCAAGAACAGAAGCGAAGCGGCAGATTGAACT
GGCAGAGAGAGAGCTGGCGAGTGCCAAGAGAATGAGACAGCAAGCACAAGCTGAGTTACAAAGAGCTCTTGCTTTGAAAGAACATGCCATCAAGAAAATCAACTCCACCA
TTCTCCAAATCACTTGTCAAGTCTGCAGACAGCGTTTTGATCAATTACCAAAAATAAATTTGAATTAG
Protein sequenceShow/hide protein sequence
PDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETAAVVRKRVFVCPEPSCLHHNPTHALGDLVGIKKHFRRKHSNHKQWVCEKCSKGYAVQS
DYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDTCKLSRQQSELPSLQPPACLSRTASSPTPTPSSDTNFSFLNHHTVTTHANKATQNLDLQLSTASTSIHVSVSPKRHE
NFSTHLQLSIGSCNYSCEKNNDQTTMISADEKNSSALALWRLKEEAREQLRVAMEEKTVAEEARTEAKRQIELAERELASAKRMRQQAQAELQRALALKEHAIKKINSTI
LQITCQVCRQRFDQLPKINLN