; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G016540 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G016540
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionB3 domain-containing transcription factor VRN1-like isoform X2
Genome locationCG_Chr01:30920979..30950824
RNA-Seq ExpressionClCG01G016540
SyntenyClCG01G016540
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003340 - B3 DNA binding domain
IPR015300 - DNA-binding pseudobarrel domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647490.1 hypothetical protein Csa_003352 [Cucumis sativus]7.1e-23963.51Show/hide
Query:  EMMIPKIFVKDYGKLLSSFVFLKLPDGMEWKVGLTTAANGAVWLQKGWDKFLEHYCLEFGSLLVFRLLDGRKSSSFDVTIFDPTGVETQYSCNVNNCTPD
        ++MIP++F+K YGKLLSS V LKLPDG EWK+GLTT+ NGAVWL+KGWDKF EHYCLE+G LLVF+LL+ R +SSF V IF+ T +ET+YS NV + T +
Subjt:  EMMIPKIFVKDYGKLLSSFVFLKLPDGMEWKVGLTTAANGAVWLQKGWDKFLEHYCLEFGSLLVFRLLDGRKSSSFDVTIFDPTGVETQYSCNVNNCTPD

Query:  SDCD-ESLEGFEGLLKKRKKASVPCFRSCKKTRKQDSFSIKVEPVEEEGCRNFSDIPNCKKKMPKEDEVWISNKGQVLKNKGESTEKPGFKVVMSQSNVG
         D D +  E F    KKR KASVP  R  KKTRK+D FSIK EP E E C  FSDIP        ++EV IS + + LKN+GES EK GFKVVMSQSNVG
Subjt:  SDCD-ESLEGFEGLLKKRKKASVPCFRSCKKTRKQDSFSIKVEPVEEEGCRNFSDIPNCKKKMPKEDEVWISNKGQVLKNKGESTEKPGFKVVMSQSNVG

Query:  GRFNMAIPEKFARKYLCEEFGSISLQTTNGRKWAVLYKWSRNKDEKVAYFCSGWRVFVQENLLKAGDVVFFEPIKKHRFLFTKLQDTAPFSSPSPNNKIA
        GRFN+ IP++FA KYL +E GSIS+QT NG+KW++LYKWS + DE VAY   GWR FV+ENLLK GDVVFFE IKK +FLFTKLQ+     S SP NK A
Subjt:  GRFNMAIPEKFARKYLCEEFGSISLQTTNGRKWAVLYKWSRNKDEKVAYFCSGWRVFVQENLLKAGDVVFFEPIKKHRFLFTKLQDTAPFSSPSPNNKIA

Query:  STRNPFFKVKIHLKSYGNAVLNIPMGFAKKHLSPEMYYAKLQVRNKEWNVTLKQYESHSRFSAGWSKFYHENGLRDGNTCLFEMMLPKKFITDYGKFLSN
        ST NPFF+V+IH KSYGN VLNIP+GFA KH SPEM++AKLQV NKEW V LKQY +H RFSAGWSKFY EN LRDG TCLFEMM+PKKFITDYGKFLSN
Subjt:  STRNPFFKVKIHLKSYGNAVLNIPMGFAKKHLSPEMYYAKLQVRNKEWNVTLKQYESHSRFSAGWSKFYHENGLRDGNTCLFEMMLPKKFITDYGKFLSN

Query:  SICLKLPDGLEWKLGSKTA-NDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEFNGYRYEEVETNNHEIN
         ICLK PDGLEWK+ S T  N TVWLQNGWQ+FSNHY LK GSLLVFR DGNSTF T IF+Q C EIQY SN IG  + + EE+   R +  ET      
Subjt:  SICLKLPDGLEWKLGSKTA-NDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEFNGYRYEEVETNNHEIN

Query:  DLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIENSDGESWAMSYKWSQSRNVAEYVYIS-----------------------------
        + KPEKIGFKIVVKKS +EGRYNML+PK+FA +HL EEFG+IEIENSDG  W M YKWSQSR V  + YIS                             
Subjt:  DLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIENSDGESWAMSYKWSQSRNVAEYVYIS-----------------------------

Query:  ---RTTFSP--PISAKNTNVKITTPNNNLFFKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVKMKHYERCISSNEAASSLEFFK
               SP  PI A N NV+ TTPN + FFKVNIH KSYKN VLNIPLTFA NHLSS M+ AKL+VGKKQW VK+KHYERCI    +    EFFK
Subjt:  ---RTTFSP--PISAKNTNVKITTPNNNLFFKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVKMKHYERCISSNEAASSLEFFK

TYK12921.1 B3 domain-containing transcription factor VRN1-like isoform X2 [Cucumis melo var. makuwa]7.0e-24767.53Show/hide
Query:  EMMIPKIFVKDYGKLLSSFVFLKLPDGMEWKVGLTTAANGAVWLQKGWDKFLEHYCLEFGSLLVFRLLDGRKSSSFDVTIFDPTGVETQYSCNVNNCTP-
        ++MIP++F+KDYGK LS+FV L+LPDG EWKVGLT   NG+VWLQ+GWDKF EHYCLE+G LLVF+LLD R +SSF V IFD T +ETQYS NVN+ T  
Subjt:  EMMIPKIFVKDYGKLLSSFVFLKLPDGMEWKVGLTTAANGAVWLQKGWDKFLEHYCLEFGSLLVFRLLDGRKSSSFDVTIFDPTGVETQYSCNVNNCTP-

Query:  -DSDCDESLEGFEGLLKKRKKASVPCFRSCKKTRKQDSFSIKVEPVEEEGCRNFSDIPNCKKKMPKEDEVWISNKGQVLKNKGESTEKPGFKVVMSQSNV
         DSD D S E      KKRKK S+P  R  KKTR++D F IK EP EEEGC  FSDIP        ++EV IS + + LKN+GES EK GFKVVMS+SNV
Subjt:  -DSDCDESLEGFEGLLKKRKKASVPCFRSCKKTRKQDSFSIKVEPVEEEGCRNFSDIPNCKKKMPKEDEVWISNKGQVLKNKGESTEKPGFKVVMSQSNV

Query:  GGRFNMAIPEKFARKYLCEEFGSISLQTTNGRKWAVLYKWSRNKDEKVAYFCSGWRVFVQENLLKAGDVVFFEPIKKHRFLFTKLQDTAPFSSPSPNNKI
        GGRFN+ IP++FARKYL +E GSIS+QT NG+KW++LY+W+   DE VAY   GWR FV+ENLLK GDVVFFE IKK +FLFTKL++     S SP  KI
Subjt:  GGRFNMAIPEKFARKYLCEEFGSISLQTTNGRKWAVLYKWSRNKDEKVAYFCSGWRVFVQENLLKAGDVVFFEPIKKHRFLFTKLQDTAPFSSPSPNNKI

Query:  ASTRNPFFKVKIHLKSYGNAVLNIPMGFAKKHLSPEMYYAKLQVRNKEWNVTLKQYESHSRFSAGWSKFYHENGLRDGNTCLFEMMLPKKFITDYGKFLS
        +ST NPFFKV+IH KSYGNA LNIP+GFAK+H SPEM+YAKLQVRNKEW VTLKQY +H R SAGWSKFYHEN LRDGNTCLFEMM+PKKFITDYGKFLS
Subjt:  ASTRNPFFKVKIHLKSYGNAVLNIPMGFAKKHLSPEMYYAKLQVRNKEWNVTLKQYESHSRFSAGWSKFYHENGLRDGNTCLFEMMLPKKFITDYGKFLS

Query:  NSICLKLPDGLEWKLGSKTA-NDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEFNGYRYEEVETNNHEI
        N ICLK PDG+EWK+ S T  N TVWL+NGWQ+FSNHY LK GSLLVFR DGNSTF T IF+Q C EIQY SN++G  +P+ EE++            E 
Subjt:  NSICLKLPDGLEWKLGSKTA-NDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEFNGYRYEEVETNNHEI

Query:  NDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIENSDGESWAMSYKWSQSRNVAEYVYISRTTFSP---PISAKNTNVKITTPNNNLF
         + KPEKIGFKI VKKS +EGRYNMLIPK FA +HL EEFG+IEIENSDG SW M YKWSQSR V E+ YIS     P   PI A N NV+ TTP+ + F
Subjt:  NDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIENSDGESWAMSYKWSQSRNVAEYVYISRTTFSP---PISAKNTNVKITTPNNNLF

Query:  FKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVKMKHYERCI
        FKVNIH KSYKNSVLNIPLTFA+NHLSSKM+ AKL+VGKKQWKVK+KHYERCI
Subjt:  FKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVKMKHYERCI

XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]5.8e-23352.08Show/hide
Query:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG
        SS EA SSLEFFKVFLPD  SLHMSIPPAFMKHLNGTFPEKATIQDHTGKSW ITLEKLDDLLYFK GWQ FVDYH LKYGDFLVFQYDGH TFDV IFG
Subjt:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG

Query:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV
        KNGCKKAV AK  SSVPILEAEIAEAGNSVSNLEA+VADA              GNS+SN EAIV +AGNSVSNLEA+ A A NSVSN  A+ ADA  SV
Subjt:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV

Query:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD
        SNLEVVV DAG+SVPI KVK+EPVVEEEDVEPSI  KRKRLQ GS+ VRKSKSIVASN GR  NASNSV QV PRG FFER MK WS QTI         
Subjt:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD

Query:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E
                                                                      V+E  HYMPFFGIENFRIEPFEIIPVRI P+V KYI E
Subjt:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E

Query:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDD-DLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTE
        YSQFQEHSR M CSQFHS+SA++EPKYIQF+HE VDSQQ+DQYFQ+D D Q D  S GVDMD+ NELPISQSQEILYLEYQP QTDKEDN KSAN +TTE
Subjt:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDD-DLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTE

Query:  LGGNSFDIKENNMDTTTELRENSF-GIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNV-
        +GG SF+I+E N ++TTEL  NS    EENNM  TE LGGN FDIEENNIK+EK+SP TV+ATRK KKRKSRETTSFEVQE NEETSEIDTDQDSRRNV 
Subjt:  LGGNSFDIKENNMDTTTELRENSF-GIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNV-

Query:  -------------EDDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK------------------------------------------------------
                     +  GKRK R+KR KKS IS TPSEHDD+VDVYK                                                      
Subjt:  -------------EDDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEI
                             FD+QPLIATKTEMEMPYMIPFGGVKPS+EK KS +DQEHNSDARTSYN  YCN KGP SV NDGV NFLFTKIVNIEEI
Subjt:  ---------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEI

Query:  LGSLVHDIDNLKNLFSKVCENVNEAADPEKMRE
        LGSLVHDIDNLKNLFSK+CENVNEA DPEKMR+
Subjt:  LGSLVHDIDNLKNLFSKVCENVNEAADPEKMRE

XP_038883716.1 uncharacterized protein LOC120074618 isoform X2 [Benincasa hispida]4.4e-23352.13Show/hide
Query:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG
        SS EA SSLEFFKVFLPD  SLHMSIPPAFMKHLNGTFPEKATIQDHTGKSW ITLEKLDDLLYFK GWQ FVDYH LKYGDFLVFQYDGH TFDV IFG
Subjt:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG

Query:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV
        KNGCKKAV AK  SSVPILEAEIAEAGNSVSNLEA+VADA              GNS+SN EAIV +AGNSVSNLEA+ A A NSVSN  A+ ADA  SV
Subjt:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV

Query:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD
        SNLEVVV DAG+SVPI KVK+EPVVEEEDVEPSI  KRKRLQ GS+ VRKSKSIVASN GR  NASNSV QV PRG FFER MK WS QTI         
Subjt:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD

Query:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E
                                                                      V+E  HYMPFFGIENFRIEPFEIIPVRI P+V KYI E
Subjt:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E

Query:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDD-DLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTE
        YSQFQEHSR M CSQFHS+SA++EPKYIQF+HE VDSQQ+DQYFQ+D D Q D  S GVDMD+ NELPISQSQEILYLEYQP QTDKEDN KSAN +TTE
Subjt:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDD-DLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTE

Query:  LGGNSFDIKENNMDTTTELRENSF-GIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNV-
        +GG SF+I+E N ++TTEL  NS    EENNM  TE LGGN FDIEENNIK+EK+SP TV+ATRK KKRKSRETTSFEVQE NEETSEIDTDQDSRRNV 
Subjt:  LGGNSFDIKENNMDTTTELRENSF-GIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNV-

Query:  ------------EDDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK-------------------------------------------------------
                    +  GKRK R+KR KKS IS TPSEHDD+VDVYK                                                       
Subjt:  ------------EDDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEIL
                            FD+QPLIATKTEMEMPYMIPFGGVKPS+EK KS +DQEHNSDARTSYN  YCN KGP SV NDGV NFLFTKIVNIEEIL
Subjt:  --------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEIL

Query:  GSLVHDIDNLKNLFSKVCENVNEAADPEKMRE
        GSLVHDIDNLKNLFSK+CENVNEA DPEKMR+
Subjt:  GSLVHDIDNLKNLFSKVCENVNEAADPEKMRE

XP_038883717.1 uncharacterized protein LOC120074618 isoform X3 [Benincasa hispida]3.8e-23251.98Show/hide
Query:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG
        SS EA SSLEFFKVFLPD  SLHMSIPPAFMKHLNGTFPEKATIQDHTGKSW ITLEKLDDLLYFK GWQ FVDYH LKYGDFLVFQYDGH TFDV IFG
Subjt:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG

Query:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV
        KNGCKKAV AK  SSVPILEAEIAEAGNSVSNLEA+VADA              GNS+SN EAIV +AGNSVSNLEA+ A A NSVSN  A+ ADA  SV
Subjt:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV

Query:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD
        SNLEVVV DAG+SVPI KVK+EPVVEEEDVEPSI  KRKRLQ GS+ VRKSKSIVASN GR  NASNSV QV PRG FFER MK WS QTI         
Subjt:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD

Query:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E
                                                                        E  HYMPFFGIENFRIEPFEIIPVRI P+V KYI E
Subjt:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E

Query:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDD-DLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTE
        YSQFQEHSR M CSQFHS+SA++EPKYIQF+HE VDSQQ+DQYFQ+D D Q D  S GVDMD+ NELPISQSQEILYLEYQP QTDKEDN KSAN +TTE
Subjt:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDD-DLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTE

Query:  LGGNSFDIKENNMDTTTELRENSF-GIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNV-
        +GG SF+I+E N ++TTEL  NS    EENNM  TE LGGN FDIEENNIK+EK+SP TV+ATRK KKRKSRETTSFEVQE NEETSEIDTDQDSRRNV 
Subjt:  LGGNSFDIKENNMDTTTELRENSF-GIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNV-

Query:  -------------EDDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK------------------------------------------------------
                     +  GKRK R+KR KKS IS TPSEHDD+VDVYK                                                      
Subjt:  -------------EDDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEI
                             FD+QPLIATKTEMEMPYMIPFGGVKPS+EK KS +DQEHNSDARTSYN  YCN KGP SV NDGV NFLFTKIVNIEEI
Subjt:  ---------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEI

Query:  LGSLVHDIDNLKNLFSKVCENVNEAADPEKMRE
        LGSLVHDIDNLKNLFSK+CENVNEA DPEKMR+
Subjt:  LGSLVHDIDNLKNLFSKVCENVNEAADPEKMRE

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein3.8e-19044.41Show/hide
Query:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG
        S  E AS+LEFFKVFLP   +LHMSIPPAFMKHLNGTFPEKAT+QDHTG SWCITLEKLDDLLYFKNGW+ FVDYHSLKYGDFLVFQY GHC FDV IFG
Subjt:  SSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFG

Query:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV
        KNGCKKA AAK ASS+P+LE EIAEAGNSVS+ EA V                                          ADA NS++NLEA++ADAGNS 
Subjt:  KNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSV

Query:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD
        S LEVV A  G++VP LKVKEEPVVEEEDV+PSISHKRKRLQ GS+   +SKS+V  N GR  N SNSVEQ  PRG FFERTMKRWS Q +         
Subjt:  SNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRD

Query:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E
                                                                      VEEP HYMPFFG +NFRIEP +I PVR NPEV KY  E
Subjt:  ENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-E

Query:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTEL
         +QFQE+S EM  S  HS SAN+E KYIQFEHE VDSQQ+ QYFQDDD+Q D  SEG+DM +T+E PISQS+EILYLEYQP QTD EDN KSAN D+TEL
Subjt:  YSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTEL

Query:  GGNSFDIKENNMD------------------TTTELRENSF-GIEENNMDITEQLGGNLFD-IEENNI--------------------KQEKQSPVTVKA
          NS+DI++NNMD                   TTEL  NS+  I+ENNMD TE LGGNL D I+ENN+                     +EKQSP +V+ 
Subjt:  GGNSFDIKENNMD------------------TTTELRENSF-GIEENNMDITEQLGGNLFD-IEENNI--------------------KQEKQSPVTVKA

Query:  TRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNVE---------------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK---------------
        +RK KKRKS    SFEVQEQ EETSEIDTDQDSRR VE               +DGKRK R KR KKS ISGT SEHDDEVDV+K               
Subjt:  TRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNVE---------------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHN
                                                                    FD+QPLIAT TEMEMPYMIPFGGVKPS EK  SPVDQEHN
Subjt:  ------------------------------------------------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHN

Query:  SDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGSLVHDIDNLKNLFSKVCENVNEAADPEKMREVL
        SDARTSYN D+CN KG QSVS DGV+NFLFTKIVNIE ILGSLVHDIDNLK+ F K+C   NEAAD EKMR+ L
Subjt:  SDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGSLVHDIDNLKNLFSKVCENVNEAADPEKMREVL

A0A1S3B176 uncharacterized protein LOC103484737 isoform X11.5e-18944.71Show/hide
Query:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNG
        EAAS+LEFFKVFLPD  +LHMSIPPAFMKHLNGTFPEKAT+QDHTG SW ITLEKLDDLLYFKNGW+ FVDYHSLKYGDFLVFQYDGHC FDV IFGKNG
Subjt:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNG

Query:  CKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSVSNL
        CKKA AAK ASS+P+LE EI EAGNSVS+ EA V                                          ADA NS++NLEA++ADAGNS S L
Subjt:  CKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSVSNL

Query:  EVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRDENI
        EVV AD  ++VP L+VKEEPVVEEEDV PSISHKRKRLQ GS+   KSKSIV  N G   N SNSVEQ  PRG FFERTMKRWS Q +            
Subjt:  EVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRDENI

Query:  SLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-EYSQ
                                                                   VEEP HYMPFFG+ENFRIEPF+IIPVR NPEV KY  E +Q
Subjt:  SLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-EYSQ

Query:  FQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTELGGN
        F+E S EM CS  HS SAN+E KYIQFEHE VD QQ++QYFQ DDLQED  SEG D+ +T+ELPISQS+EILYLEYQP +TD EDN KSA  DTTEL GN
Subjt:  FQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTELGGN

Query:  SFDIKENNMDTTTEL--------RENSF-----------GIEENNMDITE-----------------QLGGNLFDIEENNIKQEKQSPVTVKATRKMKKR
        S+DI+ENNMD TTEL        +EN+             IEENN+D TE                 +LGGN FDIEE NIKQEKQSP +VKATRK KKR
Subjt:  SFDIKENNMDTTTEL--------RENSF-----------GIEENNMDITE-----------------QLGGNLFDIEENNIKQEKQSPVTVKATRKMKKR

Query:  KSRETTSFEVQEQNEETSEIDTDQDSRRNVE---------------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK----------------------
        KS    S EVQEQNEETSEIDTDQDSRR VE                 GKRK R K+  KS +S +PSEHDDEVDVYK                      
Subjt:  KSRETTSFEVQEQNEETSEIDTDQDSRRNVE---------------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSY
                                                             FD+QPLIAT+TEMEM YMIPFGG KPS EK K P+DQEHNSDARTSY
Subjt:  -----------------------------------------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSY

Query:  NNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGSLVHDIDNLKNLFSK
        N D+CN KGPQSVS DG +NFLF KIVNIE ILG+LVHDIDN+KNLFSK
Subjt:  NNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGSLVHDIDNLKNLFSK

A0A1S3B181 uncharacterized protein LOC103484737 isoform X71.4e-19246.06Show/hide
Query:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNG
        EAAS+LEFFKVFLPD  +LHMSIPPAFMKHLNGTFPEKAT+QDHTG SW ITLEKLDDLLYFKNGW+ FVDYHSLKYGDFLVFQYDGHC FDV IFGKNG
Subjt:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNG

Query:  CKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSVSNL
        CKKA AAK ASS+P+LE EI EAGNSVS+ EA V                                          ADA NS++NLEA++ADAGNS S L
Subjt:  CKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSVSNL

Query:  EVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRDENI
        EVV AD  ++VP L+VKEEPVVEEEDV PSISHKRKRLQ GS+   KSKSIV  N G   N SNSVEQ  PRG FFERTMKRWS Q +            
Subjt:  EVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRDENI

Query:  SLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-EYSQ
                                                                   VEEP HYMPFFG+ENFRIEPF+IIPVR NPEV KY  E +Q
Subjt:  SLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-EYSQ

Query:  FQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTELGGN
        F+E S EM CS  HS SAN+E KYIQFEHE VD QQ++QYFQ DDLQED  SEG D+ +T+ELPISQS+EILYLEYQP +TD EDN KSA  DTTEL GN
Subjt:  FQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTELGGN

Query:  SFDIKENNMDTTTELREN-SFGIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNVE----
        S+DI+ENNMD TTEL  N    I+ENN+D TE LGGN FDIEE NIKQEKQSP +VKATRK KKRKS    S EVQEQNEETSEIDTDQDSRR VE    
Subjt:  SFDIKENNMDTTTELREN-SFGIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNVE----

Query:  -----------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK---------------------------------------------------------
                     GKRK R K+  KS +S +PSEHDDEVDVYK                                                         
Subjt:  -----------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK---------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGS
                          FD+QPLIAT+TEMEM YMIPFGG KPS EK K P+DQEHNSDARTSYN D+CN KGPQSVS DG +NFLF KIVNIE ILG+
Subjt:  ------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGS

Query:  LVHDIDNLKNLFSK
        LVHDIDN+KNLFSK
Subjt:  LVHDIDNLKNLFSK

A0A1S3B1B6 uncharacterized protein LOC103484737 isoform X28.5e-19044.79Show/hide
Query:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNG
        EAAS+LEFFKVFLPD  +LHMSIPPAFMKHLNGTFPEKAT+QDHTG SW ITLEKLDDLLYFKNGW+ FVDYHSLKYGDFLVFQYDGHC FDV IFGKNG
Subjt:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNG

Query:  CKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSVSNL
        CKKA AAK ASS+P+LE EI EAGNSVS+ EA V                                          ADA NS++NLEA++ADAGNS S L
Subjt:  CKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADAGNSVSNL

Query:  EVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRDENI
        EVV AD  ++VP L+VKEEPVVEEEDV PSISHKRKRLQ GS+   KSKSIV  N G   N SNSVEQ  PRG FFERTMKRWS Q +            
Subjt:  EVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGRVVRDENI

Query:  SLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-EYSQ
                                                                   VEEP HYMPFFG+ENFRIEPF+IIPVR NPEV KY  E +Q
Subjt:  SLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIPVRINPEVRKYI-EYSQ

Query:  FQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTELGGN
        F+E S EM CS  HS SAN+E KYIQFEHE VD QQ++QYFQ DDLQED  SEG D+ +T+ELPISQS+EILYLEYQP +TD EDN KSA  DTTEL GN
Subjt:  FQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTTELGGN

Query:  SFDIKENNMDTTTEL--------RENSF-----------GIEENNMDITE-----------------QLGGNLFDIEENNIKQEKQSPVTVKATRKMKKR
        S+DI+ENNMD TTEL        +EN+             IEENN+D TE                 +LGGN FDIEE NIKQEKQSP +VKATRK KKR
Subjt:  SFDIKENNMDTTTEL--------RENSF-----------GIEENNMDITE-----------------QLGGNLFDIEENNIKQEKQSPVTVKATRKMKKR

Query:  KSRETTSFEVQEQNEETSEIDTDQDSRRNVE-------------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK------------------------
        KS    S EVQEQNEETSEIDTDQDSRR VE               GKRK R K+  KS +S +PSEHDDEVDVYK                        
Subjt:  KSRETTSFEVQEQNEETSEIDTDQDSRRNVE-------------DDGKRKTRSKRVKKSRISGTPSEHDDEVDVYK------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNN
                                                           FD+QPLIAT+TEMEM YMIPFGG KPS EK K P+DQEHNSDARTSYN 
Subjt:  ---------------------------------------------------FDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNN

Query:  DYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGSLVHDIDNLKNLFSK
        D+CN KGPQSVS DG +NFLF KIVNIE ILG+LVHDIDN+KNLFSK
Subjt:  DYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGSLVHDIDNLKNLFSK

A0A5D3CM75 B3 domain-containing transcription factor VRN1-like isoform X23.4e-24767.53Show/hide
Query:  EMMIPKIFVKDYGKLLSSFVFLKLPDGMEWKVGLTTAANGAVWLQKGWDKFLEHYCLEFGSLLVFRLLDGRKSSSFDVTIFDPTGVETQYSCNVNNCTP-
        ++MIP++F+KDYGK LS+FV L+LPDG EWKVGLT   NG+VWLQ+GWDKF EHYCLE+G LLVF+LLD R +SSF V IFD T +ETQYS NVN+ T  
Subjt:  EMMIPKIFVKDYGKLLSSFVFLKLPDGMEWKVGLTTAANGAVWLQKGWDKFLEHYCLEFGSLLVFRLLDGRKSSSFDVTIFDPTGVETQYSCNVNNCTP-

Query:  -DSDCDESLEGFEGLLKKRKKASVPCFRSCKKTRKQDSFSIKVEPVEEEGCRNFSDIPNCKKKMPKEDEVWISNKGQVLKNKGESTEKPGFKVVMSQSNV
         DSD D S E      KKRKK S+P  R  KKTR++D F IK EP EEEGC  FSDIP        ++EV IS + + LKN+GES EK GFKVVMS+SNV
Subjt:  -DSDCDESLEGFEGLLKKRKKASVPCFRSCKKTRKQDSFSIKVEPVEEEGCRNFSDIPNCKKKMPKEDEVWISNKGQVLKNKGESTEKPGFKVVMSQSNV

Query:  GGRFNMAIPEKFARKYLCEEFGSISLQTTNGRKWAVLYKWSRNKDEKVAYFCSGWRVFVQENLLKAGDVVFFEPIKKHRFLFTKLQDTAPFSSPSPNNKI
        GGRFN+ IP++FARKYL +E GSIS+QT NG+KW++LY+W+   DE VAY   GWR FV+ENLLK GDVVFFE IKK +FLFTKL++     S SP  KI
Subjt:  GGRFNMAIPEKFARKYLCEEFGSISLQTTNGRKWAVLYKWSRNKDEKVAYFCSGWRVFVQENLLKAGDVVFFEPIKKHRFLFTKLQDTAPFSSPSPNNKI

Query:  ASTRNPFFKVKIHLKSYGNAVLNIPMGFAKKHLSPEMYYAKLQVRNKEWNVTLKQYESHSRFSAGWSKFYHENGLRDGNTCLFEMMLPKKFITDYGKFLS
        +ST NPFFKV+IH KSYGNA LNIP+GFAK+H SPEM+YAKLQVRNKEW VTLKQY +H R SAGWSKFYHEN LRDGNTCLFEMM+PKKFITDYGKFLS
Subjt:  ASTRNPFFKVKIHLKSYGNAVLNIPMGFAKKHLSPEMYYAKLQVRNKEWNVTLKQYESHSRFSAGWSKFYHENGLRDGNTCLFEMMLPKKFITDYGKFLS

Query:  NSICLKLPDGLEWKLGSKTA-NDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEFNGYRYEEVETNNHEI
        N ICLK PDG+EWK+ S T  N TVWL+NGWQ+FSNHY LK GSLLVFR DGNSTF T IF+Q C EIQY SN++G  +P+ EE++            E 
Subjt:  NSICLKLPDGLEWKLGSKTA-NDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEFNGYRYEEVETNNHEI

Query:  NDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIENSDGESWAMSYKWSQSRNVAEYVYISRTTFSP---PISAKNTNVKITTPNNNLF
         + KPEKIGFKI VKKS +EGRYNMLIPK FA +HL EEFG+IEIENSDG SW M YKWSQSR V E+ YIS     P   PI A N NV+ TTP+ + F
Subjt:  NDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIENSDGESWAMSYKWSQSRNVAEYVYISRTTFSP---PISAKNTNVKITTPNNNLF

Query:  FKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVKMKHYERCI
        FKVNIH KSYKNSVLNIPLTFA+NHLSSKM+ AKL+VGKKQWKVK+KHYERCI
Subjt:  FKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVKMKHYERCI

SwissProt top hitse value%identityAlignment
Q7XS74 Putative B3 domain-containing protein Os04g03474002.8e-1234.21Show/hide
Query:  SNEAASSLEFFKVFLPDFASLHMSIPPAFMKHL---NGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTI
        + +A++  +F +V LP F    M IP  F++H           A+I    GK W I LEK +  ++FK GW  F+ +H +  GD ++ +++G+  F + +
Subjt:  SNEAASSLEFFKVFLPDFASLHMSIPPAFMKHL---NGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTI

Query:  FGKNGCKKAVAAKD
        FG NGCKK +  KD
Subjt:  FGKNGCKKAVAAKD

Q851V5 Putative B3 domain-containing protein Os03g06216007.6e-1822.3Show/hide
Query:  EMMLPKKFITDYGKFLSNSICLKLPDGLEWKLGSKTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEE
        +M +P KF  ++   +  +I LK  +G    +     ++ + L  GW +F+N + +K G  LVFR+ GNS F+  IFD +   ++  S+N        + 
Subjt:  EMMLPKKFITDYGKFLSNSICLKLPDGLEWKLGSKTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEE

Query:  FNGYRYEEVETNNHEI--NDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIE-----------NSDGESWAMSYKWSQSRNVAEYVYI
          G   E +  ++  +    L  E+   +   +K  ++     +  +H +S    +E    E+            +S+   +    K S       YV I
Subjt:  FNGYRYEEVETNNHEI--NDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIE-----------NSDGESWAMSYKWSQSRNVAEYVYI

Query:  SRTTFSPPISAKNTNVKITTPNNNLFFKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIV-------------GKKQWKVKMKHYERCISSNEAASS
        SR   +      +  + +         K  I K+  +           +N L     +A  I              G+K  K+  +      S+      
Subjt:  SRTTFSPPISAKNTNVKITTPNNNLFFKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIV-------------GKKQWKVKMKHYERCISSNEAASS

Query:  LEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNGCKKAV
          FFKV + DF    M+IP  F +H  G   +   ++  +G ++ + + K  ++L   +GW++FV+ H L  GDFLVF+Y+G     V IF  +GC+K+ 
Subjt:  LEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNGCKKAV

Query:  AAKDASSV
        +    +++
Subjt:  AAKDASSV

Q851V5 Putative B3 domain-containing protein Os03g06216002.7e-0729.17Show/hide
Query:  IPPAFVKYFNGRIPSEAVIRDQSRQSWHVTLEELKNVVFFKDGWQEFVESHLLKLGDFLVFQYDGSHMFDVKIFSKNGCKKERVSRTGCPCAVVKVKDEP
        IP  F++YF G+IP    ++ +   ++ V + +    +  + GW+ FV +H L++GDFLVF YDG     V IF  +GC+K   SR+    A    +   
Subjt:  IPPAFVKYFNGRIPSEAVIRDQSRQSWHVTLEELKNVVFFKDGWQEFVESHLLKLGDFLVFQYDGSHMFDVKIFSKNGCKKERVSRTGCPCAVVKVKDEP

Query:  QSEHNYSTSLTRCKRSDSEVRSTDSSGTAPKSRRRSTSNLEELS
        +  H  S S     +S   V  ++    +   +   T+N+EE++
Subjt:  QSEHNYSTSLTRCKRSDSEVRSTDSSGTAPKSRRRSTSNLEELS

Q8L3W1 B3 domain-containing transcription factor VRN18.4e-1726.06Show/hide
Query:  MMLPKKFITDYGKFLSNSICLKLPDGLEWKLGSKTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEF
        + +P KF++ +   LS ++ L +PDG  W++G + A++ +W Q+GWQ+F + Y ++ G LL+FR++GNS F   IF+ +  EI Y S  +      D   
Subjt:  MMLPKKFITDYGKFLSNSICLKLPDGLEWKLGSKTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEF

Query:  NGYR----YEEVETNNHEINDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEF-GRIEIE-----------------NSDGESWAMSYKWSQSR
        N ++    +E++E  + E+  + P  + +   + +STV         K +AS  ++  F G ++ E                 N+D E    S       
Subjt:  NGYR----YEEVETNNHEINDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEF-GRIEIE-----------------NSDGESWAMSYKWSQSR

Query:  NVAEYVYISRTTFSPPISAKN----TNVKITTPNNNLFFKVNIHKK-SYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVK
              Y S +     ++A+      N   T    N FF+V +     Y+  ++ +P  FAE +LS      K+ + +KQW V+
Subjt:  NVAEYVYISRTTFSPPISAKN----TNVKITTPNNNLFFKVNIHKK-SYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVK

Q8LAV5 B3 domain-containing protein REM203.5e-1534.09Show/hide
Query:  FFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNGCKKAVAA
        FFKVFL + AS  + IP  FM  L    P+   +Q   GK W ++L+K+    Y   GW  F + H LK G+F+ F YDGH TF+V++F + G K+  A 
Subjt:  FFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNGCKKAVAA

Query:  KDASSVPILEAEIAEAGNSVSNLEAIVADADN
         +  ++P+ +++         +   +V D D+
Subjt:  KDASSVPILEAEIAEAGNSVSNLEAIVADADN

Q9FGD2 Putative B3 domain-containing protein At5g669801.5e-2125.97Show/hide
Query:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDD----LLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIF
        + +  L+FFKVFLP+F S  + IPPAF+  L    P++A + D  G+ WC+  +  D      ++F  GWQ+F +  SL++GDFLVF YDG   F VTIF
Subjt:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDD----LLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIF

Query:  GKNGCKK---AVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADA
          +GCKK    V+  D S V + E E  +       +                                 D G S+           N     ++V  D 
Subjt:  GKNGCKK---AVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADA

Query:  GNSVSNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGR
                V+V D    V   K K E     E  + +++       +     +K      S                P+   F R + R S Q + +   
Subjt:  GNSVSNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGR

Query:  VVRDENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVL
         +R   I L+ +I L DE G  WP  +    +        W  F   H++ + +KC FEF++
Subjt:  VVRDENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVL

Arabidopsis top hitse value%identityAlignment
AT3G06160.1 AP2/B3-like transcriptional factor family protein8.3e-1228.5Show/hide
Query:  WKVKMKHYERCISSNEAASSL-EFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQ
        W  K +      SS  +   L  FF VFL   +S  M IP ++   L    P+ A +    G+ W + +    + +YF+ GW  FV  + LK G+FL F 
Subjt:  WKVKMKHYERCISSNEAASSL-EFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQ

Query:  YDGHCTFDVTIFGKNGCK--KAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNS
        +DGH +++V+I+G+  CK  +AV   +  S    +  ++    S  +L+++  D+ +S SN+ ++ + + +S+     I +D+  S  NL +    A  S
Subjt:  YDGHCTFDVTIFGKNGCK--KAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNS

Query:  VSNLEAV
        V ++E V
Subjt:  VSNLEAV

AT3G18990.1 AP2/B3-like transcriptional factor family protein5.9e-1826.06Show/hide
Query:  MMLPKKFITDYGKFLSNSICLKLPDGLEWKLGSKTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEF
        + +P KF++ +   LS ++ L +PDG  W++G + A++ +W Q+GWQ+F + Y ++ G LL+FR++GNS F   IF+ +  EI Y S  +      D   
Subjt:  MMLPKKFITDYGKFLSNSICLKLPDGLEWKLGSKTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEF

Query:  NGYR----YEEVETNNHEINDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEF-GRIEIE-----------------NSDGESWAMSYKWSQSR
        N ++    +E++E  + E+  + P  + +   + +STV         K +AS  ++  F G ++ E                 N+D E    S       
Subjt:  NGYR----YEEVETNNHEINDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEF-GRIEIE-----------------NSDGESWAMSYKWSQSR

Query:  NVAEYVYISRTTFSPPISAKN----TNVKITTPNNNLFFKVNIHKK-SYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVK
              Y S +     ++A+      N   T    N FF+V +     Y+  ++ +P  FAE +LS      K+ + +KQW V+
Subjt:  NVAEYVYISRTTFSPPISAKN----TNVKITTPNNNLFFKVNIHKK-SYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVK

AT3G53310.1 AP2/B3-like transcriptional factor family protein2.5e-1634.09Show/hide
Query:  FFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNGCKKAVAA
        FFKVFL + AS  + IP  FM  L    P+   +Q   GK W ++L+K+    Y   GW  F + H LK G+F+ F YDGH TF+V++F + G K+  A 
Subjt:  FFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNGCKKAVAA

Query:  KDASSVPILEAEIAEAGNSVSNLEAIVADADN
         +  ++P+ +++         +   +V D D+
Subjt:  KDASSVPILEAEIAEAGNSVSNLEAIVADADN

AT4G31690.1 Transcriptional factor B3 family protein2.9e-1224.9Show/hide
Query:  EHNHSIPPAFVKYFNGRIPSE------AVIRDQSRQSWHVTLEELKNVVFFKDGWQEFVESHLLKLGDFLVFQYDGSHMFDVKIFSKNGCKKERVSRTGC
        + N  IP   V YF+  I  +       +  D S ++W V +E  +      +GW+EFVE+H L++GDF+VF+++G  +F V     + C+ +       
Subjt:  EHNHSIPPAFVKYFNGRIPSE------AVIRDQSRQSWHVTLEELKNVVFFKDGWQEFVESHLLKLGDFLVFQYDGSHMFDVKIFSKNGCKKERVSRTGC

Query:  PCAVVKVKDEPQSEHNYSTSLTRCKRSDSEVRSTDSSGTAPKSRRRSTSNLEELSPSKTAEHISMESPTFELMVKRWSHNAIHIPKTVMVTHNISL-KPN
                  PQS  +        +  ++E+   +      K   +S+S+L   S S T  +I              S +A+ +P+  +     S  +  
Subjt:  PCAVVKVKDEPQSEHNYSTSLTRCKRSDSEVRSTDSSGTAPKSRRRSTSNLEELSPSKTAEHISMESPTFELMVKRWSHNAIHIPKTVMVTHNISL-KPN

Query:  LVIVNERGRSWLVTAKPISRGRFALTTGWPAFFRANSLREDDECIFEFV
        +V++NE G+SW    K    G   L  GW  F   N L   D C F+ +
Subjt:  LVIVNERGRSWLVTAKPISRGRFALTTGWPAFFRANSLREDDECIFEFV

AT5G66980.1 AP2/B3-like transcriptional factor family protein1.0e-2225.97Show/hide
Query:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDD----LLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIF
        + +  L+FFKVFLP+F S  + IPPAF+  L    P++A + D  G+ WC+  +  D      ++F  GWQ+F +  SL++GDFLVF YDG   F VTIF
Subjt:  EAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLEKLDD----LLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIF

Query:  GKNGCKK---AVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADA
          +GCKK    V+  D S V + E E  +       +                                 D G S+           N     ++V  D 
Subjt:  GKNGCKK---AVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVADAGNSVSNLEAVAADADNSVSNLEAVAADA

Query:  GNSVSNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGR
                V+V D    V   K K E     E  + +++       +     +K      S                P+   F R + R S Q + +   
Subjt:  GNSVSNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGLFFERTMKRWSRQTIYISGR

Query:  VVRDENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVL
         +R   I L+ +I L DE G  WP  +    +        W  F   H++ + +KC FEF++
Subjt:  VVRDENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTCGTCTCCAAAGTTTTTCAGGATCGTCTTGCAAAGAAATCTTCAAGACCCAAAACTGATGATTCCGAAGACGTTCGTCAAAGATTATGGTAAGCTTTTGTCAAG
TTCCGTTAGTCTAAAGCTTTCAGACGGAAAGGAATGGAAAGTGGGTTTAATGACGGCCGGTAACGGCGCTGTTTGGTTGGAAAAGGGATGGGATAAATTTTCGGAATATT
ACTGTTTAGAGTTCGGGTCATTGTTGGTCTTTAGATTATTGGATGGAAGAAGTTCGAGTTTCGATGTGACCATATTTGATCCAACCGGAGTAGAAACCCAATATTCTTGC
AACTTGGATTTTTCTACAGCAGAACTTGAGGAGGATTCTGATTCTGATTCTGACAAAATTTTAGAGAGTTTTGATGTACATTTGAAGAAGAGGAAGAAGGCTTCAGTTCC
CTGTCGTCAATCTCGTAAGAAAATGAGAAAGGAAGATTCTTTTACCATCAAAACGGAACCTGAAGAAGAAGAAGGGTGCAATAATATATTTAGGAACATTCCAAGTTGTA
AAGAAAGAATGTCAAAAGATGTAGTTAAGTTTTCAAGGAAGAAGCAACAACTAGCAAACAAAGTTGAAGCAACTCAAAGATTCAGCTCAAAATCAGACCAGAAACCTTAC
TTCAAAGTTGTCATGCGACAGTCAAACGTACGAGGTAGATTTAACTTGGTTATTCCCTATGAATTTGCAGTGAAGTATTTACCCGAGGAATTTGGAACTATTAATCTTCA
AATTGTGAATGGTAAAAATTGGCAACTTTTGTACAAATGGTGTCGAACTATTCGAGCAAATTATGCATATATTTCAAGTGGGTGGAAGCGTTTTGCTGAGGAAAATCACT
TGAAAGAAGGGGATATTGGATTCTTTCAATTGATTAAGAACCACAATTTTATGTTCACCAAATTACAAGACACCCCTTCATCTTCCTTGTCATCGAAGAAAGGAACTGCA
ACAACAACAAACAATCACTTCTTTGAAATGGATAGAAGCTACAAGAATTCATATCTGGTGTACTCTGCAACGCTTCAAGTTGGGAATAAACAATGGAACGTGACATTAAA
GCAATATGATGGTTATGTCCGATTCTCAGCAGGTTGGAGCACATTTCGTGATGATAATGGTTTGGAGGATGGAGATACATGTTTGTTTGAAATGATGATCCCGAAGATAT
TTGTCAAAGACTATGGTAAGCTTTTATCAAGTTTCGTATTTCTTAAACTTCCAGATGGAATGGAATGGAAAGTGGGTTTGACGACCGCCGCCAACGGTGCGGTTTGGCTG
CAAAAGGGATGGGATAAATTTTTGGAACATTACTGTTTAGAGTTCGGGTCATTGTTGGTTTTTAGATTATTGGATGGAAGAAAAAGTTCGAGTTTCGATGTGACCATATT
CGATCCAACTGGAGTAGAAACCCAATATTCTTGCAATGTTAACAATTGCACACCAGATTCTGATTGTGATGAAAGTTTAGAGGGTTTTGAGGGACTTTTAAAGAAGAGGA
AGAAGGCTTCAGTTCCCTGTTTTCGATCTTGTAAAAAAACGAGAAAGCAGGATTCGTTTTCAATCAAAGTGGAACCTGTAGAAGAAGAAGGATGCAGAAATTTTAGTGAC
ATACCAAATTGTAAGAAAAAAATGCCAAAAGAAGATGAAGTTTGGATTTCAAATAAAGGGCAAGTATTAAAAAACAAAGGTGAATCAACTGAGAAACCTGGCTTCAAAGT
TGTGATGAGCCAATCAAATGTAGGAGGTAGATTTAATATGGCTATTCCTGAGAAATTTGCAAGAAAATACTTATGTGAGGAATTTGGAAGTATTAGCCTTCAAACAACAA
ATGGTAGGAAATGGGCAGTTCTGTACAAATGGAGTAGAAATAAAGATGAAAAAGTTGCCTACTTTTGTAGTGGGTGGAGGGTTTTTGTACAGGAAAATCTCTTGAAAGCA
GGGGATGTTGTATTCTTTGAACCCATTAAGAAACACAGATTTTTATTCACCAAATTACAAGACACAGCTCCCTTTTCTTCTCCTTCACCAAACAACAAAATTGCATCAAC
TAGAAACCCCTTCTTCAAAGTGAAGATTCACTTGAAAAGCTATGGGAATGCAGTTCTGAACATTCCCATGGGCTTTGCAAAGAAGCATTTGTCACCAGAAATGTATTATG
CAAAGCTTCAAGTTAGGAACAAAGAATGGAATGTAACATTGAAGCAATATGAGAGTCATTCCAGATTCTCGGCTGGTTGGAGTAAGTTTTATCATGAGAATGGCTTGAGA
GATGGGAACACATGTTTGTTTGAGATGATGCTTCCCAAGAAGTTCATTACTGACTATGGAAAATTCCTTTCAAATTCCATTTGCCTCAAGCTTCCTGACGGCTTGGAATG
GAAGCTTGGGTCTAAAACAGCTAACGACACCGTTTGGTTACAAAATGGGTGGCAGCAATTTTCAAATCATTACCGTTTGAAACCTGGCTCGCTTTTGGTTTTCAGATTTG
ATGGAAACTCTACGTTCCAAACTTGTATTTTTGACCAGACATGTTTAGAGATTCAATATCCTTCCAACAATATTGGAAAGACAAAACCCGACGATGAAGAATTCAATGGA
TATCGATACGAGGAGGTTGAAACAAACAATCATGAAATCAACGACTTGAAACCTGAAAAGATAGGTTTTAAAATTGTTGTGAAGAAATCAACCGTGGAAGGCCGCTATAA
CATGCTTATTCCTAAACATTTTGCAAGTAAACATTTGAAAGAGGAGTTTGGAAGGATAGAAATTGAAAATTCGGATGGAGAAAGTTGGGCAATGTCATACAAATGGAGCC
AAAGTCGTAATGTGGCGGAATATGTTTATATTTCAAGGACTACTTTCTCTCCGCCCATTTCAGCAAAGAACACAAATGTGAAAATTACAACTCCCAATAATAATCTCTTC
TTCAAAGTCAATATCCACAAGAAAAGCTACAAGAACTCTGTTTTGAACATTCCTCTTACATTTGCTGAGAATCATCTTTCTTCAAAAATGAACATTGCAAAACTTATAGT
GGGGAAGAAGCAATGGAAGGTGAAGATGAAACACTATGAAAGATGCATCAGCAGCAACGAAGCCGCTTCTAGCCTTGAGTTTTTCAAGGTTTTTCTTCCCGATTTCGCCT
CTCTGCATATGAGCATACCGCCAGCTTTTATGAAGCATTTAAATGGAACCTTTCCAGAAAAAGCTACCATCCAAGATCATACGGGAAAATCATGGTGTATTACATTGGAA
AAACTGGATGACCTTCTGTATTTCAAGAATGGCTGGCAGACTTTTGTAGATTACCATTCCCTGAAATATGGAGACTTCTTAGTTTTCCAATATGACGGCCACTGTACATT
TGATGTTACGATATTTGGTAAAAATGGATGTAAGAAGGCAGTGGCAGCAAAAGATGCTAGTTCTGTCCCAATTTTGGAGGCTGAGATAGCTGAAGCTGGTAATTCTGTTT
CAAATTTGGAGGCTATAGTAGCAGATGCTGATAATTCTGTTTCAAATTTGGAGGCTATAGTAGCAGATACTGGTAATTCTGTTTCAAATTTGGAGGCTATAGTAGCAGAT
GCTGGTAATTCTGTTTCAAATTTGGAGGCTGTGGCAGCAGATGCTGATAATTCTGTTTCAAATTTGGAGGCTGTGGCAGCAGATGCTGGTAATTCTGTTTCAAATTTGGA
GGTTGTGGTGGCAGATGCTGGTAGTTCTGTTCCAATTTTGAAGGTCAAAGAAGAGCCTGTGGTTGAGGAAGAAGATGTCGAACCTTCAATTTCTCACAAGAGGAAGCGAT
TACAAGTTGGATCAGATACAGTTCGTAAGTCAAAAAGCATTGTAGCTTCAAATTGTGGTAGAGCTGGCAATGCTTCGAACTCTGTAGAACAGGTTAGCCCAAGAGGGCTT
TTCTTTGAGCGGACGATGAAACGCTGGTCGCGTCAGACAATTTATATTTCTGGACGTGTGGTGAGGGATGAGAACATCTCGTTGAAGCCAAACATAGTTCTTAGGGATGA
AGAGGGTACATTGTGGCCAGCAACAGTCTCTTTCACTAGCCAGAATCGTATTTCTGTTACTGCTGGATGGTCTAAATTTTACACCGGCCATAAGTTGAGAATAAATGACA
AATGTGAGTTTGAGTTTGTTCTTGAAAGGGGAAATGTGGAGGAGCCCTTCCATTATATGCCTTTCTTTGGAATAGAGAATTTCAGGATTGAACCCTTCGAGATAATTCCA
GTACGAATAAATCCAGAAGTTAGAAAGTACATTGAGTATAGCCAATTTCAAGAACATAGCAGGGAGATGACTTGTAGCCAATTTCATTCATCGTCTGCAAATGAAGAACC
AAAATACATTCAATTTGAGCATGAGGGAGTTGATAGTCAACAACATGATCAATATTTCCAAGATGATGACCTTCAGGAGGATATTCTAAGTGAAGGAGTTGATATGGACA
TAACAAATGAACTACCTATTAGTCAATCACAAGAAATACTGTATTTGGAGTATCAACCCCCTCAAACAGATAAAGAAGACAATTGGAAATCAGCCAATACGGATACTACT
GAGCTTGGTGGGAATTCATTTGATATTAAAGAGAATAATATGGATACTACTACTGAGCTTCGTGAGAATTCATTTGGTATTGAAGAGAATAATATGGATATTACTGAGCA
GCTTGGTGGGAATTTATTTGATATTGAAGAGAATAATATCAAACAGGAAAAACAGTCCCCTGTAACTGTCAAGGCTACCAGAAAGATGAAAAAGAGAAAATCGAGAGAGA
CGACGAGCTTTGAAGTGCAAGAACAAAACGAAGAGACTTCCGAAATTGATACGGATCAGGACTCTAGGAGGAATGTGGAGGACGATGGTAAGAGAAAGACACGATCAAAG
AGGGTAAAGAAATCAAGAATTTCTGGTACGCCTTCTGAACATGATGATGAGGTGGATGTTTATAAGTTTGATATTCAGCCTCTTATAGCAACAAAAACAGAAATGGAAAT
GCCATATATGATACCATTTGGTGGGGTTAAGCCATCAAAAGAAAAGGGTAAATCGCCAGTGGATCAAGAACACAACAGTGATGCTAGAACTTCGTACAACAATGATTACT
GTAATATGAAAGGACCACAGTCTGTTTCAAATGACGGCGTCAAGAATTTTTTGTTCACTAAGATAGTCAACATAGAGGAAATTTTGGGCAGTTTGGTTCATGACATAGAT
AACCTAAAGAATCTCTTTTCAAAAGTGTGTGAAAATGTAAACGAAGCTGCAGATCCTGAAAAGATGAGGGAAGTGCTCTTCTCACAGAAAGTAATGGAAAGGGGAAAGCG
AGATGAACACAACCATAGTATACCACCTGCTTTTGTGAAGTATTTTAATGGAAGAATACCAAGTGAAGCTGTTATCAGAGATCAAAGCAGACAATCTTGGCATGTTACCT
TGGAAGAACTGAAGAATGTTGTGTTTTTCAAGGATGGCTGGCAAGAATTTGTTGAGAGCCACCTCTTGAAACTTGGAGACTTTTTAGTTTTCCAATACGATGGGAGTCAT
ATGTTTGATGTTAAGATATTTAGTAAAAATGGATGCAAGAAAGAGCGAGTATCAAGAACCGGATGTCCTTGCGCGGTTGTGAAGGTCAAAGACGAGCCCCAATCTGAGCA
CAATTATAGTACATCATTAACTCGTTGCAAAAGAAGTGATTCAGAAGTTAGATCCACTGATAGTTCAGGCACTGCCCCTAAATCAAGGAGAAGATCAACATCAAATCTTG
AAGAACTTAGTCCCTCTAAGACTGCAGAACATATTTCGATGGAATCACCAACCTTCGAGCTCATGGTTAAGCGTTGGTCACATAACGCTATTCATATTCCTAAAACGGTG
ATGGTTACCCATAACATCTCACTGAAGCCAAATTTGGTTATTGTAAATGAAAGGGGCAGGTCATGGCTGGTGACAGCAAAGCCAATTAGCCGTGGTCGGTTTGCATTGAC
CACTGGCTGGCCTGCTTTCTTCAGGGCAAATAGCTTGAGAGAAGATGACGAATGTATATTTGAGTTTGTTCTTGATTCAAATAACCTTTGCGGAGAACTAAAGGTGAAAA
TCACCCGCAGCCTGGAAATAACTCAAGAAAAACAGGCAACAATGTTTCTAAATTGGCTCTCGAGGATAGTTTGCAGAACTCTAGTAGAGGAGTTGAAATTTGAGAGGTTG
TATAAAAGTGATGCTCTTGATAAGAAAAGCTACAAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTTCGTCTCCAAAGTTTTTCAGGATCGTCTTGCAAAGAAATCTTCAAGACCCAAAACTGATGATTCCGAAGACGTTCGTCAAAGATTATGGTAAGCTTTTGTCAAG
TTCCGTTAGTCTAAAGCTTTCAGACGGAAAGGAATGGAAAGTGGGTTTAATGACGGCCGGTAACGGCGCTGTTTGGTTGGAAAAGGGATGGGATAAATTTTCGGAATATT
ACTGTTTAGAGTTCGGGTCATTGTTGGTCTTTAGATTATTGGATGGAAGAAGTTCGAGTTTCGATGTGACCATATTTGATCCAACCGGAGTAGAAACCCAATATTCTTGC
AACTTGGATTTTTCTACAGCAGAACTTGAGGAGGATTCTGATTCTGATTCTGACAAAATTTTAGAGAGTTTTGATGTACATTTGAAGAAGAGGAAGAAGGCTTCAGTTCC
CTGTCGTCAATCTCGTAAGAAAATGAGAAAGGAAGATTCTTTTACCATCAAAACGGAACCTGAAGAAGAAGAAGGGTGCAATAATATATTTAGGAACATTCCAAGTTGTA
AAGAAAGAATGTCAAAAGATGTAGTTAAGTTTTCAAGGAAGAAGCAACAACTAGCAAACAAAGTTGAAGCAACTCAAAGATTCAGCTCAAAATCAGACCAGAAACCTTAC
TTCAAAGTTGTCATGCGACAGTCAAACGTACGAGGTAGATTTAACTTGGTTATTCCCTATGAATTTGCAGTGAAGTATTTACCCGAGGAATTTGGAACTATTAATCTTCA
AATTGTGAATGGTAAAAATTGGCAACTTTTGTACAAATGGTGTCGAACTATTCGAGCAAATTATGCATATATTTCAAGTGGGTGGAAGCGTTTTGCTGAGGAAAATCACT
TGAAAGAAGGGGATATTGGATTCTTTCAATTGATTAAGAACCACAATTTTATGTTCACCAAATTACAAGACACCCCTTCATCTTCCTTGTCATCGAAGAAAGGAACTGCA
ACAACAACAAACAATCACTTCTTTGAAATGGATAGAAGCTACAAGAATTCATATCTGGTGTACTCTGCAACGCTTCAAGTTGGGAATAAACAATGGAACGTGACATTAAA
GCAATATGATGGTTATGTCCGATTCTCAGCAGGTTGGAGCACATTTCGTGATGATAATGGTTTGGAGGATGGAGATACATGTTTGTTTGAAATGATGATCCCGAAGATAT
TTGTCAAAGACTATGGTAAGCTTTTATCAAGTTTCGTATTTCTTAAACTTCCAGATGGAATGGAATGGAAAGTGGGTTTGACGACCGCCGCCAACGGTGCGGTTTGGCTG
CAAAAGGGATGGGATAAATTTTTGGAACATTACTGTTTAGAGTTCGGGTCATTGTTGGTTTTTAGATTATTGGATGGAAGAAAAAGTTCGAGTTTCGATGTGACCATATT
CGATCCAACTGGAGTAGAAACCCAATATTCTTGCAATGTTAACAATTGCACACCAGATTCTGATTGTGATGAAAGTTTAGAGGGTTTTGAGGGACTTTTAAAGAAGAGGA
AGAAGGCTTCAGTTCCCTGTTTTCGATCTTGTAAAAAAACGAGAAAGCAGGATTCGTTTTCAATCAAAGTGGAACCTGTAGAAGAAGAAGGATGCAGAAATTTTAGTGAC
ATACCAAATTGTAAGAAAAAAATGCCAAAAGAAGATGAAGTTTGGATTTCAAATAAAGGGCAAGTATTAAAAAACAAAGGTGAATCAACTGAGAAACCTGGCTTCAAAGT
TGTGATGAGCCAATCAAATGTAGGAGGTAGATTTAATATGGCTATTCCTGAGAAATTTGCAAGAAAATACTTATGTGAGGAATTTGGAAGTATTAGCCTTCAAACAACAA
ATGGTAGGAAATGGGCAGTTCTGTACAAATGGAGTAGAAATAAAGATGAAAAAGTTGCCTACTTTTGTAGTGGGTGGAGGGTTTTTGTACAGGAAAATCTCTTGAAAGCA
GGGGATGTTGTATTCTTTGAACCCATTAAGAAACACAGATTTTTATTCACCAAATTACAAGACACAGCTCCCTTTTCTTCTCCTTCACCAAACAACAAAATTGCATCAAC
TAGAAACCCCTTCTTCAAAGTGAAGATTCACTTGAAAAGCTATGGGAATGCAGTTCTGAACATTCCCATGGGCTTTGCAAAGAAGCATTTGTCACCAGAAATGTATTATG
CAAAGCTTCAAGTTAGGAACAAAGAATGGAATGTAACATTGAAGCAATATGAGAGTCATTCCAGATTCTCGGCTGGTTGGAGTAAGTTTTATCATGAGAATGGCTTGAGA
GATGGGAACACATGTTTGTTTGAGATGATGCTTCCCAAGAAGTTCATTACTGACTATGGAAAATTCCTTTCAAATTCCATTTGCCTCAAGCTTCCTGACGGCTTGGAATG
GAAGCTTGGGTCTAAAACAGCTAACGACACCGTTTGGTTACAAAATGGGTGGCAGCAATTTTCAAATCATTACCGTTTGAAACCTGGCTCGCTTTTGGTTTTCAGATTTG
ATGGAAACTCTACGTTCCAAACTTGTATTTTTGACCAGACATGTTTAGAGATTCAATATCCTTCCAACAATATTGGAAAGACAAAACCCGACGATGAAGAATTCAATGGA
TATCGATACGAGGAGGTTGAAACAAACAATCATGAAATCAACGACTTGAAACCTGAAAAGATAGGTTTTAAAATTGTTGTGAAGAAATCAACCGTGGAAGGCCGCTATAA
CATGCTTATTCCTAAACATTTTGCAAGTAAACATTTGAAAGAGGAGTTTGGAAGGATAGAAATTGAAAATTCGGATGGAGAAAGTTGGGCAATGTCATACAAATGGAGCC
AAAGTCGTAATGTGGCGGAATATGTTTATATTTCAAGGACTACTTTCTCTCCGCCCATTTCAGCAAAGAACACAAATGTGAAAATTACAACTCCCAATAATAATCTCTTC
TTCAAAGTCAATATCCACAAGAAAAGCTACAAGAACTCTGTTTTGAACATTCCTCTTACATTTGCTGAGAATCATCTTTCTTCAAAAATGAACATTGCAAAACTTATAGT
GGGGAAGAAGCAATGGAAGGTGAAGATGAAACACTATGAAAGATGCATCAGCAGCAACGAAGCCGCTTCTAGCCTTGAGTTTTTCAAGGTTTTTCTTCCCGATTTCGCCT
CTCTGCATATGAGCATACCGCCAGCTTTTATGAAGCATTTAAATGGAACCTTTCCAGAAAAAGCTACCATCCAAGATCATACGGGAAAATCATGGTGTATTACATTGGAA
AAACTGGATGACCTTCTGTATTTCAAGAATGGCTGGCAGACTTTTGTAGATTACCATTCCCTGAAATATGGAGACTTCTTAGTTTTCCAATATGACGGCCACTGTACATT
TGATGTTACGATATTTGGTAAAAATGGATGTAAGAAGGCAGTGGCAGCAAAAGATGCTAGTTCTGTCCCAATTTTGGAGGCTGAGATAGCTGAAGCTGGTAATTCTGTTT
CAAATTTGGAGGCTATAGTAGCAGATGCTGATAATTCTGTTTCAAATTTGGAGGCTATAGTAGCAGATACTGGTAATTCTGTTTCAAATTTGGAGGCTATAGTAGCAGAT
GCTGGTAATTCTGTTTCAAATTTGGAGGCTGTGGCAGCAGATGCTGATAATTCTGTTTCAAATTTGGAGGCTGTGGCAGCAGATGCTGGTAATTCTGTTTCAAATTTGGA
GGTTGTGGTGGCAGATGCTGGTAGTTCTGTTCCAATTTTGAAGGTCAAAGAAGAGCCTGTGGTTGAGGAAGAAGATGTCGAACCTTCAATTTCTCACAAGAGGAAGCGAT
TACAAGTTGGATCAGATACAGTTCGTAAGTCAAAAAGCATTGTAGCTTCAAATTGTGGTAGAGCTGGCAATGCTTCGAACTCTGTAGAACAGGTTAGCCCAAGAGGGCTT
TTCTTTGAGCGGACGATGAAACGCTGGTCGCGTCAGACAATTTATATTTCTGGACGTGTGGTGAGGGATGAGAACATCTCGTTGAAGCCAAACATAGTTCTTAGGGATGA
AGAGGGTACATTGTGGCCAGCAACAGTCTCTTTCACTAGCCAGAATCGTATTTCTGTTACTGCTGGATGGTCTAAATTTTACACCGGCCATAAGTTGAGAATAAATGACA
AATGTGAGTTTGAGTTTGTTCTTGAAAGGGGAAATGTGGAGGAGCCCTTCCATTATATGCCTTTCTTTGGAATAGAGAATTTCAGGATTGAACCCTTCGAGATAATTCCA
GTACGAATAAATCCAGAAGTTAGAAAGTACATTGAGTATAGCCAATTTCAAGAACATAGCAGGGAGATGACTTGTAGCCAATTTCATTCATCGTCTGCAAATGAAGAACC
AAAATACATTCAATTTGAGCATGAGGGAGTTGATAGTCAACAACATGATCAATATTTCCAAGATGATGACCTTCAGGAGGATATTCTAAGTGAAGGAGTTGATATGGACA
TAACAAATGAACTACCTATTAGTCAATCACAAGAAATACTGTATTTGGAGTATCAACCCCCTCAAACAGATAAAGAAGACAATTGGAAATCAGCCAATACGGATACTACT
GAGCTTGGTGGGAATTCATTTGATATTAAAGAGAATAATATGGATACTACTACTGAGCTTCGTGAGAATTCATTTGGTATTGAAGAGAATAATATGGATATTACTGAGCA
GCTTGGTGGGAATTTATTTGATATTGAAGAGAATAATATCAAACAGGAAAAACAGTCCCCTGTAACTGTCAAGGCTACCAGAAAGATGAAAAAGAGAAAATCGAGAGAGA
CGACGAGCTTTGAAGTGCAAGAACAAAACGAAGAGACTTCCGAAATTGATACGGATCAGGACTCTAGGAGGAATGTGGAGGACGATGGTAAGAGAAAGACACGATCAAAG
AGGGTAAAGAAATCAAGAATTTCTGGTACGCCTTCTGAACATGATGATGAGGTGGATGTTTATAAGTTTGATATTCAGCCTCTTATAGCAACAAAAACAGAAATGGAAAT
GCCATATATGATACCATTTGGTGGGGTTAAGCCATCAAAAGAAAAGGGTAAATCGCCAGTGGATCAAGAACACAACAGTGATGCTAGAACTTCGTACAACAATGATTACT
GTAATATGAAAGGACCACAGTCTGTTTCAAATGACGGCGTCAAGAATTTTTTGTTCACTAAGATAGTCAACATAGAGGAAATTTTGGGCAGTTTGGTTCATGACATAGAT
AACCTAAAGAATCTCTTTTCAAAAGTGTGTGAAAATGTAAACGAAGCTGCAGATCCTGAAAAGATGAGGGAAGTGCTCTTCTCACAGAAAGTAATGGAAAGGGGAAAGCG
AGATGAACACAACCATAGTATACCACCTGCTTTTGTGAAGTATTTTAATGGAAGAATACCAAGTGAAGCTGTTATCAGAGATCAAAGCAGACAATCTTGGCATGTTACCT
TGGAAGAACTGAAGAATGTTGTGTTTTTCAAGGATGGCTGGCAAGAATTTGTTGAGAGCCACCTCTTGAAACTTGGAGACTTTTTAGTTTTCCAATACGATGGGAGTCAT
ATGTTTGATGTTAAGATATTTAGTAAAAATGGATGCAAGAAAGAGCGAGTATCAAGAACCGGATGTCCTTGCGCGGTTGTGAAGGTCAAAGACGAGCCCCAATCTGAGCA
CAATTATAGTACATCATTAACTCGTTGCAAAAGAAGTGATTCAGAAGTTAGATCCACTGATAGTTCAGGCACTGCCCCTAAATCAAGGAGAAGATCAACATCAAATCTTG
AAGAACTTAGTCCCTCTAAGACTGCAGAACATATTTCGATGGAATCACCAACCTTCGAGCTCATGGTTAAGCGTTGGTCACATAACGCTATTCATATTCCTAAAACGGTG
ATGGTTACCCATAACATCTCACTGAAGCCAAATTTGGTTATTGTAAATGAAAGGGGCAGGTCATGGCTGGTGACAGCAAAGCCAATTAGCCGTGGTCGGTTTGCATTGAC
CACTGGCTGGCCTGCTTTCTTCAGGGCAAATAGCTTGAGAGAAGATGACGAATGTATATTTGAGTTTGTTCTTGATTCAAATAACCTTTGCGGAGAACTAAAGGTGAAAA
TCACCCGCAGCCTGGAAATAACTCAAGAAAAACAGGCAACAATGTTTCTAAATTGGCTCTCGAGGATAGTTTGCAGAACTCTAGTAGAGGAGTTGAAATTTGAGAGGTTG
TATAAAAGTGATGCTCTTGATAAGAAAAGCTACAAG
Protein sequenceShow/hide protein sequence
MPSSPKFFRIVLQRNLQDPKLMIPKTFVKDYGKLLSSSVSLKLSDGKEWKVGLMTAGNGAVWLEKGWDKFSEYYCLEFGSLLVFRLLDGRSSSFDVTIFDPTGVETQYSC
NLDFSTAELEEDSDSDSDKILESFDVHLKKRKKASVPCRQSRKKMRKEDSFTIKTEPEEEEGCNNIFRNIPSCKERMSKDVVKFSRKKQQLANKVEATQRFSSKSDQKPY
FKVVMRQSNVRGRFNLVIPYEFAVKYLPEEFGTINLQIVNGKNWQLLYKWCRTIRANYAYISSGWKRFAEENHLKEGDIGFFQLIKNHNFMFTKLQDTPSSSLSSKKGTA
TTTNNHFFEMDRSYKNSYLVYSATLQVGNKQWNVTLKQYDGYVRFSAGWSTFRDDNGLEDGDTCLFEMMIPKIFVKDYGKLLSSFVFLKLPDGMEWKVGLTTAANGAVWL
QKGWDKFLEHYCLEFGSLLVFRLLDGRKSSSFDVTIFDPTGVETQYSCNVNNCTPDSDCDESLEGFEGLLKKRKKASVPCFRSCKKTRKQDSFSIKVEPVEEEGCRNFSD
IPNCKKKMPKEDEVWISNKGQVLKNKGESTEKPGFKVVMSQSNVGGRFNMAIPEKFARKYLCEEFGSISLQTTNGRKWAVLYKWSRNKDEKVAYFCSGWRVFVQENLLKA
GDVVFFEPIKKHRFLFTKLQDTAPFSSPSPNNKIASTRNPFFKVKIHLKSYGNAVLNIPMGFAKKHLSPEMYYAKLQVRNKEWNVTLKQYESHSRFSAGWSKFYHENGLR
DGNTCLFEMMLPKKFITDYGKFLSNSICLKLPDGLEWKLGSKTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTFQTCIFDQTCLEIQYPSNNIGKTKPDDEEFNG
YRYEEVETNNHEINDLKPEKIGFKIVVKKSTVEGRYNMLIPKHFASKHLKEEFGRIEIENSDGESWAMSYKWSQSRNVAEYVYISRTTFSPPISAKNTNVKITTPNNNLF
FKVNIHKKSYKNSVLNIPLTFAENHLSSKMNIAKLIVGKKQWKVKMKHYERCISSNEAASSLEFFKVFLPDFASLHMSIPPAFMKHLNGTFPEKATIQDHTGKSWCITLE
KLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTFDVTIFGKNGCKKAVAAKDASSVPILEAEIAEAGNSVSNLEAIVADADNSVSNLEAIVADTGNSVSNLEAIVAD
AGNSVSNLEAVAADADNSVSNLEAVAADAGNSVSNLEVVVADAGSSVPILKVKEEPVVEEEDVEPSISHKRKRLQVGSDTVRKSKSIVASNCGRAGNASNSVEQVSPRGL
FFERTMKRWSRQTIYISGRVVRDENISLKPNIVLRDEEGTLWPATVSFTSQNRISVTAGWSKFYTGHKLRINDKCEFEFVLERGNVEEPFHYMPFFGIENFRIEPFEIIP
VRINPEVRKYIEYSQFQEHSREMTCSQFHSSSANEEPKYIQFEHEGVDSQQHDQYFQDDDLQEDILSEGVDMDITNELPISQSQEILYLEYQPPQTDKEDNWKSANTDTT
ELGGNSFDIKENNMDTTTELRENSFGIEENNMDITEQLGGNLFDIEENNIKQEKQSPVTVKATRKMKKRKSRETTSFEVQEQNEETSEIDTDQDSRRNVEDDGKRKTRSK
RVKKSRISGTPSEHDDEVDVYKFDIQPLIATKTEMEMPYMIPFGGVKPSKEKGKSPVDQEHNSDARTSYNNDYCNMKGPQSVSNDGVKNFLFTKIVNIEEILGSLVHDID
NLKNLFSKVCENVNEAADPEKMREVLFSQKVMERGKRDEHNHSIPPAFVKYFNGRIPSEAVIRDQSRQSWHVTLEELKNVVFFKDGWQEFVESHLLKLGDFLVFQYDGSH
MFDVKIFSKNGCKKERVSRTGCPCAVVKVKDEPQSEHNYSTSLTRCKRSDSEVRSTDSSGTAPKSRRRSTSNLEELSPSKTAEHISMESPTFELMVKRWSHNAIHIPKTV
MVTHNISLKPNLVIVNERGRSWLVTAKPISRGRFALTTGWPAFFRANSLREDDECIFEFVLDSNNLCGELKVKITRSLEITQEKQATMFLNWLSRIVCRTLVEELKFERL
YKSDALDKKSYK