; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023052 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023052
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription factor MYB3R-2 isoform X3
Genome locationchr7:43434192..43440046
RNA-Seq ExpressionLag0023052
SyntenyLag0023052
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589626.1 Myb-like protein L, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.1Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY+ ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIKDLDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
        +R +      K     A K              +    AN +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DTQSLE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDD-IIERNGESVADKDLD
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS   P DT LLA  T DD IIE  G +VADKDLD
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDD-IIERNGESVADKDLD

Query:  D--SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETF
        D  SFSLP SCLELR TDSEGV+SYSVDE T K HGVCK QGRRKKNSK SNKSQDSLVSC+Q E E S   ELHR NQSKKRKH+ TN TS LGT E  
Subjt:  D--SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETF

Query:  EEVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKR
        EEVDDCTL GFLQKR+KRT T HDKKVDGSSS+  EVDNDDNDPT+A LLKDKLKRK+
Subjt:  EEVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKR

XP_022921860.1 uncharacterized protein LOC111430000 isoform X1 [Cucurbita moschata]0.0e+0077.84Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY+ ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIK LDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLA MYL+GRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
        +R +      K     A K +            +    AN +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT+SLE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS   P DT LLA  T DDIIE  G +VADKDLDD
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD

Query:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
          SFSLP SCLELR TDSEGV+SYSVDE T K HGVCK QGRRKKNSK SNKSQDSLVSC+Q ELE S   ELHR NQSKKRKH+ TN TS LGT E  E
Subjt:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTL GFLQKR+KRT T HDKKVDGSSS+  EVDNDDNDPT+A LLKDKLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH

XP_022921861.1 uncharacterized protein LOC111430000 isoform X2 [Cucurbita moschata]0.0e+0077.7Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY+ ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIK LDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLA MYL+GRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
        +R +      K     A K +            +    AN +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT+SLE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS   P DT LLA  T DDIIE  G +VADKDLDD
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD

Query:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
          SFSLP SCLELR TDSEGV+SYSVDE T K HGVCK QGRRKKNSK SNKSQDSLVSC+Q ELE S   ELHR NQSKKRKH+ TN TS LGT E  E
Subjt:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTL GFLQKR+KRT T HDKKVDGSSS+  EVDNDDNDPT+A LLKDKLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH

XP_022987958.1 uncharacterized protein LOC111485355 isoform X1 [Cucurbita maxima]0.0e+0077.84Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY++ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIKDLDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALK---------LLLFLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
         R +      K     A K          +   +    A+ +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT++LE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALK---------LLLFLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLD-
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS +   DT LLA  T DDIIE  G +VADKDLD 
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLD-

Query:  -DSFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
         +SFSLP SCLELR TDSEGV+SYSVDE T K H VCK QGRRKKNSK SNKSQDSLVSC+Q ELE S T ELHR NQ KKRKH+STN TS LGT E  E
Subjt:  -DSFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTLLGFLQKR+KRT T H KKVDGSSS S EVDNDDNDPT+A LLK+KLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPTIASLLKDKLKRKRH

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0077.44Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLS++ YGPAENSHVACY+ A TKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIK LDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVA+FG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
         R +      K     A K              +     N +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT+S+E+QLN KE
Subjt:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS +   DT LLA  T DDIIE  G +VADKDLDD
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD

Query:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
          SFSLP SCLELR TDSEGV+SYSVDE T K HGVCK QGRRKKNSK SNKSQDSLVSC+Q ELE S T ELHR NQSKKRKH+ TN TS LGT E  E
Subjt:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTL GFLQKR+KRT T HDKKVDGSSS+  EVDNDDNDPT+A LL DKLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH

TrEMBL top hitse value%identityAlignment
A0A0A0L2R2 Uncharacterized protein2.6e-29373.17Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLS +YYGP ENSHVACY+M L KFPP VDRKKWS VERENLGKGIRQQFQEMVLQ+SVDQISG QG S DSDDLD+I+ASIKDLDI P+KIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLASMYL+GRSGAECEARWLNFEDPLINR+PWTTSEDK+LLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILKREWTK+EDD+L
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVA FGV DWQAVASTLEGR G QCSNRWKKSLDPART++GYFTPDED RLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFP+EVPLLQEARKIQKAALISNFVDRE+ERPALGP  F             G       +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALKLLL---------FLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
         R +      K     A K              +  GIA  +S VPEEV+S KPQR+R+R GA+T KR  GV E RS+SE   +QNLDT+SL LQLNSKE
Subjt:  LRFLFFFFFLKEICSQALKLLL---------FLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD-
        SER++S+CTETVDEN ME  ENKVAEKL++ + CFSE E+NQNSTGSSGVSVLSEMT D+ +YNPSI+   DT L A  T DDI E  G+S AD+DLDD 
Subjt:  SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD-

Query:  -SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCK-YQGRRKKNSKTSNKSQDSLVSCR-QVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETF
         SFSL HSCLELR  DSEGV+SYSVDE T K +GVC   QGRRKKNSKTSN S D+L+  R Q+  E   TK+   HNQSKKRKH++T   S L TSE  
Subjt:  -SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCK-YQGRRKKNSKTSNKSQDSLVSCR-QVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETF

Query:  EEVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRHEAASG
        EEVDDCTL+GFLQKR+KRT   H++ VD SS++ L+VDNDDN+PTIAS L +KLKRK+H+  SG
Subjt:  EEVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRHEAASG

A0A6J1E2J4 uncharacterized protein LOC111430000 isoform X20.0e+0077.7Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY+ ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIK LDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLA MYL+GRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
        +R +      K     A K +            +    AN +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT+SLE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS   P DT LLA  T DDIIE  G +VADKDLDD
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD

Query:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
          SFSLP SCLELR TDSEGV+SYSVDE T K HGVCK QGRRKKNSK SNKSQDSLVSC+Q ELE S   ELHR NQSKKRKH+ TN TS LGT E  E
Subjt:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTL GFLQKR+KRT T HDKKVDGSSS+  EVDNDDNDPT+A LLKDKLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0077.84Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY+ ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIK LDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLA MYL+GRSGAECEARWLNFEDPLINRN WTTSEDKNLLFTIQQKGLNNWI++AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
        +R +      K     A K +            +    AN +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT+SLE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALKLLLF---------LSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS   P DT LLA  T DDIIE  G +VADKDLDD
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDD

Query:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
          SFSLP SCLELR TDSEGV+SYSVDE T K HGVCK QGRRKKNSK SNKSQDSLVSC+Q ELE S   ELHR NQSKKRKH+ TN TS LGT E  E
Subjt:  --SFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTL GFLQKR+KRT T HDKKVDGSSS+  EVDNDDNDPT+A LLKDKLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSS-LEVDNDDNDPTIASLLKDKLKRKRH

A0A6J1JK98 uncharacterized protein LOC111485355 isoform X22.9e-31077.7Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY++ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIKDLDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNEC RRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALK---------LLLFLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
         R +      K     A K          +   +    A+ +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT++LE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALK---------LLLFLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLD-
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS +   DT LLA  T DDIIE  G +VADKDLD 
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLD-

Query:  -DSFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
         +SFSLP SCLELR TDSEGV+SYSVDE T K H VCK QGRRKKNSK SNKSQDSLVSC+Q ELE S T ELHR NQ KKRKH+STN TS LGT E  E
Subjt:  -DSFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTLLGFLQKR+KRT T H KKVDGSSS S EVDNDDNDPT+A LLK+KLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPTIASLLKDKLKRKRH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0077.84Show/hide
Query:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF
        S KDKRLSA+ YGPAENSHVACY++ALTKF P VDRK+WSN ERENLGKGIRQQFQEMVLQ+SVDQIS +QGFSA+SDDLD+I+ASIKDLDITPEKIREF
Subjt:  SYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREF

Query:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL
        LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWID+AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKL
Subjt:  LPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKL

Query:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        RSAVAIFG GDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRC+WTEE
Subjt:  RSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG
        EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPN+VPLLQEARKIQK ALISNFVDRESERPALGP  F               +     +  
Subjt:  EDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTF-------------GLDRIQIYHSG

Query:  LRFLFFFFFLKEICSQALK---------LLLFLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE
         R +      K     A K          +   +    A+ +SSVP EVKS KPQR+R+R+GA+TT R+KG  +   NSER  EQN DT++LE+QLN KE
Subjt:  LRFLFFFFFLKEICSQALK---------LLLFLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKE

Query:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLD-
         +ER +SDC ETVDENGME  ENK AE  S+  VCFSEQEENQNSTGSSGVSVLSEMT DMDEYNPS +   DT LLA  T DDIIE  G +VADKDLD 
Subjt:  -SERTSSDCTETVDENGMESCENKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMT-DMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLD-

Query:  -DSFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE
         +SFSLP SCLELR TDSEGV+SYSVDE T K H VCK QGRRKKNSK SNKSQDSLVSC+Q ELE S T ELHR NQ KKRKH+STN TS LGT E  E
Subjt:  -DSFSLPHSCLELRATDSEGVNSYSVDEVTGKGHGVCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFE

Query:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPTIASLLKDKLKRKRH
        EVDDCTLLGFLQKR+KRT T H KKVDGSSS S EVDNDDNDPT+A LLK+KLKRK+H
Subjt:  EVDDCTLLGFLQKRVKRTNTRHDKKVDGSSS-SLEVDNDDNDPTIASLLKDKLKRKRH

SwissProt top hitse value%identityAlignment
P46200 Transcriptional activator Myb7.7e-3239.77Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ +A+ L  RT  QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ L GR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF
        +P +++  WTEEED  +  A +  G  WA++A  +P RTDN  +  W      +V     LQE+ K  + A+ ++F
Subjt:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARKIQKAALISNF

P91868 snRNA-activating protein complex subunit 4 homolog2.0e-3233.08Show/hide
Query:  VNWDKLASMYLRG-RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        V W  +A+   +G R+    +++W N  +P  N+  W+  E + L +  +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED KL +
Subjt:  VNWDKLASMYLRG-RSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAIFGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR-RCQWT
           I  +     W  VA  + GRT  Q   R+  +LD A  K G +T  ED  L  AV  +G K+W K A+ +  RN  QCRERW N L+ S     ++T
Subjt:  AVAIFGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLR-RCQWT

Query:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD
          ED +L  A++  G  +WAK    +P +T  + RRR+ +L          A K++ AA   N VD
Subjt:  EEEDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVD

Q54NA6 Myb-like protein L2.1e-5835.83Show/hide
Query:  PAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQ-FQEMVLQMSVDQIS---------GLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK
        PA+N      ++     P     ++W+  E E L KGI+++  Q+ + ++S D++S          +Q  S ++++ ++ I +  + +   +K     P 
Subjt:  PAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQ-FQEMVLQMSVDQIS---------GLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK

Query:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA
        +       +    RS  E   RW N +DP IN+ P+T  EDK LL   ++   + W  I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED+ L   
Subjt:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA

Query:  VAIFGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE
        + +   G   DWQ +   + GRTG QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      + GR  VQCRER+ N LDP L + +WT +
Subjt:  VAIFGVG---DWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEE

Query:  EDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPAL
        ED RL     + G   W+ VA  + +RTDN+C RRWK+L    N +   QE    +K   +SNF  R+ ER  L
Subjt:  EDLRLEIAIQEHGY-SWAKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARKIQKAALISNFVDRESERPAL

Q5SXM2 snRNA-activating protein complex subunit 41.4e-4132.02Show/hide
Query:  KWSNVERENLGKGIRQQFQEMVLQMSVDQISGL-QGFSADSDDLD---------SIIASIKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEA
        KW N E+  L K +     + +LQ  + ++  L Q  S  S +L+              I+D++  PE+  +   L   +W+K++++   G RS  E   
Subjt:  KWSNVERENLGKGIRQQFQEMVLQMSVDQISGL-QGFSADSDDLD---------SIIASIKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEA

Query:  RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEG
         W N E P IN+  W+  E++ L       G   W  IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    VG    ++ +   +EG
Subjt:  RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEG

Query:  RTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYS-WAKVA
        R   Q   RW KSLDP   K+GY+ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++ +W  +E+ +L   I+++G   WAK+A
Subjt:  RTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYS-WAKVA

Query:  ACVPSRTDNECRRRWKKLFPNEVPLLQEARK
        + +P R+ ++C  +WK +   +  L +  R+
Subjt:  ACVPSRTDNECRRRWKKLFPNEVPLLQEARK

Q8BP86 snRNA-activating protein complex subunit 42.9e-3930.82Show/hide
Query:  KWSNVERENLGKGIRQQFQEMVLQMSV----------DQISGLQGFSADSDDLDSIIASIKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEA
        KW + E+  L K +     + +LQ  +           ++S      A    +      I+D++  PE+  +   L   +W+K++++   G RS  E   
Subjt:  KWSNVERENLGKGIRQQFQEMVLQMSV----------DQISGLQGFSADSDDLDSIIASIKDLDITPEK--IREFLPKVNWDKLASMYLRG-RSGAECEA

Query:  RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEG
         W + E P I++  W+T E + L       G   W  +A  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V    VG+   ++ +   +EG
Subjt:  RWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGD---WQAVASTLEG

Query:  RTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYS-WAKVA
        R   Q   RW KSLDP+  KRG++ P+ED++L  AV  +G ++W K  E +PGR+  QCR+R+   L  SL++ +W  +E+ +L   I+++G   WA++A
Subjt:  RTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYS-WAKVA

Query:  ACVPSRTDNECRRRWKKL
        + +P R+ ++C  +WK L
Subjt:  ACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-34.8e-2941.5Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV  F    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-34.8e-2941.5Show/hide
Query:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV  F    W+ +A +   RT  QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ LPGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCL

Query:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW
        +P + +  WT EE++ L  A + HG  WA++A  +P RTDN  +  W
Subjt:  DPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r15.5e-13460.6Show/hide
Query:  DKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK
        DK++S +  GPAEN  V  Y+MAL K+P SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D+I  SI +L+ITPE IR+FLPK
Subjt:  DKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK

Query:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA
        +NWD   S+ ++ RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+A
Subjt:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA

Query:  VAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDL
        V +FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED 
Subjt:  VAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDL

Query:  RLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPAL
        +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL
Subjt:  RLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPAL

AT3G18100.2 myb domain protein 4r15.5e-13460.6Show/hide
Query:  DKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK
        DK++S +  GPAEN  V  Y+MAL K+P SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D+I  SI +L+ITPE IR+FLPK
Subjt:  DKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK

Query:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA
        +NWD   S+ ++ RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+A
Subjt:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA

Query:  VAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDL
        V +FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED 
Subjt:  VAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDL

Query:  RLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPAL
        +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL
Subjt:  RLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPAL

AT3G18100.3 myb domain protein 4r15.5e-13460.6Show/hide
Query:  DKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK
        DK++S +  GPAEN  V  Y+MAL K+P SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D+I  SI +L+ITPE IR+FLPK
Subjt:  DKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMVLQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREFLPK

Query:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA
        +NWD   S+ ++ RS AECEARW++ EDPLIN  PWT +EDKNLL TI+Q  L +W+DIAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+A
Subjt:  VNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSA

Query:  VAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDL
        V +FG  DWQ+VA+ L+GRTG QCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++F+PGR Q QCRERW NCLDP + R +WTEEED 
Subjt:  VAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQCRERWFNCLDPSLRRCQWTEEEDL

Query:  RLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPAL
        +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEAR++QK A + NFVDRESERPAL
Subjt:  RLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTGGCTCATCTGCTTTTCCGATCTATTAGTTCCTTTTGTTTAGAGGTTATTGCATATTTGGTTACTGTTAAAAGACTAACAGTAATGCTGGCGAACCTGAGAGC
TCTATATTTTACAGCAACACTGATAAACTTGAGAGCTCTATGTTCTTATAAGGACAAACGACTTTCTGCAGTGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCT
ACAAAATGGCATTGACAAAGTTTCCACCTTCTGTAGATCGAAAAAAATGGTCCAATGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAATTTCAAGAGATGGTG
CTTCAGATGTCAGTGGATCAAATTAGTGGGCTACAAGGATTTTCAGCAGATTCAGATGATCTGGATAGCATTATTGCATCAATAAAAGACCTTGATATTACCCCTGAAAA
GATTAGGGAATTTCTACCAAAAGTTAATTGGGACAAACTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTTGAAGACCCCC
TAATTAATCGGAATCCATGGACTACAAGTGAGGACAAGAATCTTTTGTTTACCATCCAACAGAAAGGGTTGAATAACTGGATTGACATAGCAGTTTCATTGGGTACCAAC
AGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTTGCTAT
ATTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACCGGTCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAG
GCTATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCGGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAA
TGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAGATGTCAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAGCAATTCAGGAACATGGATATAGCTG
GGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAAGATTC
AGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCACTGACTTTCGGCCTCGACCGGATACAGATTTATCATTCAGGTCTTCGT
TTTCTTTTCTTTTTCTTTTTCCTGAAAGAAATCTGTAGTCAGGCATTGAAGCTCTTGTTATTTCTTTCTTCATGTGGTATTGCAAATATTTCGTCTTCTGTCCCGGAGGA
GGTTAAATCTATAAAACCTCAAAGACAACGAAGTAGAAATGGAGCGCATACCACTAAGAGGAAGAAGGGAGTTCTGGAGCCACGTTCTAACAGTGAGAGACGTGTCGAAC
AGAATTTGGATACTCAGAGCCTCGAGTTGCAGCTGAATAGTAAGGAATCAGAGAGGACCAGCAGTGACTGCACTGAGACTGTTGATGAAAATGGTATGGAGTCGTGTGAG
AACAAAGTTGCAGAGAAGCTTTCTAAAAGAGATGTATGCTTTTCAGAGCAAGAAGAAAATCAAAACTCTACTGGATCTTCTGGAGTATCAGTATTGTCAGAAATGACCGA
CATGGACGAGTACAATCCCTCTATTATCCTTCCAGCAGATACAGCATTGCTGGCCATCCCTACTGGGGATGATATTATAGAAAGGAATGGGGAGAGTGTTGCAGACAAAG
ATCTGGATGACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCAGGGCAACTGACAGTGAAGGTGTCAACAGCTATTCTGTGGATGAAGTCACAGGTAAAGGCCATGGG
GTTTGCAAGTACCAAGGCAGAAGGAAGAAAAATAGTAAAACATCAAATAAGAGCCAGGATTCATTGGTTTCTTGTAGACAAGTGGAGCTGGAGAGGTCGGAGACGAAGGA
GCTTCATCGTCATAATCAATCGAAGAAGAGAAAGCATAACAGTACAAATACGACGAGTTTATTAGGAACATCGGAGACATTCGAAGAGGTCGATGACTGCACTCTTCTGG
GCTTTTTGCAGAAAAGAGTGAAGAGGACGAACACGAGACATGACAAGAAAGTTGATGGCAGCTCTAGTTCATTAGAAGTTGATAATGATGATAATGATCCTACCATTGCC
TCGCTTCTCAAGGATAAATTGAAGAGAAAAAGGCATGAAGCGGCCAGTGGTGGAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTGGCTCATCTGCTTTTCCGATCTATTAGTTCCTTTTGTTTAGAGGTTATTGCATATTTGGTTACTGTTAAAAGACTAACAGTAATGCTGGCGAACCTGAGAGC
TCTATATTTTACAGCAACACTGATAAACTTGAGAGCTCTATGTTCTTATAAGGACAAACGACTTTCTGCAGTGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCT
ACAAAATGGCATTGACAAAGTTTCCACCTTCTGTAGATCGAAAAAAATGGTCCAATGTAGAAAGGGAGAATCTAGGGAAGGGAATAAGACAGCAATTTCAAGAGATGGTG
CTTCAGATGTCAGTGGATCAAATTAGTGGGCTACAAGGATTTTCAGCAGATTCAGATGATCTGGATAGCATTATTGCATCAATAAAAGACCTTGATATTACCCCTGAAAA
GATTAGGGAATTTCTACCAAAAGTTAATTGGGACAAACTGGCTTCCATGTATCTTCGGGGTCGCTCAGGGGCAGAATGTGAAGCAAGGTGGTTGAATTTTGAAGACCCCC
TAATTAATCGGAATCCATGGACTACAAGTGAGGACAAGAATCTTTTGTTTACCATCCAACAGAAAGGGTTGAATAACTGGATTGACATAGCAGTTTCATTGGGTACCAAC
AGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTTCGATCTGCTGTTGCTAT
ATTTGGTGTGGGAGATTGGCAGGCAGTAGCTTCTACTTTGGAAGGACGAACCGGTCCACAGTGCTCTAATAGGTGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAG
GCTATTTCACTCCAGATGAAGACAGTCGCTTGAAAATTGCGGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTTTACCTGGTCGAAATCAAGTTCAA
TGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAGATGTCAATGGACAGAAGAGGAGGATTTAAGGCTAGAGATAGCAATTCAGGAACATGGATATAGCTG
GGCTAAGGTAGCTGCATGTGTGCCGTCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTATTTCCCAATGAAGTTCCTTTGCTCCAGGAAGCTAGAAAGATTC
AGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGGGAATCAGAGCGTCCTGCTCTTGGTCCACTGACTTTCGGCCTCGACCGGATACAGATTTATCATTCAGGTCTTCGT
TTTCTTTTCTTTTTCTTTTTCCTGAAAGAAATCTGTAGTCAGGCATTGAAGCTCTTGTTATTTCTTTCTTCATGTGGTATTGCAAATATTTCGTCTTCTGTCCCGGAGGA
GGTTAAATCTATAAAACCTCAAAGACAACGAAGTAGAAATGGAGCGCATACCACTAAGAGGAAGAAGGGAGTTCTGGAGCCACGTTCTAACAGTGAGAGACGTGTCGAAC
AGAATTTGGATACTCAGAGCCTCGAGTTGCAGCTGAATAGTAAGGAATCAGAGAGGACCAGCAGTGACTGCACTGAGACTGTTGATGAAAATGGTATGGAGTCGTGTGAG
AACAAAGTTGCAGAGAAGCTTTCTAAAAGAGATGTATGCTTTTCAGAGCAAGAAGAAAATCAAAACTCTACTGGATCTTCTGGAGTATCAGTATTGTCAGAAATGACCGA
CATGGACGAGTACAATCCCTCTATTATCCTTCCAGCAGATACAGCATTGCTGGCCATCCCTACTGGGGATGATATTATAGAAAGGAATGGGGAGAGTGTTGCAGACAAAG
ATCTGGATGACAGTTTCTCGTTACCGCACAGTTGCTTAGAACTCAGGGCAACTGACAGTGAAGGTGTCAACAGCTATTCTGTGGATGAAGTCACAGGTAAAGGCCATGGG
GTTTGCAAGTACCAAGGCAGAAGGAAGAAAAATAGTAAAACATCAAATAAGAGCCAGGATTCATTGGTTTCTTGTAGACAAGTGGAGCTGGAGAGGTCGGAGACGAAGGA
GCTTCATCGTCATAATCAATCGAAGAAGAGAAAGCATAACAGTACAAATACGACGAGTTTATTAGGAACATCGGAGACATTCGAAGAGGTCGATGACTGCACTCTTCTGG
GCTTTTTGCAGAAAAGAGTGAAGAGGACGAACACGAGACATGACAAGAAAGTTGATGGCAGCTCTAGTTCATTAGAAGTTGATAATGATGATAATGATCCTACCATTGCC
TCGCTTCTCAAGGATAAATTGAAGAGAAAAAGGCATGAAGCGGCCAGTGGTGGAGGCTAA
Protein sequenceShow/hide protein sequence
MVVAHLLFRSISSFCLEVIAYLVTVKRLTVMLANLRALYFTATLINLRALCSYKDKRLSAVYYGPAENSHVACYKMALTKFPPSVDRKKWSNVERENLGKGIRQQFQEMV
LQMSVDQISGLQGFSADSDDLDSIIASIKDLDITPEKIREFLPKVNWDKLASMYLRGRSGAECEARWLNFEDPLINRNPWTTSEDKNLLFTIQQKGLNNWIDIAVSLGTN
RTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAIFGVGDWQAVASTLEGRTGPQCSNRWKKSLDPARTKRGYFTPDEDSRLKIAVLLFGPKNWNKKAEFLPGRNQVQ
CRERWFNCLDPSLRRCQWTEEEDLRLEIAIQEHGYSWAKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARKIQKAALISNFVDRESERPALGPLTFGLDRIQIYHSGLR
FLFFFFFLKEICSQALKLLLFLSSCGIANISSSVPEEVKSIKPQRQRSRNGAHTTKRKKGVLEPRSNSERRVEQNLDTQSLELQLNSKESERTSSDCTETVDENGMESCE
NKVAEKLSKRDVCFSEQEENQNSTGSSGVSVLSEMTDMDEYNPSIILPADTALLAIPTGDDIIERNGESVADKDLDDSFSLPHSCLELRATDSEGVNSYSVDEVTGKGHG
VCKYQGRRKKNSKTSNKSQDSLVSCRQVELERSETKELHRHNQSKKRKHNSTNTTSLLGTSETFEEVDDCTLLGFLQKRVKRTNTRHDKKVDGSSSSLEVDNDDNDPTIA
SLLKDKLKRKRHEAASGGG