; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G200610 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G200610
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionMic1 domain-containing protein
Genome locationCla97Chr10:30635248..30644000
RNA-Seq ExpressionCla97C10G200610
SyntenyCla97C10G200610
Gene Ontology termsGO:0010506 - regulation of autophagy (biological process)
GO:0031902 - late endosome membrane (cellular component)
GO:0035658 - Mon1-Ccz1 complex (cellular component)
InterPro domainsIPR009755 - Regulator of MON1-CCZ1 complex, C-terminal
IPR040371 - Regulator of MON1-CCZ1 complex


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580792.1 Regulator of MON1-CCZ1 complex, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.41Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP+AGLSKS ALSH YIQYPPLRC +PGP GLFFDDGNKLLICPTVDQ+FSWKTVPFNPAV YT DA+ EGPILSIRYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETGETFSQ CRPESESILGFFWTDCP CNIVFVKTSG+DLFAY SDSKS HLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIF DSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLLLRGFP PN DVRSSKQ +ASLEAD  PDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEV SLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRT ILEHRPVATVAKAIDVLVSSYT SSKVGP+VKE KTDR QSVVPQVSGSGPVPG NNR+ST G+ESEA HRTSIFPSSDSE NAD++QLNT  
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI
        GNHQSIV            + QASSSQY HLGPGCNRLND+VSDEGSL+ SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPNI
Subjt:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI

Query:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT
        YVLT+QIL+R+ERYTEIGLFVQQKILEPSKEVALQLLESGRHN  TRKLGLDMLRQLSLHHDYVSLLVQDGYY EALRY RKF VDTVRP+LFLQAAFAT
Subjt:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT

Query:  NDSQHLSAVLRFLSDSTPGFKDTSDYT
        ND+QHL+AVLRFLSD TPGFK+TSDY+
Subjt:  NDSQHLSAVLRFLSDSTPGFKDTSDYT

XP_004136556.1 uncharacterized protein LOC101218836 [Cucumis sativus]0.0e+0092.02Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP AGLSKS ALSHVYIQYPPLRCRIPG RGLFFDDGNKLLICP +DQ+FSWKTVPFNPAVAYT+D ITEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETG+TFSQKCR ESESILGFFWTDCP CNIVFVKTSG+DLFAYSSDSKS HLVESKKLNVS YAYTHESRLVLMASG+QCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEM MAKSDANSKPVLAIED+FIITVYGRIYCLQVDR+AMLLHTYRFYRDAVVQQGSLPIYSS IAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLL RGFPGPN DVRSSKQ +A+LE DAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRTTILEHRPVA+VAKAI+VL+SSY R++KVGPN KE KTDR QSVVPQ SGSGPVPG+NNR+S  GVESEALHRTSIFPSSDSEENADI+QLNTVP
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE
        GNHQSIVEAQASSSQY HLGPGC RLND+VSDEGS+ISSP+ISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILAR+E
Subjt:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE

Query:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF
        RYTEIGLFV QKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKF VDTVRPALFLQAAFATND Q LSAVLRF
Subjt:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF

Query:  LSDSTPGFKDTSDY
        LSD TPG K TSDY
Subjt:  LSDSTPGFKDTSDY

XP_016899713.1 PREDICTED: uncharacterized protein LOC103486744 [Cucumis melo]0.0e+0092.72Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP AGLSKS ALSHVYIQYPPLRCRIPG RGLFFDDGNKLLICP +DQ+FSWKTVPFNP VAYT+DAITEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETG+TFSQKCR ESESILGFFWTDCP CNIVFVKTSG+DLFAYSSDSKS HLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLA ED+FI+TVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLL RGFPGPN DVRSSKQ SASLE DAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRT ILEHRPVA+VAKAIDVL+SSYTRSSK+GPN+KE KTD  QSVVPQ SGSGPVPG+NNR+ST GVESEALHRTSIFPSSDSEENADIEQL+TVP
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE
        GNHQSIVEAQASSS Y HLGPGC RLNDNVSDEGS+ISSP+ISPDEMYSFVFAPIEEEIVGD SYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILAR+E
Subjt:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE

Query:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF
        RYTEIGLFVQQKILEPSKEVALQLLESGR+NFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKF VDTVRPALFLQAAFATNDSQ L+AVLRF
Subjt:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF

Query:  LSDSTPGFKDTSDY
        LSD TPG K++SDY
Subjt:  LSDSTPGFKDTSDY

XP_022934291.1 uncharacterized protein LOC111441498 isoform X1 [Cucurbita moschata]0.0e+0089.41Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP+AGLSKS ALSH YIQYPPLRC +PGP GLFFDDGNKLLICPTVDQ+FSWKTVPFNPAV YT DA+ EGPILSIRYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETGETF Q CRPESESILGFFWTDCP CNIVFVKTSG+DLFAY SDSKS HLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVS VDNVLLVHQVDAKVVILYDIF+DSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLLLRGFP PN DVRSSKQ +ASLEAD  PDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEV SLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRT ILEHRPVATVAKAIDVLVSSYT SSKVGP+VKE KTDR QSVVPQVSGSGPVPG NNR+ST G+ESEA HRTSIFPSSDSE NAD++QLNT  
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI
        GNHQSIV            + QASSSQY HLGPGCNRLND+VSDEGSL+ SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPNI
Subjt:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI

Query:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT
        YVLT+QILAR+ERYTEIGLFVQQKILEPSKEVALQLLESGRHN  TRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RKF VDTVRP+LFLQAAFAT
Subjt:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT

Query:  NDSQHLSAVLRFLSDSTPGFKDTSDYT
        ND+QHL+AVLRFLSD TPGFK+TSDY+
Subjt:  NDSQHLSAVLRFLSDSTPGFKDTSDYT

XP_038903891.1 regulator of MON1-CCZ1 complex isoform X3 [Benincasa hispida]0.0e+0092.98Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQPSAGLSKS ALSH YIQYPPLRCRIPG RGLFFDDGNKLLIC T DQ+FSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRS+H
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETGETFSQKCRPE ESILGFFWTDCP CNIVFVKTSG+DLFAYSSDSKS HLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWI+V VVDNVLLVHQVDAKVVILYDIFTDSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLLLRGFPGPN DVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKE KTDR QSV+PQV GSGPVPG NNR+STT VESEALHRTSIFPSSDSEENADIEQLNTVP
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI
        GNHQSIV            E QASSSQY HLGPGCNRLND+VSDE SLISSP+ISPDEMYSFVFAP+EEEIVGDPSYLLAIIIEFLRRVN EKIKVNPNI
Subjt:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI

Query:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT
        YVLTVQILAR+ERYTEIGLFVQQKI+EPSKEVALQLLESGRHNFPTRKLGLDMLRQL LH+DYVSLLVQDGYYLEALRYTRKF VDTVRPALFLQAAFAT
Subjt:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT

Query:  NDSQHLSAVLRFLSDSTPGFKDTSDYT
        NDSQHLSAVLRFLSD TPGFK+TSDY+
Subjt:  NDSQHLSAVLRFLSDSTPGFKDTSDYT

TrEMBL top hitse value%identityAlignment
A0A0A0LEC9 Mic1 domain-containing protein0.0e+0092.02Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP AGLSKS ALSHVYIQYPPLRCRIPG RGLFFDDGNKLLICP +DQ+FSWKTVPFNPAVAYT+D ITEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETG+TFSQKCR ESESILGFFWTDCP CNIVFVKTSG+DLFAYSSDSKS HLVESKKLNVS YAYTHESRLVLMASG+QCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEM MAKSDANSKPVLAIED+FIITVYGRIYCLQVDR+AMLLHTYRFYRDAVVQQGSLPIYSS IAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLL RGFPGPN DVRSSKQ +A+LE DAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRTTILEHRPVA+VAKAI+VL+SSY R++KVGPN KE KTDR QSVVPQ SGSGPVPG+NNR+S  GVESEALHRTSIFPSSDSEENADI+QLNTVP
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE
        GNHQSIVEAQASSSQY HLGPGC RLND+VSDEGS+ISSP+ISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILAR+E
Subjt:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE

Query:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF
        RYTEIGLFV QKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKF VDTVRPALFLQAAFATND Q LSAVLRF
Subjt:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF

Query:  LSDSTPGFKDTSDY
        LSD TPG K TSDY
Subjt:  LSDSTPGFKDTSDY

A0A1S4DUS0 uncharacterized protein LOC1034867440.0e+0092.72Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP AGLSKS ALSHVYIQYPPLRCRIPG RGLFFDDGNKLLICP +DQ+FSWKTVPFNP VAYT+DAITEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETG+TFSQKCR ESESILGFFWTDCP CNIVFVKTSG+DLFAYSSDSKS HLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLA ED+FI+TVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLL RGFPGPN DVRSSKQ SASLE DAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRT ILEHRPVA+VAKAIDVL+SSYTRSSK+GPN+KE KTD  QSVVPQ SGSGPVPG+NNR+ST GVESEALHRTSIFPSSDSEENADIEQL+TVP
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE
        GNHQSIVEAQASSS Y HLGPGC RLNDNVSDEGS+ISSP+ISPDEMYSFVFAPIEEEIVGD SYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILAR+E
Subjt:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE

Query:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF
        RYTEIGLFVQQKILEPSKEVALQLLESGR+NFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKF VDTVRPALFLQAAFATNDSQ L+AVLRF
Subjt:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF

Query:  LSDSTPGFKDTSDY
        LSD TPG K++SDY
Subjt:  LSDSTPGFKDTSDY

A0A5A7TKD9 Mic1 domain-containing protein0.0e+0092.72Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP AGLSKS ALSHVYIQYPPLRCRIPG RGLFFDDGNKLLICP +DQ+FSWKTVPFNP VAYT+DAITEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETG+TFSQKCR ESESILGFFWTDCP CNIVFVKTSG+DLFAYSSDSKS HLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLA ED+FI+TVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLL RGFPGPN DVRSSKQ SASLE DAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRT ILEHRPVA+VAKAIDVL+SSYTRSSK+GPN+KE KTD  QSVVPQ SGSGPVPG+NNR+ST GVESEALHRTSIFPSSDSEENADIEQL+TVP
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE
        GNHQSIVEAQASSS Y HLGPGC RLNDNVSDEGS+ISSP+ISPDEMYSFVFAPIEEEIVGD SYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILAR+E
Subjt:  GNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSE

Query:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF
        RYTEIGLFVQQKILEPSKEVALQLLESGR+NFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKF VDTVRPALFLQAAFATNDSQ L+AVLRF
Subjt:  RYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRF

Query:  LSDSTPGFKDTSDY
        LSD TPG K++SDY
Subjt:  LSDSTPGFKDTSDY

A0A6J1F1E9 uncharacterized protein LOC111441498 isoform X10.0e+0089.41Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+PSRLQP+AGLSKS ALSH YIQYPPLRC +PGP GLFFDDGNKLLICPTVDQ+FSWKTVPFNPAV YT DA+ EGPILSIRYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETGETF Q CRPESESILGFFWTDCP CNIVFVKTSG+DLFAY SDSKS HLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVS VDNVLLVHQVDAKVVILYDIF+DSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLPLLLRGFP PN DVRSSKQ +ASLEAD  PDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEV SLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRT ILEHRPVATVAKAIDVLVSSYT SSKVGP+VKE KTDR QSVVPQVSGSGPVPG NNR+ST G+ESEA HRTSIFPSSDSE NAD++QLNT  
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI
        GNHQSIV            + QASSSQY HLGPGCNRLND+VSDEGSL+ SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPNI
Subjt:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI

Query:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT
        YVLT+QILAR+ERYTEIGLFVQQKILEPSKEVALQLLESGRHN  TRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RKF VDTVRP+LFLQAAFAT
Subjt:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT

Query:  NDSQHLSAVLRFLSDSTPGFKDTSDYT
        ND+QHL+AVLRFLSD TPGFK+TSDY+
Subjt:  NDSQHLSAVLRFLSDSTPGFKDTSDYT

A0A6J1J5P9 uncharacterized protein LOC111481584 isoform X10.0e+0089.55Show/hide
Query:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH
        MSG+P RLQP+AGLSKS ALSH YIQYPPLRC IPGP GLFFDDGNKLLICPTVDQ+FSWKTVPFNPAV YT DA+TEGPILSIRYSLDLKIIAIQRSSH
Subjt:  MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF
        EIQFLIRETGETFSQ CRPESESILGFFWTDCP CNIVFVKTSG+DLFAY SDSKS HLVESKKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLSAAG 
Subjt:  EIQFLIRETGETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGF

Query:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA
        VRLPKFEMAMAKSDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIF DSRA
Subjt:  VRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRA

Query:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC
        PISAPLP LLRGFP PN DVRSSKQ SASLEAD  PDEAIVYGDGWKFLVPDLICDHVNKLVWKIH+DLEAIASSSSEV SLLEFLQRRKLEVSKAKQLC
Subjt:  PISAPLPLLLRGFPGPNTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLC

Query:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP
        LTLTRT ILEHRPVA VAKAIDVLVSSYT SSKVGP+VKE KTDR QSVVPQVSGSGPVPG NNR+ST G+ESEA HRTSIFPSSDSE NAD++QLNT  
Subjt:  LTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVP

Query:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI
        GNHQSIV            + QASSSQY HLGPGCNRLND+VSDEGSL+ SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFL R+NMEKIKVNPNI
Subjt:  GNHQSIV------------EAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNI

Query:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT
        YVLT+QILAR+ERYTEIGLFVQQKILEPSKEVALQLLESGRHN  TRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RKF VDTVRP+LFLQAAFAT
Subjt:  YVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFAT

Query:  NDSQHLSAVLRFLSDSTPGFKDTSDYT
        NDSQHL+AVLRFLSD TPGFK+TSDY+
Subjt:  NDSQHLSAVLRFLSDSTPGFKDTSDYT

SwissProt top hitse value%identityAlignment
Q54LC7 Regulator of MON1-CCZ1 complex homolog1.7e-0920.22Show/hide
Query:  TEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQKCRPESE--SILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTH
        ++ PI+  ++S DLK  AIQ S ++I+ L  E G  + Q C+ +S   +ILG++WT     NI+ V  + ++L+A   D  S  LV+  K+ ++   Y+ 
Subjt:  TEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQKCRPESE--SILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTH

Query:  ESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPI-YSSWIAVS
             ++       +   +      F +LPKF +    +  N      I+++++  ++ + +C+  D+    ++ Y    + + +   + I  S   ++ 
Subjt:  ESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPI-YSSWIAVS

Query:  VVDNVLLVHQVDAKVVILYDIFTDSR-------------APISAPLPLLL------------------------------------RGFPGPNTDVRSSK
         VDN+++VH  +  + I+YD+ T  R              PISA +P+ L                                       P  ++   SS 
Subjt:  VVDNVLLVHQVDAKVVILYDIFTDSR-------------APISAPLPLLL------------------------------------RGFPGPNTDVRSSK

Query:  QGSAS----------------LEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIAS-SSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTT
           +S                 E         +Y   W+F+ P+ I D  + + +++ ++ E I++    +    + FLQ R L    AK   L++ +T 
Subjt:  QGSAS----------------LEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIAS-SSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTT

Query:  ILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNR---NSTTGVESEALHRTSIFPSSDSEENADIEQLNTVPGNHQ
        I                               E KTD L        G G +    N+    +T    +E+LH++   P++++  N +    N    N  
Subjt:  ILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNR---NSTTGVESEALHRTSIFPSSDSEENADIEQLNTVPGNHQ

Query:  SIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMY
          + +  + S  + +G G N   DN++   +  SS + SP   +
Subjt:  SIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMY

Q8VC42 Regulator of MON1-CCZ1 complex1.0e-3322.38Show/hide
Query:  LFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTD--------AITEGPILSIRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQKCRPESESILGFF
        +FFD+ NK        QVF+ ++      V    D            G +  I++SL+ KI+A+QR++  + F   I +  +  ++Q+C+ ++ +ILGF 
Subjt:  LFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTD--------AITEGPILSIRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQKCRPESESILGFF

Query:  WTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDANSKPVLAIEDIFI
        WT   S  IVF+   G++ +    + +S  L++S  +NV+WY Y  ES ++L+++ +       F   A    +LPKFE+ +  +  ++K  L+  DI +
Subjt:  WTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDANSKPVLAIEDIFI

Query:  ITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY--SSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRAPISAPLPLLLRGFPGPNTD
         T+YG++Y L +   +   ++       Y   R+   ++  +     +   A++VVDN+++VH  D +  +++DI    R      +      F  P   
Subjt:  ITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY--SSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRAPISAPLPLLLRGFPGPNTD

Query:  VRSSKQGSASLEADAV-----PDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTTILEHRPV
         RS +     L   A      P    +Y   W    PD+I       +W + V L+ I +   +   L++FL +RK    KA  L +     +  +   +
Subjt:  VRSSKQGSASLEADAV-----PDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTTILEHRPV

Query:  ATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVPGNHQSIVEAQASSS
          +A   D L   Y          K L  D                    ++ T  VE           +  S  N  +++    P   Q++V+      
Subjt:  ATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVPGNHQSIVEAQASSS

Query:  QYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSERYTEIGLFVQQKIL
                                       ++Y+ V +P  E       +++A+++E++R +N  +I V   ++ L ++ L +   +  +  F+Q  +L
Subjt:  QYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSERYTEIGLFVQQKIL

Query:  EPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHD-YVSLLVQDGYYLEALRYTRKF-NVDTVRPALFLQAAFATNDSQHLSAVLRFLSDSTPGFKDTS
          SK +A  LL       P  +L LDML++LS  +D  V +L+     L ALR+ R     D +    FL AA  T+D      + RF        +   
Subjt:  EPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHD-YVSLLVQDGYYLEALRYTRKF-NVDTVRPALFLQAAFATNDSQHLSAVLRFLSDSTPGFKDTS

Query:  DYT---NVKEAIGLLKMEISDVAI
        ++T   + +E +   K    + A+
Subjt:  DYT---NVKEAIGLLKMEISDVAI

Q96DM3 Regulator of MON1-CCZ1 complex2.2e-3622.7Show/hide
Query:  LFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTD--------AITEGPILSIRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQKCRPESESILGFF
        +FFD+ NK        QVF+ ++      V    D           +G +  I++SL+ KI+A+QR+S  + F   I +  +  ++Q+C+ ++ +ILGF 
Subjt:  LFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTD--------AITEGPILSIRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQKCRPESESILGFF

Query:  WTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDANSKPVLAIEDIFI
        WT   S  IVF+   G++ +    + +S  L++S  LNV+WY Y  ES ++L+++ +       F   A    +LPKFE+ +  +  ++KP L+  DI +
Subjt:  WTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDANSKPVLAIEDIFI

Query:  ITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY--SSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRAPISAPLPLLLRGFPGPNTD
         T+YG++Y L +   +   ++       Y   R+   ++  +     +   A++VVDN+++VH  D +  +++DI        S         F  P   
Subjt:  ITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY--SSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRAPISAPLPLLLRGFPGPNTD

Query:  VRSSK------QGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTTILE--H
         RS +       G A++ + + P    +Y   W    PD+I       +W + V LE I +   +   L++FL +RK    + K + L++    + E   
Subjt:  VRSSK------QGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTTILE--H

Query:  RPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVPGNHQSIVEAQA
          +  +A   D L   Y                                                           ++  D EQ      ++   VEA  
Subjt:  RPVATVAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVPGNHQSIVEAQA

Query:  SSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSERYTEIGLFVQQ
        S S  L   P              + +   +   ++Y+ V +   E+      +++A+++E++R +N  +I V   ++ L ++ L +   +  +  F+Q 
Subjt:  SSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSERYTEIGLFVQQ

Query:  KILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHD-YVSLLVQDGYYLEALRYTRKF-NVDTVRPALFLQAAFATNDSQHLSAVLRFLSDSTPGFK
         +L  SK +A  LL       P  +L LDML++LS  +D  V +L+     L ALR+ R     D +    FL AA  T D+     + RF        +
Subjt:  KILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHD-YVSLLVQDGYYLEALRYTRKF-NVDTVRPALFLQAAFATNDSQHLSAVLRFLSDSTPGFK

Query:  DTSDYT---NVKEAIGLLKMEISDVAI
         + ++T   + +E +   K    D A+
Subjt:  DTSDYT---NVKEAIGLLKMEISDVAI

Arabidopsis top hitse value%identityAlignment
AT3G12010.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: sperm cell, cultured cell; CONTAINS InterPro DOMAIN/s: Colon cancer-associated Mic1-like (InterPro:IPR009755); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).8.5e-22257.38Show/hide
Query:  SGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQK
        SGALSHVYI +P LRC IP   GLF+DD N+LLIC T  QVFSW+T PFNP V  + D+I+EGPILSIR+SLD K IA+QRS  EIQ   RET +  + K
Subjt:  SGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETGETFSQK

Query:  CRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDAN
        C+  SESILGFFW+D P C++  VKTSGMDLFA  S   S  LVE+KK NV+WY YTHE+RLVL+ASG+QCKTF+GFQLS AG VRLP+FEM MA+S++N
Subjt:  CRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDAN

Query:  SKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRAPISAPLPLLLRGFPGP
        SKP+L+  D+ ++TVYGRIYCLQVDR AMLLH YRFYRDAVVQQGSLPIYSS ++V+VVDN+LLVHQ+DAKVVI+YD+F DSRAP+SAPLPLL RG+   
Subjt:  SKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRAPISAPLPLLLRGFPGP

Query:  NTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTTILEHRPVAT
         T  ++  +   S E+    +  ++Y DGW FLVPDLI D  NK++WKIH+DLEAI++SSS+  SLLEFLQRRKLE +KAKQLCL + R  ILE RP   
Subjt:  NTDVRSSKQGSASLEADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTTILEHRPVAT

Query:  VAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIE-QLNTVPGNHQSIVEAQASSSQ
        V +AIDVLV++Y+ S K G   KE+K +         + S P PGA+        +SE  HR        S  N D E ++N   G+ +++  A      
Subjt:  VAKAIDVLVSSYTRSSKVGPNVKELKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIE-QLNTVPGNHQSIVEAQASSSQ

Query:  YLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSERYTEIGLFVQQKILE
                         + S +SSPAISPDE+Y FVF  +EE +V +  YL+AII EFLR ++ EK+KV+ NIYV+T+++LA S+R+ E+ LF   KI+E
Subjt:  YLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSERYTEIGLFVQQKILE

Query:  PSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRFLSDSTPGFKDTSDYT
        PSKEVA QLL+SGR N   RKLGLDMLRQLSLHHDY+S LVQDGYYLEALRY +K  V +VR ++FL+AAFA+ND QHL+A+LR LS+  PGFK+TS+Y 
Subjt:  PSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRFLSDSTPGFKDTSDYT

Query:  NVKEAIGLLKMEISDVAI
              GLL    S VA+
Subjt:  NVKEAIGLLKMEISDVAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGAAAACCATCGAGATTACAGCCCAGTGCTGGTCTAAGCAAGTCTGGTGCTCTCTCACATGTTTATATACAATATCCACCCTTACGATGTAGAATTCCTGGACC
AAGGGGATTATTTTTTGATGATGGAAATAAGTTATTGATCTGCCCAACCGTGGATCAGGTCTTCTCATGGAAAACTGTTCCGTTTAATCCTGCTGTTGCTTATACCACTG
ATGCAATTACAGAAGGGCCCATTTTATCTATTCGATATTCCTTAGACTTAAAGATTATTGCAATACAAAGATCAAGTCATGAAATACAGTTTTTGATTAGAGAAACAGGT
GAAACTTTTAGTCAGAAATGTAGACCAGAGTCGGAGAGCATTCTGGGATTTTTTTGGACAGATTGTCCCTCATGCAATATTGTATTTGTGAAGACCAGCGGGATGGACTT
GTTTGCTTATAGTTCTGATTCAAAGTCCTTTCATTTGGTGGAGTCAAAGAAATTGAATGTGAGCTGGTATGCCTACACGCATGAAAGTCGATTGGTGCTTATGGCTTCTG
GAATGCAGTGCAAAACTTTTCATGGATTCCAGCTTTCAGCAGCAGGGTTTGTTCGCTTGCCAAAGTTTGAAATGGCGATGGCAAAATCTGATGCTAATAGCAAGCCTGTC
CTAGCTATAGAGGATATCTTTATTATCACTGTCTATGGAAGAATATATTGCTTGCAAGTGGATAGAATTGCGATGCTACTTCATACCTACAGGTTCTATCGTGATGCGGT
TGTGCAGCAGGGTTCTTTACCAATCTACTCGAGCTGGATTGCTGTGAGCGTGGTTGACAATGTGCTGCTTGTTCATCAAGTAGATGCAAAAGTAGTTATTCTTTATGATA
TTTTTACTGATTCGAGGGCACCCATATCTGCCCCACTTCCTTTGTTGTTGAGAGGTTTTCCTGGACCAAATACTGATGTCCGAAGTAGTAAACAAGGTAGTGCCAGTTTA
GAAGCTGACGCGGTACCTGATGAAGCAATTGTCTACGGGGATGGTTGGAAATTTCTTGTCCCAGACCTGATTTGTGATCATGTCAACAAATTGGTGTGGAAGATACATGT
AGACTTGGAGGCAATCGCTTCAAGTAGCTCTGAAGTGCCATCACTTCTAGAATTCTTGCAGCGACGGAAATTGGAAGTTAGCAAGGCTAAACAGTTGTGCTTGACCTTGA
CAAGAACTACCATTCTGGAGCACAGGCCGGTGGCGACTGTTGCTAAGGCTATAGATGTTCTAGTCTCATCATATACCCGCTCAAGCAAAGTAGGTCCTAATGTCAAGGAA
TTAAAAACTGACAGGCTGCAGTCAGTTGTGCCTCAAGTTAGTGGCTCGGGCCCTGTACCTGGTGCTAATAACCGTAATTCAACTACTGGAGTGGAAAGTGAAGCTCTTCA
CAGAACTTCAATATTTCCATCTTCAGATTCTGAGGAGAATGCTGACATTGAACAACTAAATACAGTTCCAGGCAACCATCAGTCTATAGTTGAAGCTCAGGCATCATCTT
CACAGTATCTACATCTTGGACCTGGATGTAACCGGTTGAATGACAATGTCTCTGATGAGGGATCTCTGATTTCGTCACCAGCTATCTCACCTGATGAGATGTACAGCTTT
GTGTTCGCCCCCATTGAGGAAGAGATAGTTGGAGACCCTTCTTACTTGCTGGCTATAATTATCGAGTTCCTTCGCAGAGTTAATATGGAAAAGATCAAAGTAAATCCAAA
CATCTATGTCTTGACTGTCCAAATATTAGCTCGCAGTGAACGATACACAGAAATTGGATTATTTGTGCAGCAAAAGATTCTCGAACCTTCTAAAGAGGTTGCTTTGCAAC
TACTGGAGTCTGGTCGCCATAATTTCCCGACAAGGAAACTGGGTCTAGATATGCTCCGACAGCTTTCTCTACATCATGATTATGTGTCTCTGCTCGTTCAAGATGGATAT
TACCTTGAAGCATTGCGCTACACACGGAAGTTTAATGTTGACACAGTCCGGCCAGCCTTGTTTCTTCAAGCCGCTTTCGCGACCAACGACTCGCAACATTTGTCAGCAGT
TTTGAGATTCCTGTCAGATTCAACTCCTGGATTCAAAGATACCTCAGATTACACAAATGTCAAGGAAGCTATTGGCCTTTTGAAGATGGAAATATCTGATGTTGCTATCA
ACATCCTGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGAAAACCATCGAGATTACAGCCCAGTGCTGGTCTAAGCAAGTCTGGTGCTCTCTCACATGTTTATATACAATATCCACCCTTACGATGTAGAATTCCTGGACC
AAGGGGATTATTTTTTGATGATGGAAATAAGTTATTGATCTGCCCAACCGTGGATCAGGTCTTCTCATGGAAAACTGTTCCGTTTAATCCTGCTGTTGCTTATACCACTG
ATGCAATTACAGAAGGGCCCATTTTATCTATTCGATATTCCTTAGACTTAAAGATTATTGCAATACAAAGATCAAGTCATGAAATACAGTTTTTGATTAGAGAAACAGGT
GAAACTTTTAGTCAGAAATGTAGACCAGAGTCGGAGAGCATTCTGGGATTTTTTTGGACAGATTGTCCCTCATGCAATATTGTATTTGTGAAGACCAGCGGGATGGACTT
GTTTGCTTATAGTTCTGATTCAAAGTCCTTTCATTTGGTGGAGTCAAAGAAATTGAATGTGAGCTGGTATGCCTACACGCATGAAAGTCGATTGGTGCTTATGGCTTCTG
GAATGCAGTGCAAAACTTTTCATGGATTCCAGCTTTCAGCAGCAGGGTTTGTTCGCTTGCCAAAGTTTGAAATGGCGATGGCAAAATCTGATGCTAATAGCAAGCCTGTC
CTAGCTATAGAGGATATCTTTATTATCACTGTCTATGGAAGAATATATTGCTTGCAAGTGGATAGAATTGCGATGCTACTTCATACCTACAGGTTCTATCGTGATGCGGT
TGTGCAGCAGGGTTCTTTACCAATCTACTCGAGCTGGATTGCTGTGAGCGTGGTTGACAATGTGCTGCTTGTTCATCAAGTAGATGCAAAAGTAGTTATTCTTTATGATA
TTTTTACTGATTCGAGGGCACCCATATCTGCCCCACTTCCTTTGTTGTTGAGAGGTTTTCCTGGACCAAATACTGATGTCCGAAGTAGTAAACAAGGTAGTGCCAGTTTA
GAAGCTGACGCGGTACCTGATGAAGCAATTGTCTACGGGGATGGTTGGAAATTTCTTGTCCCAGACCTGATTTGTGATCATGTCAACAAATTGGTGTGGAAGATACATGT
AGACTTGGAGGCAATCGCTTCAAGTAGCTCTGAAGTGCCATCACTTCTAGAATTCTTGCAGCGACGGAAATTGGAAGTTAGCAAGGCTAAACAGTTGTGCTTGACCTTGA
CAAGAACTACCATTCTGGAGCACAGGCCGGTGGCGACTGTTGCTAAGGCTATAGATGTTCTAGTCTCATCATATACCCGCTCAAGCAAAGTAGGTCCTAATGTCAAGGAA
TTAAAAACTGACAGGCTGCAGTCAGTTGTGCCTCAAGTTAGTGGCTCGGGCCCTGTACCTGGTGCTAATAACCGTAATTCAACTACTGGAGTGGAAAGTGAAGCTCTTCA
CAGAACTTCAATATTTCCATCTTCAGATTCTGAGGAGAATGCTGACATTGAACAACTAAATACAGTTCCAGGCAACCATCAGTCTATAGTTGAAGCTCAGGCATCATCTT
CACAGTATCTACATCTTGGACCTGGATGTAACCGGTTGAATGACAATGTCTCTGATGAGGGATCTCTGATTTCGTCACCAGCTATCTCACCTGATGAGATGTACAGCTTT
GTGTTCGCCCCCATTGAGGAAGAGATAGTTGGAGACCCTTCTTACTTGCTGGCTATAATTATCGAGTTCCTTCGCAGAGTTAATATGGAAAAGATCAAAGTAAATCCAAA
CATCTATGTCTTGACTGTCCAAATATTAGCTCGCAGTGAACGATACACAGAAATTGGATTATTTGTGCAGCAAAAGATTCTCGAACCTTCTAAAGAGGTTGCTTTGCAAC
TACTGGAGTCTGGTCGCCATAATTTCCCGACAAGGAAACTGGGTCTAGATATGCTCCGACAGCTTTCTCTACATCATGATTATGTGTCTCTGCTCGTTCAAGATGGATAT
TACCTTGAAGCATTGCGCTACACACGGAAGTTTAATGTTGACACAGTCCGGCCAGCCTTGTTTCTTCAAGCCGCTTTCGCGACCAACGACTCGCAACATTTGTCAGCAGT
TTTGAGATTCCTGTCAGATTCAACTCCTGGATTCAAAGATACCTCAGATTACACAAATGTCAAGGAAGCTATTGGCCTTTTGAAGATGGAAATATCTGATGTTGCTATCA
ACATCCTGAATTAG
Protein sequenceShow/hide protein sequence
MSGKPSRLQPSAGLSKSGALSHVYIQYPPLRCRIPGPRGLFFDDGNKLLICPTVDQVFSWKTVPFNPAVAYTTDAITEGPILSIRYSLDLKIIAIQRSSHEIQFLIRETG
ETFSQKCRPESESILGFFWTDCPSCNIVFVKTSGMDLFAYSSDSKSFHLVESKKLNVSWYAYTHESRLVLMASGMQCKTFHGFQLSAAGFVRLPKFEMAMAKSDANSKPV
LAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFTDSRAPISAPLPLLLRGFPGPNTDVRSSKQGSASL
EADAVPDEAIVYGDGWKFLVPDLICDHVNKLVWKIHVDLEAIASSSSEVPSLLEFLQRRKLEVSKAKQLCLTLTRTTILEHRPVATVAKAIDVLVSSYTRSSKVGPNVKE
LKTDRLQSVVPQVSGSGPVPGANNRNSTTGVESEALHRTSIFPSSDSEENADIEQLNTVPGNHQSIVEAQASSSQYLHLGPGCNRLNDNVSDEGSLISSPAISPDEMYSF
VFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARSERYTEIGLFVQQKILEPSKEVALQLLESGRHNFPTRKLGLDMLRQLSLHHDYVSLLVQDGY
YLEALRYTRKFNVDTVRPALFLQAAFATNDSQHLSAVLRFLSDSTPGFKDTSDYTNVKEAIGLLKMEISDVAINILN