; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000261 (gene) of Snake gourd v1 genome

Gene IDTan0000261
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMic1 domain-containing protein
Genome locationLG08:63585432..63614823
RNA-Seq ExpressionTan0000261
SyntenyTan0000261
Gene Ontology termsGO:0010506 - regulation of autophagy (biological process)
GO:0031902 - late endosome membrane (cellular component)
GO:0035658 - Mon1-Ccz1 complex (cellular component)
InterPro domainsIPR009755 - Regulator of MON1-CCZ1 complex, C-terminal
IPR040371 - Regulator of MON1-CCZ1 complex


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580792.1 Regulator of MON1-CCZ1 complex, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.58Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRPSRLQP+AGLSKSSALSH YIQYPPLRC +PGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+ EGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETFSQ CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLPLLLRGFP P IDVRSSKQD+ S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VATVAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWR TISST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QIL+RNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYY EALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN++QHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S A A
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

KAG7017543.1 putative protein C18orf8, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.58Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRPSRLQP+AGLSKSSALSH YIQYPPLRC +PGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+ EGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETFSQ CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLPLLLRGFP P IDVRSSKQD+ S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VATVAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWR TISST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QIL+RNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYY EALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN++QHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S A A
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

XP_022934291.1 uncharacterized protein LOC111441498 isoform X1 [Cucurbita moschata]0.0e+0090.58Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRPSRLQP+AGLSKSSALSH YIQYPPLRC +PGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+ EGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETF Q CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVS VDNVLLVHQVDAKVVILYDIF+DSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLPLLLRGFP P IDVRSSKQD+ S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VATVAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWR TISST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN++QHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S ASA
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

XP_022982858.1 uncharacterized protein LOC111481584 isoform X1 [Cucurbita maxima]0.0e+0091.12Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRP RLQP+AGLSKSSALSH YIQYPPLRC IPGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+TEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETFSQ CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLP LLRGFP P IDVRSSKQDS S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VA VAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWRGTI+ST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFL R+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN+SQHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S ASA
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

XP_038903891.1 regulator of MON1-CCZ1 complex isoform X3 [Benincasa hispida]0.0e+0090.71Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRPSRLQPSAGLSKSSALSH YIQYPPLRCRIPG RGLF+DDGNKLLIC T DQIFSWKTVPFNPAV YTTDAITEGPILS+RYSLDLKIIAIQRS+H
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETFSQ+CR E ESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKS+HLVESKKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWI+V VVDNVLLVHQVDAKVVILYDIF DSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLPLLLRGFPGP IDVRSSKQ S S EAD VPD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RTTILEHR VATVAKA+DVLVSSY  SSKVGPNVKESKTDR+QSV+PQV G GPV G  NRDST  VESE LHRT IFPSSDSEEN D+EQLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
        PGNHQS VD+ERWRG ISSTEVQASSSQYQ LGPGCNRLNDDVSDE SL+SSP+ISPDEMYSFVFAP+EEEIVGDPSYLLAIIIEFLRRVN EKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLTVQILARNERYTEIGLFVQQKI+EPSKEVALQLLESGRHNF TRKLGLDMLRQL LH+DYVSLLVQDGYYLEALRYTRK+KVDTVRP+LFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN+SQHL+AVLRFLSDLTP FK+TSDYSRYH+IL EMNSC +A
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

TrEMBL top hitse value%identityAlignment
A0A6J1DB11 uncharacterized protein LOC1110193350.0e+0089.38Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSG PSRLQPSAGLSKSSALSHVYIQYPPLRCRIP PRGLFYDDGNKLLIC T  QIFSWKTVPFNPAVTYT +AITEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETFSQ+CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY+SDSKSLHLVE+KKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLSSAG+
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEM MA+SDA +KPVLA+ED+FI+T YGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWI+VSVVDNVLLVHQVDAKVVILYDIFADSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLPLLLRGFPG  +D+R SKQDS S +AD+V DYEAIVYGDGW FLVPDL+CD+VNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDR-SQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNT
        CLTL RT ILEH  V  VA+A+DVLVSSY HSSK GPN+KESKTDR S SV+ QVSG GPV GAINRDSTAGVESE LHRT IFPSSDSEEN DVEQLNT
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDR-SQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNT

Query:  DPGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNP
          G+HQSTV QE+WRGT SST+VQASSSQYQLLGPGCNR NDDVSD+GSLVSSP ISPDE+YSFVFAPIEEEIVG PSYLLAIIIEFLRRVNMEKIKVNP
Subjt:  DPGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNP

Query:  NIYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAF
        NIYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDML+QLSLHHDYVSLLVQDGYYLEALRY RK+KVDTVRPSLFLQAAF
Subjt:  NIYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAF

Query:  ATNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
         TN+SQHLAAVLRFL D TP FK+TSDYSRYH ILTEMNSCASA
Subjt:  ATNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

A0A6J1F1E9 uncharacterized protein LOC111441498 isoform X10.0e+0090.58Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRPSRLQP+AGLSKSSALSH YIQYPPLRC +PGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+ EGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETF Q CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVS VDNVLLVHQVDAKVVILYDIF+DSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLPLLLRGFP P IDVRSSKQD+ S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VATVAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWR TISST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN++QHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S ASA
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

A0A6J1F2A9 uncharacterized protein LOC111441498 isoform X20.0e+0090.31Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRPSRLQP+AGLSKSSALSH YIQYPPLRC +PGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+ EGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETF Q CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSW+AYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVS VDNVLLVHQVDAKVVILYDIF+DSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLPLLLRGFP P IDVRSSKQD+ S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLE   AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VATVAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWR TISST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRR+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN++QHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S ASA
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

A0A6J1J411 uncharacterized protein LOC111481584 isoform X20.0e+0090.85Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRP RLQP+AGLSKSSALSH YIQYPPLRC IPGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+TEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETFSQ CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLP LLRGFP P IDVRSSKQDS S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLE   AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VA VAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWRGTI+ST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFL R+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN+SQHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S ASA
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

A0A6J1J5P9 uncharacterized protein LOC111481584 isoform X10.0e+0091.12Show/hide
Query:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH
        MSGRP RLQP+AGLSKSSALSH YIQYPPLRC IPGP GLF+DDGNKLLICPTVDQIFSWKTVPFNPAVTYT DA+TEGPILS+RYSLDLKIIAIQRSSH
Subjt:  MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSH

Query:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI
        EIQFLIRETGETFSQ CR ESESILGFFWTDCPLCNIVFVKTSGLDLFAY SDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLS+AGI
Subjt:  EIQFLIRETGETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGI

Query:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
        VRLPKFEMAMA+SDANSKPVLA+EDIFIITVYGRIYCLQVDRI+MLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA
Subjt:  VRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRA

Query:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL
        PISAPLP LLRGFP P IDVRSSKQDS S EAD  PD EAIVYGDGW FLVPDL+CDHVNKLVWKIHIDLEAIASSSSEV SLLEFLQRRKLEVS+AKQL
Subjt:  PISAPLPLLLRGFPGPIIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQL

Query:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD
        CLTL RT ILEHR VA VAKA+DVLVSSY  SSKVGP+VKESKTDRSQSVVPQVSG GPV G  NRDSTAG+ESE  HRT IFPSSDSE N DV+QLNT 
Subjt:  CLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTD

Query:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN
         GNHQS VDQERWRGTI+ST++QASSSQYQ LGPGCNRLNDDVSDEGSLV SPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFL R+NMEKIKVNPN
Subjt:  PGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPN

Query:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA
        IYVLT+QILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHN QTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRY RK+KVDTVRPSLFLQAAFA
Subjt:  IYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFA

Query:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA
        TN+SQHLAAVLRFLSDLTP FK+TSDYSRYHHILTEM+S ASA
Subjt:  TNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA

SwissProt top hitse value%identityAlignment
Q54LC7 Regulator of MON1-CCZ1 complex homolog3.9e-0920.51Show/hide
Query:  TEGPILSVRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRQESE--SILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTH
        ++ PI+  ++S DLK  AIQ S ++I+ L  E G  + Q C+ +S   +ILG++WT     NI+ V  + L+L+A   D  S  LV+  K+ ++   Y+ 
Subjt:  TEGPILSVRYSLDLKIIAIQRSSHEIQFLIRETGETFSQRCRQESE--SILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTH

Query:  ESRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPI-YSSWIAVS
             ++       +   +   +    +LPKF +    +  N      I+++++  ++ + +C+  D+    ++ Y    + + +   + I  S   ++ 
Subjt:  ESRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPI-YSSWIAVS

Query:  VVDNVLLVHQVDAKVVILYDIFADSR-------------APISAPLPLLLRGFPG-------------------PIID----------------VRSSKQ
         VDN+++VH  +  + I+YD+    R              PISA +P+ L                        PII+                  SS  
Subjt:  VVDNVLLVHQVDAKVVILYDIFADSR-------------APISAPLPLLLRGFPG-------------------PIID----------------VRSSKQ

Query:  DSGSFEADTVPDYE----------------AIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIAS-SSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTT
        +S S     +  YE                  +Y   W F+ P+ + D  + + +++ ++ E I++    +    + FLQ R L    AK   L++I+T 
Subjt:  DSGSFEADTVPDYE----------------AIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIAS-SSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTT

Query:  ILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINR---DSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTDPGNHQ
        I                               E KTD        + G G ++  +N+    +T    +E LH++   P++++  N      N +  N+ 
Subjt:  ILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINR---DSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTDPGNHQ

Query:  STVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAIS
        +   Q     T++ST   ++S     +    + L    +   S+ SSP+ S
Subjt:  STVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAIS

Q8VC42 Regulator of MON1-CCZ1 complex4.0e-3020.98Show/hide
Query:  GPILSVRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHE
        G +  +++SL+ KI+A+QR++  + F   I +  +  ++Q C+ ++ +ILGF WT      IVF+   G++ +    + +SL L++S  +NV+W+ Y  E
Subjt:  GPILSVRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHE

Query:  SRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY--
        S ++L+++ +       F   +  + +LPKFE+ +  +  ++K  L+  DI + T+YG++Y L +   +   ++       Y   R+   ++  +     
Subjt:  SRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY--

Query:  SSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRAPISAPLPLLLRGFPGPIIDVRSSKQD----SGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKL
        +   A++VVDN+++VH  D +  +++DI    R      +      F  P++  RS +      +G     +       +Y   W    PD++       
Subjt:  SSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRAPISAPLPLLLRGFPGPIIDVRSSKQD----SGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKL

Query:  VWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHG
        +W + + L+ I +   +   L++FL +RK   +    +C  ++  +  +  ++  +A   D L   Y           +   D  QS             
Subjt:  VWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHG

Query:  AINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTDPGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYS
              T  VE           +  S  NP +++    P   Q+ VDQ                                                ++Y+
Subjt:  AINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTDPGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYS

Query:  FVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHH
         V +P  E       +++A+++E++R +N  +I V   ++ L ++ L ++  +  +  F+Q  +L  SK +A  LL          +L LDML++LS  +
Subjt:  FVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHH

Query:  D-YVSLLVQDGYYLEALRYTRKY-KVDTVRPSLFLQAAFATNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHH
        D  V +L+     L ALR+ R     D +    FL AA  T++      + RF        +   +++   H
Subjt:  D-YVSLLVQDGYYLEALRYTRKY-KVDTVRPSLFLQAAFATNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHH

Q96DM3 Regulator of MON1-CCZ1 complex7.0e-2724.32Show/hide
Query:  EGPILSVRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTH
        +G +  +++SL+ KI+A+QR+S  + F   I +  +  ++Q C+ ++ +ILGF WT      IVF+   G++ +    + +SL L++S  LNV+W+ Y  
Subjt:  EGPILSVRYSLDLKIIAIQRSSHEIQF--LIRETGE-TFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTH

Query:  ESRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY-
        ES ++L+++ +       F   +  + +LPKFE+ +  +  ++KP L+  DI + T+YG++Y L +   +   ++       Y   R+   ++  +    
Subjt:  ESRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDANSKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHT-------YRFYRDAVVQQGSLPIY-

Query:  -SSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRAPISAPLPLLLRGFPGPIIDVRSSK----QDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNK
         +   A++VVDN+++VH  D +  +++DI        S         F  P++  RS +      +G     +       +Y   W    PD++      
Subjt:  -SSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRAPISAPLPLLLRGFPGPIIDVRSSK----QDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNK

Query:  LVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTTILE--HRSVATVAKALDVLVSSY
         +W + + LE I +   +   L++FL +RK    + K + L++    + E    S+  +A   D L   Y
Subjt:  LVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTTILE--HRSVATVAKALDVLVSSY

Arabidopsis top hitse value%identityAlignment
AT3G12010.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: sperm cell, cultured cell; CONTAINS InterPro DOMAIN/s: Colon cancer-associated Mic1-like (InterPro:IPR009755); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.3e-21755.74Show/hide
Query:  SSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSHEIQFLIRETGETFSQR
        S ALSHVYI +P LRC IP   GLFYDD N+LLIC T  Q+FSW+T PFNP V  + D+I+EGPILS+R+SLD K IA+QRS  EIQ   RET +  + +
Subjt:  SSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSHEIQFLIRETGETFSQR

Query:  CRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDAN
        C+  SESILGFFW+D PLC++  VKTSG+DLFA  S   SL LVE+KK NV+W+ YTHE+RLVL+ASG+QCKTF+GFQLS+AG+VRLP+FEM MARS++N
Subjt:  CRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDAN

Query:  SKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRAPISAPLPLLLRGFPGP
        SKP+L+  D+ ++TVYGRIYCLQVDR AMLLH YRFYRDAVVQQGSLPIYSS ++V+VVDN+LLVHQ+DAKVVI+YD+F DSRAP+SAPLPLL RG+   
Subjt:  SKPVLAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRAPISAPLPLLLRGFPGP

Query:  IIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTTILEHRSVA
            ++  ++  S E+ T  +   ++Y DGW FLVPDL+ D  NK++WKIH+DLEAI++SSS+  SLLEFLQRRKLE ++AKQLCL + R  ILE R   
Subjt:  IIDVRSSKQDSGSFEADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTTILEHRSVA

Query:  TVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTDPGNHQSTVDQERWRGT
         V +A+DVLV++Y +S K G   KE K + + +  P    PG                          S DSE +              S +D+E     
Subjt:  TVAKALDVLVSSYIHSSKVGPNVKESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTDPGNHQSTVDQERWRGT

Query:  ISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARNERYT
            E+  SS   +      N    D   E  L SSPAISPDE+Y FVF  +EE +V +  YL+AII EFLR ++ EK+KV+ NIYV+T+++LA ++R+ 
Subjt:  ISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLVSSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARNERYT

Query:  EIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFATNNSQHLAAVLRFLSD
        E+ LF   KI+EPSKEVA QLL+SGR N + RKLGLDMLRQLSLHHDY+S LVQDGYYLEALRY +K+KV +VR S+FL+AAFA+N+ QHLAA+LR LS+
Subjt:  EIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSLHHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFATNNSQHLAAVLRFLSD

Query:  LTPDFKDTSDYSRYHHILTEMNS
        L P FK+TS+Y  ++ +L + +S
Subjt:  LTPDFKDTSDYSRYHHILTEMNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGAAGACCATCGAGATTACAACCCAGTGCTGGTCTAAGCAAATCAAGTGCTCTCTCACATGTTTATATACAATATCCACCCTTACGGTGTAGAATTCCTGGACC
AAGGGGCTTATTTTATGATGATGGAAATAAGTTATTGATCTGCCCAACAGTGGATCAGATCTTCTCATGGAAAACAGTTCCGTTTAATCCTGCTGTCACTTATACCACTG
ATGCAATTACCGAAGGACCCATTTTATCTGTTCGATATTCCTTAGACTTAAAGATTATTGCAATACAAAGATCAAGTCATGAGATACAGTTCTTGATTAGAGAAACAGGT
GAAACTTTTAGTCAGAGATGTAGACAAGAGTCAGAGAGCATTCTGGGGTTTTTTTGGACGGATTGTCCCTTGTGCAATATTGTATTTGTGAAGACCAGCGGGCTGGACTT
GTTTGCTTATAGTTCTGATTCGAAGTCTCTCCATTTGGTGGAGTCAAAGAAATTGAATGTGAGCTGGCATGCCTACACGCATGAAAGTCGATTGGTGCTTATGGCTTCTG
GGATGCAGTGCAAAACTTTTCATGGTTTTCAGCTTTCATCAGCAGGGATTGTTCGCTTGCCAAAGTTTGAAATGGCGATGGCAAGATCTGATGCTAATAGCAAGCCTGTC
CTTGCTATAGAGGATATCTTTATTATCACTGTCTATGGAAGAATATATTGCTTGCAAGTGGATAGAATTGCAATGCTACTTCATACCTACAGGTTCTACCGTGATGCTGT
TGTGCAGCAGGGCTCTTTACCAATCTACTCGAGCTGGATTGCTGTGAGCGTGGTTGACAATGTGTTGCTTGTTCATCAAGTAGATGCAAAAGTAGTTATTCTTTATGATA
TTTTTGCTGATTCGAGGGCACCCATATCTGCCCCACTTCCTTTGTTGTTGAGAGGTTTTCCTGGGCCCATTATTGATGTTCGAAGTAGTAAGCAAGATAGCGGCAGTTTC
GAAGCTGATACTGTACCTGATTATGAAGCAATTGTCTATGGGGATGGTTGGAATTTTCTTGTCCCAGACCTTGTTTGTGATCACGTCAACAAATTAGTTTGGAAGATACA
TATAGACTTAGAGGCGATTGCTTCAAGTAGCTCTGAAGTGCCCTCACTCCTTGAATTCTTGCAGCGACGGAAATTGGAAGTTAGCCAGGCTAAACAGTTGTGCCTGACCT
TGATAAGGACTACGATTCTGGAGCACAGGTCGGTGGCCACTGTTGCTAAGGCTTTAGATGTTCTAGTCTCATCATATATCCACTCAAGCAAAGTAGGCCCCAACGTTAAG
GAATCGAAAACTGACAGGTCGCAGTCGGTTGTGCCTCAAGTTAGTGGCCCTGGCCCTGTACATGGTGCTATTAACCGTGATTCAACTGCTGGAGTGGAAAGTGAAGTTCT
TCATAGAACGCCAATATTTCCATCTTCAGATTCTGAGGAGAATCCTGACGTTGAACAACTAAATACAGATCCAGGCAACCATCAGTCTACAGTTGACCAGGAAAGATGGA
GAGGCACAATAAGTTCTACTGAAGTTCAGGCATCATCTTCACAGTATCAGCTTCTTGGGCCAGGATGTAACCGGTTGAATGATGATGTCTCTGATGAGGGATCTCTGGTT
TCATCTCCAGCTATCTCACCTGATGAGATGTACAGCTTTGTGTTTGCCCCCATTGAGGAAGAGATAGTTGGAGACCCTTCTTACTTGCTGGCTATAATTATTGAGTTCCT
CCGCAGGGTTAATATGGAAAAGATCAAAGTAAATCCAAACATTTATGTTTTGACTGTCCAAATATTAGCTCGGAATGAACGATACACAGAAATTGGATTATTTGTGCAGC
AAAAGATTCTAGAACCTTCTAAAGAGGTTGCTTTGCAACTACTGGAGTCTGGTCGCCATAATTTCCAGACAAGGAAACTGGGTCTGGATATGCTTAGGCAGCTTTCTCTG
CATCATGATTATGTGTCTCTGCTTGTGCAAGATGGATATTACCTCGAAGCATTGCGTTACACACGGAAGTATAAGGTTGACACTGTCCGTCCTTCGTTGTTTCTTCAAGC
AGCCTTCGCGACCAATAACTCGCAACATTTGGCAGCGGTTTTGAGATTCTTGTCAGATTTAACCCCTGACTTCAAAGACACCTCAGATTACAGTAGGTACCATCACATAC
TCACTGAAATGAACTCTTGTGCTTCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGGAAGACCATCGAGATTACAACCCAGTGCTGGTCTAAGCAAATCAAGTGCTCTCTCACATGTTTATATACAATATCCACCCTTACGGTGTAGAATTCCTGGACC
AAGGGGCTTATTTTATGATGATGGAAATAAGTTATTGATCTGCCCAACAGTGGATCAGATCTTCTCATGGAAAACAGTTCCGTTTAATCCTGCTGTCACTTATACCACTG
ATGCAATTACCGAAGGACCCATTTTATCTGTTCGATATTCCTTAGACTTAAAGATTATTGCAATACAAAGATCAAGTCATGAGATACAGTTCTTGATTAGAGAAACAGGT
GAAACTTTTAGTCAGAGATGTAGACAAGAGTCAGAGAGCATTCTGGGGTTTTTTTGGACGGATTGTCCCTTGTGCAATATTGTATTTGTGAAGACCAGCGGGCTGGACTT
GTTTGCTTATAGTTCTGATTCGAAGTCTCTCCATTTGGTGGAGTCAAAGAAATTGAATGTGAGCTGGCATGCCTACACGCATGAAAGTCGATTGGTGCTTATGGCTTCTG
GGATGCAGTGCAAAACTTTTCATGGTTTTCAGCTTTCATCAGCAGGGATTGTTCGCTTGCCAAAGTTTGAAATGGCGATGGCAAGATCTGATGCTAATAGCAAGCCTGTC
CTTGCTATAGAGGATATCTTTATTATCACTGTCTATGGAAGAATATATTGCTTGCAAGTGGATAGAATTGCAATGCTACTTCATACCTACAGGTTCTACCGTGATGCTGT
TGTGCAGCAGGGCTCTTTACCAATCTACTCGAGCTGGATTGCTGTGAGCGTGGTTGACAATGTGTTGCTTGTTCATCAAGTAGATGCAAAAGTAGTTATTCTTTATGATA
TTTTTGCTGATTCGAGGGCACCCATATCTGCCCCACTTCCTTTGTTGTTGAGAGGTTTTCCTGGGCCCATTATTGATGTTCGAAGTAGTAAGCAAGATAGCGGCAGTTTC
GAAGCTGATACTGTACCTGATTATGAAGCAATTGTCTATGGGGATGGTTGGAATTTTCTTGTCCCAGACCTTGTTTGTGATCACGTCAACAAATTAGTTTGGAAGATACA
TATAGACTTAGAGGCGATTGCTTCAAGTAGCTCTGAAGTGCCCTCACTCCTTGAATTCTTGCAGCGACGGAAATTGGAAGTTAGCCAGGCTAAACAGTTGTGCCTGACCT
TGATAAGGACTACGATTCTGGAGCACAGGTCGGTGGCCACTGTTGCTAAGGCTTTAGATGTTCTAGTCTCATCATATATCCACTCAAGCAAAGTAGGCCCCAACGTTAAG
GAATCGAAAACTGACAGGTCGCAGTCGGTTGTGCCTCAAGTTAGTGGCCCTGGCCCTGTACATGGTGCTATTAACCGTGATTCAACTGCTGGAGTGGAAAGTGAAGTTCT
TCATAGAACGCCAATATTTCCATCTTCAGATTCTGAGGAGAATCCTGACGTTGAACAACTAAATACAGATCCAGGCAACCATCAGTCTACAGTTGACCAGGAAAGATGGA
GAGGCACAATAAGTTCTACTGAAGTTCAGGCATCATCTTCACAGTATCAGCTTCTTGGGCCAGGATGTAACCGGTTGAATGATGATGTCTCTGATGAGGGATCTCTGGTT
TCATCTCCAGCTATCTCACCTGATGAGATGTACAGCTTTGTGTTTGCCCCCATTGAGGAAGAGATAGTTGGAGACCCTTCTTACTTGCTGGCTATAATTATTGAGTTCCT
CCGCAGGGTTAATATGGAAAAGATCAAAGTAAATCCAAACATTTATGTTTTGACTGTCCAAATATTAGCTCGGAATGAACGATACACAGAAATTGGATTATTTGTGCAGC
AAAAGATTCTAGAACCTTCTAAAGAGGTTGCTTTGCAACTACTGGAGTCTGGTCGCCATAATTTCCAGACAAGGAAACTGGGTCTGGATATGCTTAGGCAGCTTTCTCTG
CATCATGATTATGTGTCTCTGCTTGTGCAAGATGGATATTACCTCGAAGCATTGCGTTACACACGGAAGTATAAGGTTGACACTGTCCGTCCTTCGTTGTTTCTTCAAGC
AGCCTTCGCGACCAATAACTCGCAACATTTGGCAGCGGTTTTGAGATTCTTGTCAGATTTAACCCCTGACTTCAAAGACACCTCAGATTACAGTAGGTACCATCACATAC
TCACTGAAATGAACTCTTGTGCTTCTGCTTGA
Protein sequenceShow/hide protein sequence
MSGRPSRLQPSAGLSKSSALSHVYIQYPPLRCRIPGPRGLFYDDGNKLLICPTVDQIFSWKTVPFNPAVTYTTDAITEGPILSVRYSLDLKIIAIQRSSHEIQFLIRETG
ETFSQRCRQESESILGFFWTDCPLCNIVFVKTSGLDLFAYSSDSKSLHLVESKKLNVSWHAYTHESRLVLMASGMQCKTFHGFQLSSAGIVRLPKFEMAMARSDANSKPV
LAIEDIFIITVYGRIYCLQVDRIAMLLHTYRFYRDAVVQQGSLPIYSSWIAVSVVDNVLLVHQVDAKVVILYDIFADSRAPISAPLPLLLRGFPGPIIDVRSSKQDSGSF
EADTVPDYEAIVYGDGWNFLVPDLVCDHVNKLVWKIHIDLEAIASSSSEVPSLLEFLQRRKLEVSQAKQLCLTLIRTTILEHRSVATVAKALDVLVSSYIHSSKVGPNVK
ESKTDRSQSVVPQVSGPGPVHGAINRDSTAGVESEVLHRTPIFPSSDSEENPDVEQLNTDPGNHQSTVDQERWRGTISSTEVQASSSQYQLLGPGCNRLNDDVSDEGSLV
SSPAISPDEMYSFVFAPIEEEIVGDPSYLLAIIIEFLRRVNMEKIKVNPNIYVLTVQILARNERYTEIGLFVQQKILEPSKEVALQLLESGRHNFQTRKLGLDMLRQLSL
HHDYVSLLVQDGYYLEALRYTRKYKVDTVRPSLFLQAAFATNNSQHLAAVLRFLSDLTPDFKDTSDYSRYHHILTEMNSCASA