; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007856 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007856
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein SCAI
Genome locationscaffold13:1839393..1844472
RNA-Seq ExpressionMS007856
SyntenyMS007856
Gene Ontology termsGO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003714 - transcription corepressor activity (molecular function)
InterPro domainsIPR022709 - Protein SCAI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138930.1 protein SCAI [Cucumis sativus]5.8e-30388.04Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANS++ N+SIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAILTREYFK+GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV P LT RYL+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGG+GPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEM SDGVLLIYLSA+G  KN LSSPA T++GCESINNA++IDKT+SPC QVEGG  G Q+GCLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSD SEAFE IHGAEKGEPAAMLLS +  +H VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+VEMDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+L+QVWAQ+LNDPFIRRLLLRFIFCR VL  YAPT  KKE++P+CVP+LPS +DPT+AT QSVVMKIAN+ GVS+SF+FS+NL+L 
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

XP_008457173.1 PREDICTED: protein SCAI isoform X1 [Cucumis melo]2.4e-30188.04Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSS+SN+SIPVSEAYWSLVDKADRKFSKIRDLPYYERN YDAYFHKAFKVYTQLWKFQQENRQKLVE GLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAI+TREYFK+GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV   +T R L+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGG+GPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEM SDGVLLIYLSA+G   N LSSPA T++GCESINNAEDIDKT+SPC QVEGG  G Q+ CLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSD SEAFE IHGAEKGEPAAMLLS S TSH VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+VEMDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+L+QVWAQ+LNDPFIRRLLLRFIFCR+VL LY PT  KKE+VP+CVP+LPS +DPT+AT QSVVMKIAN+ GVS+SF+FS+NL+L 
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

XP_008457174.1 PREDICTED: protein SCAI isoform X2 [Cucumis melo]8.9e-29687.04Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSS+SN+SIPVSEAYWSLVDKADRKFSKIRDLPYYERN YDAYFHKAFKVYTQLWKFQQENRQKLVE GLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAI+TREYFK+GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV   +T R L+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGG+GPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEM SDGVLLIYLSA+ G          T++GCESINNAEDIDKT+SPC QVEGG  G Q+ CLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSD SEAFE IHGAEKGEPAAMLLS S TSH VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+VEMDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+L+QVWAQ+LNDPFIRRLLLRFIFCR+VL LY PT  KKE+VP+CVP+LPS +DPT+AT QSVVMKIAN+ GVS+SF+FS+NL+L 
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

XP_022155765.1 protein SCAI [Momordica charantia]0.0e+0099.17Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSSSSN SIPVSEAYWSLVDKADRKFSKIRDLP YERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKF KAD AF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYL+LQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        ES
Subjt:  ES

XP_038875244.1 protein SCAI [Benincasa hispida]0.0e+0090.37Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSSSSN+SIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAILTREYFKEG FQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV P LTKRYL+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGGSGPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSA+G  KNV SSPA T++GCESINNAEDIDKTRSPC QVEG  IGPQ+GCLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLL+IDSD SEAFE IHGAEKGEP AMLLSSS TSH+VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+V+MDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+++QVWAQ+LNDPFIRRLLLRFIFCRTVLALYAPT  KKE+VP+CVP+LPS +DPT+ATSQSVVMKIA + G S+SF+F++NLVLP
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

TrEMBL top hitse value%identityAlignment
A0A0A0LKY8 Uncharacterized protein2.8e-30388.04Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANS++ N+SIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAILTREYFK+GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV P LT RYL+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGG+GPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEM SDGVLLIYLSA+G  KN LSSPA T++GCESINNA++IDKT+SPC QVEGG  G Q+GCLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSD SEAFE IHGAEKGEPAAMLLS +  +H VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+VEMDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+L+QVWAQ+LNDPFIRRLLLRFIFCR VL  YAPT  KKE++P+CVP+LPS +DPT+AT QSVVMKIAN+ GVS+SF+FS+NL+L 
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

A0A1S3C4H1 protein SCAI isoform X24.3e-29687.04Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSS+SN+SIPVSEAYWSLVDKADRKFSKIRDLPYYERN YDAYFHKAFKVYTQLWKFQQENRQKLVE GLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAI+TREYFK+GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV   +T R L+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGG+GPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEM SDGVLLIYLSA+ G          T++GCESINNAEDIDKT+SPC QVEGG  G Q+ CLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSD SEAFE IHGAEKGEPAAMLLS S TSH VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+VEMDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+L+QVWAQ+LNDPFIRRLLLRFIFCR+VL LY PT  KKE+VP+CVP+LPS +DPT+AT QSVVMKIAN+ GVS+SF+FS+NL+L 
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

A0A1S3C668 protein SCAI isoform X11.2e-30188.04Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSS+SN+SIPVSEAYWSLVDKADRKFSKIRDLPYYERN YDAYFHKAFKVYTQLWKFQQENRQKLVE GLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAI+TREYFK+GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV   +T R L+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGG+GPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEM SDGVLLIYLSA+G   N LSSPA T++GCESINNAEDIDKT+SPC QVEGG  G Q+ CLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSD SEAFE IHGAEKGEPAAMLLS S TSH VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+VEMDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+L+QVWAQ+LNDPFIRRLLLRFIFCR+VL LY PT  KKE+VP+CVP+LPS +DPT+AT QSVVMKIAN+ GVS+SF+FS+NL+L 
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

A0A5A7T841 Protein SCAI isoform X11.2e-30188.04Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSS+SN+SIPVSEAYWSLVDKADRKFSKIRDLPYYERN YDAYFHKAFKVYTQLWKFQQENRQKLVE GLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAI+TREYFK+GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPV   +T R L+LQDAILSSY+HNEVKF+ELTLDTFRMIQSLEWEPSGSFYRPN NRSGQNGG+GPSRSNF+QDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEM SDGVLLIYLSA+G   N LSSPA T++GCESINNAEDIDKT+SPC QVEGG  G Q+ CLSF TRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSD SEAFE IHGAEKGEPAAMLLS S TSH VAT+YSRH  GSLFTLFLTAPLHAFCLLLGISGS+VEMDTFSKAE++LS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSL+EWGQ L TSE+L+QVWAQ+LNDPFIRRLLLRFIFCR+VL LY PT  KKE+VP+CVP+LPS +DPT+AT QSVVMKIAN+ GVS+SF+FS+NL+L 
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        E+
Subjt:  ES

A0A6J1DNC3 protein SCAI0.0e+0099.17Show/hide
Query:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
        MSQPGNPANSSSSN SIPVSEAYWSLVDKADRKFSKIRDLP YERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR
Subjt:  MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMR

Query:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF
        TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKF KAD AF
Subjt:  TSEASYLSESYVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAF

Query:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
        MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYL+LQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL
Subjt:  MNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTL

Query:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
        PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS
Subjt:  PSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLS

Query:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
        CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS
Subjt:  CIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLS

Query:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
        SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP
Subjt:  SSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLP

Query:  ES
        ES
Subjt:  ES

SwissProt top hitse value%identityAlignment
Q54YY1 Protein SCAI homolog5.8e-5627.41Show/hide
Query:  SQPGNPANSSSSNTS--------IPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEA---GLKRWEIGEIASRI
        S P N   +++S+TS          + + +  L+ K+ R F  +RDLP + R ++  +F K F++YT+LWKFQQ+ R  L +    GLKR EIGEIAS+I
Subjt:  SQPGNPANSSSSNTS--------IPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEA---GLKRWEIGEIASRI

Query:  AQLYFGQYMRTSEASYLSESYVFYEAILTREYFKE-GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQE
         QLY+  Y+RTS+ +YL+ESY+FYEAI  R YFK+  L +   +  KQLR+ +RF++VCL+LN++++V  L+ +L   +++  + ++ +D +EW LV+QE
Subjt:  AQLYFGQYMRTSEASYLSESYVFYEAILTREYFKE-GLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQE

Query:  IMKFLKAD-------------------------TAFMNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLK-LQDAILSSYHHNEVKFTELTLDTFRMIQSLE
        I  FL+AD                          +  N      + V     + L   SPP      + LQ AIL     N++KF+E+TLD FRM QSLE
Subjt:  IMKFLKAD-------------------------TAFMNIRPFRYSVVLEPHPDSLTPVSPPLTKRYLK-LQDAILSSYHHNEVKFTELTLDTFRMIQSLE

Query:  WEPSGSFYRPNV----------NRSGQNGGSGPSRSNFTQDIVDPTLPS-------------------------NPRKSILYRPSVTHFLAVLATICEEM
        +EP       ++           +  Q       ++N T D  + T+PS                         NP K +LYRP+++  L  L+   +E+
Subjt:  WEPSGSFYRPNV----------NRSGQNGGSGPSRSNFTQDIVDPTLPS-------------------------NPRKSILYRPSVTHFLAVLATICEEM

Query:  PSDGVLLIYLSASG----------------------------------------GAKNVLSSPASTE------------IGCESINNAED-------IDK
          +  +L+Y+ A G                                         A N L +   T+            I   ++NN  +       +  
Subjt:  PSDGVLLIYLSASG----------------------------------------GAKNVLSSPASTE------------IGCESINNAED-------IDK

Query:  TRSPCRQVEGGCIGPQAGCLSFSTRGKGGL--------------SCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATE
        T +    V            S ST     +                +YP DL+PF R+PF L+++S +S+ F  +      +P   LLS  +    +   
Subjt:  TRSPCRQVEGGCIGPQAGCLSFSTRGKGGL--------------SCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATE

Query:  YSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEF
         S    G+LFT FL  P+ AFC +    G+++   TF+    +  +SL    + L    +L+  ++  L D F+R  ++RFIFC     L+   F    +
Subjt:  YSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEF

Query:  VPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFI
          K  P LP  +    +  +S + ++ +   VS  F+
Subjt:  VPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFI

Q8C8N2 Protein SCAI6.2e-7433.39Show/hide
Query:  NSSSSNTSIPVSEA-----YWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEA-GLKRWEIGEIASRIAQLYFGQYMRTS
        +S  +   IP  E      +  L+DK+ + F+ +RDLP Y + ++ +YF + F VYT+LWKFQQ++RQ L    GLKRW+IGEIAS+I QLY+  Y+RTS
Subjt:  NSSSSNTSIPVSEA-----YWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEA-GLKRWEIGEIASRIAQLYFGQYMRTS

Query:  EASYLSESYVFYEAILTREYFKEGLFQD-VSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFM
        E SYL+E++ FY AI  R Y+ +   +D   L  K+LR+ +RF++VCL+LN+ ++V  LV +L   +++    F   D  EW LV+QE+  F++AD   M
Subjt:  EASYLSESYVFYEAILTREYFKEGLFQD-VSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFM

Query:  NIRPFRYSVVLEPHPDSLTPVSPPLTKR-----YLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIV
         +      V+     + L     PL ++      L L DA++    +N+VKF+ELT+D FRM+Q+LE EP              N  S  ++    +   
Subjt:  NIRPFRYSVVLEPHPDSLTPVSPPLTKR-----YLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIV

Query:  DPTLPSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLS-SPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRG
         PT   NP K +LY+P+ +     LA   +E+P++ VLLIYLSA+G      S      + G    N+  DI         + G  I  +        + 
Subjt:  DPTLPSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLS-SPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRG

Query:  KGGLSCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKA
           + C++P DL PFTR+P  +V+DS  S A++       G+P   LLS +     +  +  R   GSLFTLFL  PL AF  + G+  S +    + K 
Subjt:  KGGLSCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKA

Query:  ESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSF
        +  L     +  Q L  S +++Q + Q   D F+R LL RF+FC   + ++   F +    P+  P LP      +   Q  ++++A++  V   F
Subjt:  ESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSF

Q8N9R8 Protein SCAI2.1e-7433.56Show/hide
Query:  NSSSSNTSIPVSEA-----YWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEA-GLKRWEIGEIASRIAQLYFGQYMRTS
        +S  +   IP  E      +  L+DK+ + F+ +RDLP Y + ++ +YF + F VYT+LWKFQQ++RQ L    GLKRW+IGEIAS+I QLY+  Y+RTS
Subjt:  NSSSSNTSIPVSEA-----YWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEA-GLKRWEIGEIASRIAQLYFGQYMRTS

Query:  EASYLSESYVFYEAILTREYFKEGLFQD-VSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFM
        E SYL+E++ FY AI  R Y+ +   +D   L  K+LR+ +RF++VCL+LN+ ++V  LV +L   +++    F   D  EW LV+QE+  F++AD   M
Subjt:  EASYLSESYVFYEAILTREYFKEGLFQD-VSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFM

Query:  NIRPFRYSVVLEPHPDSLTPVSPPLTKR-----YLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIV
         +      V+     + L     PL ++      L L DA++    +N+VKF+ELT+D FRM+Q+LE EP              N  S  ++    +   
Subjt:  NIRPFRYSVVLEPHPDSLTPVSPPLTKR-----YLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIV

Query:  DPTLPSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLS-SPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRG
         PT   NP K +LY+P+ +     LA   +E+P++ VLLIYLSA+G      S S    + G    N+  DI         + G  I  +        + 
Subjt:  DPTLPSNPRKSILYRPSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLS-SPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRG

Query:  KGGLSCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKA
           + C++P DL PFTR+P  +++DS  S A++       G+P   LLS +     +  +  R   GSLFTLFL  PL AF  + G+  S +    + K 
Subjt:  KGGLSCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKA

Query:  ESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSF
        +  L     +  Q L  S +++Q + Q   D F+R LL RFIFC   + ++   F +    P+  P LP      +   Q  ++++A++  V   F
Subjt:  ESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSF

Arabidopsis top hitse value%identityAlignment
AT3G03570.1 Protein of unknown function (DUF3550/UPF0682)6.1e-21062.11Show/hide
Query:  SSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMRTSEASYLSESY
        S N +IP+SE YWSLV+KAD+KFSKIRDLP+YER+RY+ YF K FKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLY+G YMRTS+A YLSESY
Subjt:  SSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMRTSEASYLSESY

Query:  VFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFMNIRPFRYSVV
        VFYEAILTREYFK+GLFQD+++ANKQLRFL+RFLMVCLVL RREMVHQLV+Q K L+DECKRTFQETDF+EWK+V QEI++FLK+DTAFMNIRP RYS+V
Subjt:  VFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFMNIRPFRYSVV

Query:  LEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTLPSNPRKSILYR
        L+P+ D+ T    P   R L+L DAILSSY+ NEVK++ELTLD+FRM+Q LEWEPSGS Y+    + GQN   G +R N +Q + DPTLP NPRK++LYR
Subjt:  LEPHPDSLTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTLPSNPRKSILYR

Query:  PSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINN----------------AEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRG
        PS+THFLAVLATICEE+PS G+LL+YLSASG    + SSP S         N                +  I  +    RQ+    +    G LSF + G
Subjt:  PSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINN----------------AEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRG

Query:  KGGLSCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKA
          G S IYPSDLVPFTR+P  ++IDSD+S  F+ I GAEKGEPAA+LLS S T   ++ ++SR  +GSLFT+FLT+P+ AFCLL  IS S++E D F+KA
Subjt:  KGGLSCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKA

Query:  ESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSK
        E +LSSS+NEW  +LATS+ L+ VW+Q+L DPF+RRLLLRFIFCR VLALY P FN K+  P+C P+LP  + PT+   QS V ++ANVFG +  F   +
Subjt:  ESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSK

Query:  NLVLPES
        ++ + ES
Subjt:  NLVLPES

AT4G40050.1 Protein of unknown function (DUF3550/UPF0682)3.4e-14446.31Show/hide
Query:  VSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMRTSEASYLSESYVFYEAIL
        VS  + +LV+ ADRKF+++RDLP + R +   YF K FK Y +LW +QQ +R KLVE+GL RWEIGEIASRI QLYF QYMRTSEA +L E++VFYEAIL
Subjt:  VSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMRTSEASYLSESYVFYEAIL

Query:  TREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFMNIRPFRYSVVLEPHPDS
         R YF E   +D+    K+LRF +RFL+V L+++R++M+  L ++L++L+D     F+ET+F+EW+LVVQEI +F+++DT    +RP RY  +L+ +P S
Subjt:  TREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFMNIRPFRYSVVLEPHPDS

Query:  LTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNG-------GSGPSRSNFTQDIVDPTLPSNPRKSILYR
         T ++    K+  K +DA+L+SYH NEVK+ E+TLDT+RM+Q LEWEPSGSFY+     + +NG        SG    N   D+ DP+LP NPRK+ILYR
Subjt:  LTPVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNG-------GSGPSRSNFTQDIVDPTLPSNPRKSILYR

Query:  PSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTR------------GKGGL
        P+V+H LAVLA IC+E+  + V+L+YLSASGG      +     +G    + ++ + +     +  +     P +     S              G  G 
Subjt:  PSVTHFLAVLATICEEMPSDGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTR------------GKGGL

Query:  SCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTS-HTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESM
        + +YP DL+PFTR+P  L+IDSD S AF+ + GAE+GEP AMLLS    S    +T+ +   NGS FT FLTAPL AFC +LG+S ++ + +   +AES+
Subjt:  SCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAAMLLSSSTTS-HTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESM

Query:  LSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFS
        LS+S +EW   L TS+ LN VWAQVL DPF+RRL+LRFIFCR+VL  ++ T +   ++P+C P LP  +   S   QS V ++A   GV++SF F+
Subjt:  LSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTFNKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCAGCCGGGAAATCCGGCGAATAGCAGTAGCAGCAACACTAGCATTCCAGTGAGCGAAGCGTACTGGTCTCTGGTTGACAAAGCTGACCGGAAGTTCTCCAAGAT
CAGGGACCTGCCTTATTACGAGCGCAATAGGTATGATGCGTATTTTCACAAGGCGTTCAAAGTGTACACACAGTTGTGGAAATTTCAGCAAGAGAATCGTCAGAAATTGG
TAGAGGCAGGGCTGAAGAGATGGGAGATTGGCGAGATTGCGTCACGGATTGCTCAGCTTTACTTTGGGCAATACATGAGGACCAGTGAGGCGAGCTACTTGTCCGAGTCT
TACGTATTTTATGAAGCAATATTGACTCGAGAGTACTTCAAGGAGGGATTGTTTCAAGACGTCAGTCTTGCGAACAAGCAGTTGAGATTTCTGTCCAGATTTCTAATGGT
GTGCCTCGTTTTGAACCGAAGAGAAATGGTGCATCAATTGGTTAATCAGCTTAAAATGTTGCTCGATGAGTGCAAGAGGACATTCCAGGAAACGGACTTTAGAGAATGGA
AGCTGGTAGTTCAGGAAATAATGAAATTTTTGAAAGCAGATACCGCCTTTATGAATATCAGGCCTTTCAGATATAGTGTTGTGTTAGAGCCTCATCCAGATTCTCTGACG
CCTGTTTCTCCACCACTTACTAAGAGATACTTGAAGTTACAGGATGCTATTCTGAGTAGCTACCATCATAATGAGGTCAAGTTTACAGAGCTCACGCTAGACACTTTCAG
GATGATTCAAAGCCTGGAATGGGAACCTAGTGGTTCATTTTATCGGCCAAATGTTAATAGAAGTGGGCAGAATGGTGGCTCTGGGCCAAGTCGCAGTAACTTCACACAGG
ATATTGTGGATCCTACTCTCCCTTCTAATCCTCGGAAGTCTATTTTATACCGTCCTTCAGTGACACATTTTTTAGCAGTTCTAGCTACAATTTGTGAGGAGATGCCAAGT
GATGGGGTTCTCTTGATATATTTGTCAGCTTCAGGTGGTGCAAAGAATGTTTTATCTTCCCCAGCTAGCACAGAAATAGGCTGTGAGTCTATCAACAATGCAGAGGATAT
TGATAAAACCAGGTCTCCTTGTAGACAAGTTGAAGGAGGCTGCATAGGTCCCCAAGCTGGTTGCTTATCATTCAGTACTCGTGGAAAAGGAGGTCTAAGCTGCATTTATC
CAAGCGACTTAGTTCCATTTACCAGAAGGCCTTTTCTTTTAGTTATTGATAGTGATGCCAGTGAAGCATTTGAGGCTATCCATGGAGCAGAAAAGGGAGAGCCAGCTGCA
ATGCTGTTGTCTTCCAGCACTACTAGTCATACTGTTGCTACCGAATATTCTCGACATGCTAATGGAAGCCTTTTCACCCTTTTTCTCACAGCTCCATTGCATGCTTTCTG
TCTATTGCTTGGTATCTCGGGCTCTGAAGTTGAAATGGATACATTTAGTAAAGCTGAGAGCATGCTGTCCTCGTCATTAAATGAGTGGGGGCAGTCTTTGGCGACATCAG
AGAACCTTAATCAAGTTTGGGCACAGGTTTTGAATGATCCATTCATAAGGCGGCTTCTCCTGAGATTTATATTCTGTCGTACCGTGCTAGCACTTTACGCACCAACCTTC
AATAAGAAGGAATTCGTCCCCAAGTGCGTGCCAACCTTGCCCTCATTTATCGATCCGACATCTGCAACATCGCAATCTGTCGTTATGAAGATAGCAAACGTCTTTGGTGT
AAGCCAAAGTTTCATCTTCTCCAAAAATCTGGTACTTCCTGAAAGC
mRNA sequenceShow/hide mRNA sequence
ATGTCGCAGCCGGGAAATCCGGCGAATAGCAGTAGCAGCAACACTAGCATTCCAGTGAGCGAAGCGTACTGGTCTCTGGTTGACAAAGCTGACCGGAAGTTCTCCAAGAT
CAGGGACCTGCCTTATTACGAGCGCAATAGGTATGATGCGTATTTTCACAAGGCGTTCAAAGTGTACACACAGTTGTGGAAATTTCAGCAAGAGAATCGTCAGAAATTGG
TAGAGGCAGGGCTGAAGAGATGGGAGATTGGCGAGATTGCGTCACGGATTGCTCAGCTTTACTTTGGGCAATACATGAGGACCAGTGAGGCGAGCTACTTGTCCGAGTCT
TACGTATTTTATGAAGCAATATTGACTCGAGAGTACTTCAAGGAGGGATTGTTTCAAGACGTCAGTCTTGCGAACAAGCAGTTGAGATTTCTGTCCAGATTTCTAATGGT
GTGCCTCGTTTTGAACCGAAGAGAAATGGTGCATCAATTGGTTAATCAGCTTAAAATGTTGCTCGATGAGTGCAAGAGGACATTCCAGGAAACGGACTTTAGAGAATGGA
AGCTGGTAGTTCAGGAAATAATGAAATTTTTGAAAGCAGATACCGCCTTTATGAATATCAGGCCTTTCAGATATAGTGTTGTGTTAGAGCCTCATCCAGATTCTCTGACG
CCTGTTTCTCCACCACTTACTAAGAGATACTTGAAGTTACAGGATGCTATTCTGAGTAGCTACCATCATAATGAGGTCAAGTTTACAGAGCTCACGCTAGACACTTTCAG
GATGATTCAAAGCCTGGAATGGGAACCTAGTGGTTCATTTTATCGGCCAAATGTTAATAGAAGTGGGCAGAATGGTGGCTCTGGGCCAAGTCGCAGTAACTTCACACAGG
ATATTGTGGATCCTACTCTCCCTTCTAATCCTCGGAAGTCTATTTTATACCGTCCTTCAGTGACACATTTTTTAGCAGTTCTAGCTACAATTTGTGAGGAGATGCCAAGT
GATGGGGTTCTCTTGATATATTTGTCAGCTTCAGGTGGTGCAAAGAATGTTTTATCTTCCCCAGCTAGCACAGAAATAGGCTGTGAGTCTATCAACAATGCAGAGGATAT
TGATAAAACCAGGTCTCCTTGTAGACAAGTTGAAGGAGGCTGCATAGGTCCCCAAGCTGGTTGCTTATCATTCAGTACTCGTGGAAAAGGAGGTCTAAGCTGCATTTATC
CAAGCGACTTAGTTCCATTTACCAGAAGGCCTTTTCTTTTAGTTATTGATAGTGATGCCAGTGAAGCATTTGAGGCTATCCATGGAGCAGAAAAGGGAGAGCCAGCTGCA
ATGCTGTTGTCTTCCAGCACTACTAGTCATACTGTTGCTACCGAATATTCTCGACATGCTAATGGAAGCCTTTTCACCCTTTTTCTCACAGCTCCATTGCATGCTTTCTG
TCTATTGCTTGGTATCTCGGGCTCTGAAGTTGAAATGGATACATTTAGTAAAGCTGAGAGCATGCTGTCCTCGTCATTAAATGAGTGGGGGCAGTCTTTGGCGACATCAG
AGAACCTTAATCAAGTTTGGGCACAGGTTTTGAATGATCCATTCATAAGGCGGCTTCTCCTGAGATTTATATTCTGTCGTACCGTGCTAGCACTTTACGCACCAACCTTC
AATAAGAAGGAATTCGTCCCCAAGTGCGTGCCAACCTTGCCCTCATTTATCGATCCGACATCTGCAACATCGCAATCTGTCGTTATGAAGATAGCAAACGTCTTTGGTGT
AAGCCAAAGTTTCATCTTCTCCAAAAATCTGGTACTTCCTGAAAGC
Protein sequenceShow/hide protein sequence
MSQPGNPANSSSSNTSIPVSEAYWSLVDKADRKFSKIRDLPYYERNRYDAYFHKAFKVYTQLWKFQQENRQKLVEAGLKRWEIGEIASRIAQLYFGQYMRTSEASYLSES
YVFYEAILTREYFKEGLFQDVSLANKQLRFLSRFLMVCLVLNRREMVHQLVNQLKMLLDECKRTFQETDFREWKLVVQEIMKFLKADTAFMNIRPFRYSVVLEPHPDSLT
PVSPPLTKRYLKLQDAILSSYHHNEVKFTELTLDTFRMIQSLEWEPSGSFYRPNVNRSGQNGGSGPSRSNFTQDIVDPTLPSNPRKSILYRPSVTHFLAVLATICEEMPS
DGVLLIYLSASGGAKNVLSSPASTEIGCESINNAEDIDKTRSPCRQVEGGCIGPQAGCLSFSTRGKGGLSCIYPSDLVPFTRRPFLLVIDSDASEAFEAIHGAEKGEPAA
MLLSSSTTSHTVATEYSRHANGSLFTLFLTAPLHAFCLLLGISGSEVEMDTFSKAESMLSSSLNEWGQSLATSENLNQVWAQVLNDPFIRRLLLRFIFCRTVLALYAPTF
NKKEFVPKCVPTLPSFIDPTSATSQSVVMKIANVFGVSQSFIFSKNLVLPES