; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G012760 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G012760
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: 6 growth stages;
Genome locationCmo_Chr18:12388723..12393746
RNA-Seq ExpressionCmoCh18G012760
SyntenyCmoCh18G012760
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001119 - S-layer homology domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574185.1 60S ribosomal protein L36-2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0082.77Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRL+VI RPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCSK                                                  NDATLNKNR+GAVADVEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY
        LSGNDVESSSNNDN+NNVAFLQEDIQSDSSLAVTSVAYG SEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNS ETKYDFSSEKLPVYDDSSSNYNSGY
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY

Query:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------
        QDETADPPVNE DDSSLHELGLVDKETVTESLEGVLNPGKT+QLLSEETASTIEQQIGRGLS+AAFVSVTAYPLADDQEELNHETTMNS           
Subjt:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------

Query:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
             VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
Subjt:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED

Query:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------
        PDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                                    
Subjt:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------

Query:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE
            GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE
Subjt:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE

Query:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE
        RERDNIALMRER AIESEMEVLSRLRNELE+QLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM           AREQAKALE
Subjt:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE

Query:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI
        EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETV RAE LMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI
Subjt:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI

Query:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTS
        ARASRSATELQ+SSAEMGLALKEGAKRVVGDCREGVEKISQKFRTS
Subjt:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTS

XP_022945850.1 uncharacterized protein LOC111449970 isoform X1 [Cucurbita moschata]0.0e+0084.28Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCSK                                                  NDATLNKNRIGAVADVEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY
        LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY

Query:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------
        QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS           
Subjt:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------

Query:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
             VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
Subjt:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED

Query:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------
        PDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                                    
Subjt:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------

Query:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE
            GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE
Subjt:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE

Query:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE
        RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM           AREQAKALE
Subjt:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE

Query:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI
        EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI
Subjt:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI

Query:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
        ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

XP_022968011.1 uncharacterized protein LOC111467385 isoform X1 [Cucurbita maxima]0.0e+0080.02Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKN  KFP +PVRATVRKLDPRL+VICRPIVHNSAKIARTNGLRR+GICFAGSDSKADGFSGWSESDSGEEDLNLR KNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCSK                                                  ND TLNKNR+GAVADVEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS
         SG+DVE SSNNDNVNNVAFLQEDIQSDSSLAVT VAYG SEID D+DSG KDVNSGTEVLTSEPEMNDEPDNSPET  KYDFSSEKLPVYDDSSSNYNS
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS

Query:  GYQDETADPPVNEIDDSSLH-----------ELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTM
        GYQDETADPPVNEIDDSSLH           ELGLV+KE VTESLEGVLNPGKT++LLSEETASTIEQ+IGRGLSKAAFVSVTAYPLADDQE+ NHETTM
Subjt:  GYQDETADPPVNEIDDSSLH-----------ELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTM

Query:  NS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIEN
        NS                VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIEN
Subjt:  NS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIEN

Query:  VTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE-------------------------------------------------------
        VTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                       
Subjt:  VTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE-------------------------------------------------------

Query:  -----------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMA
                         GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMA
Subjt:  -----------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMA

Query:  EEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-------
        EEAKQELERLR E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM       
Subjt:  EEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-------

Query:  ----AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRN
            AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETV RAENLMDKLKGM  EVRGKSKDIIEMIIQKIALLISNLRQWVRN
Subjt:  ----AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRN

Query:  AGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
        AGEKAEDVKNV IARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  AGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

XP_022968012.1 uncharacterized protein LOC111467385 isoform X2 [Cucurbita maxima]0.0e+0080.95Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKN  KFP +PVRATVRKLDPRL+VICRPIVHNSAKIARTNGLRR+GICFAGSDSKADGFSGWSESDSGEEDLNLR KNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCSK                                                  ND TLNKNR+GAVADVEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS
         SG+DVE SSNNDNVNNVAFLQEDIQSDSSLAVT VAYG SEID D+DSG KDVNSGTEVLTSEPEMNDEPDNSPET  KYDFSSEKLPVYDDSSSNYNS
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS

Query:  GYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS---------
        GYQDETADPPVNEIDDSSLHELGLV+KE VTESLEGVLNPGKT++LLSEETASTIEQ+IGRGLSKAAFVSVTAYPLADDQE+ NHETTMNS         
Subjt:  GYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS---------

Query:  -------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITP
               VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITP
Subjt:  -------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITP

Query:  EDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE------------------------------------------------------------------
        EDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                                  
Subjt:  EDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE------------------------------------------------------------------

Query:  ------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLR
              GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLR
Subjt:  ------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLR

Query:  SERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKA
         E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM           AREQAKA
Subjt:  SERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKA

Query:  LEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNV
        LEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETV RAENLMDKLKGM  EVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNV
Subjt:  LEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNV

Query:  GIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
         IARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  GIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

XP_023541683.1 uncharacterized protein LOC111801768 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0081.33Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRL+VI RPIVHNSAKIARTNGLRRN ICFAGSDSKADGFSGWSESDSGEEDLNLR KNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCS+                                                  ND  LNKNR+G VA VEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY
        LSGNDVESSSNNDNVNNVAFLQEDIQSDS LAVTSVAYG SEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEK PVYDDSSSNYNSGY
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY

Query:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------
        QDETADPPVN IDDSSLHELGLVD+ETVTESLEGVLNPGKT+QLLSEET STIEQQIGRGLS+AAFVSVTAYPLADDQE+ NHETT+NS           
Subjt:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------

Query:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
             VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
Subjt:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED

Query:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------
        PDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                                    
Subjt:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------

Query:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE
            GYTRLFQPDKPVTKAQAAIALA GEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLR E
Subjt:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE

Query:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE
        RERDNIALM+ERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM           AREQAKALE
Subjt:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE

Query:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI
        EARDRWEK+GIKVVVDSDLRE+ESAGDTWLDSSKQFAVEETV RAENLMDKLKGMGREVRGKSKDIIEMIIQKIA+LISNLRQWVRNAGEKAEDVKNVGI
Subjt:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI

Query:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
        ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

TrEMBL top hitse value%identityAlignment
A0A6J1G212 uncharacterized protein LOC111449970 isoform X20.0e+0086.85Show/hide
Query:  SKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYD
        ++NDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYD
Subjt:  SKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYD

Query:  FSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEE
        FSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEE
Subjt:  FSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEE

Query:  LNHETTMNS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVY
        LNHETTMNS                VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVY
Subjt:  LNHETTMNS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVY

Query:  PAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE------------------------------------------------
        PAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                
Subjt:  PAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE------------------------------------------------

Query:  ------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKA
                                GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKA
Subjt:  ------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKA

Query:  DAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM
        DAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM
Subjt:  DAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM

Query:  -----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISN
                   AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISN
Subjt:  -----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISN

Query:  LRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
        LRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  LRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

A0A6J1G255 uncharacterized protein LOC111449970 isoform X10.0e+0084.28Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCSK                                                  NDATLNKNRIGAVADVEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY
        LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPETKYDFSSEKLPVYDDSSSNYNSGY

Query:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------
        QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS           
Subjt:  QDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS-----------

Query:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
             VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED
Subjt:  -----VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPED

Query:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------
        PDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                                    
Subjt:  PDFASIQGLAEAGLISSKLSRHDILSSLDDDE--------------------------------------------------------------------

Query:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE
            GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE
Subjt:  ----GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSE

Query:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE
        RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM           AREQAKALE
Subjt:  RERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALE

Query:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI
        EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI
Subjt:  EARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGI

Query:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
        ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  ARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

A0A6J1HSB9 uncharacterized protein LOC111467385 isoform X20.0e+0080.95Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKN  KFP +PVRATVRKLDPRL+VICRPIVHNSAKIARTNGLRR+GICFAGSDSKADGFSGWSESDSGEEDLNLR KNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCSK                                                  ND TLNKNR+GAVADVEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS
         SG+DVE SSNNDNVNNVAFLQEDIQSDSSLAVT VAYG SEID D+DSG KDVNSGTEVLTSEPEMNDEPDNSPET  KYDFSSEKLPVYDDSSSNYNS
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS

Query:  GYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS---------
        GYQDETADPPVNEIDDSSLHELGLV+KE VTESLEGVLNPGKT++LLSEETASTIEQ+IGRGLSKAAFVSVTAYPLADDQE+ NHETTMNS         
Subjt:  GYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNS---------

Query:  -------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITP
               VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITP
Subjt:  -------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITP

Query:  EDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE------------------------------------------------------------------
        EDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                                  
Subjt:  EDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE------------------------------------------------------------------

Query:  ------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLR
              GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLR
Subjt:  ------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLR

Query:  SERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKA
         E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM           AREQAKA
Subjt:  SERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKA

Query:  LEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNV
        LEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETV RAENLMDKLKGM  EVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNV
Subjt:  LEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNV

Query:  GIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
         IARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  GIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

A0A6J1HTP0 uncharacterized protein LOC111467385 isoform X10.0e+0080.02Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MASTPSTCSPNSLQLRLALNCKN  KFP +PVRATVRKLDPRL+VICRPIVHNSAKIARTNGLRR+GICFAGSDSKADGFSGWSESDSGEEDLNLR KNW
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE
        LAGLVGIGITGFILVSGITFAAWSINKQNCSK                                                  ND TLNKNR+GAVADVEE
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSK--------------------------------------------------NDATLNKNRIGAVADVEE

Query:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS
         SG+DVE SSNNDNVNNVAFLQEDIQSDSSLAVT VAYG SEID D+DSG KDVNSGTEVLTSEPEMNDEPDNSPET  KYDFSSEKLPVYDDSSSNYNS
Subjt:  LSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--KYDFSSEKLPVYDDSSSNYNS

Query:  GYQDETADPPVNEIDDSSLH-----------ELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTM
        GYQDETADPPVNEIDDSSLH           ELGLV+KE VTESLEGVLNPGKT++LLSEETASTIEQ+IGRGLSKAAFVSVTAYPLADDQE+ NHETTM
Subjt:  GYQDETADPPVNEIDDSSLH-----------ELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTM

Query:  NS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIEN
        NS                VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIEN
Subjt:  NS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIEN

Query:  VTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE-------------------------------------------------------
        VTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                                       
Subjt:  VTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE-------------------------------------------------------

Query:  -----------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMA
                         GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMA
Subjt:  -----------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMA

Query:  EEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-------
        EEAKQELERLR E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM       
Subjt:  EEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-------

Query:  ----AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRN
            AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETV RAENLMDKLKGM  EVRGKSKDIIEMIIQKIALLISNLRQWVRN
Subjt:  ----AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRN

Query:  AGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
        AGEKAEDVKNV IARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  AGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

A0A6J1HWU2 uncharacterized protein LOC111467385 isoform X30.0e+0082.46Show/hide
Query:  SKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--K
        ++ND TLNKNR+GAVADVEE SG+DVE SSNNDNVNNVAFLQEDIQSDSSLAVT VAYG SEID D+DSG KDVNSGTEVLTSEPEMNDEPDNSPET  K
Subjt:  SKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDEPDNSPET--K

Query:  YDFSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLH-----------ELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFV
        YDFSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLH           ELGLV+KE VTESLEGVLNPGKT++LLSEETASTIEQ+IGRGLSKAAFV
Subjt:  YDFSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLH-----------ELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFV

Query:  SVTAYPLADDQEELNHETTMNS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSA
        SVTAYPLADDQE+ NHETTMNS                VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSA
Subjt:  SVTAYPLADDQEELNHETTMNS----------------VPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSA

Query:  SCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE-----------------------------------
        SCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE                                   
Subjt:  SCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE-----------------------------------

Query:  -------------------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINA
                                             GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINA
Subjt:  -------------------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINA

Query:  SFEKELSIEREKADAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRL
        SFEKELSIEREKADAVEKMAEEAKQELERLR E+ERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRL
Subjt:  SFEKELSIEREKADAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRL

Query:  QYELEVERKALSM-----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDII
        QYELEVERKALSM           AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETV RAENLMDKLKGM  EVRGKSKDII
Subjt:  QYELEVERKALSM-----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDII

Query:  EMIIQKIALLISNLRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
        EMIIQKIALLISNLRQWVRNAGEKAEDVKNV IARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG
Subjt:  EMIIQKIALLISNLRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRTSYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25680.1 FUNCTIONS IN: molecular_function unknown1.7e-2928.33Show/hide
Query:  EELNHETTMNSVPAPLVSAAV-KTLPGK---VLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTE
        +E N  +  NS     VS  V  T  GK   V  P  VD  Q +A+A L+ LK+ E ++    LCT+REYARWLV ++  L RN    + PA+ +   + 
Subjt:  EELNHETTMNSVPAPLVSAAV-KTLPGK---VLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTE

Query:  LAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE----------------------------------------------------------
         AFDDI   DPDF  IQ LAEAG+ SSKLS  D  + L +                                                            
Subjt:  LAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSLDDDE----------------------------------------------------------

Query:  ----------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQEL
                  G  + FQP++PVTKAQAA+AL +G+    ++ EL+R+EAES+++ A           + +I   +++++  ER +   +E++      E+
Subjt:  ----------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQEL

Query:  ERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARE-----------
        E  ++ +E+ +   ++E+AAI+ + ++L+ L  E++E  Q L+S+K     E  ++ ++  + + + + +   +  LE E +AL + R            
Subjt:  ERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMARE-----------

Query:  QAKALEEARDRWE
        +AK LEEA  RW+
Subjt:  QAKALEEARDRWE

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope1.3e-14339.59Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGE-EDLNLRTKN
        MAS  +T +P SLQLRLAL+     K P + +R +         ++C   V    ++    G          S S AD  +GW +SD+ + +   ++ K+
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGE-EDLNLRTKN

Query:  WLAGLVGIGITGFILVSGITFAAWSINKQNCSKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEI----DS
         + G+VG G+ G IL  G+++AA S +K+   +   +L   +   +   +E+S ++++ +  N   +N+    + I+S+     +    G  ++     S
Subjt:  WLAGLVGIGITGFILVSGITFAAWSINKQNCSKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEI----DS

Query:  DVDSGFKDVNSGTEVL---TSEPEMNDEPDNSPET----------KYDFSSEKLPVYDDSSSNYNSGYQDETADPP-----------------VNEIDDS
          D    D    TE +   T E ++    +  PET          K    S   P+  D+ S+   G ++  ++ P                 VN   + 
Subjt:  DVDSGFKDVNSGTEVL---TSEPEMNDEPDNSPET----------KYDFSSEKLPVYDDSSSNYNSGYQDETADPP-----------------VNEIDDS

Query:  SLHELGLVD----KETVTE------SLEGVLNPGKTKQLLSEETASTIEQQIGR-------------------GLSKAAFVSVTAYPLADD---------
        SL  L  +D      TVTE       L+    P +   L   ETA    +++                       +K    S +  P + D         
Subjt:  SLHELGLVD----KETVTE------SLEGVLNPGKTKQLLSEETASTIEQQIGR-------------------GLSKAAFVSVTAYPLADD---------

Query:  -QEELNHETTM------------NSVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVY
         Q+EL+    +              +PAP +S  V   PGK+LVP   DQ+Q QA AALQVLKVIE++ +PS LCTRREYARWL+SAS ALSRNT SKVY
Subjt:  -QEELNHETTM------------NSVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVY

Query:  PAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSS--------------------------------------------LDDDE----
        PAMYIENVTELAFDDITPEDPDF+SIQGLAEAGLI+SKLS  D+L                                              +D D+    
Subjt:  PAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSS--------------------------------------------LDDDE----

Query:  ---------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAV
                             G TRLFQP KPVTK QAAIAL++GEASDIVSEELARIEAESMAE AV+AH+ALVA+VEKD+NASFEKELS+EREK +AV
Subjt:  ---------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAV

Query:  EKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM---
        EKMAE AK ELE+LR +RE +N+AL++ERAA+ESEMEVLSRLR + EE+L+ LMSNK E+++EKER+  LRKEAE E+Q IS+LQYELEVERKALSM   
Subjt:  EKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM---

Query:  --------AREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISN
                AREQ +ALEEAR RWE  G++VVVD DL+E   +E+     L+  ++ +VEET  RA+ LMDKLK M   V GKS+++I  +++KI L I+ 
Subjt:  --------AREQAKALEEARDRWEKRGIKVVVDSDLRE---QESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISN

Query:  LRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRT
        L+++  N G++A ++++  I RA  +A +++Q + +    + +  K++  +CR+GV KISQ+F+T
Subjt:  LRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRT

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)4.4e-11553.25Show/hide
Query:  AALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS----------------
        AALQ LKVIES+  P  LCTRRE+ARW+VSAS  LSRN+ASKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS                
Subjt:  AALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS----------------

Query:  -----RHDILS-----------------------SLDDDE-------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELA
             R D+LS                        LD D+                         G TRLFQP K VTKAQ A++LA G+A ++V EELA
Subjt:  -----RHDILS-----------------------SLDDDE-------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELA

Query:  RIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSN
        RIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E  DAVEK+AEEAK EL RLR E+E + +AL RER +IE+EME L+R+RNELEEQLQ L SN
Subjt:  RIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSN

Query:  KVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAV
        K E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+           AREQAK LEEAR RWEK G+KV+VDSDL EQ +  + TWL++ KQ  V
Subjt:  KVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSM-----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAV

Query:  EETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEK
        E T+ RA NL+ KLK M ++V  KS+++I +II+KI+LLIS L+Q V     KA+D+K    ++A     +    + E+       AK  V + ++ V K
Subjt:  EETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEK

Query:  ISQKFRT
        + +KF++
Subjt:  ISQKFRT

AT5G52410.2 INVOLVED IN: biological_process unknown7.3e-12641Show/hide
Query:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW
        MAST +T +P+SLQLR+ALN          P RA + KL  RL++ C   V  +A+  R +G   NG         +D F GW  +DSG+++ N R  +W
Subjt:  MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNW

Query:  LAGLVGIGITGFILVSGITFAAWSINKQNCSKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSG
          G +  G+ G +L  G+T+AA S  +          N  R      V  ++ + ++  S ++N  N+   Q++ +S  +                    
Subjt:  LAGLVGIGITGFILVSGITFAAWSINKQNCSKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSG

Query:  FKDVNSGTEVLTSEPEMNDEPDNSPETKYD-FSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEET
        F DV+  ++VL+  P+  D    S  T+ D   +EK  V   S+  Y S            E+D    H                      T Q+ +E+ 
Subjt:  FKDVNSGTEVLTSEPEMNDEPDNSPETKYD-FSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEET

Query:  ASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNSVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLV
         +                                      +PAP     V +L  K + P VVD VQ Q  AALQ LKVIES+  P  LCTRRE+ARW+V
Subjt:  ASTIEQQIGRGLSKAAFVSVTAYPLADDQEELNHETTMNSVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLV

Query:  SASCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS---------------------RHDILS------------------
        SAS  LSRN+ASKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS                     R D+LS                  
Subjt:  SASCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLS---------------------RHDILS------------------

Query:  -----SLDDDE-------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINAS
              LD D+                         G TRLFQP K VTKAQ A++LA G+A ++V EELARIEAE+MAEN V AH+ LVAQVEKDINAS
Subjt:  -----SLDDDE-------------------------GYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINAS

Query:  FEKELSIEREKADAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQ
        FEKEL  E+E  DAVEK+AEEAK EL RLR E+E + +AL RER +IE+EME L+R+RNELEEQLQ L SNK E+SYEKER ++L+K+ E ENQEI RLQ
Subjt:  FEKELSIEREKADAVEKMAEEAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQ

Query:  YELEVERKALSM-----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDII
         ELEVER ALS+           AREQAK LEEAR RWEK G+KV+VDSDL EQ +  + TWL++ KQ  VE T+ RA NL+ KLK M ++V  KS+++I
Subjt:  YELEVERKALSM-----------AREQAKALEEARDRWEKRGIKVVVDSDLREQESAGD-TWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDII

Query:  EMIIQKIALLISNLRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRT
         +II+KI+LLIS L+Q V     KA+D+K    ++A     +    + E+       AK  V + ++ V K+ +KF++
Subjt:  EMIIQKIALLISNLRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGLALKEGAKRVVGDCREGVEKISQKFRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACTCCTTCCACATGTTCGCCCAACTCTCTCCAGCTTCGTCTCGCTCTCAATTGCAAGAATTCCCCCAAATTCCCTTTCCTTCCTGTTCGCGCCACAGTGCG
GAAGTTGGATCCTCGCCTCCAGGTGATTTGCCGCCCTATTGTTCATAATAGCGCGAAAATCGCGCGAACAAATGGACTGCGTCGGAATGGAATTTGTTTTGCTGGATCGG
ATTCCAAGGCTGATGGATTCTCCGGGTGGTCGGAATCGGATTCCGGCGAGGAGGATTTGAACTTGCGGACAAAGAATTGGTTGGCAGGGTTGGTGGGGATTGGAATTACT
GGATTCATCCTTGTCTCTGGAATCACCTTTGCAGCATGGTCGATAAACAAGCAGAATTGTTCCAAAAATGATGCAACTCTCAACAAAAATCGAATTGGTGCCGTTGCTGA
TGTCGAGGAGTTATCAGGAAATGATGTTGAATCTTCATCTAATAATGATAATGTCAATAATGTTGCTTTCTTGCAAGAGGATATCCAATCTGATTCCTCGTTAGCTGTTA
CGTCGGTCGCTTATGGAAGATCTGAGATCGATTCGGACGTTGATTCTGGTTTTAAAGATGTAAATTCTGGTACAGAAGTGTTAACTTCTGAACCTGAAATGAATGATGAA
CCGGATAACTCGCCTGAAACGAAGTATGATTTCAGCTCTGAAAAGTTACCAGTCTATGATGATAGTTCATCAAATTATAATTCTGGCTACCAGGATGAGACAGCTGATCC
TCCTGTAAATGAAATCGACGATTCTTCATTGCATGAATTGGGATTAGTTGACAAAGAGACCGTGACCGAATCATTGGAAGGAGTTCTGAACCCTGGCAAAACCAAACAGC
TCTTATCTGAGGAAACTGCATCGACCATAGAACAACAAATAGGAAGAGGATTATCCAAAGCTGCGTTTGTCTCTGTCACAGCTTATCCACTGGCAGATGATCAAGAAGAA
CTAAATCATGAAACAACCATGAATAGTGTTCCTGCTCCTCTGGTTTCTGCAGCTGTTAAGACACTTCCTGGCAAGGTTCTAGTGCCTGCAGTTGTGGATCAGGTTCAGGG
GCAGGCCTTGGCAGCACTGCAAGTTTTAAAGGTGATCGAGTCTGAAGTTGAGCCGAGTGGTCTATGTACTCGTCGGGAATATGCTCGTTGGCTAGTGTCTGCAAGCTGTG
CGCTTTCAAGGAACACTGCATCTAAAGTATACCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCTTTTGATGATATTACTCCTGAAGACCCTGATTTTGCATCTATT
CAAGGTTTGGCAGAAGCTGGACTGATTTCGAGCAAGCTTTCGAGACATGATATTCTTTCTTCATTGGACGACGACGAGGGATACACAAGGCTATTCCAGCCGGATAAGCC
TGTAACAAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCTGATATCGTAAGCGAGGAGCTTGCAAGGATTGAAGCCGAGTCAATGGCGGAGAACGCTGTGG
CTGCACACAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATTAACGCCAGCTTCGAGAAGGAACTTTCCATCGAAAGGGAAAAGGCTGACGCTGTGGAGAAAATGGCAGAA
GAGGCAAAGCAAGAATTGGAAAGATTAAGATCTGAAAGAGAGAGAGATAATATCGCCTTGATGAGGGAACGTGCTGCTATCGAATCGGAAATGGAGGTTCTTTCGAGGTT
AAGAAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAGGTATCTTATGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGCTGAAATTGAAAATC
AGGAGATTTCCCGCTTGCAATATGAGCTTGAGGTTGAGAGAAAGGCGTTGTCCATGGCCAGAGAACAAGCAAAAGCACTAGAGGAGGCAAGAGATCGCTGGGAAAAGCGG
GGCATCAAAGTAGTCGTCGACAGCGATCTCCGGGAACAGGAATCTGCTGGAGATACTTGGCTCGATTCCAGCAAACAGTTTGCAGTGGAGGAAACGGTTGGTAGGGCCGA
GAACTTAATGGACAAGCTTAAAGGAATGGGTAGAGAAGTAAGAGGGAAATCCAAAGACATAATTGAGATGATCATCCAGAAGATAGCTCTGCTGATATCAAACTTGAGAC
AATGGGTCCGGAATGCTGGTGAGAAGGCTGAAGATGTAAAGAATGTGGGCATCGCAAGGGCAAGTAGATCGGCAACTGAACTGCAACAGAGCAGTGCAGAGATGGGGTTG
GCCCTGAAGGAGGGAGCTAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTTCCCAAAAGTTCAGAACATCGTACGGTTAA
mRNA sequenceShow/hide mRNA sequence
AGTAAAAAAGACCCCCTTTTCTTCCGCCCTTTTATCCACCAAAATCCCTACACCTTTTTCTCATTCAATTCCTTCTTTTTTTACAAATTCAAGCTTGAATTTTCTTCCTC
AATCCAATCTTCCTCAATTTACAGTAAATTACTCAATCTCCGTTTGGAGTAGGCTTTTTCATGGCTTCCACTCCTTCCACATGTTCGCCCAACTCTCTCCAGCTTCGTCT
CGCTCTCAATTGCAAGAATTCCCCCAAATTCCCTTTCCTTCCTGTTCGCGCCACAGTGCGGAAGTTGGATCCTCGCCTCCAGGTGATTTGCCGCCCTATTGTTCATAATA
GCGCGAAAATCGCGCGAACAAATGGACTGCGTCGGAATGGAATTTGTTTTGCTGGATCGGATTCCAAGGCTGATGGATTCTCCGGGTGGTCGGAATCGGATTCCGGCGAG
GAGGATTTGAACTTGCGGACAAAGAATTGGTTGGCAGGGTTGGTGGGGATTGGAATTACTGGATTCATCCTTGTCTCTGGAATCACCTTTGCAGCATGGTCGATAAACAA
GCAGAATTGTTCCAAAAATGATGCAACTCTCAACAAAAATCGAATTGGTGCCGTTGCTGATGTCGAGGAGTTATCAGGAAATGATGTTGAATCTTCATCTAATAATGATA
ATGTCAATAATGTTGCTTTCTTGCAAGAGGATATCCAATCTGATTCCTCGTTAGCTGTTACGTCGGTCGCTTATGGAAGATCTGAGATCGATTCGGACGTTGATTCTGGT
TTTAAAGATGTAAATTCTGGTACAGAAGTGTTAACTTCTGAACCTGAAATGAATGATGAACCGGATAACTCGCCTGAAACGAAGTATGATTTCAGCTCTGAAAAGTTACC
AGTCTATGATGATAGTTCATCAAATTATAATTCTGGCTACCAGGATGAGACAGCTGATCCTCCTGTAAATGAAATCGACGATTCTTCATTGCATGAATTGGGATTAGTTG
ACAAAGAGACCGTGACCGAATCATTGGAAGGAGTTCTGAACCCTGGCAAAACCAAACAGCTCTTATCTGAGGAAACTGCATCGACCATAGAACAACAAATAGGAAGAGGA
TTATCCAAAGCTGCGTTTGTCTCTGTCACAGCTTATCCACTGGCAGATGATCAAGAAGAACTAAATCATGAAACAACCATGAATAGTGTTCCTGCTCCTCTGGTTTCTGC
AGCTGTTAAGACACTTCCTGGCAAGGTTCTAGTGCCTGCAGTTGTGGATCAGGTTCAGGGGCAGGCCTTGGCAGCACTGCAAGTTTTAAAGGTGATCGAGTCTGAAGTTG
AGCCGAGTGGTCTATGTACTCGTCGGGAATATGCTCGTTGGCTAGTGTCTGCAAGCTGTGCGCTTTCAAGGAACACTGCATCTAAAGTATACCCAGCAATGTATATAGAG
AATGTTACCGAGCTTGCTTTTGATGATATTACTCCTGAAGACCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCGAGCAAGCTTTCGAGACATGA
TATTCTTTCTTCATTGGACGACGACGAGGGATACACAAGGCTATTCCAGCCGGATAAGCCTGTAACAAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCTG
ATATCGTAAGCGAGGAGCTTGCAAGGATTGAAGCCGAGTCAATGGCGGAGAACGCTGTGGCTGCACACAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATTAACGCCAGC
TTCGAGAAGGAACTTTCCATCGAAAGGGAAAAGGCTGACGCTGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGAATTGGAAAGATTAAGATCTGAAAGAGAGAGAGATAA
TATCGCCTTGATGAGGGAACGTGCTGCTATCGAATCGGAAATGGAGGTTCTTTCGAGGTTAAGAAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAG
AGGTATCTTATGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAGCTGAAATTGAAAATCAGGAGATTTCCCGCTTGCAATATGAGCTTGAGGTTGAGAGAAAGGCGTTG
TCCATGGCCAGAGAACAAGCAAAAGCACTAGAGGAGGCAAGAGATCGCTGGGAAAAGCGGGGCATCAAAGTAGTCGTCGACAGCGATCTCCGGGAACAGGAATCTGCTGG
AGATACTTGGCTCGATTCCAGCAAACAGTTTGCAGTGGAGGAAACGGTTGGTAGGGCCGAGAACTTAATGGACAAGCTTAAAGGAATGGGTAGAGAAGTAAGAGGGAAAT
CCAAAGACATAATTGAGATGATCATCCAGAAGATAGCTCTGCTGATATCAAACTTGAGACAATGGGTCCGGAATGCTGGTGAGAAGGCTGAAGATGTAAAGAATGTGGGC
ATCGCAAGGGCAAGTAGATCGGCAACTGAACTGCAACAGAGCAGTGCAGAGATGGGGTTGGCCCTGAAGGAGGGAGCTAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGT
AGAGAAAATTTCCCAAAAGTTCAGAACATCGTACGGTTAA
Protein sequenceShow/hide protein sequence
MASTPSTCSPNSLQLRLALNCKNSPKFPFLPVRATVRKLDPRLQVICRPIVHNSAKIARTNGLRRNGICFAGSDSKADGFSGWSESDSGEEDLNLRTKNWLAGLVGIGIT
GFILVSGITFAAWSINKQNCSKNDATLNKNRIGAVADVEELSGNDVESSSNNDNVNNVAFLQEDIQSDSSLAVTSVAYGRSEIDSDVDSGFKDVNSGTEVLTSEPEMNDE
PDNSPETKYDFSSEKLPVYDDSSSNYNSGYQDETADPPVNEIDDSSLHELGLVDKETVTESLEGVLNPGKTKQLLSEETASTIEQQIGRGLSKAAFVSVTAYPLADDQEE
LNHETTMNSVPAPLVSAAVKTLPGKVLVPAVVDQVQGQALAALQVLKVIESEVEPSGLCTRREYARWLVSASCALSRNTASKVYPAMYIENVTELAFDDITPEDPDFASI
QGLAEAGLISSKLSRHDILSSLDDDEGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKADAVEKMAE
EAKQELERLRSERERDNIALMRERAAIESEMEVLSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEISRLQYELEVERKALSMAREQAKALEEARDRWEKR
GIKVVVDSDLREQESAGDTWLDSSKQFAVEETVGRAENLMDKLKGMGREVRGKSKDIIEMIIQKIALLISNLRQWVRNAGEKAEDVKNVGIARASRSATELQQSSAEMGL
ALKEGAKRVVGDCREGVEKISQKFRTSYG