; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030446 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030446
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionnuclear envelope integral membrane protein 1 isoform X1
Genome locationtig00153654:1344201..1351138
RNA-Seq ExpressionSgr030446
SyntenySgr030446
Gene Ontology termsGO:0005637 - nuclear inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR019358 - NEMP family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605219.1 hypothetical protein SDJN03_02536, partial [Cucurbita argyrosperma subsp. sororia]2.9e-22886.49Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PSM FRAS C+LFLALFFA AYSIP+P+  RLVVSESTTLQLSRGLPV+NSPGSKPGTVVVCERVYIQGLLRIKNL KLAHTVKVK+S+ NSSARIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSWAQSMSPFD KL+DIRTSGLSLESFEVS EEEFF+YRI+F ILG+VLMSSASIL KSLVFYYGSAMTIG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNP+A FLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TW IRILA LLILQCSLDPLLATGVLICG+MASS+LRR FKLRFLRRQYKN FK PK+  KRS++SD P+FDDS DE  +KSP SYEDP  YRS+DRNFA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW
        LQSCS S  D YPS FHSTPGRR+FSKDEW+RFTKDSTEKALE LVSSPDF +WLVDNADRI+ITPQSS RAEK RKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW

KAG7035192.1 hypothetical protein SDJN02_01987 [Cucurbita argyrosperma subsp. argyrosperma]6.4e-22886.49Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PSM FRAS C+LFLALFFA AYSIP+P+  RLVVSESTTLQLSRGLPV+NSPGSKPGTVVVCERVYIQGLLRIKNL KLAHTVKVK+S+ NSSARIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSWAQSMSPFD KL+DIRTSGLSLESFEVS EEEFF+YRIIF ILG+VLMSSASIL KSLVFYYGSAMTIG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNP+A FLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TW IRILA LLILQCSLDPLLATGVLICG+MASS+LRR FKLRFLRRQYKN FK PK+  KRS++SD P+FDDS DE  +KSP SYEDP  YRS+DRNFA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW
        L+SCS S  D YPS FHSTPGRR+FSKDEW+RFTKDSTEKALE LVSSPDF +WLVDNADRI+ITPQSS RAEK RKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW

XP_022149433.1 uncharacterized protein LOC111017862 [Momordica charantia]8.9e-23889.38Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PS  FRASA +LFLALFFA A SIPEPEGHRLVVSEST LQLSRGLPVK+SPG+KPG VVVCERVYIQGLLRIKNLRKLAHTVKVKVS+++S++RIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSW QSMSPFD KLLDIRTSGLSLESFEVS EEEFFLYRIIF ILG VLMSSASIL KSLVFYYGSAM IG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        +VLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFL GAWLGFWVVHKFVLDEDGSI TSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TWSIRILATLLILQCSLDPLLATGVLICGVMASS+LR+IFKLRFLRRQYKNFFK PKKMHKRS++SD P+ DDS DEFTL+SPPSYEDPR YRS+DR FA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHW
        LQSCSSS+ DVYPS FHSTPGR+KFSK EWE+FTKDSTEKALEELVSSPDFS WLVDNADRISITPQSSRAEKRRKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHW

XP_022948207.1 uncharacterized protein LOC111451852 [Cucurbita moschata]1.3e-22886.69Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PSM FRAS C+LFLALFFA AYSIP+P+  RLVVSESTTLQLSRGLPV+NSPGSKPGTVVVCERVYIQGLLRIKNL KLAHTVKVK+S+ NSSARIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSWAQSMSPFD KL+DIRTSGLSLESFEVS EEEFF+YRI+F ILG+VLMSSASIL KSLVFYYGSAMTIG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNP+A FLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TW IRILA LLILQCSLDPLLATGVLICG+MASS+LRR FKLRFLRRQYKN FK PK+  KRS++SD P+FDDS DE  +KSP SYEDP  YRS+DRNFA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW
        LQSCS S  D YPS FHSTPGRR+FSKDEW+RFTKDSTEKALE LVSSPDF +WLVDNADRISITPQSS RAEK RKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW

XP_023006880.1 uncharacterized protein LOC111499541 [Cucurbita maxima]1.9e-22786.28Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PS  FRAS C+LFLALFFA AYSIP+P+  RLVVSESTTLQLSRGLPV+NSPGSKPGTVVVCERVYIQGLLRIKNL KLAHTVKVK+S+ NSSARIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSWAQSMSPFD KL+DIRTSGLSLESFEVS EEEFF+YRIIF ILG+VLMSSASIL KSLVFYYGSAMTIG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNP+A FLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TW IRILA LLILQCSLDPLLATGVLICG+MASS+LRR FKLRFLRRQYKN FK PK+  KRS++SD P+FDDS DE  +KSP SYEDP+ Y S+DRNFA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW
        LQSCS S  D YPS FHSTPGRR+FSKDEW+RFTKDSTEKALE LVSSPDF +WLVDNADRI+ITPQSS RAEK RKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW

TrEMBL top hitse value%identityAlignment
A0A1S4E3G6 nuclear envelope integral membrane protein 1 isoform X12.6e-21982.16Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M  S  FRAS C+LFL++FFAS Y   +PE HRL+VSESTT+QLS GLPVKNSPGSKPGTVV CERVYIQGL R KNL+K AHTVKVKVS  NSS  + N
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKV +GSW QSMSPFD KLLDIRT GLSLESFEVSTE+EFFLYRIIF ILG++LMSSASILSKSLVFYYGS M IGILL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        +VLMILFQGMKLLPTGRKSSL IFLYASAVGLGSFF+RYIPGLL+QIL+EMGISEDMYNPLAAFLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TWSIRILA+LLILQCSLDPLLATGVLICG++ASSMLR+IFK RFLRR +KN FK PKK+ KRS++SD P++DDS DE TLK+ P Y++PR YRS++R F 
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRH-DVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHWF
        LQSC SS+H DVYPS FHSTP RRKFSKDEWE+FTKDST+KALE LVSSPDFS WLVD ADRISITPQSSRAEKRRKWLHWF
Subjt:  LQSCSSSRH-DVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHWF

A0A5D3CHE8 Nuclear envelope integral membrane protein 1 isoform X11.0e-17571.98Show/hide
Query:  LFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSLGIGMCP
        +FFAS Y   +PE HRL+VSESTT+QLS GLPVKNSPGSKPGTVV CERVYIQGL                                  RN SLGIGMCP
Subjt:  LFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSLGIGMCP

Query:  QRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMKLLPTGR
        Q QWEKV +GSW QSMSPFD KLLDIRTS                                 SILSKSLVFYYGS M IGILL+VLMILFQGMKLLPTGR
Subjt:  QRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMKLLPTGR

Query:  KSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILATLLILQCSL
        KSSL IFLYASAVGLGSFF+RYIPGLL+QIL+EMGISEDMYNPLAAFLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILA+LLILQCSL
Subjt:  KSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILATLLILQCSL

Query:  DPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFALQSCSSSRH-DVYPSAF
        DPLLATGVLICG++ASSMLR+IFK RFLRR +KN FK PKK+ KRS++SD P++DDS DE TLK+ P Y++PR YRS++R F LQSC SS+H DVYPS F
Subjt:  DPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFALQSCSSSRH-DVYPSAF

Query:  HSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHW
        HSTP RRKFSKDEWE+FTKDST+KALE LVSSPDFS WLVD ADRISITPQSSRAEKRRKWLHW
Subjt:  HSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHW

A0A6J1D6S5 uncharacterized protein LOC1110178624.3e-23889.38Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PS  FRASA +LFLALFFA A SIPEPEGHRLVVSEST LQLSRGLPVK+SPG+KPG VVVCERVYIQGLLRIKNLRKLAHTVKVKVS+++S++RIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSW QSMSPFD KLLDIRTSGLSLESFEVS EEEFFLYRIIF ILG VLMSSASIL KSLVFYYGSAM IG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        +VLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFL GAWLGFWVVHKFVLDEDGSI TSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TWSIRILATLLILQCSLDPLLATGVLICGVMASS+LR+IFKLRFLRRQYKNFFK PKKMHKRS++SD P+ DDS DEFTL+SPPSYEDPR YRS+DR FA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHW
        LQSCSSS+ DVYPS FHSTPGR+KFSK EWE+FTKDSTEKALEELVSSPDFS WLVDNADRISITPQSSRAEKRRKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHW

A0A6J1G8K0 uncharacterized protein LOC1114518526.3e-22986.69Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PSM FRAS C+LFLALFFA AYSIP+P+  RLVVSESTTLQLSRGLPV+NSPGSKPGTVVVCERVYIQGLLRIKNL KLAHTVKVK+S+ NSSARIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSWAQSMSPFD KL+DIRTSGLSLESFEVS EEEFF+YRI+F ILG+VLMSSASIL KSLVFYYGSAMTIG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNP+A FLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TW IRILA LLILQCSLDPLLATGVLICG+MASS+LRR FKLRFLRRQYKN FK PK+  KRS++SD P+FDDS DE  +KSP SYEDP  YRS+DRNFA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW
        LQSCS S  D YPS FHSTPGRR+FSKDEW+RFTKDSTEKALE LVSSPDF +WLVDNADRISITPQSS RAEK RKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW

A0A6J1L1F6 uncharacterized protein LOC1114995419.0e-22886.28Show/hide
Query:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN
        M PS  FRAS C+LFLALFFA AYSIP+P+  RLVVSESTTLQLSRGLPV+NSPGSKPGTVVVCERVYIQGLLRIKNL KLAHTVKVK+S+ NSSARIPN
Subjt:  MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPN

Query:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL
         EVCFHRN SLGIGMCPQ QWEKVAKGSWAQSMSPFD KL+DIRTSGLSLESFEVS EEEFF+YRIIF ILG+VLMSSASIL KSLVFYYGSAMTIG+LL
Subjt:  AEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILL

Query:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV
        VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNP+A FLLAFIFL GAWLGFWVVHKFVLDEDGSIDTSTSLFV
Subjt:  VVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFV

Query:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA
        TW IRILA LLILQCSLDPLLATGVLICG+MASS+LRR FKLRFLRRQYKN FK PK+  KRS++SD P+FDDS DE  +KSP SYEDP+ Y S+DRNFA
Subjt:  TWSIRILATLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFA

Query:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW
        LQSCS S  D YPS FHSTPGRR+FSKDEW+RFTKDSTEKALE LVSSPDF +WLVDNADRI+ITPQSS RAEK RKWL W
Subjt:  LQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSS-RAEKRRKWLHW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G28760.1 Uncharacterized conserved protein (DUF2215)8.8e-5834.91Show/hide
Query:  CERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLES----FEVSTEE
        CER+ + G  R K L K A++++V +        +   +VC HRN +LG+  C +  W+ +   S    +SP+D++ +D+R +G   +S      V+  E
Subjt:  CERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLES----FEVSTEE

Query:  EFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYN
        EF  +RI   + G++++  A ++S  L FYY S+M +G+ LVVL+I+FQ M+LLPTGRK+ + +  Y S VG GSF L     +++ IL+  G+SEDMYN
Subjt:  EFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYN

Query:  PLAAFLLAFIFLTGAWLGFWVVHKFVLDED-GSIDTSTSLFVTWSIRILATLLILQCSLDPLLATGVLIC----GVMASSMLRRIFKLRFL-----RRQY
        P+A  +L  + +TGA  GFW V KFV+ +D G +D S + FV W++R +A   ILQ SLD  +A G  +     G + S     + K  +L     RR  
Subjt:  PLAAFLLAFIFLTGAWLGFWVVHKFVLDED-GSIDTSTSLFVTWSIRILATLLILQCSLDPLLATGVLIC----GVMASSMLRRIFKLRFL-----RRQY

Query:  KN----FFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFALQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEEL
         +    F   P            P +  SP    + S PS  + R+                  D Y S FH+TP R++ SK E++  T+++T +A+  L
Subjt:  KN----FFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFALQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEEL

Query:  VSSPDFSKWLVDNADRISITPQSS
         +SP FS WLV++ADRI + P  S
Subjt:  VSSPDFSKWLVDNADRISITPQSS

AT3G49840.1 Uncharacterized conserved protein (DUF2215)1.5e-9744.04Show/hide
Query:  VLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGL-LRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSL
        V+ L L   + +S+         V E+ +LQ++    V  SPG K G   +CER++I GL  R++++ + AH++K+ +   N+S  I   +VCFHRN+S 
Subjt:  VLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGL-LRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSL

Query:  GIGMCPQRQWEKVAKG-SWAQSMSPFDQKLLDIRTSGLS--LESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQ
         IGMCP  QW++V+KG  W   MSPFD K+LDIRT G S  + + E+  ++EFF+YRI+F I+G+VL+S AS LSKS+ FYY  AM+IGI+++V +I+ Q
Subjt:  GIGMCPQRQWEKVAKG-SWAQSMSPFDQKLLDIRTSGLS--LESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQ

Query:  GMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILA
        G+K LPT  KS   +F Y+S +G+G +FL+YI GL+  +L+++ ISED+Y PLA  L+ F+F+ GAW GFW V KFV+ +DGS+D STS+FV+WSIR  A
Subjt:  GMKLLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILA

Query:  TLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKW----PKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFALQSC
          LILQ SLDPLLA G LI G++ S +L+ I +    +R Y+   +     P  +H  S+ S  P                         R  N  L++ 
Subjt:  TLLILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKW----PKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFALQSC

Query:  SSSRHDVYPSAFHSTP-GRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEK
          S  D++PS+FH TP GRRK +K+E ++FTK+STE AL+ELVSSP F +W V NA RI++ P    + K
Subjt:  SSSRHDVYPSAFHSTP-GRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQSSRAEK

AT5G67610.1 Uncharacterized conserved protein (DUF2215)1.1e-11950.42Show/hide
Query:  VLFLALF-FASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSL
        ++ L LF FAS  S  E +    VV ES  LQ++  L VK SPG KP    +CER++I GL R K+L K AH++K+ V+  + S +  N +VCFHRN S 
Subjt:  VLFLALF-FASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSL

Query:  GIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMK
        GIGMCP  +WEK +KGSW Q+MSPFD K+LD+R    +  S EVS  EE F++RI+F +LG VL++SAS LS+SL FYY SAM +GI+LVVL++LFQGMK
Subjt:  GIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMK

Query:  LLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILATLL
        LLPTGR SS A+F+Y++ +GLG F LRY+PGL   +L EMGI E+MY P A F+ AF+ L GA+ GFW V K +L EDGSID STSLFV+WSIRI+A +L
Subjt:  LLPTGRKSSLAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILATLL

Query:  ILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSR--------DRNFALQS
        ILQ S+DPLLA G LI  ++ SS L++I +L+FL R ++        + +    +D P       +F  KSP    D   +R+R          N  ++ 
Subjt:  ILQCSLDPLLATGVLICGVMASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSR--------DRNFALQS

Query:  CSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQ--SSRAEKRRKWLHWF
           S  D +PS+FH TP R + +K+EW++ TKDST KA++ELVSSPDF KW   NADRI++TP+  SS   + RKW+ WF
Subjt:  CSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQ--SSRAEKRRKWLHWF

AT5G67610.2 Uncharacterized conserved protein (DUF2215)5.2e-11951.2Show/hide
Query:  VVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMS
        VV ES  LQ++  L VK SPG KP    +CER++I GL R K+L K AH++K+ V+  + S +  N +VCFHRN S GIGMCP  +WEK +KGSW Q+MS
Subjt:  VVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTSLGIGMCPQRQWEKVAKGSWAQSMS

Query:  PFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGS
        PFD K+LD+R    +  S EVS  EE F++RI+F +LG VL++SAS LS+SL FYY SAM +GI+LVVL++LFQGMKLLPTGR SS A+F+Y++ +GLG 
Subjt:  PFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMKLLPTGRKSSLAIFLYASAVGLGS

Query:  FFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILATLLILQCSLDPLLATGVLICGVMASS
        F LRY+PGL   +L EMGI E+MY P A F+ AF+ L GA+ GFW V K +L EDGSID STSLFV+WSIRI+A +LILQ S+DPLLA G LI  ++ SS
Subjt:  FFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILATLLILQCSLDPLLATGVLICGVMASS

Query:  MLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSR--------DRNFALQSCSSSRHDVYPSAFHSTPGRRKFS
         L++I +L+FL R ++        + +    +D P       +F  KSP    D   +R+R          N  ++    S  D +PS+FH TP R + +
Subjt:  MLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSR--------DRNFALQSCSSSRHDVYPSAFHSTPGRRKFS

Query:  KDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQ--SSRAEKRRKWLHWF
        K+EW++ TKDST KA++ELVSSPDF KW   NADRI++TP+  SS   + RKW+ WF
Subjt:  KDEWERFTKDSTEKALEELVSSPDFSKWLVDNADRISITPQ--SSRAEKRRKWLHWF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCTTCGATGCCGTTTCGAGCTTCTGCTTGCGTTCTTTTTCTCGCACTGTTCTTCGCTTCAGCTTACTCTATTCCCGAACCTGAAGGCCATCGTCTTGTTGTTTC
TGAATCCACTACTTTGCAACTATCTCGTGGCTTGCCTGTGAAGAATTCTCCAGGTTCCAAACCTGGAACTGTGGTGGTTTGTGAAAGAGTATACATCCAAGGATTATTGA
GGATTAAGAATCTGAGGAAGCTTGCACACACAGTGAAAGTGAAGGTTTCAATTACAAATTCAAGTGCTCGAATACCAAATGCTGAGGTTTGTTTTCATAGGAACACATCA
CTTGGGATAGGAATGTGCCCTCAACGTCAGTGGGAGAAAGTTGCCAAAGGTTCTTGGGCTCAATCCATGTCGCCATTTGACCAGAAGCTATTAGATATTAGAACATCTGG
GTTATCCTTGGAAAGCTTTGAGGTATCGACTGAAGAAGAATTCTTTCTGTATCGCATAATCTTTTTCATCCTGGGCATGGTGTTGATGAGTTCGGCCTCCATTCTGAGCA
AGTCATTGGTATTCTATTATGGCAGTGCCATGACAATTGGAATTCTCCTTGTAGTGTTAATGATCCTTTTTCAGGGGATGAAGCTTCTACCTACTGGTCGGAAGAGCTCA
CTTGCAATTTTTCTATATGCATCTGCAGTTGGTCTGGGATCTTTTTTCCTCCGTTACATACCTGGATTATTGCATCAAATACTTTTGGAAATGGGTATAAGTGAGGACAT
GTATAATCCTCTAGCAGCATTTCTACTGGCATTTATTTTTCTGACTGGAGCATGGTTGGGCTTTTGGGTAGTCCACAAATTTGTCCTTGACGAAGATGGATCAATTGATA
CAAGTACATCACTTTTTGTCACGTGGTCCATACGGATTTTGGCTACTCTTCTGATTCTTCAGTGTTCTTTGGATCCCTTGCTGGCAACAGGAGTTTTAATATGTGGAGTA
ATGGCTTCATCAATGCTGAGGAGAATCTTTAAGTTGAGATTTCTTCGTCGCCAATACAAGAATTTCTTCAAATGGCCAAAGAAAATGCACAAGAGATCTTATGTCTCCGA
TTTTCCACAGTTTGATGATTCTCCCGATGAATTCACTTTGAAGTCCCCCCCAAGTTATGAAGACCCCAGGCTCTATAGATCTCGAGACAGAAACTTCGCCCTTCAGTCCT
GCAGTTCTTCTAGACATGATGTATATCCTTCAGCATTCCATTCCACTCCTGGACGGAGAAAATTTTCTAAGGATGAGTGGGAAAGGTTCACCAAGGACTCAACCGAAAAA
GCTTTGGAAGAATTAGTTTCTTCACCTGATTTCAGCAAGTGGCTTGTTGACAATGCAGATAGAATCAGCATAACTCCCCAAAGCAGTAGAGCTGAGAAGCGTCGCAAGTG
GCTCCATTGGTTCCTGATTGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCTTCGATGCCGTTTCGAGCTTCTGCTTGCGTTCTTTTTCTCGCACTGTTCTTCGCTTCAGCTTACTCTATTCCCGAACCTGAAGGCCATCGTCTTGTTGTTTC
TGAATCCACTACTTTGCAACTATCTCGTGGCTTGCCTGTGAAGAATTCTCCAGGTTCCAAACCTGGAACTGTGGTGGTTTGTGAAAGAGTATACATCCAAGGATTATTGA
GGATTAAGAATCTGAGGAAGCTTGCACACACAGTGAAAGTGAAGGTTTCAATTACAAATTCAAGTGCTCGAATACCAAATGCTGAGGTTTGTTTTCATAGGAACACATCA
CTTGGGATAGGAATGTGCCCTCAACGTCAGTGGGAGAAAGTTGCCAAAGGTTCTTGGGCTCAATCCATGTCGCCATTTGACCAGAAGCTATTAGATATTAGAACATCTGG
GTTATCCTTGGAAAGCTTTGAGGTATCGACTGAAGAAGAATTCTTTCTGTATCGCATAATCTTTTTCATCCTGGGCATGGTGTTGATGAGTTCGGCCTCCATTCTGAGCA
AGTCATTGGTATTCTATTATGGCAGTGCCATGACAATTGGAATTCTCCTTGTAGTGTTAATGATCCTTTTTCAGGGGATGAAGCTTCTACCTACTGGTCGGAAGAGCTCA
CTTGCAATTTTTCTATATGCATCTGCAGTTGGTCTGGGATCTTTTTTCCTCCGTTACATACCTGGATTATTGCATCAAATACTTTTGGAAATGGGTATAAGTGAGGACAT
GTATAATCCTCTAGCAGCATTTCTACTGGCATTTATTTTTCTGACTGGAGCATGGTTGGGCTTTTGGGTAGTCCACAAATTTGTCCTTGACGAAGATGGATCAATTGATA
CAAGTACATCACTTTTTGTCACGTGGTCCATACGGATTTTGGCTACTCTTCTGATTCTTCAGTGTTCTTTGGATCCCTTGCTGGCAACAGGAGTTTTAATATGTGGAGTA
ATGGCTTCATCAATGCTGAGGAGAATCTTTAAGTTGAGATTTCTTCGTCGCCAATACAAGAATTTCTTCAAATGGCCAAAGAAAATGCACAAGAGATCTTATGTCTCCGA
TTTTCCACAGTTTGATGATTCTCCCGATGAATTCACTTTGAAGTCCCCCCCAAGTTATGAAGACCCCAGGCTCTATAGATCTCGAGACAGAAACTTCGCCCTTCAGTCCT
GCAGTTCTTCTAGACATGATGTATATCCTTCAGCATTCCATTCCACTCCTGGACGGAGAAAATTTTCTAAGGATGAGTGGGAAAGGTTCACCAAGGACTCAACCGAAAAA
GCTTTGGAAGAATTAGTTTCTTCACCTGATTTCAGCAAGTGGCTTGTTGACAATGCAGATAGAATCAGCATAACTCCCCAAAGCAGTAGAGCTGAGAAGCGTCGCAAGTG
GCTCCATTGGTTCCTGATTGGGTAG
Protein sequenceShow/hide protein sequence
MAPSMPFRASACVLFLALFFASAYSIPEPEGHRLVVSESTTLQLSRGLPVKNSPGSKPGTVVVCERVYIQGLLRIKNLRKLAHTVKVKVSITNSSARIPNAEVCFHRNTS
LGIGMCPQRQWEKVAKGSWAQSMSPFDQKLLDIRTSGLSLESFEVSTEEEFFLYRIIFFILGMVLMSSASILSKSLVFYYGSAMTIGILLVVLMILFQGMKLLPTGRKSS
LAIFLYASAVGLGSFFLRYIPGLLHQILLEMGISEDMYNPLAAFLLAFIFLTGAWLGFWVVHKFVLDEDGSIDTSTSLFVTWSIRILATLLILQCSLDPLLATGVLICGV
MASSMLRRIFKLRFLRRQYKNFFKWPKKMHKRSYVSDFPQFDDSPDEFTLKSPPSYEDPRLYRSRDRNFALQSCSSSRHDVYPSAFHSTPGRRKFSKDEWERFTKDSTEK
ALEELVSSPDFSKWLVDNADRISITPQSSRAEKRRKWLHWFLIG