; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010142 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010142
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSerine/threonine-protein kinase SRPK
Genome locationchr9:44930539..44935521
RNA-Seq ExpressionLag0010142
SyntenyLag0010142
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145826.1 uncharacterized protein LOC101215373 [Cucumis sativus]1.7e-24584.46Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSD DPIEAEL+ DLEP +  NGP HHPSAP DE+FDISTTVDPSYIISLIRKLLP NA+  RNSCG+G D  D SV  MDE D Y SGDQ+ SSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TVSKC GI   DDS K AD+E EDEGACP+ EQ ISSSEEKVWEEYGCILWDLSAS+  AELMVQNLVLEVLSA LMVSQSVRVMEIS+GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIVAKSGLI TIV+QLFLDDAQCLCEVCRLL  GLQSSECV WAEALNSEHVLSRILWVSENTLN QLIEKSVGLLS IIESQQE+VH+LL CLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+SVLFNLF+FEMKILTNERSAER+SILDVILRA+EALSG EEHS+++CSNKELFQLV DLVKLPDAFEVSSSC+SAVVLIANIL+DVPDLAF+MSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGLLDIFSF GDD EARDAVWSIIARILV VQENV+SRP+LFEYVSLLVSKTDLIEDDLLD   TESNKEE+ +TS+C KSNSRCISLRRII+ILNH
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE
        WTASKDEGTDVRDEY +EDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE

XP_022944140.1 uncharacterized protein LOC111448685 isoform X1 [Cucurbita moschata]6.4e-25385.5Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSDSDPIEAEL+P+LE  EGG GP HHPSAP DELFDISTTVDPSYIISLIRKLLP +A+ LRNS G  DDD DASVTNMDE+DAY SGDQVLSSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TV++CQGI  AD SDK AD+E EDEGACPR EQ ISSSEE VWEEYGCILWDLSASK HAELMVQNLVLEVLSA LMVSQSVRVMEI +GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIV KSGLI  IVNQLFLDDAQCLCEVCRLL AGL SSEC  WAEALNSEHVLSRILWVSENTLN QLIEKSVGLLS IIESQQEVVH+LLPCLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+S LFNLF+FEMKILTNERS ERYSILD ILRA+EALSGIEEHSQ+ CSNK+LFQLV +LVKLPDAFEVSSSC+SAV+LIANIL+DVPDLAFDMSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHV+E  +SRPR+FEYVSLLVSKTDLIEDDLLD R TE NK+E+ LTS+C KSNSRCISLRRII ILN 
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS
        WT SKDEGTDVRDEY  ED+DVNRLL+CCCKHSEW GS
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS

XP_022986281.1 uncharacterized protein LOC111484077 [Cucurbita maxima]1.2e-25185.32Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSDSDPIEAEL+P+LE  EGG GP HHPSAP DELFDISTTVDPSYIISLIRKLLP NA+ LRNS G  DDD +ASVTNMDE+DAY SGDQVLSSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TV++CQGI  AD SDK AD+E EDEGACPR EQ ISSSEE VWEEYGCILWDLSASK HAELMVQNLVLEVLSA LMVSQSVRVMEI +GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIV KSGLI TIVNQLFLDDAQCLCEVCRLL AGLQSSEC  WA ALNSEHVLSRILWVSENTLN QLIEKSVGLLS IIESQQEVVH+LLPCLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+S LFNLF+FEMKILTNERSAERYSILD ILRA+EALSGIEEHSQ+ CSNK+LFQLV +LVKLPDAFEVSSSC+SAV+LIANIL+D+PDLAFDMSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHV+E  +SRPR+FE VSLLVSKTDLIEDDLLD R TE NK+E+ LTS+C KSNSRCISL RII ILN 
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS
        W ASKDEGTDVRDEY  ED+DVNRLL+CCCKHSEW GS
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS

XP_023512598.1 uncharacterized protein LOC111777294 isoform X1 [Cucurbita pepo subsp. pepo]5.1e-25084.97Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSDSDPIEAEL+P+LE  EGG GP HHPSAP DELFDISTTVDPSYIISLIRKLLP +A+ LRNS G  DDD DASVTNMDE+DAY SGDQVLSSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TV++CQGI  AD SDK AD+E EDEG CPR E+ ISSSEE VWEEYGCILWDLSASK HAELMVQNLVLEVLSA LMVSQSVRVMEI +GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIV KSGLI TIVNQLFLDDAQCLCEVCRLL AGL SSEC  WAEALNSEHVLSRILWVSENTLN QLIEKSVGLLS IIESQQEVVH+LLPCLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+S LFNLF+FEMKILTNERSAERYSILD ILRA+EALSGIEEHSQ+ CSNK+LFQLV +LVKLPDAFEVSSSC+SAV+LIANIL+DVPDLA DMSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHV+E  +SRPR+FEYVSLLVSKTDLIEDDLLD R TE NK+E+ LTS+C KSNSRCISLRRII ILN 
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTD-VRDEYHVEDVDVNRLLNCCCKHSEWPGS
        WTASKDEGT  VR EY  ED+DVNRLL+CCCKHSEW GS
Subjt:  WTASKDEGTD-VRDEYHVEDVDVNRLLNCCCKHSEWPGS

XP_038901476.1 uncharacterized protein LOC120088329 isoform X2 [Benincasa hispida]2.1e-24383.52Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSDSDPIEAELE DLEP E GNGP HHPSAP DELFDISTTVDPSYIISLIRKLLP NA+ L +S G+GD D D S+T MDE DAY SGDQVLS SG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        +VSKC GI  AD SDK AD+  EDEGAC + EQ +SSSEEKVWEEYGCILWDLSASK HAELMVQN VLEVLSA LMVSQSVRVMEIS+GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIV KSGLI TIVNQLFLDDAQCLCEVCRLL AGLQSSECV WAEALNSEHVLSR+LW+SENTLN QLIEKSVGLLS IIESQQEVVHILLPCLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+SVLFNLF+ EMKILTNERS ER+SILDVILR  EALSG+EEHSQ+ICSNKELF+LV DLVKLPDAFEV SSC+S+VVLIANIL+DVPD AFD+SQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        +FLQGLLDIFSF G+D EARDA+WSI ARILVHVQEN +SR RLFEYVSLLVSKTDLIEDDLLD   TES KEE+ +TS+  +SNSRCISLRRII+IL+H
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE
        WTASKDEGTDVRD YHVEDVD+NRLLNCCCKHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE

TrEMBL top hitse value%identityAlignment
A0A0A0KDI1 Uncharacterized protein8.2e-24684.46Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSD DPIEAEL+ DLEP +  NGP HHPSAP DE+FDISTTVDPSYIISLIRKLLP NA+  RNSCG+G D  D SV  MDE D Y SGDQ+ SSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TVSKC GI   DDS K AD+E EDEGACP+ EQ ISSSEEKVWEEYGCILWDLSAS+  AELMVQNLVLEVLSA LMVSQSVRVMEIS+GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIVAKSGLI TIV+QLFLDDAQCLCEVCRLL  GLQSSECV WAEALNSEHVLSRILWVSENTLN QLIEKSVGLLS IIESQQE+VH+LL CLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+SVLFNLF+FEMKILTNERSAER+SILDVILRA+EALSG EEHS+++CSNKELFQLV DLVKLPDAFEVSSSC+SAVVLIANIL+DVPDLAF+MSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGLLDIFSF GDD EARDAVWSIIARILV VQENV+SRP+LFEYVSLLVSKTDLIEDDLLD   TESNKEE+ +TS+C KSNSRCISLRRII+ILNH
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE
        WTASKDEGTDVRDEY +EDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE

A0A1S3C8G6 uncharacterized protein LOC103497988 isoform X11.0e-24384.46Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSDSDPIEAELE D+EP E  NGP HHPSAP DELFDISTTVDPSYIISLIRKLLP NA+  RNSC +G D  D SV  MDE D Y SGDQ+LSSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TVSKC G+  AD S K AD+E EDEGAC + EQ ISS EEKVWEEYGCILWDLSAS+  AELMVQNLVLEVLSA LMVSQSVRVMEIS+GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIVAKSGLI TIV+QLFLDDAQCLCEVCRLL  GLQSSECV WAEALN EHVLSRILWVSENTLN QLIEKSVGLLS IIES QEVVH LLPCLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+SVLFNLF+FEMKILTNERSAER+SILDVILRA+E LSGIEEHS ++CSNKELFQLV DLVKLPDAFEVSSSC+SAVVLIANIL+DVPDLAF+MSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGL D FSFAGDDLEARDAVWSIIARILV VQENV+SRP+L EYVSLLVSKTDLIEDDLLD   TESNKEE+ +TS+C KSNSRCISLRRII+ILNH
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE
        WTASKDEGTDVRDEY VEDVDVNRLL CC KHSE
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE

A0A6J1CG56 uncharacterized protein LOC111011366 isoform X11.7e-23883.12Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDP-----DASVTNMDENDAYSSGDQV
        MELG   DP+EAELE D EP EG NGP+HHPSAP DELFDISTTVDPSYIISLIRKLLP NA+   N C S +DD        SV  MDE+DA  SGD+V
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDP-----DASVTNMDENDAYSSGDQV

Query:  LSSSGTVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNL
        LS SGTV KCQGI   D SDKFADQE E+EG+CPRLEQ ISSSEEKVWEEYGCILWDLSAS+ HAELMVQNLVLEVLSA LMVSQSVRV+EIS+GIIGNL
Subjt:  LSSSGTVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNL

Query:  ACHEVPMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLP
        ACHEVPMK IVAKSGLIA IVNQLFLDDAQCLCEVCRLLTAG+QS +C+TWAEAL+SEHVLSRILWVSENTLN QLIEKSVGLLSAI+ES+QEV  ILLP
Subjt:  ACHEVPMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLP

Query:  CLMKLGLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFD
        CL KLG +SVLFNLFAFEMKILTNERS ERYSILDVILRAIEALS IEEHSQ+I SNKELFQL+  LVKLPD  EVSSSCV AVVLIANIL+DVPDLAF+
Subjt:  CLMKLGLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFD

Query:  MSQDLSFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRII
        MSQDLSFLQGLLDIFSF GDDLEARDAVWSIIARILV VQENV++RPRLFEYVSLLVSKTDLIEDDLLDQR TES+KEE  L S+CMK  SRCISLR+II
Subjt:  MSQDLSFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRII

Query:  TILNHWTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE
        TILNHWTA KDE T+VRDEYHVEDVDVNRLLNCCCKHSE
Subjt:  TILNHWTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSE

A0A6J1FYH6 uncharacterized protein LOC111448685 isoform X13.1e-25385.5Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSDSDPIEAEL+P+LE  EGG GP HHPSAP DELFDISTTVDPSYIISLIRKLLP +A+ LRNS G  DDD DASVTNMDE+DAY SGDQVLSSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TV++CQGI  AD SDK AD+E EDEGACPR EQ ISSSEE VWEEYGCILWDLSASK HAELMVQNLVLEVLSA LMVSQSVRVMEI +GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIV KSGLI  IVNQLFLDDAQCLCEVCRLL AGL SSEC  WAEALNSEHVLSRILWVSENTLN QLIEKSVGLLS IIESQQEVVH+LLPCLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+S LFNLF+FEMKILTNERS ERYSILD ILRA+EALSGIEEHSQ+ CSNK+LFQLV +LVKLPDAFEVSSSC+SAV+LIANIL+DVPDLAFDMSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHV+E  +SRPR+FEYVSLLVSKTDLIEDDLLD R TE NK+E+ LTS+C KSNSRCISLRRII ILN 
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS
        WT SKDEGTDVRDEY  ED+DVNRLL+CCCKHSEW GS
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS

A0A6J1J751 uncharacterized protein LOC1114840775.8e-25285.32Show/hide
Query:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG
        ME+GSDSDPIEAEL+P+LE  EGG GP HHPSAP DELFDISTTVDPSYIISLIRKLLP NA+ LRNS G  DDD +ASVTNMDE+DAY SGDQVLSSSG
Subjt:  MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSG

Query:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV
        TV++CQGI  AD SDK AD+E EDEGACPR EQ ISSSEE VWEEYGCILWDLSASK HAELMVQNLVLEVLSA LMVSQSVRVMEI +GIIGNLACHEV
Subjt:  TVSKCQGIGNADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEV

Query:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL
        PMKHIV KSGLI TIVNQLFLDDAQCLCEVCRLL AGLQSSEC  WA ALNSEHVLSRILWVSENTLN QLIEKSVGLLS IIESQQEVVH+LLPCLMKL
Subjt:  PMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKL

Query:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL
        GL+S LFNLF+FEMKILTNERSAERYSILD ILRA+EALSGIEEHSQ+ CSNK+LFQLV +LVKLPDAFEVSSSC+SAV+LIANIL+D+PDLAFDMSQDL
Subjt:  GLASVLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDL

Query:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH
        SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHV+E  +SRPR+FE VSLLVSKTDLIEDDLLD R TE NK+E+ LTS+C KSNSRCISL RII ILN 
Subjt:  SFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNH

Query:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS
        W ASKDEGTDVRDEY  ED+DVNRLL+CCCKHSEW GS
Subjt:  WTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS

SwissProt top hitse value%identityAlignment
Q803M5 Protein saal12.2e-0624.31Show/hide
Query:  EEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGL-QSSE
        EE  C +WD++  K  A  + +    ++L   +  S + R+ EI +GI+GN+AC       +   S L A ++  L  +D   L E CRLL   L Q+  
Subjt:  EEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQCLCEVCRLLTAGL-QSSE

Query:  CVTWAEALNSEH-VLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPC------LMKLGLASVLFNLFAFEMKILTNERSAERYSILDVILRA
           W E +  +  V S + ++  ++ N  L+ K   LL  + +  +E++   +        L       +L +L     ++ +    A     L+V L +
Subjt:  CVTWAEALNSEH-VLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPC------LMKLGLASVLFNLFAFEMKILTNERSAERYSILDVILRA

Query:  IEALSGIEEHSQQICSNK
        ++ L+ +EE  Q + S++
Subjt:  IEALSGIEEHSQQICSNK

Q96ER3 Protein SAAL17.1e-0521.29Show/hide
Query:  EQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQCLCEVC
        E+ ++  +E++  E  C +WD+S  +  A  + +    ++    L  S+  R+ EI +GI+GN+AC +     I +   L   +++ L+  D   L E  
Subjt:  EQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQCLCEVC

Query:  RLLTAGLQSSECVT-WAEALNSEH--VLSRILWVSENTLNTQLIEKSVGLLSAII------------------------ESQQEVVHILLPCLMKLGLAS
        RLL   L  +E  + W E +  EH  +   I ++  ++ N  L+ K   ++  +                         ES+++ V  L+PC+++     
Subjt:  RLLTAGLQSSECVT-WAEALNSEH--VLSRILWVSENTLNTQLIEKSVGLLSAII------------------------ESQQEVVHILLPCLMKLGLAS

Query:  VLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQIC----SNKELFQLVLDLV
                       +  +E    LDV +  ++ L+ +++  Q I     + K+++ L+ DLV
Subjt:  VLFNLFAFEMKILTNERSAERYSILDVILRAIEALSGIEEHSQQIC----SNKELFQLVLDLV

Arabidopsis top hitse value%identityAlignment
AT5G22820.1 ARM repeat superfamily protein1.6e-11650.85Show/hide
Query:  PTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPD-ASVTNMDENDAYSSGDQVLSSSGTVSKCQGIGNADDSDKFADQEDEDE
        P+HHP  P DELFDISTTVDPSY+ISLIRKLLP ++       GS +   D  +  N+ +     SG+ V+ +S    +   IG  D+ D+   +  E  
Subjt:  PTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPD-ASVTNMDENDAYSSGDQVLSSSGTVSKCQGIGNADDSDKFADQEDEDE

Query:  GACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQ
         +CP       SS    WE++GC+LWDL+AS+ HAELMVQNL+LEVL A LMVS+S R+ EI +GII NLACHE  +KHI + +G++ T+V QLFLDD Q
Subjt:  GACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQ

Query:  CLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKLGLASVLFNLFAFEMKILTNERSAER
        CL EVCR+LT GL  + C +WA  L S+ +L  ILW++ENTLN  LIEKSVGLL  IIE Q EV  +L+P LM LGL S+L NL +FEM  LT ER  ER
Subjt:  CLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKLGLASVLFNLFAFEMKILTNERSAER

Query:  YSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWS
        Y +L++ILRAIEALS  + +S++ICS+KELFQLV DL+KL D  EV++SCV+  VLIAN+L++  D   ++ +D SFL+GL     FA DD+EAR A+W+
Subjt:  YSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWS

Query:  IIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRII
        +IAR+L  V E+ I+   L +Y+ +L+S  D+IEDD LD +  +SN+  N   S  +KS++R I++   I
Subjt:  IIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRII

AT5G22820.2 ARM repeat superfamily protein4.6e-12449.51Show/hide
Query:  PTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPD-ASVTNMDENDAYSSGDQVLSSSGTVSKCQGIGNADDSDKFADQEDEDE
        P+HHP  P DELFDISTTVDPSY+ISLIRKLLP ++       GS +   D  +  N+ +     SG+ V+ +S    +   IG  D+ D+   +  E  
Subjt:  PTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPD-ASVTNMDENDAYSSGDQVLSSSGTVSKCQGIGNADDSDKFADQEDEDE

Query:  GACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQ
         +CP       SS    WE++GC+LWDL+AS+ HAELMVQNL+LEVL A LMVS+S R+ EI +GII NLACHE  +KHI + +G++ T+V QLFLDD Q
Subjt:  GACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLFLDDAQ

Query:  CLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKLGLASVLFNLFAFEMKILTNERSAER
        CL EVCR+LT GL  + C +WA  L S+ +L  ILW++ENTLN  LIEKSVGLL  IIE Q EV  +L+P LM LGL S+L NL +FEM  LT ER  ER
Subjt:  CLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKLGLASVLFNLFAFEMKILTNERSAER

Query:  YSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWS
        Y +L++ILRAIEALS  + +S++ICS+KELFQLV DL+KL D  EV++SCV+  VLIAN+L++  D   ++ +D SFL+GL     FA DD+EAR A+W+
Subjt:  YSILDVILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWS

Query:  IIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNHWTASKD--EGTDVRDEYHVEDVDVN
        +IAR+L  V E+ I+   L +Y+ +L+S  D+IEDD LD +  +SN+  N   S  +KS++R I++++I +ILN+W A K+  +   V     +   DV 
Subjt:  IIARILVHVQENVISRPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNHWTASKD--EGTDVRDEYHVEDVDVN

Query:  RLLNCCCKH
        RL +CC ++
Subjt:  RLLNCCCKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTGGGCTCAGATTCAGACCCTATAGAAGCAGAATTGGAGCCAGACCTTGAACCTGCAGAAGGCGGTAATGGACCCACTCATCACCCTTCTGCTCCATCTGACGA
GTTATTTGATATCTCAACGACGGTTGATCCTAGCTATATTATCTCTCTAATACGGAAACTTCTACCACCTAACGCAAATAAGCTGCGCAATTCTTGTGGAAGTGGAGATG
ACGACCCTGACGCCTCAGTAACTAACATGGATGAAAATGATGCCTATTCATCTGGCGACCAAGTCTTAAGTTCTTCAGGAACAGTGAGTAAATGTCAGGGCATTGGAAAT
GCGGATGATTCTGATAAATTTGCTGATCAAGAAGATGAGGATGAAGGTGCATGCCCTAGATTGGAGCAGCATATTTCATCATCGGAAGAAAAGGTCTGGGAAGAGTATGG
TTGCATTCTGTGGGATCTTTCTGCGAGTAAATTTCATGCAGAACTTATGGTTCAGAATCTTGTCCTTGAAGTTCTGTCTGCAACCCTTATGGTTTCACAATCTGTGCGTG
TCATGGAGATTAGCATTGGAATTATTGGAAACCTGGCATGTCATGAAGTTCCTATGAAACATATAGTCGCCAAGAGTGGATTAATTGCAACCATTGTGAACCAGCTGTTT
CTGGATGATGCTCAATGTTTATGTGAAGTTTGCAGGCTATTAACTGCGGGTCTTCAAAGTAGCGAATGTGTCACATGGGCTGAGGCTTTGAATTCTGAGCATGTTCTTTC
TCGTATTCTATGGGTTTCTGAGAACACCTTAAATACACAACTTATAGAAAAGAGTGTTGGGTTATTATCAGCCATCATTGAAAGTCAGCAAGAAGTCGTGCATATTCTTC
TCCCGTGTTTGATGAAGCTGGGTTTGGCGAGTGTTTTGTTCAACCTTTTTGCTTTTGAGATGAAAATATTAACAAATGAAAGATCAGCTGAAAGGTATTCAATTTTGGAC
GTGATTCTTCGGGCAATTGAAGCACTTTCTGGAATTGAAGAGCATTCTCAACAAATATGTTCAAATAAAGAACTTTTTCAGCTTGTACTCGATCTAGTCAAATTGCCAGA
TGCATTTGAGGTTTCCAGTTCTTGTGTCAGTGCTGTGGTTTTGATTGCAAATATTCTGACAGATGTACCTGATCTAGCCTTTGACATGTCTCAGGATTTGTCTTTCCTAC
AAGGTCTACTTGATATATTCTCTTTTGCTGGGGATGACTTAGAGGCACGTGATGCTGTTTGGAGCATCATTGCCAGGATACTGGTTCATGTTCAAGAAAATGTGATAAGC
AGACCAAGGCTATTTGAGTACGTGTCATTACTAGTGAGTAAGACTGACCTCATTGAGGATGATCTTCTAGACCAACGTCCGACTGAATCAAATAAAGAAGAGAATGTATT
GACCTCTTCCTGCATGAAATCAAACTCTAGATGCATATCTTTAAGAAGGATAATTACCATTTTAAATCATTGGACTGCTTCGAAGGATGAAGGGACAGATGTAAGAGACG
AATATCATGTAGAAGATGTTGATGTCAATAGATTGTTGAATTGCTGCTGTAAACATTCTGAATGGCCAGGGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTGGGCTCAGATTCAGACCCTATAGAAGCAGAATTGGAGCCAGACCTTGAACCTGCAGAAGGCGGTAATGGACCCACTCATCACCCTTCTGCTCCATCTGACGA
GTTATTTGATATCTCAACGACGGTTGATCCTAGCTATATTATCTCTCTAATACGGAAACTTCTACCACCTAACGCAAATAAGCTGCGCAATTCTTGTGGAAGTGGAGATG
ACGACCCTGACGCCTCAGTAACTAACATGGATGAAAATGATGCCTATTCATCTGGCGACCAAGTCTTAAGTTCTTCAGGAACAGTGAGTAAATGTCAGGGCATTGGAAAT
GCGGATGATTCTGATAAATTTGCTGATCAAGAAGATGAGGATGAAGGTGCATGCCCTAGATTGGAGCAGCATATTTCATCATCGGAAGAAAAGGTCTGGGAAGAGTATGG
TTGCATTCTGTGGGATCTTTCTGCGAGTAAATTTCATGCAGAACTTATGGTTCAGAATCTTGTCCTTGAAGTTCTGTCTGCAACCCTTATGGTTTCACAATCTGTGCGTG
TCATGGAGATTAGCATTGGAATTATTGGAAACCTGGCATGTCATGAAGTTCCTATGAAACATATAGTCGCCAAGAGTGGATTAATTGCAACCATTGTGAACCAGCTGTTT
CTGGATGATGCTCAATGTTTATGTGAAGTTTGCAGGCTATTAACTGCGGGTCTTCAAAGTAGCGAATGTGTCACATGGGCTGAGGCTTTGAATTCTGAGCATGTTCTTTC
TCGTATTCTATGGGTTTCTGAGAACACCTTAAATACACAACTTATAGAAAAGAGTGTTGGGTTATTATCAGCCATCATTGAAAGTCAGCAAGAAGTCGTGCATATTCTTC
TCCCGTGTTTGATGAAGCTGGGTTTGGCGAGTGTTTTGTTCAACCTTTTTGCTTTTGAGATGAAAATATTAACAAATGAAAGATCAGCTGAAAGGTATTCAATTTTGGAC
GTGATTCTTCGGGCAATTGAAGCACTTTCTGGAATTGAAGAGCATTCTCAACAAATATGTTCAAATAAAGAACTTTTTCAGCTTGTACTCGATCTAGTCAAATTGCCAGA
TGCATTTGAGGTTTCCAGTTCTTGTGTCAGTGCTGTGGTTTTGATTGCAAATATTCTGACAGATGTACCTGATCTAGCCTTTGACATGTCTCAGGATTTGTCTTTCCTAC
AAGGTCTACTTGATATATTCTCTTTTGCTGGGGATGACTTAGAGGCACGTGATGCTGTTTGGAGCATCATTGCCAGGATACTGGTTCATGTTCAAGAAAATGTGATAAGC
AGACCAAGGCTATTTGAGTACGTGTCATTACTAGTGAGTAAGACTGACCTCATTGAGGATGATCTTCTAGACCAACGTCCGACTGAATCAAATAAAGAAGAGAATGTATT
GACCTCTTCCTGCATGAAATCAAACTCTAGATGCATATCTTTAAGAAGGATAATTACCATTTTAAATCATTGGACTGCTTCGAAGGATGAAGGGACAGATGTAAGAGACG
AATATCATGTAGAAGATGTTGATGTCAATAGATTGTTGAATTGCTGCTGTAAACATTCTGAATGGCCAGGGAGTTAG
Protein sequenceShow/hide protein sequence
MELGSDSDPIEAELEPDLEPAEGGNGPTHHPSAPSDELFDISTTVDPSYIISLIRKLLPPNANKLRNSCGSGDDDPDASVTNMDENDAYSSGDQVLSSSGTVSKCQGIGN
ADDSDKFADQEDEDEGACPRLEQHISSSEEKVWEEYGCILWDLSASKFHAELMVQNLVLEVLSATLMVSQSVRVMEISIGIIGNLACHEVPMKHIVAKSGLIATIVNQLF
LDDAQCLCEVCRLLTAGLQSSECVTWAEALNSEHVLSRILWVSENTLNTQLIEKSVGLLSAIIESQQEVVHILLPCLMKLGLASVLFNLFAFEMKILTNERSAERYSILD
VILRAIEALSGIEEHSQQICSNKELFQLVLDLVKLPDAFEVSSSCVSAVVLIANILTDVPDLAFDMSQDLSFLQGLLDIFSFAGDDLEARDAVWSIIARILVHVQENVIS
RPRLFEYVSLLVSKTDLIEDDLLDQRPTESNKEENVLTSSCMKSNSRCISLRRIITILNHWTASKDEGTDVRDEYHVEDVDVNRLLNCCCKHSEWPGS