; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005131 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005131
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSerine/threonine-protein kinase SRPK
Genome locationscaffold176:2915806..2920255
RNA-Seq ExpressionMS005131
SyntenyMS005131
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458652.1 PREDICTED: uncharacterized protein LOC103497988 isoform X1 [Cucumis melo]2.3e-22679.66Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP
        ME+G   DP+EAELE D EPVE  NGP+HHPSAP DELFDISTTVDPSYIISLIRKLLP NASN   S E+    D G  S  KMDE D  LSGD++LS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV KC G+EI DGS K AD+EGE+EG+C + EQ ISS EEKVWEEYGCILWDLSASRS AELMVQNLVLEVLSANLMVSQSVRV+EISLGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IVAKSGLI  IV+QLFLDDAQCLCEVCRLL  G+QS +C+ WAEAL+ EHVLSRILWVSENTLNPQLIEKSVGLLS I+ES QEV   LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ
        KLG SSVLFNLF+FEMKILTNERS ER+SILDVILRA+E LS IEEHS E+CSNKELFQL+  LVKLPD  EVSSSC+ AVVLIANILSDVPDLAFEMSQ
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ

Query:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL
        DLSFLQGL D FSF GDDLEARDAVWSIIARILVRVQENVM+RP+L EYVSLLVSKTDLIEDDLLD  MTES+KEE  + S C K  SRCISLR+II+IL
Subjt:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL

Query:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        NHWTA KDE T+VRDEY VEDVDVNRLL CC KHSE
Subjt:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

XP_022140789.1 uncharacterized protein LOC111011366 isoform X1 [Momordica charantia]1.1e-29499.44Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGS AKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLT GIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEI SNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS

Query:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW
        FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW
Subjt:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW

Query:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
Subjt:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

XP_022140791.1 uncharacterized protein LOC111011366 isoform X2 [Momordica charantia]2.0e-29198.87Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGS AKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLT GIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEI SNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS

Query:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW
        FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCIS   IITILNHW
Subjt:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW

Query:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
Subjt:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

XP_022944140.1 uncharacterized protein LOC111448685 isoform X1 [Cucurbita moschata]4.1e-22879.18Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKS--DEDDDRDDPGQGSAAKMDESDACLSGDRVL
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPS+ASN   S    DDDRD     S   MDESDA LSGD+VL
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKS--DEDDDRDDPGQGSAAKMDESDACLSGDRVL

Query:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA
        S SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLA
Subjt:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA

Query:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC
        CHEVPMK IV KSGLI IIVNQLFLDDAQCLCEVCRLL  G+ S +C  WAEAL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPC
Subjt:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC

Query:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEM
        L KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE CSNK+LFQL+  LVKLPD  EVSSSC+ AV+LIANILSDVPDLAF+M
Subjt:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEM

Query:  SQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIIT
        SQDLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FEYVSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISLR+II 
Subjt:  SQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIIT

Query:  ILNHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        ILN WT  KDE T+VRDEY  ED+DVNRLL+CCCKHSE
Subjt:  ILNHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

XP_022986281.1 uncharacterized protein LOC111484077 [Cucurbita maxima]2.7e-22778.73Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPSNASN    +    RDD G  S   MDESDA LSGD+VLS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IV KSGLI  IVNQLFLDDAQCLCEVCRLL  G+QS +C  WA AL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ
        KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE CSNK+LFQL+C LVKLPD  EVSSSC+ AV+LIANILSD+PDLAF+MSQ
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ

Query:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL
        DLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FE VSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISL +II IL
Subjt:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL

Query:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        N W A KDE T+VRDEY  ED+DVNRLL+CCCKHSE
Subjt:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

TrEMBL top hitse value%identityAlignment
A0A1S3C8G6 uncharacterized protein LOC103497988 isoform X11.1e-22679.66Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP
        ME+G   DP+EAELE D EPVE  NGP+HHPSAP DELFDISTTVDPSYIISLIRKLLP NASN   S E+    D G  S  KMDE D  LSGD++LS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV KC G+EI DGS K AD+EGE+EG+C + EQ ISS EEKVWEEYGCILWDLSASRS AELMVQNLVLEVLSANLMVSQSVRV+EISLGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IVAKSGLI  IV+QLFLDDAQCLCEVCRLL  G+QS +C+ WAEAL+ EHVLSRILWVSENTLNPQLIEKSVGLLS I+ES QEV   LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ
        KLG SSVLFNLF+FEMKILTNERS ER+SILDVILRA+E LS IEEHS E+CSNKELFQL+  LVKLPD  EVSSSC+ AVVLIANILSDVPDLAFEMSQ
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ

Query:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL
        DLSFLQGL D FSF GDDLEARDAVWSIIARILVRVQENVM+RP+L EYVSLLVSKTDLIEDDLLD  MTES+KEE  + S C K  SRCISLR+II+IL
Subjt:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL

Query:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        NHWTA KDE T+VRDEY VEDVDVNRLL CC KHSE
Subjt:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

A0A6J1CG56 uncharacterized protein LOC111011366 isoform X15.5e-29599.44Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGS AKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLT GIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEI SNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS

Query:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW
        FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW
Subjt:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW

Query:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
Subjt:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

A0A6J1CH38 uncharacterized protein LOC111011366 isoform X29.8e-29298.87Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGS AKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLT GIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEI SNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLS

Query:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW
        FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCIS   IITILNHW
Subjt:  FLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHW

Query:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
Subjt:  TALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

A0A6J1FYH6 uncharacterized protein LOC111448685 isoform X12.0e-22879.18Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKS--DEDDDRDDPGQGSAAKMDESDACLSGDRVL
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPS+ASN   S    DDDRD     S   MDESDA LSGD+VL
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKS--DEDDDRDDPGQGSAAKMDESDACLSGDRVL

Query:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA
        S SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLA
Subjt:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA

Query:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC
        CHEVPMK IV KSGLI IIVNQLFLDDAQCLCEVCRLL  G+ S +C  WAEAL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPC
Subjt:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC

Query:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEM
        L KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE CSNK+LFQL+  LVKLPD  EVSSSC+ AV+LIANILSDVPDLAF+M
Subjt:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEM

Query:  SQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIIT
        SQDLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FEYVSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISLR+II 
Subjt:  SQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIIT

Query:  ILNHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        ILN WT  KDE T+VRDEY  ED+DVNRLL+CCCKHSE
Subjt:  ILNHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

A0A6J1J751 uncharacterized protein LOC1114840771.3e-22778.73Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPSNASN    +    RDD G  S   MDESDA LSGD+VLS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IV KSGLI  IVNQLFLDDAQCLCEVCRLL  G+QS +C  WA AL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ
        KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE CSNK+LFQL+C LVKLPD  EVSSSC+ AV+LIANILSD+PDLAF+MSQ
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQ

Query:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL
        DLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FE VSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISL +II IL
Subjt:  DLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITIL

Query:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE
        N W A KDE T+VRDEY  ED+DVNRLL+CCCKHSE
Subjt:  NHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE

SwissProt top hitse value%identityAlignment
Q6DCP5 Protein saal17.8e-0425Show/hide
Query:  IEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVA
        IE   G +  A+QE          E  +   EE   E   C +WD+S ++  A  + +    E+L   ++ S+  R+ EI +GI+GN++C + P   I  
Subjt:  IEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVA

Query:  KSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCI-TWAEALHSE-HVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEV
           L  + +  L   D   L E  RLL   +   +   TWAE       V   + ++  ++ N  L+ K   LL  + +  +++
Subjt:  KSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCI-TWAEALHSE-HVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEV

Q803M5 Protein saal13.2e-0522.74Show/hide
Query:  EEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCR-LLTVGIQSCK
        EE  C +WD++  +  A  + +    ++L   +  S + R+ EI +GI+GN+AC       +   S L A+++  L  +D   L E CR LLT   Q+  
Subjt:  EEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCR-LLTVGIQSCK

Query:  CITWAEALHSEH-VLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC------LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRA
           W E +  +  V S + ++  ++ N  L+ K   LL  + +  +E+ K  +        L       +L +L          +  +E    L+V L +
Subjt:  CITWAEALHSEH-VLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC------LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRA

Query:  IEALSAIEEHSQEICSNKE--------LFQLLCYLVKLPD----VLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFTGDDLEARDA
        ++ L+ +EE  Q + S++         + +LLC     PD    +L+   + +   + + + L     L   +S +L  L  L+ I  F  ++ ++  A
Subjt:  IEALSAIEEHSQEICSNKE--------LFQLLCYLVKLPD----VLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFTGDDLEARDA

Arabidopsis top hitse value%identityAlignment
AT5G22820.1 ARM repeat superfamily protein4.0e-12049.8Show/hide
Query:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGTVGKC
        LE++ E   GR        PSHHP  PPDELFDISTTVDPSY+ISLIRKLLP ++ +  + ++  + D+  QG  A        +SG+ V+  S   G  
Subjt:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGTVGKC

Query:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI
        + ++I D  D+   + GE   SCP    P        WE++GC+LWDL+ASR+HAELMVQNL+LEVL ANLMVS+S R+ EI LGII NLACHE  +K I
Subjt:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI

Query:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV
         + +G++  +V QLFLDD QCL EVCR+LT G+    C +WA  L S+ +L  ILW++ENTLNP LIEKSVGLL  I+E + EV ++L+P L  LG +S+
Subjt:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV

Query:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQG
        L NL +FEM  LT ER  ERY +L++ILRAIEALSA + +S+EICS+KELFQL+C L+KL D  EV++SCV   VLIAN+LS+  D   E+ +D SFL+G
Subjt:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQG

Query:  LLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII
        L     F  DD+EAR A+W++IAR+L RV E+ +    L +Y+ +L+S  D+IEDD LD ++ E S E      + +K ++R I++   I
Subjt:  LLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII

AT5G22820.2 ARM repeat superfamily protein1.8e-12848.96Show/hide
Query:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGTVGKC
        LE++ E   GR        PSHHP  PPDELFDISTTVDPSY+ISLIRKLLP ++ +  + ++  + D+  QG  A        +SG+ V+  S   G  
Subjt:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGTVGKC

Query:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI
        + ++I D  D+   + GE   SCP    P        WE++GC+LWDL+ASR+HAELMVQNL+LEVL ANLMVS+S R+ EI LGII NLACHE  +K I
Subjt:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI

Query:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV
         + +G++  +V QLFLDD QCL EVCR+LT G+    C +WA  L S+ +L  ILW++ENTLNP LIEKSVGLL  I+E + EV ++L+P L  LG +S+
Subjt:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV

Query:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQG
        L NL +FEM  LT ER  ERY +L++ILRAIEALSA + +S+EICS+KELFQL+C L+KL D  EV++SCV   VLIAN+LS+  D   E+ +D SFL+G
Subjt:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQG

Query:  LLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHWTALK
        L     F  DD+EAR A+W++IAR+L RV E+ +    L +Y+ +L+S  D+IEDD LD ++ E S E      + +K ++R I+++KI +ILN+W A K
Subjt:  LLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHWTALK

Query:  D--EETNVRDEYHVEDVDVNRLLNCCCKH
        +  +E  V     +   DV RL +CC ++
Subjt:  D--EETNVRDEYHVEDVDVNRLLNCCCKH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTGGGCGACCCTGTGGAAGCGGAATTGGAGCAAGACTTTGAACCAGTAGAAGGTCGGAATGGACCCTCTCACCACCCTTCTGCGCCACCTGATGAGTTATTTGA
TATCTCGACGACGGTTGATCCTAGCTATATCATCTCTCTAATACGGAAACTTTTGCCATCTAACGCAAGTAACCCGTGCAAATCTGATGAAGATGATGATCGTGACGACC
CAGGACAAGGATCAGCGGCCAAAATGGATGAAAGTGATGCCTGTTTATCAGGCGACCGAGTCTTAAGTCCTTCAGGGACGGTAGGTAAATGCCAGGGAATTGAAATTACG
GATGGTTCTGATAAATTTGCTGATCAAGAAGGTGAGGAGGAAGGCTCCTGCCCTAGATTGGAGCAACCCATTTCATCATCAGAAGAAAAGGTCTGGGAAGAGTATGGCTG
CATTCTGTGGGATCTTTCTGCGAGTAGATCTCATGCAGAACTTATGGTTCAGAACCTTGTCCTTGAAGTCCTTTCTGCGAACCTTATGGTCTCACAATCTGTGCGTGTTT
TGGAGATTAGCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCATGAAACAAATAGTCGCTAAGAGTGGATTGATTGCAATCATTGTGAACCAGTTGTTTCTA
GATGATGCTCAATGCTTATGTGAAGTTTGCAGGTTATTAACTGTGGGTATTCAAAGTTGCAAATGTATCACATGGGCCGAGGCTTTGCATTCTGAGCATGTTCTTTCTCG
TATTCTGTGGGTTTCTGAGAACACTTTAAATCCACAACTTATTGAAAAGAGTGTTGGACTATTATCAGCCATTGTTGAGAGTAAGCAGGAAGTTGCGAAAATTCTTCTCC
CTTGTTTGACGAAGCTGGGTTTTTCGAGTGTTTTGTTCAACCTTTTTGCTTTTGAGATGAAAATATTAACAAATGAAAGATCAACTGAAAGGTACTCAATTTTGGATGTG
ATTCTTCGGGCAATTGAAGCACTCTCTGCAATTGAAGAGCATTCTCAAGAAATATGTTCAAATAAAGAACTTTTTCAGCTACTTTGTTATCTAGTCAAATTGCCAGATGT
ATTAGAGGTTTCCAGTTCTTGTGTTTGTGCTGTGGTTTTGATTGCAAATATTCTGTCAGATGTACCTGACCTAGCCTTTGAGATGTCTCAGGATTTGTCTTTCCTACAAG
GTCTACTTGATATATTCTCTTTTACTGGGGATGATTTAGAGGCACGTGATGCTGTTTGGAGCATCATTGCCAGGATACTGGTTCGTGTTCAAGAAAATGTGATGACCCGG
CCAAGGCTATTTGAGTACGTCTCATTACTAGTGAGTAAGACTGATCTCATCGAGGATGATCTTCTGGACCAGCGGATGACTGAATCAAGTAAAGAAGAGGGTGAATTGAT
CTCAACCTGCATGAAACCAACCTCTAGATGTATATCTTTAAGAAAGATAATTACTATTTTAAATCATTGGACTGCTTTAAAGGATGAAGAGACAAACGTGAGAGATGAAT
ATCATGTCGAAGATGTAGATGTCAATAGATTGTTGAATTGCTGCTGTAAACATTCTGAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTGGGCGACCCTGTGGAAGCGGAATTGGAGCAAGACTTTGAACCAGTAGAAGGTCGGAATGGACCCTCTCACCACCCTTCTGCGCCACCTGATGAGTTATTTGA
TATCTCGACGACGGTTGATCCTAGCTATATCATCTCTCTAATACGGAAACTTTTGCCATCTAACGCAAGTAACCCGTGCAAATCTGATGAAGATGATGATCGTGACGACC
CAGGACAAGGATCAGCGGCCAAAATGGATGAAAGTGATGCCTGTTTATCAGGCGACCGAGTCTTAAGTCCTTCAGGGACGGTAGGTAAATGCCAGGGAATTGAAATTACG
GATGGTTCTGATAAATTTGCTGATCAAGAAGGTGAGGAGGAAGGCTCCTGCCCTAGATTGGAGCAACCCATTTCATCATCAGAAGAAAAGGTCTGGGAAGAGTATGGCTG
CATTCTGTGGGATCTTTCTGCGAGTAGATCTCATGCAGAACTTATGGTTCAGAACCTTGTCCTTGAAGTCCTTTCTGCGAACCTTATGGTCTCACAATCTGTGCGTGTTT
TGGAGATTAGCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCATGAAACAAATAGTCGCTAAGAGTGGATTGATTGCAATCATTGTGAACCAGTTGTTTCTA
GATGATGCTCAATGCTTATGTGAAGTTTGCAGGTTATTAACTGTGGGTATTCAAAGTTGCAAATGTATCACATGGGCCGAGGCTTTGCATTCTGAGCATGTTCTTTCTCG
TATTCTGTGGGTTTCTGAGAACACTTTAAATCCACAACTTATTGAAAAGAGTGTTGGACTATTATCAGCCATTGTTGAGAGTAAGCAGGAAGTTGCGAAAATTCTTCTCC
CTTGTTTGACGAAGCTGGGTTTTTCGAGTGTTTTGTTCAACCTTTTTGCTTTTGAGATGAAAATATTAACAAATGAAAGATCAACTGAAAGGTACTCAATTTTGGATGTG
ATTCTTCGGGCAATTGAAGCACTCTCTGCAATTGAAGAGCATTCTCAAGAAATATGTTCAAATAAAGAACTTTTTCAGCTACTTTGTTATCTAGTCAAATTGCCAGATGT
ATTAGAGGTTTCCAGTTCTTGTGTTTGTGCTGTGGTTTTGATTGCAAATATTCTGTCAGATGTACCTGACCTAGCCTTTGAGATGTCTCAGGATTTGTCTTTCCTACAAG
GTCTACTTGATATATTCTCTTTTACTGGGGATGATTTAGAGGCACGTGATGCTGTTTGGAGCATCATTGCCAGGATACTGGTTCGTGTTCAAGAAAATGTGATGACCCGG
CCAAGGCTATTTGAGTACGTCTCATTACTAGTGAGTAAGACTGATCTCATCGAGGATGATCTTCTGGACCAGCGGATGACTGAATCAAGTAAAGAAGAGGGTGAATTGAT
CTCAACCTGCATGAAACCAACCTCTAGATGTATATCTTTAAGAAAGATAATTACTATTTTAAATCATTGGACTGCTTTAAAGGATGAAGAGACAAACGTGAGAGATGAAT
ATCATGTCGAAGATGTAGATGTCAATAGATTGTTGAATTGCTGCTGTAAACATTCTGAG
Protein sequenceShow/hide protein sequence
MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSAAKMDESDACLSGDRVLSPSGTVGKCQGIEIT
DGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVAKSGLIAIIVNQLFL
DDAQCLCEVCRLLTVGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSVLFNLFAFEMKILTNERSTERYSILDV
ILRAIEALSAIEEHSQEICSNKELFQLLCYLVKLPDVLEVSSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTR
PRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHWTALKDEETNVRDEYHVEDVDVNRLLNCCCKHSE