; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0235 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0235
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSerine/threonine-protein kinase SRPK
Genome locationMC02:2191044..2197807
RNA-Seq ExpressionMC02g0235
SyntenyMC02g0235
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458652.1 PREDICTED: uncharacterized protein LOC103497988 isoform X1 [Cucumis melo]2.20e-27979.32Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP
        ME+G   DP+EAELE D EPVE  NGP+HHPSAP DELFDISTTVDPSYIISLIRKLLP NASN   S E+    D G  SV KMDE D  LSGD++LS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV KC G+EI DGS K AD+EGE+EG+C + EQ ISS EEKVWEEYGCILWDLSASRS AELMVQNLVLEVLSANLMVSQSVRV+EISLGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IVAKSGLI  IV+QLFLDDAQCLCEVCRLL  G+QS +C+ WAEAL+ EHVLSRILWVSENTLNPQLIEKSVGLLS I+ES QEV   LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS
        KLG SSVLFNLF+FEMKILTNERS ER+SILDVILRA+E LS IEEHS E+ SNKELFQL+  LVKLPD  EV  SSC+ AVVLIANILSDVPDLAFEMS
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS

Query:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI
        QDLSFLQGL D FSF GDDLEARDAVWSIIARILVRVQENVM+RP+L EYVSLLVSKTDLIEDDLLD  MTES+KEE  + S C K  SRCISLR+II+I
Subjt:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI

Query:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC
        LNHWTA KDE T+VRDEY VEDVDVNRLL CC
Subjt:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC

XP_022140789.1 uncharacterized protein LOC111011366 isoform X1 [Momordica charantia]0.099.62Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEV  SSCVCAVVLIANILSDVPDLAFEMSQDL
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL

Query:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH
        SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH
Subjt:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH

Query:  WTALKDEETNVRDEYHVEDVDVNRLLNCC
        WTALKDEETNVRDEYHVEDVDVNRLLNCC
Subjt:  WTALKDEETNVRDEYHVEDVDVNRLLNCC

XP_022140791.1 uncharacterized protein LOC111011366 isoform X2 [Momordica charantia]0.099.05Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEV  SSCVCAVVLIANILSDVPDLAFEMSQDL
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL

Query:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH
        SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCIS   IITILNH
Subjt:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH

Query:  WTALKDEETNVRDEYHVEDVDVNRLLNCC
        WTALKDEETNVRDEYHVEDVDVNRLLNCC
Subjt:  WTALKDEETNVRDEYHVEDVDVNRLLNCC

XP_022944140.1 uncharacterized protein LOC111448685 isoform X1 [Cucurbita moschata]1.07e-28078.84Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSD--EDDDRDDPGQGSVAKMDESDACLSGDRVL
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPS+ASN   S    DDDRD     SV  MDESDA LSGD+VL
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSD--EDDDRDDPGQGSVAKMDESDACLSGDRVL

Query:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA
        S SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLA
Subjt:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA

Query:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC
        CHEVPMK IV KSGLI IIVNQLFLDDAQCLCEVCRLL AG+ S +C  WAEAL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPC
Subjt:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC

Query:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFE
        L KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE  SNK+LFQL+  LVKLPD  EV  SSC+ AV+LIANILSDVPDLAF+
Subjt:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFE

Query:  MSQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII
        MSQDLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FEYVSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISLR+II
Subjt:  MSQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII

Query:  TILNHWTALKDEETNVRDEYHVEDVDVNRLLNCC
         ILN WT  KDE T+VRDEY  ED+DVNRLL+CC
Subjt:  TILNHWTALKDEETNVRDEYHVEDVDVNRLLNCC

XP_022986281.1 uncharacterized protein LOC111484077 [Cucurbita maxima]6.16e-28078.57Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPSNASN   S     RDD G  SV  MDESDA LSGD+VLS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IV KSGLI  IVNQLFLDDAQCLCEVCRLL AG+QS +C  WA AL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS
        KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE  SNK+LFQL+C LVKLPD  EV  SSC+ AV+LIANILSD+PDLAF+MS
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS

Query:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI
        QDLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FE VSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISL +II I
Subjt:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI

Query:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC
        LN W A KDE T+VRDEY  ED+DVNRLL+CC
Subjt:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC

TrEMBL top hitse value%identityAlignment
A0A1S3C8G6 uncharacterized protein LOC103497988 isoform X11.06e-27979.32Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP
        ME+G   DP+EAELE D EPVE  NGP+HHPSAP DELFDISTTVDPSYIISLIRKLLP NASN   S E+    D G  SV KMDE D  LSGD++LS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV KC G+EI DGS K AD+EGE+EG+C + EQ ISS EEKVWEEYGCILWDLSASRS AELMVQNLVLEVLSANLMVSQSVRV+EISLGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IVAKSGLI  IV+QLFLDDAQCLCEVCRLL  G+QS +C+ WAEAL+ EHVLSRILWVSENTLNPQLIEKSVGLLS I+ES QEV   LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS
        KLG SSVLFNLF+FEMKILTNERS ER+SILDVILRA+E LS IEEHS E+ SNKELFQL+  LVKLPD  EV  SSC+ AVVLIANILSDVPDLAFEMS
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS

Query:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI
        QDLSFLQGL D FSF GDDLEARDAVWSIIARILVRVQENVM+RP+L EYVSLLVSKTDLIEDDLLD  MTES+KEE  + S C K  SRCISLR+II+I
Subjt:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI

Query:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC
        LNHWTA KDE T+VRDEY VEDVDVNRLL CC
Subjt:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC

A0A6J1CG56 uncharacterized protein LOC111011366 isoform X10.099.62Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEV  SSCVCAVVLIANILSDVPDLAFEMSQDL
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL

Query:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH
        SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH
Subjt:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH

Query:  WTALKDEETNVRDEYHVEDVDVNRLLNCC
        WTALKDEETNVRDEYHVEDVDVNRLLNCC
Subjt:  WTALKDEETNVRDEYHVEDVDVNRLLNCC

A0A6J1CH38 uncharacterized protein LOC111011366 isoform X20.099.05Show/hide
Query:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
        MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT
Subjt:  MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGT

Query:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
        VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP
Subjt:  VGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVP

Query:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
        MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG
Subjt:  MKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLG

Query:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL
        FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEV  SSCVCAVVLIANILSDVPDLAFEMSQDL
Subjt:  FSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDL

Query:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH
        SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCIS   IITILNH
Subjt:  SFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNH

Query:  WTALKDEETNVRDEYHVEDVDVNRLLNCC
        WTALKDEETNVRDEYHVEDVDVNRLLNCC
Subjt:  WTALKDEETNVRDEYHVEDVDVNRLLNCC

A0A6J1FYH6 uncharacterized protein LOC111448685 isoform X15.17e-28178.84Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSD--EDDDRDDPGQGSVAKMDESDACLSGDRVL
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPS+ASN   S    DDDRD     SV  MDESDA LSGD+VL
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSD--EDDDRDDPGQGSVAKMDESDACLSGDRVL

Query:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA
        S SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLA
Subjt:  SPSGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLA

Query:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC
        CHEVPMK IV KSGLI IIVNQLFLDDAQCLCEVCRLL AG+ S +C  WAEAL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPC
Subjt:  CHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC

Query:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFE
        L KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE  SNK+LFQL+  LVKLPD  EV  SSC+ AV+LIANILSDVPDLAF+
Subjt:  LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFE

Query:  MSQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII
        MSQDLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FEYVSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISLR+II
Subjt:  MSQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII

Query:  TILNHWTALKDEETNVRDEYHVEDVDVNRLLNCC
         ILN WT  KDE T+VRDEY  ED+DVNRLL+CC
Subjt:  TILNHWTALKDEETNVRDEYHVEDVDVNRLLNCC

A0A6J1J751 uncharacterized protein LOC1114840772.98e-28078.57Show/hide
Query:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP
        ME+G   DP+EAEL+ + E VEG  GP+HHPSAP DELFDISTTVDPSYIISLIRKLLPSNASN   S     RDD G  SV  MDESDA LSGD+VLS 
Subjt:  MELG---DPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSP

Query:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH
        SGTV +CQGIEI DGSDK AD+EGE+EG+CPR EQ ISSSEE VWEEYGCILWDLSAS+SHAELMVQNLVLEVLSANLMVSQSVRV+EI LGIIGNLACH
Subjt:  SGTVGKCQGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACH

Query:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT
        EVPMK IV KSGLI  IVNQLFLDDAQCLCEVCRLL AG+QS +C  WA AL+SEHVLSRILWVSENTLNPQLIEKSVGLLS I+ES+QEV  +LLPCL 
Subjt:  EVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLT

Query:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS
        KLG SS LFNLF+FEMKILTNERS ERYSILD ILRA+EALS IEEHSQE  SNK+LFQL+C LVKLPD  EV  SSC+ AV+LIANILSD+PDLAF+MS
Subjt:  KLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMS

Query:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI
        QDLSFLQGLLDIFSF GDDLEARDAVWSIIARILV V+E  M+RPR+FE VSLLVSKTDLIEDDLLD RMTE +K+E  L S C K  SRCISL +II I
Subjt:  QDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITI

Query:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC
        LN W A KDE T+VRDEY  ED+DVNRLL+CC
Subjt:  LNHWTALKDEETNVRDEYHVEDVDVNRLLNCC

SwissProt top hitse value%identityAlignment
Q6DCP5 Protein saal16.1e-0425Show/hide
Query:  IEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVA
        IE   G +  A+QE          E  +   EE   E   C +WD+S ++  A  + +    E+L   ++ S+  R+ EI +GI+GN++C + P   I  
Subjt:  IEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVA

Query:  KSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCI-TWAEALHSE-HVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEV
           L  + +  L   D   L E  RLL   +   +   TWAE       V   + ++  ++ N  L+ K   LL  + +  +++
Subjt:  KSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCI-TWAEALHSE-HVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEV

Q803M5 Protein saal11.1e-0523.67Show/hide
Query:  EEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCR-LLTAGIQSCK
        EE  C +WD++  +  A  + +    ++L   +  S + R+ EI +GI+GN+AC       +   S L A+++  L  +D   L E CR LLT   Q+  
Subjt:  EEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVAKSGLIAIIVNQLFLDDAQCLCEVCR-LLTAGIQSCK

Query:  CITWAEALHSEH-VLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC------LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRA
           W E +  +  V S + ++  ++ N  L+ K   LL  + +  +E+ K  +        L       +L +L          +  +E    L+V L +
Subjt:  CITWAEALHSEH-VLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPC------LTKLGFSSVLFNLFAFEMKILTNERSTERYSILDVILRA

Query:  IEALSAIEEHSQEIRSNKE--------LFQLLCYLVKLPD----VLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFTGDDLEARDA
        ++ L+ +EE  Q + S++         + +LLC     PD    +L+ Q +    A+ L++ + S    L   +S +L  L  L+ I  F  ++ ++  A
Subjt:  IEALSAIEEHSQEIRSNKE--------LFQLLCYLVKLPD----VLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFTGDDLEARDA

Arabidopsis top hitse value%identityAlignment
AT5G22820.1 ARM repeat superfamily protein6.5e-11849.69Show/hide
Query:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGTVGKC
        LE++ E   GR        PSHHP  PPDELFDISTTVDPSY+ISLIRKLLP ++ +  + ++  + D+  QG VA        +SG+ V+  S   G  
Subjt:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGTVGKC

Query:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI
        + ++I D  D+   + GE   SCP    P        WE++GC+LWDL+ASR+HAELMVQNL+LEVL ANLMVS+S R+ EI LGII NLACHE  +K I
Subjt:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI

Query:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV
         + +G++  +V QLFLDD QCL EVCR+LT G+    C +WA  L S+ +L  ILW++ENTLNP LIEKSVGLL  I+E + EV ++L+P L  LG +S+
Subjt:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV

Query:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQ
        L NL +FEM  LT ER  ERY +L++ILRAIEALSA + +S+EI S+KELFQL+C L+KL D  EV  +SCV   VLIAN+LS+  D   E+ +D SFL+
Subjt:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQ

Query:  GLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII
        GL     F  DD+EAR A+W++IAR+L RV E+ +    L +Y+ +L+S  D+IEDD LD ++ E S E      + +K ++R I++   I
Subjt:  GLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKII

AT5G22820.2 ARM repeat superfamily protein1.7e-12649.05Show/hide
Query:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGTVGKC
        LE++ E   GR        PSHHP  PPDELFDISTTVDPSY+ISLIRKLLP ++ +  + ++  + D+  QG VA        +SG+ V+  S   G  
Subjt:  LEQDFEPVEGRNG------PSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGTVGKC

Query:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI
        + ++I D  D+   + GE   SCP    P        WE++GC+LWDL+ASR+HAELMVQNL+LEVL ANLMVS+S R+ EI LGII NLACHE  +K I
Subjt:  QGIEITDGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQI

Query:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV
         + +G++  +V QLFLDD QCL EVCR+LT G+    C +WA  L S+ +L  ILW++ENTLNP LIEKSVGLL  I+E + EV ++L+P L  LG +S+
Subjt:  VAKSGLIAIIVNQLFLDDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSV

Query:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQ
        L NL +FEM  LT ER  ERY +L++ILRAIEALSA + +S+EI S+KELFQL+C L+KL D  EV  +SCV   VLIAN+LS+  D   E+ +D SFL+
Subjt:  LFNLFAFEMKILTNERSTERYSILDVILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQ

Query:  GLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHWTAL
        GL     F  DD+EAR A+W++IAR+L RV E+ +    L +Y+ +L+S  D+IEDD LD ++ E S E      + +K ++R I+++KI +ILN+W A 
Subjt:  GLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMTRPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHWTAL

Query:  KD--EETNVRDEYHVEDVDVNRLLNCCY
        K+  +E  V     +   DV RL +CC+
Subjt:  KD--EETNVRDEYHVEDVDVNRLLNCCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTGGGCGACCCTGTGGAAGCGGAATTGGAGCAAGACTTTGAACCAGTAGAAGGTCGGAATGGACCCTCTCACCACCCTTCTGCGCCACCTGATGAGTTATTTGA
TATCTCGACGACGGTTGATCCTAGCTATATCATCTCTCTAATACGGAAACTTTTGCCATCTAACGCAAGTAACCCGTGCAAATCTGATGAAGATGATGATCGTGACGACC
CAGGACAAGGATCAGTGGCCAAAATGGATGAAAGTGATGCCTGTTTATCAGGCGACCGAGTCTTAAGTCCTTCAGGGACGGTAGGTAAATGCCAGGGAATTGAAATTACG
GATGGTTCTGATAAATTTGCTGATCAAGAAGGTGAGGAGGAAGGCTCCTGCCCTAGATTGGAGCAACCCATTTCATCATCAGAAGAAAAGGTCTGGGAAGAGTATGGCTG
CATTCTGTGGGATCTTTCTGCGAGTAGATCTCATGCAGAACTTATGGTTCAGAACCTTGTCCTTGAAGTCCTTTCTGCGAACCTTATGGTCTCACAATCTGTGCGTGTTT
TGGAGATTAGCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCATGAAACAAATAGTCGCTAAGAGTGGATTGATTGCAATCATTGTGAACCAGTTGTTTCTA
GATGATGCTCAATGCTTATGTGAAGTTTGCAGGTTATTAACTGCGGGTATTCAAAGTTGCAAATGTATCACATGGGCCGAGGCTTTGCATTCTGAGCATGTTCTTTCTCG
TATTCTGTGGGTTTCTGAGAACACTTTAAATCCACAACTTATTGAAAAGAGTGTTGGACTATTATCAGCCATTGTTGAGAGTAAGCAGGAAGTTGCGAAAATTCTTCTCC
CTTGTTTGACGAAGCTGGGTTTTTCGAGTGTTTTGTTCAACCTTTTTGCTTTTGAGATGAAAATATTAACAAATGAAAGATCAACTGAAAGGTACTCAATTTTGGATGTG
ATTCTTCGGGCAATTGAAGCACTCTCTGCAATTGAAGAGCATTCTCAAGAAATACGTTCAAATAAAGAACTTTTTCAGCTACTTTGTTATCTAGTCAAATTGCCAGATGT
ATTAGAGGTACAGTTCAGTTCTTGTGTTTGTGCTGTGGTTTTGATTGCAAATATTCTGTCAGATGTACCTGACCTAGCCTTTGAGATGTCTCAGGATTTGTCTTTCCTAC
AAGGTCTACTTGATATATTCTCTTTTACTGGGGATGATTTAGAGGCACGTGATGCTGTTTGGAGCATCATTGCCAGGATACTGGTTCGTGTTCAAGAAAATGTGATGACC
CGGCCAAGGCTATTTGAGTACGTCTCGTTACTAGTGAGTAAGACTGATCTCATCGAGGATGATCTTCTGGACCAGCGGATGACTGAATCAAGTAAAGAAGAGGGTGAATT
GATCTCAACCTGCATGAAACCAACCTCTAGATGTATATCTTTAAGAAAGATAATTACTATTTTAAATCATTGGACTGCTTTAAAGGATGAAGAGACAAACGTGAGAGATG
AATATCATGTCGAAGATGTAGATGTCAATAGATTGTTGAATTGCTGCTATGACCAGGGAGTTAGCGAACAGAAATCCAAACGTGATAACGTATAG
mRNA sequenceShow/hide mRNA sequence
GTCTAAAATGGACAAAAGCTAAAAATATATTAATAAAAATAATATTTTAACATTTATTTAATTAAGTACAAGGTCATATATGGAATAAGATTGGATAAATATGGTCAATC
CAAATGTAACTCGACGGATATAAGACACCTATTTTTGTTCAAGTAAAAAATGATAAATATGAAATAATTATTGATAATTTGTGCTCGAAATATTGGGCCCGGCCCGATAA
CGAAATTGGTACCTGGTCGAGTGCCGGAGAGACTGGGAAGCCGGGAAATTTTCAGGCTATACGACGTTAGCTTTACAGAAACTTCTGGAGGGAAACCAAGGTTGAAGACG
ACTACTAGTCTACAAGCTCATCGTGTTCTTGAAGCTCGTTATGGAGCTGGGCGACCCTGTGGAAGCGGAATTGGAGCAAGACTTTGAACCAGTAGAAGGTCGGAATGGAC
CCTCTCACCACCCTTCTGCGCCACCTGATGAGTTATTTGATATCTCGACGACGGTTGATCCTAGCTATATCATCTCTCTAATACGGAAACTTTTGCCATCTAACGCAAGT
AACCCGTGCAAATCTGATGAAGATGATGATCGTGACGACCCAGGACAAGGATCAGTGGCCAAAATGGATGAAAGTGATGCCTGTTTATCAGGCGACCGAGTCTTAAGTCC
TTCAGGGACGGTAGGTAAATGCCAGGGAATTGAAATTACGGATGGTTCTGATAAATTTGCTGATCAAGAAGGTGAGGAGGAAGGCTCCTGCCCTAGATTGGAGCAACCCA
TTTCATCATCAGAAGAAAAGGTCTGGGAAGAGTATGGCTGCATTCTGTGGGATCTTTCTGCGAGTAGATCTCATGCAGAACTTATGGTTCAGAACCTTGTCCTTGAAGTC
CTTTCTGCGAACCTTATGGTCTCACAATCTGTGCGTGTTTTGGAGATTAGCCTTGGAATTATTGGAAACCTGGCCTGCCATGAAGTTCCCATGAAACAAATAGTCGCTAA
GAGTGGATTGATTGCAATCATTGTGAACCAGTTGTTTCTAGATGATGCTCAATGCTTATGTGAAGTTTGCAGGTTATTAACTGCGGGTATTCAAAGTTGCAAATGTATCA
CATGGGCCGAGGCTTTGCATTCTGAGCATGTTCTTTCTCGTATTCTGTGGGTTTCTGAGAACACTTTAAATCCACAACTTATTGAAAAGAGTGTTGGACTATTATCAGCC
ATTGTTGAGAGTAAGCAGGAAGTTGCGAAAATTCTTCTCCCTTGTTTGACGAAGCTGGGTTTTTCGAGTGTTTTGTTCAACCTTTTTGCTTTTGAGATGAAAATATTAAC
AAATGAAAGATCAACTGAAAGGTACTCAATTTTGGATGTGATTCTTCGGGCAATTGAAGCACTCTCTGCAATTGAAGAGCATTCTCAAGAAATACGTTCAAATAAAGAAC
TTTTTCAGCTACTTTGTTATCTAGTCAAATTGCCAGATGTATTAGAGGTACAGTTCAGTTCTTGTGTTTGTGCTGTGGTTTTGATTGCAAATATTCTGTCAGATGTACCT
GACCTAGCCTTTGAGATGTCTCAGGATTTGTCTTTCCTACAAGGTCTACTTGATATATTCTCTTTTACTGGGGATGATTTAGAGGCACGTGATGCTGTTTGGAGCATCAT
TGCCAGGATACTGGTTCGTGTTCAAGAAAATGTGATGACCCGGCCAAGGCTATTTGAGTACGTCTCGTTACTAGTGAGTAAGACTGATCTCATCGAGGATGATCTTCTGG
ACCAGCGGATGACTGAATCAAGTAAAGAAGAGGGTGAATTGATCTCAACCTGCATGAAACCAACCTCTAGATGTATATCTTTAAGAAAGATAATTACTATTTTAAATCAT
TGGACTGCTTTAAAGGATGAAGAGACAAACGTGAGAGATGAATATCATGTCGAAGATGTAGATGTCAATAGATTGTTGAATTGCTGCTATGACCAGGGAGTTAGCGAACA
GAAATCCAAACGTGATAACGTATAGTGTCACTCTCAGCCACTTTCAGTACGTACACTTAAGTTAAATGCCTCACCTTTCAGTACACAGTCCAGGATATTATTTTGAATTC
ATTATCAAGTGAAGTGGAAGAGTCGATACAAATATTTACTGAATGGAACATAGTTTGTGGCTGTACTATGCAACGAGGTCAAGCCTGTTTGTCGGATGGTATTTGTTGGT
CTCAGGCTTGACAGAGTCTCCAAACTTCCAGGCTCCTGTGAAATTTTCAGTTGTATACTACTTGCCTTATATGTGGTTTGAAGACGGTTTTAAGACTACAAATTAGCTGT
AATTGGTTATATCTAAAATCATATATTATTCAGTGATCACTGCATCGTGCTCAAAGAATTGACTGCTACTTCATCGGTTCCCACCCATCTTTTTAACCAATTTTCATTAA
GTATTTACATTTATGGTATCTGATCATAGGGTAGAGCGGTTACTAACTTTCCTATTTCATACCTTAGTTACTTCTCATTTCTATTTTGATGCATAACTTCTGATGTCTAT
GTCCCCCCCTTGTATAACAATTAACTCTTGCTCTCGTGCAGTTTAGTTGCTTTTGAATAAGTAACAAAATTAATACTTATTTCCCCTTCTATATTTTACCAAAAATTGTG
GTTAATGTCACGAGGAGCAGGTCTCAAGTGGCCGTCTTCAGCAGTGTGTTACACAACTTCTTTCATAGATGAAGAATGACAATAGTCAAAGATAGCACACTCTCTGCATG
TTATCTCTTTCAGACTTTATTTGTCTACAGAAGAAAGCAATAAAGCAAAGCACACGACCAACATTAAAGTAAATCCCTCGAACAGGAAAAAACTTTGGGACACTCAGGTG
TATGATTTATGAGATGAGAGAGATGCCACTGGTTAGCAGATTATCAGCCAAGAACTTGTTTGCAGCTTCTGATGGGTGGAATCCATCCCAGAACACGTATTCAGATGCAT
TGGTACATGTCCCTACGGATTCTGTGTTGCATAAGATTGACGTTTCAAGCAAGCCTGTGCCACAGCAGCCCTTCCTTGCCTCAAAGAAACCTACATATTGTTATCCATGA
GTTAACGACATATCCTGAATTCTGATGGTCAAATCTTGACGCATCAATGTGGATTTCCGGTCCTTGGTTCTATGTGAAATATTCGTGGCATATTTATTCTACTTTATTTT
AGATTTTGTATAACTCATTAACCTGTACTGTCACTCGGAACTAATCTCCATGGCATACCATTTTCAGCAGGTTTTGTGACAAGGTCATAGAGAGGTTGGTAAGTGTCCAA
GACAACCAGATTGAGGCCAGGAAGCTTGGTTTGTAAGCTCCGAGATGTGGCATTCAGCTTGCTATTGAATGAAACTGCGTCATTGTTTAGCTTAGCCACACACTCGTTGC
TGTCAGATCCAAAGATCGTTATGGCAGCCGGAAGACATCCTAGTGGTGGCAGTGTAGTAACTCCAATCTTCCGTGCTCCCAATGAATATAGATTCTGAAATCATATGTTT
GATGATACAATATTCGTATAAATGTTTCTGCAATTTATTACTGCATTACTGAAAAGAATTTGAGAGATCGAATTGATTTATGATTTTGATGGGGAGCCAAACCTCAATGA
AATTTATGTAGGATGTGATTAGAATGTTGGCAAACTGATGGACGGTATATTTCTTGTAAAGTAGAG
Protein sequenceShow/hide protein sequence
MELGDPVEAELEQDFEPVEGRNGPSHHPSAPPDELFDISTTVDPSYIISLIRKLLPSNASNPCKSDEDDDRDDPGQGSVAKMDESDACLSGDRVLSPSGTVGKCQGIEIT
DGSDKFADQEGEEEGSCPRLEQPISSSEEKVWEEYGCILWDLSASRSHAELMVQNLVLEVLSANLMVSQSVRVLEISLGIIGNLACHEVPMKQIVAKSGLIAIIVNQLFL
DDAQCLCEVCRLLTAGIQSCKCITWAEALHSEHVLSRILWVSENTLNPQLIEKSVGLLSAIVESKQEVAKILLPCLTKLGFSSVLFNLFAFEMKILTNERSTERYSILDV
ILRAIEALSAIEEHSQEIRSNKELFQLLCYLVKLPDVLEVQFSSCVCAVVLIANILSDVPDLAFEMSQDLSFLQGLLDIFSFTGDDLEARDAVWSIIARILVRVQENVMT
RPRLFEYVSLLVSKTDLIEDDLLDQRMTESSKEEGELISTCMKPTSRCISLRKIITILNHWTALKDEETNVRDEYHVEDVDVNRLLNCCYDQGVSEQKSKRDNV