; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g20380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g20380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionvesicle-associated protein 1-3
Genome locationchr7:14672751..14707883
RNA-Seq ExpressionMoc07g20380
SyntenyMoc07g20380
Gene Ontology termsGO:0061817 - endoplasmic reticulum-plasma membrane tethering (biological process)
GO:0090158 - endoplasmic reticulum membrane organization (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000535 - Major sperm protein (MSP) domain
IPR008962 - PapD-like superfamily
IPR013783 - Immunoglobulin-like fold
IPR016763 - Vesicle-associated membrane-protein-associated protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143495.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia]3.7e-16256.07Show/hide
Query:  SGAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILV-----AEGSSDSLKPKPLTIFYREKPDAPSCR----------------------
        +GAKGH+LEQCN F+  VQELLDSK+LTV  SH KKG NVVED+ V     AEGSSD+LKPK LTIFY EKPDAP+C                       
Subjt:  SGAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILV-----AEGSSDSLKPKPLTIFYREKPDAPSCR----------------------

Query:  ---------QEVSSPSLPVDNITGVG--------------------------------------------------------------------------
                 Q+VSSP LPVDNITGVG                                                                          
Subjt:  ---------QEVSSPSLPVDNITGVG--------------------------------------------------------------------------

Query:  --------------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG
                            GRTPAKISILSLLLSSE H+N L E LKQAFV QDITVDNLSNVVGNITASSSITFTDEEIP           + V C  
Subjt:  --------------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG

Query:  ---------------------------DHGYIRQE----------------------QFHP----------------------LYI-----------RKL
                                   D  ++R                        Q  P                      L+I           +K+
Subjt:  ---------------------------DHGYIRQE----------------------QFHP----------------------LYI-----------RKL

Query:  NLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRV
           VDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDN SLDKLLRMAKNTKKFGLGYKPSRGDIIRV
Subjt:  NLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRV

Query:  WSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDE
         SLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAG IHQEYD SSVVA VTEEREQV PFVYPC DGFELSNWSV TEIECDNDSKYELDTPIYNIESD+
Subjt:  WSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDE

Query:  EIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV
        EIDDE S ELLRML EEEKMLGPHEELTET+NLGSQAEAKE+
Subjt:  EIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV

XP_022147181.1 uncharacterized protein LOC111016192 [Momordica charantia]1.1e-16983.46Show/hide
Query:  GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIPLE------------------------------------
        GRTPAKISILSLLLSSETHQNALFEALKQAFV QDITVDNLSNVVGNITASSSITFTDEEIPLE                                    
Subjt:  GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIPLE------------------------------------

Query:  -VSCWGDHGYIRQEQFHPLYIRKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKL
         VSCWGDHGYIRQEQFHPLYIRKLNLRVDQKLVII GQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKL
Subjt:  -VSCWGDHGYIRQEQFHPLYIRKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKL

Query:  LRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI
        L MAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPC DGFELSNWSV+
Subjt:  LRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI

Query:  ------------TEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEVCLKASKKSK
                     EIECDNDSKYELDTPIYNIESDEEIDDEFS ELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV +  +  S+
Subjt:  ------------TEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEVCLKASKKSK

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]1.2e-16356.02Show/hide
Query:  GAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR----------------------------
        GAKGHSLEQCN F+ +VQELLDSK+LT   SH KK TNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSC                             
Subjt:  GAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR----------------------------

Query:  ---QEVSSPSLPVDNITGVG--------------------------------------------------------------------------------
           Q+VSSPSLPVDNITGVG                                                                                
Subjt:  ---QEVSSPSLPVDNITGVG--------------------------------------------------------------------------------

Query:  --------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG------
                      GRTPA ISILSLLLSSE HQNAL EALKQAFV QDITVDNLSNVVGNITASSSI+FTDEEIP           + V C        
Subjt:  --------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG------

Query:  ---------------------DHGYIRQE----------------------QFHPLYI---------------------------------RKLNLRVDQ
                             D  ++R                        Q  P                                    +K+   VDQ
Subjt:  ---------------------DHGYIRQE----------------------QFHPLYI---------------------------------RKLNLRVDQ

Query:  KLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKA
        KLVIISGQEDILVSR ASM YVE AEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDN SLDKLLRMAKNTKKFGLGYKPSRGDIIRV SLEKA
Subjt:  KLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKA

Query:  KRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI------------TEIECDNDSKYELDTPIY
        KRLSRFENEERDYPRR VPPL+HSFRSAG IHQEYDESSVVA VTEEREQVGPFVY C DGFELSNWSVI            TEIECDNDSKYELDTPIY
Subjt:  KRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI------------TEIECDNDSKYELDTPIY

Query:  NIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV
         IESDEEIDDE S ELLRML EEEKMLGPHEELTET+NLGSQAEAKE+
Subjt:  NIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV

XP_022150030.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia]4.4e-15556.44Show/hide
Query:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR-------------------------------QEVSSPSLPVDN
        +VQELLDSK+LTV  SH KK TNVVEDILVAEGSSDS+KPK LTIFYREKPDAPSC                                Q+VSSPSLPVDN
Subjt:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR-------------------------------QEVSSPSLPVDN

Query:  ITGVG-------------------------------------------------------------------------------GRTPAKISILSLLLSS
        ITGVG                                                                               GRTPAKISILSLLLSS
Subjt:  ITGVG-------------------------------------------------------------------------------GRTPAKISILSLLLSS

Query:  ETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIRQEQ
        E H+NAL EALKQAFV QDITVDNLSNVVGNI ASS ITFTDEEIP           + V C                             D  ++R   
Subjt:  ETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIRQEQ

Query:  F-----------------HPLYI--------------------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFE
                           P+ I                                      +K+   VDQKLVIISGQEDILVSRLASMPYVEAAEEAFE
Subjt:  F-----------------HPLYI--------------------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFE

Query:  SSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGI
        SSFQSFEIANATTLHGKFGRPKPRLLE AFK +N SLDKLLRMAKNT++FGLGYKP+RGDIIRV S+EKAKRLSRFEN ERDY RRTVPPLSHS RSAG 
Subjt:  SSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGI

Query:  IHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQ
        IHQEYDESSV A VTEEREQV PFVYPC DGF+LSNWSV TEIECDNDSKYELDTPIYNIESDEEIDDE S ELLRML EEEKMLGPHEELTET+NLGSQ
Subjt:  IHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQ

Query:  AEAKEV
        AEAKE+
Subjt:  AEAKEV

XP_022155744.1 uncharacterized protein LOC111022791 [Momordica charantia]1.9e-12654.77Show/hide
Query:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCRQEVSSPSLPVDNITGVG--------------------------
        +VQELLDSK+LTV  SH KK TNVVEDILVAEGSSDS+KPKPLTIFYREKPDAPSCRQ+VSSPSLPVDNIT VG                          
Subjt:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCRQEVSSPSLPVDNITGVG--------------------------

Query:  --------------------------------------------------------------------GRTPAKISILSLLLSSETHQNALFEALKQAFV
                                                                            GRTPAKISILSLLLSSE H+NAL EALKQAFV
Subjt:  --------------------------------------------------------------------GRTPAKISILSLLLSSETHQNALFEALKQAFV

Query:  PQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIR-------------------
         QDITVDNLSNVVGNI  SSSITFT+EEIP           + V C                             D  ++R                   
Subjt:  PQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIR-------------------

Query:  ------------QEQFHPLYI------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHG
                       F  + I                        +K+   +DQKLVIISGQEDILVSRLASMPYVEA EEAFESSFQSFEI NATTLHG
Subjt:  ------------QEQFHPLYI------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHG

Query:  KFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTE
        KFGRPK RLLETAFKGDN SLDKLLRMAKNTKKFGLGYK SRGDIIRV SLEKAKRLSRFENEERDYPRR VPPL+HSFRSAG IHQEYDESSVVA VTE
Subjt:  KFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTE

Query:  EREQVGPFVYPCSDGFELSNWSVI
        EREQVGPFVY C DGFELSNWSVI
Subjt:  EREQVGPFVYPCSDGFELSNWSVI

TrEMBL top hitse value%identityAlignment
A0A6J1CNY7 Ribonuclease H1.8e-16256.07Show/hide
Query:  SGAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILV-----AEGSSDSLKPKPLTIFYREKPDAPSCR----------------------
        +GAKGH+LEQCN F+  VQELLDSK+LTV  SH KKG NVVED+ V     AEGSSD+LKPK LTIFY EKPDAP+C                       
Subjt:  SGAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILV-----AEGSSDSLKPKPLTIFYREKPDAPSCR----------------------

Query:  ---------QEVSSPSLPVDNITGVG--------------------------------------------------------------------------
                 Q+VSSP LPVDNITGVG                                                                          
Subjt:  ---------QEVSSPSLPVDNITGVG--------------------------------------------------------------------------

Query:  --------------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG
                            GRTPAKISILSLLLSSE H+N L E LKQAFV QDITVDNLSNVVGNITASSSITFTDEEIP           + V C  
Subjt:  --------------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG

Query:  ---------------------------DHGYIRQE----------------------QFHP----------------------LYI-----------RKL
                                   D  ++R                        Q  P                      L+I           +K+
Subjt:  ---------------------------DHGYIRQE----------------------QFHP----------------------LYI-----------RKL

Query:  NLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRV
           VDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDN SLDKLLRMAKNTKKFGLGYKPSRGDIIRV
Subjt:  NLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRV

Query:  WSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDE
         SLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAG IHQEYD SSVVA VTEEREQV PFVYPC DGFELSNWSV TEIECDNDSKYELDTPIYNIESD+
Subjt:  WSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDE

Query:  EIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV
        EIDDE S ELLRML EEEKMLGPHEELTET+NLGSQAEAKE+
Subjt:  EIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV

A0A6J1D099 Ribonuclease H5.6e-16456.02Show/hide
Query:  GAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR----------------------------
        GAKGHSLEQCN F+ +VQELLDSK+LT   SH KK TNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSC                             
Subjt:  GAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR----------------------------

Query:  ---QEVSSPSLPVDNITGVG--------------------------------------------------------------------------------
           Q+VSSPSLPVDNITGVG                                                                                
Subjt:  ---QEVSSPSLPVDNITGVG--------------------------------------------------------------------------------

Query:  --------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG------
                      GRTPA ISILSLLLSSE HQNAL EALKQAFV QDITVDNLSNVVGNITASSSI+FTDEEIP           + V C        
Subjt:  --------------GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG------

Query:  ---------------------DHGYIRQE----------------------QFHPLYI---------------------------------RKLNLRVDQ
                             D  ++R                        Q  P                                    +K+   VDQ
Subjt:  ---------------------DHGYIRQE----------------------QFHPLYI---------------------------------RKLNLRVDQ

Query:  KLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKA
        KLVIISGQEDILVSR ASM YVE AEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDN SLDKLLRMAKNTKKFGLGYKPSRGDIIRV SLEKA
Subjt:  KLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKA

Query:  KRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI------------TEIECDNDSKYELDTPIY
        KRLSRFENEERDYPRR VPPL+HSFRSAG IHQEYDESSVVA VTEEREQVGPFVY C DGFELSNWSVI            TEIECDNDSKYELDTPIY
Subjt:  KRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI------------TEIECDNDSKYELDTPIY

Query:  NIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV
         IESDEEIDDE S ELLRML EEEKMLGPHEELTET+NLGSQAEAKE+
Subjt:  NIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV

A0A6J1D1K4 uncharacterized protein LOC1110161925.2e-17083.46Show/hide
Query:  GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIPLE------------------------------------
        GRTPAKISILSLLLSSETHQNALFEALKQAFV QDITVDNLSNVVGNITASSSITFTDEEIPLE                                    
Subjt:  GRTPAKISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIPLE------------------------------------

Query:  -VSCWGDHGYIRQEQFHPLYIRKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKL
         VSCWGDHGYIRQEQFHPLYIRKLNLRVDQKLVII GQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKL
Subjt:  -VSCWGDHGYIRQEQFHPLYIRKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKL

Query:  LRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI
        L MAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPC DGFELSNWSV+
Subjt:  LRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVI

Query:  ------------TEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEVCLKASKKSK
                     EIECDNDSKYELDTPIYNIESDEEIDDEFS ELLRMLAEEEKMLGPHEELTETINLGSQAEAKEV +  +  S+
Subjt:  ------------TEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQAEAKEVCLKASKKSK

A0A6J1D7C7 Ribonuclease H2.1e-15556.44Show/hide
Query:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR-------------------------------QEVSSPSLPVDN
        +VQELLDSK+LTV  SH KK TNVVEDILVAEGSSDS+KPK LTIFYREKPDAPSC                                Q+VSSPSLPVDN
Subjt:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCR-------------------------------QEVSSPSLPVDN

Query:  ITGVG-------------------------------------------------------------------------------GRTPAKISILSLLLSS
        ITGVG                                                                               GRTPAKISILSLLLSS
Subjt:  ITGVG-------------------------------------------------------------------------------GRTPAKISILSLLLSS

Query:  ETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIRQEQ
        E H+NAL EALKQAFV QDITVDNLSNVVGNI ASS ITFTDEEIP           + V C                             D  ++R   
Subjt:  ETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIRQEQ

Query:  F-----------------HPLYI--------------------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFE
                           P+ I                                      +K+   VDQKLVIISGQEDILVSRLASMPYVEAAEEAFE
Subjt:  F-----------------HPLYI--------------------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFE

Query:  SSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGI
        SSFQSFEIANATTLHGKFGRPKPRLLE AFK +N SLDKLLRMAKNT++FGLGYKP+RGDIIRV S+EKAKRLSRFEN ERDY RRTVPPLSHS RSAG 
Subjt:  SSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGI

Query:  IHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQ
        IHQEYDESSV A VTEEREQV PFVYPC DGF+LSNWSV TEIECDNDSKYELDTPIYNIESDEEIDDE S ELLRML EEEKMLGPHEELTET+NLGSQ
Subjt:  IHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLGSQ

Query:  AEAKEV
        AEAKE+
Subjt:  AEAKEV

A0A6J1DR66 uncharacterized protein LOC1110227919.3e-12754.77Show/hide
Query:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCRQEVSSPSLPVDNITGVG--------------------------
        +VQELLDSK+LTV  SH KK TNVVEDILVAEGSSDS+KPKPLTIFYREKPDAPSCRQ+VSSPSLPVDNIT VG                          
Subjt:  RVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCRQEVSSPSLPVDNITGVG--------------------------

Query:  --------------------------------------------------------------------GRTPAKISILSLLLSSETHQNALFEALKQAFV
                                                                            GRTPAKISILSLLLSSE H+NAL EALKQAFV
Subjt:  --------------------------------------------------------------------GRTPAKISILSLLLSSETHQNALFEALKQAFV

Query:  PQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIR-------------------
         QDITVDNLSNVVGNI  SSSITFT+EEIP           + V C                             D  ++R                   
Subjt:  PQDITVDNLSNVVGNITASSSITFTDEEIP-----------LEVSCWG---------------------------DHGYIR-------------------

Query:  ------------QEQFHPLYI------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHG
                       F  + I                        +K+   +DQKLVIISGQEDILVSRLASMPYVEA EEAFESSFQSFEI NATTLHG
Subjt:  ------------QEQFHPLYI------------------------RKLNLRVDQKLVIISGQEDILVSRLASMPYVEAAEEAFESSFQSFEIANATTLHG

Query:  KFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTE
        KFGRPK RLLETAFKGDN SLDKLLRMAKNTKKFGLGYK SRGDIIRV SLEKAKRLSRFENEERDYPRR VPPL+HSFRSAG IHQEYDESSVVA VTE
Subjt:  KFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSHSFRSAGIIHQEYDESSVVAGVTE

Query:  EREQVGPFVYPCSDGFELSNWSVI
        EREQVGPFVY C DGFELSNWSVI
Subjt:  EREQVGPFVYPCSDGFELSNWSVI

SwissProt top hitse value%identityAlignment
Q84WW5 Vesicle-associated protein 1-31.3e-6161.42Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIFANPPSPVPEGSEE
        +VKTTNP+KYCVRPN GV+LP  + N+TVTMQA KE+P DMQCKDKFL+Q+VV  DG  SK++  E+FNK  G+ ++++K+RVVYI ANPPSPVPEGSEE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIFANPPSPVPEGSEE

Query:  GSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKK
        G+ P  +  +  +Q+ +LFD VSR+ EE  EKSS+A S ISKLTEEK +A QQSQKLR ELE+LRKE++ +Q+GG S+L ++LVGL+G +IGYL+ +
Subjt:  GSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKK

Q8VZ95 Vesicle-associated protein 1-14.4e-4951.5Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE
        +VKTTNPKKYCVRPN GV+LP ST  + VTMQA KE+P DMQCKDKFL+Q V+A  GV +K+++PE+F+K  G  V+E K+RV Y+    PPSPV EGSE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE

Query:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT
        EGS PR +  ++G+   F+    +  +    +E +S+A + I+KLTEEK +A+Q + +L++EL+ LR+ES   Q+GG   ++V+LVGLIG+++GY++K+T
Subjt:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT

Q9LVU1 Vesicle-associated protein 2-11.1e-2842.21Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIFANPPSPVPEGSEE
        +VKTT+PKKY VRPN GVI P  +  I VT+QA +E PPDMQCKDKFL+QS + P      ++  + F K  GK + E K++V YI    PS     SE 
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIFANPPSPVPEGSEE

Query:  GSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAG-GFSVLFVVLVGLIGVLIGYLVKKT
        G+   T G  DG                   +SS+  S I +L EE+ AA++Q+Q+L+ ELE +R+  N R +G G S+    +VGLIG++IG+++K T
Subjt:  GSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAG-GFSVLFVVLVGLIGVLIGYLVKKT

Q9SHC8 Vesicle-associated protein 1-22.2e-4851.5Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE
        +VKTTNPKKYCVRPN GV+ P S+S + VTMQA KE+P D+QCKDKFL+Q VVA  G   KD++ E+F+K  G  V+E K+RVVY+    PPSPV EGSE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE

Query:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT
        EGS PR +  ++GN  +F      S    + ++ SS+A + ++KLTEEK +A+Q + +L+QEL+ LR+ES   ++GG   ++V+LVGLIG+++GY++K+T
Subjt:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT

Q9SYC9 Vesicle-associated protein 1-49.9e-2545.32Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE
        + KTTN KKY VRPN GV+LP S+  + V MQA KE+P DMQC+DK L Q  V      +KD++ E+F+K  G   +E +++V+Y+    PPSPV EG+E
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE

Query:  EGSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASS
        EGS PR +  ++GN + A  D + RSL  P   ++ +S+
Subjt:  EGSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASS

Arabidopsis top hitse value%identityAlignment
AT2G45140.1 plant VAP homolog 121.6e-4951.5Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE
        +VKTTNPKKYCVRPN GV+ P S+S + VTMQA KE+P D+QCKDKFL+Q VVA  G   KD++ E+F+K  G  V+E K+RVVY+    PPSPV EGSE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE

Query:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT
        EGS PR +  ++GN  +F      S    + ++ SS+A + ++KLTEEK +A+Q + +L+QEL+ LR+ES   ++GG   ++V+LVGLIG+++GY++K+T
Subjt:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT

AT3G60600.1 vesicle associated protein3.1e-5051.5Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE
        +VKTTNPKKYCVRPN GV+LP ST  + VTMQA KE+P DMQCKDKFL+Q V+A  GV +K+++PE+F+K  G  V+E K+RV Y+    PPSPV EGSE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE

Query:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT
        EGS PR +  ++G+   F+    +  +    +E +S+A + I+KLTEEK +A+Q + +L++EL+ LR+ES   Q+GG   ++V+LVGLIG+++GY++K+T
Subjt:  EGSPPRTTGLEDGN-QNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKKT

AT3G60600.2 vesicle associated protein2.4e-3460.53Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE
        +VKTTNPKKYCVRPN GV+LP ST  + VTMQA KE+P DMQCKDKFL+Q V+A  GV +K+++PE+F+K  G  V+E K+RV Y+    PPSPV EGSE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE

Query:  EGSPPRTTGLEDGN
        EGS PR +  ++G+
Subjt:  EGSPPRTTGLEDGN

AT3G60600.3 vesicle associated protein6.3e-3558.68Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE
        +VKTTNPKKYCVRPN GV+LP ST  + VTMQA KE+P DMQCKDKFL+Q V+A  GV +K+++PE+F+K  G  V+E K+RV Y+    PPSPV EGSE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIF-ANPPSPVPEGSE

Query:  EGSPPRTTGLEDGN-QNFALF
        EGS PR +  ++G+   F++F
Subjt:  EGSPPRTTGLEDGN-QNFALF

AT4G00170.1 Plant VAMP (vesicle-associated membrane protein) family protein9.4e-6361.42Show/hide
Query:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIFANPPSPVPEGSEE
        +VKTTNP+KYCVRPN GV+LP  + N+TVTMQA KE+P DMQCKDKFL+Q+VV  DG  SK++  E+FNK  G+ ++++K+RVVYI ANPPSPVPEGSEE
Subjt:  QVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAVDEYKMRVVYIFANPPSPVPEGSEE

Query:  GSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKK
        G+ P  +  +  +Q+ +LFD VSR+ EE  EKSS+A S ISKLTEEK +A QQSQKLR ELE+LRKE++ +Q+GG S+L ++LVGL+G +IGYL+ +
Subjt:  GSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFVVLVGLIGVLIGYLVKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCAGTGGGCAGTGGAGTGCCTATCTGGGGCAAAAGGTCACTCCCTGGAGCAATGCAACCGTTTTCAAAAGAGAGTTCAAGAGTTGTTAGATTCAAAAGTC
CTTACTGTCACAAAATCTCACCTGAAGAAAGGGACAAATGTCGTGGAAGATATCTTGGTTGCTGAGGGCTCAAGTGATTCTCTTAAACCAAAACCTCTCACCATC
TTTTACCGTGAAAAGCCAGATGCACCCAGCTGCAGGCAAGAGGTATCATCTCCTTCACTCCCAGTTGATAATATCACCGGAGTTGGAGGTCGAACACCCGCAAAG
ATCTCTATATTATCTTTACTGTTATCCTCTGAAACGCATCAGAATGCACTGTTTGAGGCCTTGAAGCAGGCTTTCGTTCCACAAGACATCACAGTGGATAATTTG
AGCAACGTTGTGGGGAATATAACGGCATCTAGCTCAATCACTTTTACAGATGAGGAGATACCACTAGAGGTTTCTTGCTGGGGCGACCATGGATACATTCGGCAG
GAGCAGTTCCATCCACTTTACATCAGAAAATTAAATTTGCGGGTTGACCAAAAGTTAGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAAGGCTTGCTTCG
ATGCCATATGTTGAAGCAGCAGAAGAAGCTTTTGAGTCTTCATTCCAATCATTCGAGATTGCAAATGCTACAACTTTACATGGGAAGTTTGGAAGACCTAAGCCA
CGACTTTTAGAGACCGCCTTTAAAGGAGACAATGCGAGCTTAGACAAACTGCTGAGGATGGCTAAGAATACAAAGAAGTTCGGGTTGGGGTATAAACCAAGTAGA
GGCGACATCATTAGAGTGTGGAGTCTGGAAAAAGCAAAACGACTCTCAAGATTTGAGAATGAGGAGCGTGATTACCCTAGAAGGACTGTTCCACCTCTCAGCCAC
TCTTTCAGAAGTGCCGGCATAATCCATCAGGAGTACGATGAGAGCTCTGTAGTGGCAGGAGTGACAGAAGAAAGAGAGCAAGTCGGACCTTTTGTCTACCCGTGC
TCGGATGGTTTCGAGCTGAGCAATTGGAGTGTTATTACTGAGATTGAATGCGATAATGATTCGAAATACGAGCTCGATACACCTATATACAATATCGAGTCTGAT
GAGGAAATAGATGACGAGTTCTCTACTGAGTTATTGAGAATGCTAGCAGAAGAAGAAAAGATGTTGGGACCCCATGAAGAATTAACTGAGACAATTAACTTGGGA
TCACAAGCCGAAGCCAAAGAGGTTTGTTTGAAAGCTTCCAAGAAAAGCAAGTGGTACTTGGATAGTGGTTGCTCAAGGCACATGACGGGAGACCAATTCAAGTTT
GTCACTCTGTCCAAAAAGGATGGAGGTTTTGTAACCTTTGGTGACGACAAGAAATGTAAAATAATAGATCTGGAATGTAATGGTTCAGGCTTTATAGATCTGAAC
GTTTCTAGACCATTTCGTTTAAGATCTAGAACGAAATTCAAGCGTGGTCGAATACAAGTAACAATGGAAGCTATGGTGGAGCCCACAAGAGTTATCCTACCATCT
TCTCATGATCTTGAGATTGTCTTACAAGTTAACCATGGACATAAGAAACCGGTGAAAGACATTATGGAAAACTACTTGTCCAAGAAGTCGCATAAAGAATTTCAT
GGTCGAGAAAACAGAATGTTATTGGGGGTCGACCTTGTTGGGCCTCTCCCAATGGACAAAGGACAAACTAAATTTGTCATCATCGTAGTAGATTATTCAAGGGAT
CAAATGGTTGTTTATGTACCTTTATCTTTGGAGGCTCAAGCGTTTCGTCCACCACCGGTCGTACAGGATCCATTGCTCAAACTGAATCACGCGAACAACTCACTG
AAGGAGGCTGGCAGTAGGAATGAAGGAGGCGGGAAGCTACCGCGCTTGTTACTTTACAATCTATTGCTCCACAATCTTGAAAGAGAACAGGTTAAAACCACCAAT
CCCAAGAAGTACTGCGTTCGTCCAAATGCTGGAGTCATTTTGCCCAATAGTACAAGCAATATTACAGTTACCATGCAAGCTCCGAAGGAGTCGCCTCCTGATATG
CAGTGCAAGGATAAGTTCTTAATTCAAAGTGTTGTAGCACCAGATGGTGTAGCATCAAAGGACATTAGTCCAGAATTGTTTAACAAGGGAGACGGTAAAGCAGTA
GATGAGTATAAGATGAGGGTTGTTTACATTTTTGCAAATCCTCCATCACCCGTCCCAGAAGGATCCGAAGAAGGATCTCCTCCCAGGACCACTGGGCTTGAAGAT
GGAAACCAAAATTTTGCATTGTTTGATGCTGTCTCTAGATCTCTGGAGGAGCCTAAAGAGAAATCATCACAGGCATCGTCTGCCATTTCTAAGTTGACCGAGGAG
AAAGCTGCTGCTTTGCAACAGAGTCAGAAACTACGCCAGGAGCTGGAACTTCTGAGGAAGGAATCGAACGGAAGGCAGGCTGGCGGTTTCTCGGTATTGTTCGTG
GTGCTAGTCGGTCTGATCGGAGTTCTGATCGGCTACTTGGTGAAGAAAACATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATCAGTGGGCAGTGGAGTGCCTATCTGGGGCAAAAGGTCACTCCCTGGAGCAATGCAACCGTTTTCAAAAGAGAGTTCAAGAGTTGTTAGATTCAAAAGTC
CTTACTGTCACAAAATCTCACCTGAAGAAAGGGACAAATGTCGTGGAAGATATCTTGGTTGCTGAGGGCTCAAGTGATTCTCTTAAACCAAAACCTCTCACCATC
TTTTACCGTGAAAAGCCAGATGCACCCAGCTGCAGGCAAGAGGTATCATCTCCTTCACTCCCAGTTGATAATATCACCGGAGTTGGAGGTCGAACACCCGCAAAG
ATCTCTATATTATCTTTACTGTTATCCTCTGAAACGCATCAGAATGCACTGTTTGAGGCCTTGAAGCAGGCTTTCGTTCCACAAGACATCACAGTGGATAATTTG
AGCAACGTTGTGGGGAATATAACGGCATCTAGCTCAATCACTTTTACAGATGAGGAGATACCACTAGAGGTTTCTTGCTGGGGCGACCATGGATACATTCGGCAG
GAGCAGTTCCATCCACTTTACATCAGAAAATTAAATTTGCGGGTTGACCAAAAGTTAGTGATCATATCGGGACAAGAAGACATTCTAGTCTCAAGGCTTGCTTCG
ATGCCATATGTTGAAGCAGCAGAAGAAGCTTTTGAGTCTTCATTCCAATCATTCGAGATTGCAAATGCTACAACTTTACATGGGAAGTTTGGAAGACCTAAGCCA
CGACTTTTAGAGACCGCCTTTAAAGGAGACAATGCGAGCTTAGACAAACTGCTGAGGATGGCTAAGAATACAAAGAAGTTCGGGTTGGGGTATAAACCAAGTAGA
GGCGACATCATTAGAGTGTGGAGTCTGGAAAAAGCAAAACGACTCTCAAGATTTGAGAATGAGGAGCGTGATTACCCTAGAAGGACTGTTCCACCTCTCAGCCAC
TCTTTCAGAAGTGCCGGCATAATCCATCAGGAGTACGATGAGAGCTCTGTAGTGGCAGGAGTGACAGAAGAAAGAGAGCAAGTCGGACCTTTTGTCTACCCGTGC
TCGGATGGTTTCGAGCTGAGCAATTGGAGTGTTATTACTGAGATTGAATGCGATAATGATTCGAAATACGAGCTCGATACACCTATATACAATATCGAGTCTGAT
GAGGAAATAGATGACGAGTTCTCTACTGAGTTATTGAGAATGCTAGCAGAAGAAGAAAAGATGTTGGGACCCCATGAAGAATTAACTGAGACAATTAACTTGGGA
TCACAAGCCGAAGCCAAAGAGGTTTGTTTGAAAGCTTCCAAGAAAAGCAAGTGGTACTTGGATAGTGGTTGCTCAAGGCACATGACGGGAGACCAATTCAAGTTT
GTCACTCTGTCCAAAAAGGATGGAGGTTTTGTAACCTTTGGTGACGACAAGAAATGTAAAATAATAGATCTGGAATGTAATGGTTCAGGCTTTATAGATCTGAAC
GTTTCTAGACCATTTCGTTTAAGATCTAGAACGAAATTCAAGCGTGGTCGAATACAAGTAACAATGGAAGCTATGGTGGAGCCCACAAGAGTTATCCTACCATCT
TCTCATGATCTTGAGATTGTCTTACAAGTTAACCATGGACATAAGAAACCGGTGAAAGACATTATGGAAAACTACTTGTCCAAGAAGTCGCATAAAGAATTTCAT
GGTCGAGAAAACAGAATGTTATTGGGGGTCGACCTTGTTGGGCCTCTCCCAATGGACAAAGGACAAACTAAATTTGTCATCATCGTAGTAGATTATTCAAGGGAT
CAAATGGTTGTTTATGTACCTTTATCTTTGGAGGCTCAAGCGTTTCGTCCACCACCGGTCGTACAGGATCCATTGCTCAAACTGAATCACGCGAACAACTCACTG
AAGGAGGCTGGCAGTAGGAATGAAGGAGGCGGGAAGCTACCGCGCTTGTTACTTTACAATCTATTGCTCCACAATCTTGAAAGAGAACAGGTTAAAACCACCAAT
CCCAAGAAGTACTGCGTTCGTCCAAATGCTGGAGTCATTTTGCCCAATAGTACAAGCAATATTACAGTTACCATGCAAGCTCCGAAGGAGTCGCCTCCTGATATG
CAGTGCAAGGATAAGTTCTTAATTCAAAGTGTTGTAGCACCAGATGGTGTAGCATCAAAGGACATTAGTCCAGAATTGTTTAACAAGGGAGACGGTAAAGCAGTA
GATGAGTATAAGATGAGGGTTGTTTACATTTTTGCAAATCCTCCATCACCCGTCCCAGAAGGATCCGAAGAAGGATCTCCTCCCAGGACCACTGGGCTTGAAGAT
GGAAACCAAAATTTTGCATTGTTTGATGCTGTCTCTAGATCTCTGGAGGAGCCTAAAGAGAAATCATCACAGGCATCGTCTGCCATTTCTAAGTTGACCGAGGAG
AAAGCTGCTGCTTTGCAACAGAGTCAGAAACTACGCCAGGAGCTGGAACTTCTGAGGAAGGAATCGAACGGAAGGCAGGCTGGCGGTTTCTCGGTATTGTTCGTG
GTGCTAGTCGGTCTGATCGGAGTTCTGATCGGCTACTTGGTGAAGAAAACATAG
Protein sequenceShow/hide protein sequence
MYQWAVECLSGAKGHSLEQCNRFQKRVQELLDSKVLTVTKSHLKKGTNVVEDILVAEGSSDSLKPKPLTIFYREKPDAPSCRQEVSSPSLPVDNITGVGGRTPAK
ISILSLLLSSETHQNALFEALKQAFVPQDITVDNLSNVVGNITASSSITFTDEEIPLEVSCWGDHGYIRQEQFHPLYIRKLNLRVDQKLVIISGQEDILVSRLAS
MPYVEAAEEAFESSFQSFEIANATTLHGKFGRPKPRLLETAFKGDNASLDKLLRMAKNTKKFGLGYKPSRGDIIRVWSLEKAKRLSRFENEERDYPRRTVPPLSH
SFRSAGIIHQEYDESSVVAGVTEEREQVGPFVYPCSDGFELSNWSVITEIECDNDSKYELDTPIYNIESDEEIDDEFSTELLRMLAEEEKMLGPHEELTETINLG
SQAEAKEVCLKASKKSKWYLDSGCSRHMTGDQFKFVTLSKKDGGFVTFGDDKKCKIIDLECNGSGFIDLNVSRPFRLRSRTKFKRGRIQVTMEAMVEPTRVILPS
SHDLEIVLQVNHGHKKPVKDIMENYLSKKSHKEFHGRENRMLLGVDLVGPLPMDKGQTKFVIIVVDYSRDQMVVYVPLSLEAQAFRPPPVVQDPLLKLNHANNSL
KEAGSRNEGGGKLPRLLLYNLLLHNLEREQVKTTNPKKYCVRPNAGVILPNSTSNITVTMQAPKESPPDMQCKDKFLIQSVVAPDGVASKDISPELFNKGDGKAV
DEYKMRVVYIFANPPSPVPEGSEEGSPPRTTGLEDGNQNFALFDAVSRSLEEPKEKSSQASSAISKLTEEKAAALQQSQKLRQELELLRKESNGRQAGGFSVLFV
VLVGLIGVLIGYLVKKT