; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g24250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g24250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein PHLOEM PROTEIN 2-LIKE A1-like
Genome locationchr3:17246535..17253795
RNA-Seq ExpressionMoc03g24250
SyntenyMoc03g24250
Gene Ontology termsNA
InterPro domainsIPR025886 - Phloem protein 2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465530.1 PREDICTED: protein PHLOEM PROTEIN 2-LIKE A1-like [Cucumis melo]4.9e-5349.42Show/hide
Query:  MGSGWSEEQAAQSQP-PPPATGSAA------SHGGA---------------KVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----
        MGSGWSEEQAAQ QP   PA  +AA      S G +               K+ E  +LGHG E ILK AD  VDRSS++KLH+QL+ GIFLNKRT    
Subjt:  MGSGWSEEQAAQSQP-PPPATGSAA------SHGGA---------------KVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----

Query:  -------------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQE
                                                               K+K  ELSPG  YEAAF VMIKDPAYGWD+PVNIR+K+PDGSKQE
Subjt:  -------------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQE

Query:  RKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKG
         +E++E++PRGRW EIPIG+F V+DH+ GGEIEF M+EYEGG WKKGM LKGVVIR+KG
Subjt:  RKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKG

XP_022142433.1 lectin-like [Momordica charantia]4.3e-8974.9Show/hide
Query:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------
        MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT                          
Subjt:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------

Query:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFT
                                         KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFT
Subjt:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFT

Query:  VQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGSI
        VQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGS+
Subjt:  VQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGSI

XP_022142434.1 lectin-like [Momordica charantia]6.7e-5868.95Show/hide
Query:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------
        MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT                          
Subjt:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------

Query:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR
                                         KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR
Subjt:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR

XP_023001597.1 lectin-like [Cucurbita maxima]6.5e-5352.85Show/hide
Query:  MGSGWSEEQAAQSQPPPPATGSAASHG----------GAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----------------
        MGSGWS E+  Q+    PA  SAA+            G+  AEVK L HGLEAILKDAD A+DRSS+DKLH QLHAGI LNK T                
Subjt:  MGSGWSEEQAAQSQPPPPATGSAASHG----------GAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----------------

Query:  -------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR
                                                   KIK  ELSPG  YEAAF+VMI DP+YGWDVPVNIRLK+PDGSK+E +ED+E++PRG+
Subjt:  -------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR

Query:  WVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTK
        W EIPIGDF V DH NGGEIEFSMYEYEGG WKKGM LK VVIRTK
Subjt:  WVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTK

XP_038895126.1 lectin-like [Benincasa hispida]1.9e-5752.49Show/hide
Query:  MGSGWSEEQAAQSQPP-PPATGSAASHGG---------------------AKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----
        MGSGWSEEQ   +QPP  PAT SAA                          K+ EVK LGHG E ILKDAD  VDRSS+DKLH+QL+AGIFLNKRT    
Subjt:  MGSGWSEEQAAQSQPP-PPATGSAASHGG---------------------AKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----

Query:  -------------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQE
                                                               K+K  ELSPG  YEAAF VMIK+PAYGWD+PVNIRLK+PDGSKQE
Subjt:  -------------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQE

Query:  RKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGSI
        RKE++E++PRG+WVEIPI DF V DH+ GGEIEFSMYEYEGG WKKGM LKGVVIR+KGSI
Subjt:  RKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGSI

TrEMBL top hitse value%identityAlignment
A0A0A0LYN2 Uncharacterized protein9.1e-5347.86Show/hide
Query:  MGSGWSEEQAAQSQPPPPATGSAA---SHGG-----------------AKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT------
        MG GWSEEQAAQ QP P A  +A     H G                  K+ E  ++GHG+E ILKDAD  VDRSS+DKL++QL+ GIFLNKRT      
Subjt:  MGSGWSEEQAAQSQPPPPATGSAA---SHGG-----------------AKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT------

Query:  -----------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERK
                                                             K+K  ELSPG  YEAAF VMIKDP+YGWD+PVNIRL++PDGSKQE K
Subjt:  -----------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERK

Query:  EDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKG
        E++E++PRGRW EIPIGDF V DH+  GEI+FSM+EYEGG WKKG+ LKG+ IR+KG
Subjt:  EDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKG

A0A1S3CQJ8 protein PHLOEM PROTEIN 2-LIKE A1-like2.4e-5349.42Show/hide
Query:  MGSGWSEEQAAQSQP-PPPATGSAA------SHGGA---------------KVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----
        MGSGWSEEQAAQ QP   PA  +AA      S G +               K+ E  +LGHG E ILK AD  VDRSS++KLH+QL+ GIFLNKRT    
Subjt:  MGSGWSEEQAAQSQP-PPPATGSAA------SHGGA---------------KVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----

Query:  -------------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQE
                                                               K+K  ELSPG  YEAAF VMIKDPAYGWD+PVNIR+K+PDGSKQE
Subjt:  -------------------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQE

Query:  RKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKG
         +E++E++PRGRW EIPIG+F V+DH+ GGEIEF M+EYEGG WKKGM LKGVVIR+KG
Subjt:  RKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKG

A0A6J1CLI9 lectin-like3.2e-5868.95Show/hide
Query:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------
        MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT                          
Subjt:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------

Query:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR
                                         KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR
Subjt:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR

A0A6J1CN87 lectin-like2.1e-8974.9Show/hide
Query:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------
        MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT                          
Subjt:  MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT--------------------------

Query:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFT
                                         KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFT
Subjt:  ---------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFT

Query:  VQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGSI
        VQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGS+
Subjt:  VQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGSI

A0A6J1KH05 lectin-like3.1e-5352.85Show/hide
Query:  MGSGWSEEQAAQSQPPPPATGSAASHG----------GAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----------------
        MGSGWS E+  Q+    PA  SAA+            G+  AEVK L HGLEAILKDAD A+DRSS+DKLH QLHAGI LNK T                
Subjt:  MGSGWSEEQAAQSQPPPPATGSAASHG----------GAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRT----------------

Query:  -------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR
                                                   KIK  ELSPG  YEAAF+VMI DP+YGWDVPVNIRLK+PDGSK+E +ED+E++PRG+
Subjt:  -------------------------------------------KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGR

Query:  WVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTK
        W EIPIGDF V DH NGGEIEFSMYEYEGG WKKGM LK VVIRTK
Subjt:  WVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTK

SwissProt top hitse value%identityAlignment
C0HJV2 Lectin4.7e-2234.29Show/hide
Query:  GAKVAEVKQLGHGLEAILKDADSAV-DRSSMDKLHDQLHAGIFLNKRTK---------------------------------------------------
        G  V    ++GH LEAILK  D  V    S  KL+DQ+ AGIFLN RTK                                                   
Subjt:  GAKVAEVKQLGHGLEAILKDADSAV-DRSSMDKLHDQLHAGIFLNKRTK---------------------------------------------------

Query:  --------IKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGM
                I+ S LSPG  YEAAF VM+ + A GW +PV+++LK PDGS+QE + ++++KPRG W  I +G F +   +  G IEFS+ +++  + K+G+
Subjt:  --------IKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGM

Query:  FLKGVVIRTK
         +KG+VI+ K
Subjt:  FLKGVVIRTK

O81865 Protein PHLOEM PROTEIN 2-LIKE A12.5e-2334.33Show/hide
Query:  HGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRTKIK------------------------------------------------------------A
        H  EAIL+DAD  +  SS++ L +QL +G+FL  + +IK                                                             
Subjt:  HGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRTKIK------------------------------------------------------------A

Query:  SELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRT
          L+PG  YE  F V ++DPAYGWD PVN++L  P+G +  QE+K  + E PR +WV++ +G+F V +    GEI FSMYE+  G WKKG+ LKGV IR 
Subjt:  SELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRT

Query:  K
        K
Subjt:  K

O81866 Protein PHLOEM PROTEIN 2-LIKE A21.3e-1944.66Show/hide
Query:  KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVI
        K +  +L+P + YE  FVV + D A GWD  VN +L  P G  +ER+E++    R +WVEIP G+F +      G+IEFSM E +  QWK G+ +KGV I
Subjt:  KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVI

Query:  RTK
        R K
Subjt:  RTK

Q9C8U9 Uncharacterized protein PHLOEM PROTEIN 2-LIKE A43.1e-1843.65Show/hide
Query:  SMDKLHDQ--LHAGIFLNKRTKIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEI
        S +KL D   L A  +L+   K    EL+    YE  +VV ++D A GW++PVN++L  PDG K  QER   ++E    RW++I  G+F V   DN GEI
Subjt:  SMDKLHDQ--LHAGIFLNKRTKIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEI

Query:  EFSMYEYEGGQWKKGMFLKGVVIRTK
         FSMYE +   WK+G+F+K V IR K
Subjt:  EFSMYEYEGGQWKKGMFLKGVVIRTK

Q9FHE8 Protein PHLOEM PROTEIN 2-LIKE A68.9e-1340.38Show/hide
Query:  IKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLK----RPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKG
        I    L+PGA YEA FVV +++ A GW+ PVN++LK      D  + +R E++ +     WV+I  G F V        I F+MY+YE    KKG+ +KG
Subjt:  IKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLK----RPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKG

Query:  VVIR
        V IR
Subjt:  VVIR

Arabidopsis top hitse value%identityAlignment
AT1G33920.1 phloem protein 2-A42.2e-1943.65Show/hide
Query:  SMDKLHDQ--LHAGIFLNKRTKIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEI
        S +KL D   L A  +L+   K    EL+    YE  +VV ++D A GW++PVN++L  PDG K  QER   ++E    RW++I  G+F V   DN GEI
Subjt:  SMDKLHDQ--LHAGIFLNKRTKIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEI

Query:  EFSMYEYEGGQWKKGMFLKGVVIRTK
         FSMYE +   WK+G+F+K V IR K
Subjt:  EFSMYEYEGGQWKKGMFLKGVVIRTK

AT1G65390.1 phloem protein 2 A51.1e-1337.07Show/hide
Query:  LHAGIFLNKRTKIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPD--GSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGG
        L +  +L+   K     L+P   YE  FVV + +  + W+  V ++L  P+     QE+  DM +    +W++IP+G+FT     N GEI F+MYE+E  
Subjt:  LHAGIFLNKRTKIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPD--GSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGG

Query:  QWKKGMFLKGVVIRTK
         WK G+F+KGV IR K
Subjt:  QWKKGMFLKGVVIRTK

AT4G19840.1 phloem protein 2-A11.8e-2434.33Show/hide
Query:  HGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRTKIK------------------------------------------------------------A
        H  EAIL+DAD  +  SS++ L +QL +G+FL  + +IK                                                             
Subjt:  HGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRTKIK------------------------------------------------------------A

Query:  SELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRT
          L+PG  YE  F V ++DPAYGWD PVN++L  P+G +  QE+K  + E PR +WV++ +G+F V +    GEI FSMYE+  G WKKG+ LKGV IR 
Subjt:  SELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSK--QERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRT

Query:  K
        K
Subjt:  K

AT4G19850.1 lectin-related9.1e-2144.66Show/hide
Query:  KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVI
        K +  +L+P + YE  FVV + D A GWD  VN +L  P G  +ER+E++    R +WVEIP G+F +      G+IEFSM E +  QWK G+ +KGV I
Subjt:  KIKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVI

Query:  RTK
        R K
Subjt:  RTK

AT5G45080.1 phloem protein 2-A66.3e-1440.38Show/hide
Query:  IKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLK----RPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKG
        I    L+PGA YEA FVV +++ A GW+ PVN++LK      D  + +R E++ +     WV+I  G F V        I F+MY+YE    KKG+ +KG
Subjt:  IKASELSPGAWYEAAFVVMIKDPAYGWDVPVNIRLK----RPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKG

Query:  VVIR
        V IR
Subjt:  VVIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCTGGGTGGTCAGAAGAGCAGGCTGCACAGTCGCAGCCGCCACCGCCAGCCACTGGCAGCGCCGCCAGCCACGGCGGAGCAAAGGTGGCAGAAGTGAAG
CAACTGGGTCATGGGTTGGAAGCTATTCTGAAAGATGCAGACTCCGCAGTGGACAGATCCTCCATGGATAAGCTTCATGATCAACTCCATGCTGGAATCTTCTTG
AACAAAAGAACAAAGATCAAGGCATCTGAGCTCTCACCAGGAGCATGGTACGAGGCAGCATTTGTGGTGATGATCAAAGATCCAGCCTACGGATGGGATGTTCCA
GTGAACATAAGGCTCAAGAGGCCAGATGGGAGCAAGCAAGAGCGCAAAGAAGACATGGAGGAGAAGCCACGAGGGCGGTGGGTCGAGATCCCGATAGGCGATTTC
ACGGTACAAGATCATGACAACGGTGGCGAGATCGAGTTTAGCATGTATGAATATGAAGGAGGGCAATGGAAGAAGGGGATGTTCCTCAAAGGTGTTGTCATTCGA
ACCAAGGGATCAATATTGGAAGTGGTGCACACGTTGGTTGACTCCCTGTCCATTTCGGACCGGTTTACGTACTTTACCGAGCTCGCGCCCGTGCTTGTTTCTACT
CAGCCGAGCTTATCTCCGCCATCGCCTGAGCTGGGTGTCCAGTTCTTCAATTCGGGTCTCGCAGACCGCATTACTCTAAGCCACATAAAGCCATTGTATCCTCTA
GCTGCACTTACATTGAATGAAGTGGGAGATGTTAAGGACAACGGTACCCAAGTACCCCACCTTATCTCCACAATGGGACAAGTACTAACTGAAGGTTGTGAAAAG
AAGGGACTATATGAACTGCAAACACTGCAGCCGAAAATTGAAACGGGAGCCTTTGTTGAACCTGCTGTTTTTGTTGCATCTACTGCTATTTCATTAAAGACAGCT
TCTCATGTTACAAGGAAGAAGGATCACTTAGCAAGGACATTCTTGGCATTCATACCAATAGAACAAGAGTCAATCAATGTGTGGGCATATACAATGGCTTTCGAC
CAGGCTGCAAACTCTTACTCAGAAGGCACAAAATGGTCAGACAACGAGAATGAATTAGGTGTTACTGATGCTGGAGTTAATACACCTCCCTTAGAAGCATTGCAA
ATTGAACCACACAATGTGCCGTCAGAGACAGCAGCCACCACCAGCTTGGACTTCTCTACTACTCAAGAAGCAGATCCTACACTGACCAACTCAGAAACTGATACA
GATTTGCTTTCTTCAACTCTAAATACTACTCCGCCTTGCTCTAACTTAATAAATTGTGAGTCTCTCCCTGATATCAGCAATAGTATTTACATTGAACTTCCCTTC
TCTGATGCAGAAAATACTCACACACCTACAGATCCACCAACTCACTCACAAGATAACAGCACTCACAACATGGTCACAAGAGGGAACTGGCTAAAAATCCTCAAC
TTGATCCAGTCCTATCCAAGGAAAAGCTCATCACCTGAAGGTTGCCAAGAATCACTCATATATTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTCTGGGTGGTCAGAAGAGCAGGCTGCACAGTCGCAGCCGCCACCGCCAGCCACTGGCAGCGCCGCCAGCCACGGCGGAGCAAAGGTGGCAGAAGTGAAG
CAACTGGGTCATGGGTTGGAAGCTATTCTGAAAGATGCAGACTCCGCAGTGGACAGATCCTCCATGGATAAGCTTCATGATCAACTCCATGCTGGAATCTTCTTG
AACAAAAGAACAAAGATCAAGGCATCTGAGCTCTCACCAGGAGCATGGTACGAGGCAGCATTTGTGGTGATGATCAAAGATCCAGCCTACGGATGGGATGTTCCA
GTGAACATAAGGCTCAAGAGGCCAGATGGGAGCAAGCAAGAGCGCAAAGAAGACATGGAGGAGAAGCCACGAGGGCGGTGGGTCGAGATCCCGATAGGCGATTTC
ACGGTACAAGATCATGACAACGGTGGCGAGATCGAGTTTAGCATGTATGAATATGAAGGAGGGCAATGGAAGAAGGGGATGTTCCTCAAAGGTGTTGTCATTCGA
ACCAAGGGATCAATATTGGAAGTGGTGCACACGTTGGTTGACTCCCTGTCCATTTCGGACCGGTTTACGTACTTTACCGAGCTCGCGCCCGTGCTTGTTTCTACT
CAGCCGAGCTTATCTCCGCCATCGCCTGAGCTGGGTGTCCAGTTCTTCAATTCGGGTCTCGCAGACCGCATTACTCTAAGCCACATAAAGCCATTGTATCCTCTA
GCTGCACTTACATTGAATGAAGTGGGAGATGTTAAGGACAACGGTACCCAAGTACCCCACCTTATCTCCACAATGGGACAAGTACTAACTGAAGGTTGTGAAAAG
AAGGGACTATATGAACTGCAAACACTGCAGCCGAAAATTGAAACGGGAGCCTTTGTTGAACCTGCTGTTTTTGTTGCATCTACTGCTATTTCATTAAAGACAGCT
TCTCATGTTACAAGGAAGAAGGATCACTTAGCAAGGACATTCTTGGCATTCATACCAATAGAACAAGAGTCAATCAATGTGTGGGCATATACAATGGCTTTCGAC
CAGGCTGCAAACTCTTACTCAGAAGGCACAAAATGGTCAGACAACGAGAATGAATTAGGTGTTACTGATGCTGGAGTTAATACACCTCCCTTAGAAGCATTGCAA
ATTGAACCACACAATGTGCCGTCAGAGACAGCAGCCACCACCAGCTTGGACTTCTCTACTACTCAAGAAGCAGATCCTACACTGACCAACTCAGAAACTGATACA
GATTTGCTTTCTTCAACTCTAAATACTACTCCGCCTTGCTCTAACTTAATAAATTGTGAGTCTCTCCCTGATATCAGCAATAGTATTTACATTGAACTTCCCTTC
TCTGATGCAGAAAATACTCACACACCTACAGATCCACCAACTCACTCACAAGATAACAGCACTCACAACATGGTCACAAGAGGGAACTGGCTAAAAATCCTCAAC
TTGATCCAGTCCTATCCAAGGAAAAGCTCATCACCTGAAGGTTGCCAAGAATCACTCATATATTGCTGA
Protein sequenceShow/hide protein sequence
MGSGWSEEQAAQSQPPPPATGSAASHGGAKVAEVKQLGHGLEAILKDADSAVDRSSMDKLHDQLHAGIFLNKRTKIKASELSPGAWYEAAFVVMIKDPAYGWDVP
VNIRLKRPDGSKQERKEDMEEKPRGRWVEIPIGDFTVQDHDNGGEIEFSMYEYEGGQWKKGMFLKGVVIRTKGSILEVVHTLVDSLSISDRFTYFTELAPVLVST
QPSLSPPSPELGVQFFNSGLADRITLSHIKPLYPLAALTLNEVGDVKDNGTQVPHLISTMGQVLTEGCEKKGLYELQTLQPKIETGAFVEPAVFVASTAISLKTA
SHVTRKKDHLARTFLAFIPIEQESINVWAYTMAFDQAANSYSEGTKWSDNENELGVTDAGVNTPPLEALQIEPHNVPSETAATTSLDFSTTQEADPTLTNSETDT
DLLSSTLNTTPPCSNLINCESLPDISNSIYIELPFSDAENTHTPTDPPTHSQDNSTHNMVTRGNWLKILNLIQSYPRKSSSPEGCQESLIYC