; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionActin cytoskeleton-regulatory complex protein PAN1-like
Genome locationchr7:15708322..15709344
RNA-Seq ExpressionMoc07g21480
SyntenyMoc07g21480
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004246107.1 uncharacterized protein LOC101258478 [Solanum lycopersicum]2.4e-4851.67Show/hide
Query:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT
        T+SLS S +    PPPPPPPPPPP           P  R R L    KSETIP PYPWAT HRA I SL+ L  N I  I GE++C++C    ++ F+L 
Subjt:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT

Query:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEP-VTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK
        +KF +V SFIS NK  MHQRAP  W  P   +C  C  E   +P ++ KK+ +NW+FLLLGQ IGF +L+ LKY CKH   HRTGAKDR++Y  Y+CLC+
Subjt:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEP-VTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK

Query:  QLHPTGPYD
        QL  TGP+D
Subjt:  QLHPTGPYD

XP_006480845.1 uncharacterized protein LOC102624229 [Citrus sinensis]2.4e-4853.67Show/hide
Query:  NPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQ
        N  +P+ + RP    GK+ETIP P+PWAT  RA + SL+ LT + + KI GE++CK+C    ++E++L  KFMEV SFIS NK  MH RAP  W  P   
Subjt:  NPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQ

Query:  DCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        +C  C    C +P+ GKK+ +NWLFLLLGQM+G   L  LKY CKHTRNHRTGAKDR++Y+ Y+ LCKQL P G YD
Subjt:  DCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

XP_016552700.1 PREDICTED: uncharacterized protein LOC107852155 [Capsicum annuum]4.2e-4851.67Show/hide
Query:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT
        T+SLS   A     PPPPPPPPP     P+  P     R R L    KS+TI PPYPWAT  RA + SL+ L  NG+  I GE++CKKC    ++ FNL 
Subjt:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT

Query:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK
        EKF +V SFI++NK+ MH RAP  W  P    C  C  E   +PV   KK+ +NWLFLLLGQ IG  +LE LKY CKH +NHRTGAKDR++Y+ Y+ LC+
Subjt:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK

Query:  QLHPTGPYD
        QL P+GP+D
Subjt:  QLHPTGPYD

XP_022157692.1 uncharacterized protein LOC111024349 [Momordica charantia]1.9e-202100Show/hide
Query:  MHRQGFDQNLPLRNGQEEDDGGGGNDNPPLDLSLSLSLSGYNPPPLPQPLPQSLTPNPALLHTQLLSLADYNLMHFSSPSSYSTPIPIAPMMWDRQLFFG
        MHRQGFDQNLPLRNGQEEDDGGGGNDNPPLDLSLSLSLSGYNPPPLPQPLPQSLTPNPALLHTQLLSLADYNLMHFSSPSSYSTPIPIAPMMWDRQLFFG
Subjt:  MHRQGFDQNLPLRNGQEEDDGGGGNDNPPLDLSLSLSLSGYNPPPLPQPLPQSLTPNPALLHTQLLSLADYNLMHFSSPSSYSTPIPIAPMMWDRQLFFG

Query:  HSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSL
        HSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSL
Subjt:  HSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSL

Query:  DDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLE
        DDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLE
Subjt:  DDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLE

Query:  HLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL
        HLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL
Subjt:  HLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL

XP_039002770.1 uncharacterized protein LOC120129312 [Hibiscus syriacus]2.4e-4856.44Show/hide
Query:  GKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-
        GKSETI PPYPWAT H+A +  L  L  NGI  I G+++CK+C    +MEF+L EKF E+  +I+ NK  MH RAP  W  P    C  C  E   +PV 
Subjt:  GKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-

Query:  TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        + KK+ +NWLFLLLGQMIG  +LEHLKY CKHT  HRT AKDR++Y+AY+CLCKQL PT P+D
Subjt:  TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

TrEMBL top hitse value%identityAlignment
A0A1U8FAH5 uncharacterized protein LOC1078521552.0e-4851.67Show/hide
Query:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT
        T+SLS   A     PPPPPPPPP     P+  P     R R L    KS+TI PPYPWAT  RA + SL+ L  NG+  I GE++CKKC    ++ FNL 
Subjt:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT

Query:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK
        EKF +V SFI++NK+ MH RAP  W  P    C  C  E   +PV   KK+ +NWLFLLLGQ IG  +LE LKY CKH +NHRTGAKDR++Y+ Y+ LC+
Subjt:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK

Query:  QLHPTGPYD
        QL P+GP+D
Subjt:  QLHPTGPYD

A0A2H5NYI9 Uncharacterized protein1.2e-4853.67Show/hide
Query:  NPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQ
        N  +P+ + RP    GK+ETIP P+PWAT  RA + SL+ LT + + KI GE++CK+C    ++E++L  KFMEV SFIS NK  MH RAP  W  P   
Subjt:  NPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQ

Query:  DCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        +C  C    C +P+ GKK+ +NWLFLLLGQM+G   L  LKY CKHTRNHRTGAKDR++Y+ Y+ LCKQL P G YD
Subjt:  DCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

A0A3Q7HWE7 Uncharacterized protein1.2e-4851.67Show/hide
Query:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT
        T+SLS S +    PPPPPPPPPPP           P  R R L    KSETIP PYPWAT HRA I SL+ L  N I  I GE++C++C    ++ F+L 
Subjt:  TVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLT

Query:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEP-VTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK
        +KF +V SFIS NK  MHQRAP  W  P   +C  C  E   +P ++ KK+ +NW+FLLLGQ IGF +L+ LKY CKH   HRTGAKDR++Y  Y+CLC+
Subjt:  EKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEP-VTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCK

Query:  QLHPTGPYD
        QL  TGP+D
Subjt:  QLHPTGPYD

A0A6A3AEM8 Uncharacterized protein1.2e-4856.44Show/hide
Query:  GKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-
        GKSETI PPYPWAT H+A +  L  L  NGI  I G+++CK+C    +MEF+L EKF E+  +I+ NK  MH RAP  W  P    C  C  E   +PV 
Subjt:  GKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-

Query:  TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD
        + KK+ +NWLFLLLGQMIG  +LEHLKY CKHT  HRT AKDR++Y+AY+CLCKQL PT P+D
Subjt:  TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYD

A0A6J1DV57 uncharacterized protein LOC1110243499.2e-203100Show/hide
Query:  MHRQGFDQNLPLRNGQEEDDGGGGNDNPPLDLSLSLSLSGYNPPPLPQPLPQSLTPNPALLHTQLLSLADYNLMHFSSPSSYSTPIPIAPMMWDRQLFFG
        MHRQGFDQNLPLRNGQEEDDGGGGNDNPPLDLSLSLSLSGYNPPPLPQPLPQSLTPNPALLHTQLLSLADYNLMHFSSPSSYSTPIPIAPMMWDRQLFFG
Subjt:  MHRQGFDQNLPLRNGQEEDDGGGGNDNPPLDLSLSLSLSGYNPPPLPQPLPQSLTPNPALLHTQLLSLADYNLMHFSSPSSYSTPIPIAPMMWDRQLFFG

Query:  HSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSL
        HSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSL
Subjt:  HSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSL

Query:  DDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLE
        DDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLE
Subjt:  DDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLE

Query:  HLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL
        HLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL
Subjt:  HLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein4.9e-3936.65Show/hide
Query:  LSLADYNLMHFSSPSSYSTPIPIAP--------MMWDRQLFFGHSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRF
        LSL   +  + S      +P+PIAP          W     F  +  + P P   S   P+   W    +++       ++   H+ PP    PP     
Subjt:  LSLADYNLMHFSSPSSYSTPIPIAP--------MMWDRQLFFGHSHPLPPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRF

Query:  PSPIPNPTT---PMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPR
        P P+  P T    ++R R  +   KS+TI PP+PWAT  R  I+SL+ L  N I  I GE++C+ C    ++ +NL E+F EV  F    K +M  RA +
Subjt:  PSPIPNPTT---PMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPR

Query:  GWECPPRQDCGCCSGEGCTEPVTG-KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHP
         W  P ++ C  C  E   +PV   +K ++NWLFLLLGQ +GF +LE LK  CKH++NHRTGAKDR++Y+ YM LCK L P
Subjt:  GWECPPRQDCGCCSGEGCTEPVTG-KKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.7e-3637.5Show/hide
Query:  PPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPP--------PPPP-----PPPRFPSPIPNPTTPMWRGRPLLGLGKSET----------
        PP  +  SP  P      +  +  G    +  +QA     PPP        P  P     PPP+             RGRP  G  +  +          
Subjt:  PPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPP--------PPPP-----PPPRFPSPIPNPTTPMWRGRPLLGLGKSET----------

Query:  -----IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-
             I PPYPWAT+    I+S  DL+ N I  I G++ CK C     +E+NL EKF E+  +I +NK EM  RAP  W  P    C  C  E   +PV 
Subjt:  -----IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-

Query:  TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL
        + +K E+NWLFLLLGQM+G  +L+ L+Y C+    HRTG+KDR+VYI Y+ LCKQL P GP++L
Subjt:  TGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTGAKDRLVYIAYMCLCKQLHPTGPYDL

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.8e-2034.07Show/hide
Query:  PPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPP--------PPPP-----PPPRFPSPIPNPTTPMWRGRPLLGLGKSET----------
        PP  +  SP  P      +  +  G    +  +QA     PPP        P  P     PPP+             RGRP  G  +  +          
Subjt:  PPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPP--------PPPP-----PPPRFPSPIPNPTTPMWRGRPLLGLGKSET----------

Query:  -----IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-
             I PPYPWAT+    I+S  DL+ N I  I G++ CK C     +E+NL EKF E+  +I +NK EM  RAP  W  P    C  C  E   +PV 
Subjt:  -----IPPPYPWATEHRAIIRSLDDLTRNGIEKIRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPV-

Query:  TGKKREMNWLFLLLGQMIGFSSLEHL
        + +K E+NWLFLLLGQM+G  +L+ L
Subjt:  TGKKREMNWLFLLLGQMIGFSSLEHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCGTCAAGGTTTTGATCAAAATCTTCCCCTTAGAAATGGCCAAGAGGAAGACGACGGTGGTGGCGGCAACGATAATCCTCCTCTTGATCTCTCACTATCA
CTCTCACTCTCCGGATACAATCCACCACCGCTCCCGCAGCCGCTGCCACAGTCGTTGACTCCAAATCCTGCCCTGCTGCACACGCAATTACTCTCGTTAGCCGAT
TACAATCTAATGCATTTCTCATCCCCGTCTTCGTACTCGACTCCAATCCCTATTGCTCCCATGATGTGGGACAGACAATTGTTCTTTGGACACTCTCATCCGCTC
CCACCATATCCATCGTCCCTAAGCCCTGCTGCGCCCATGCAGCAGGGCTGGCAATTACTCCAATGGGAGAAAGGTGAGACAGTGTCACTCTCACTCTCGCAAGCT
GAGCACTATCCTCCGCCCCCGCCCCCGCCGCCACCTCCACCACCGCGGTTCCCATCGCCAATTCCAAACCCTACCACTCCCATGTGGCGGGGCAGGCCATTACTC
GGGCTGGGAAAGAGCGAGACAATCCCGCCACCGTATCCATGGGCGACGGAGCACCGAGCAATCATACGCAGCCTGGACGATCTCACCCGAAACGGGATAGAGAAA
ATCAGAGGGGAAATGAAGTGCAAGAAGTGCGGAGTAGATAGCAAGATGGAATTCAATCTGACAGAGAAGTTCATGGAAGTAGAGAGTTTCATATCGATGAACAAG
TCGGAGATGCACCAGCGAGCCCCGAGGGGTTGGGAGTGCCCTCCGCGGCAGGACTGCGGCTGTTGCAGCGGAGAGGGCTGCACGGAGCCGGTGACGGGGAAGAAG
AGGGAGATGAATTGGCTGTTCTTGTTGCTAGGGCAAATGATTGGATTCTCTAGCTTAGAACATCTGAAATACTTGTGTAAGCACACGAGGAACCACAGGACAGGC
GCAAAAGACAGGCTTGTGTACATTGCCTACATGTGTTTGTGCAAACAACTTCATCCAACAGGACCTTATGATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATCGTCAAGGTTTTGATCAAAATCTTCCCCTTAGAAATGGCCAAGAGGAAGACGACGGTGGTGGCGGCAACGATAATCCTCCTCTTGATCTCTCACTATCA
CTCTCACTCTCCGGATACAATCCACCACCGCTCCCGCAGCCGCTGCCACAGTCGTTGACTCCAAATCCTGCCCTGCTGCACACGCAATTACTCTCGTTAGCCGAT
TACAATCTAATGCATTTCTCATCCCCGTCTTCGTACTCGACTCCAATCCCTATTGCTCCCATGATGTGGGACAGACAATTGTTCTTTGGACACTCTCATCCGCTC
CCACCATATCCATCGTCCCTAAGCCCTGCTGCGCCCATGCAGCAGGGCTGGCAATTACTCCAATGGGAGAAAGGTGAGACAGTGTCACTCTCACTCTCGCAAGCT
GAGCACTATCCTCCGCCCCCGCCCCCGCCGCCACCTCCACCACCGCGGTTCCCATCGCCAATTCCAAACCCTACCACTCCCATGTGGCGGGGCAGGCCATTACTC
GGGCTGGGAAAGAGCGAGACAATCCCGCCACCGTATCCATGGGCGACGGAGCACCGAGCAATCATACGCAGCCTGGACGATCTCACCCGAAACGGGATAGAGAAA
ATCAGAGGGGAAATGAAGTGCAAGAAGTGCGGAGTAGATAGCAAGATGGAATTCAATCTGACAGAGAAGTTCATGGAAGTAGAGAGTTTCATATCGATGAACAAG
TCGGAGATGCACCAGCGAGCCCCGAGGGGTTGGGAGTGCCCTCCGCGGCAGGACTGCGGCTGTTGCAGCGGAGAGGGCTGCACGGAGCCGGTGACGGGGAAGAAG
AGGGAGATGAATTGGCTGTTCTTGTTGCTAGGGCAAATGATTGGATTCTCTAGCTTAGAACATCTGAAATACTTGTGTAAGCACACGAGGAACCACAGGACAGGC
GCAAAAGACAGGCTTGTGTACATTGCCTACATGTGTTTGTGCAAACAACTTCATCCAACAGGACCTTATGATCTTTGA
Protein sequenceShow/hide protein sequence
MHRQGFDQNLPLRNGQEEDDGGGGNDNPPLDLSLSLSLSGYNPPPLPQPLPQSLTPNPALLHTQLLSLADYNLMHFSSPSSYSTPIPIAPMMWDRQLFFGHSHPL
PPYPSSLSPAAPMQQGWQLLQWEKGETVSLSLSQAEHYPPPPPPPPPPPPRFPSPIPNPTTPMWRGRPLLGLGKSETIPPPYPWATEHRAIIRSLDDLTRNGIEK
IRGEMKCKKCGVDSKMEFNLTEKFMEVESFISMNKSEMHQRAPRGWECPPRQDCGCCSGEGCTEPVTGKKREMNWLFLLLGQMIGFSSLEHLKYLCKHTRNHRTG
AKDRLVYIAYMCLCKQLHPTGPYDL