; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr3:1406421..1409166
RNA-Seq ExpressionMoc03g01870
SyntenyMoc03g01870
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0006508 - proteolysis (biological process)
GO:0032196 - transposition (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151004.1 uncharacterized protein LOC111019024 [Momordica charantia]5.0e-8772.2Show/hide
Query:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-
        +K+   AKFK+DAKAVEELFLKAAK YRESYFNSI AQLRAY GVREYLDDIGKERWARCFQTQLRY QMTTNIAESVN+LFRHA KL VTALLDHIRG 
Subjt:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-

Query:  -------------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPR
                                                   A+MRNINPYS CDEAYT NSWILAYAEPIFPVGHVSTWNSSPEFVNI VEPPKTVPR
Subjt:  -------------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPR

Query:  VRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL
        V R+KT RIPSTGEVRQTRKC R GAWGHNRKTCSEPL TL
Subjt:  VRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]9.4e-10240.13Show/hide
Query:  EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTNDVNEGDVFDTKKELSLKMHLVAMRMNF-------
        EG  EAE  N+++DDALDEE EPDVEQVH EI RDE AV+  GC+GL G  N E LQLIVQSSGTNDV EG+VFDTKKELSL+MHLV MR+NF       
Subjt:  EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTNDVNEGDVFDTKKELSLKMHLVAMRMNF-------

Query:  ---------------------------------------------------------------------------------------SLNYDRAWRSNEE
                                                                                               +L+YD+AWRS+EE
Subjt:  ---------------------------------------------------------------------------------------SLNYDRAWRSNEE

Query:  ALRLIRGDPSSSYGILPAYGEALKIMNP------------------------------------------------------------------------
        ALRLIRGDP+SSYG+LP YGEALKIMNP                                                                        
Subjt:  ALRLIRGDPSSSYGILPAYGEALKIMNP------------------------------------------------------------------------

Query:  -----------------------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVRE
                                                                   AKFK DAKA+EELFLKAAK YRESYFNSI AQL AY GVRE
Subjt:  -----------------------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVRE

Query:  YLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG------------------------------------------------
        YLDDIGKERWARCFQT+LRY QMT+N AESVN+LFRHA KL VTALLDHIRG                                                
Subjt:  YLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG------------------------------------------------

Query:  ------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTA
                                            A+MRNINPY+ CDEAYT NSW++AYAEPIFP+GHVSTWNSSP+FV+  VE P  VPRV R++T 
Subjt:  ------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTA

Query:  RIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTT
        RIPSTGEVRQTRKC R G  GHN KTC+EPL T
Subjt:  RIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTT

XP_022154610.1 uncharacterized protein LOC111021833 [Momordica charantia]4.1e-8160.85Show/hide
Query:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-
        +K+    KFK+DAKAVEELFLKA K YRESYFNSI AQLRAY GVREYLDDIGKERWARCFQTQLRY QMTTNIAESVN+ FRHA KL VTALLDHIRG 
Subjt:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-

Query:  -----------------------------------------------------------------------------------AIMRNINPYSPCDEAYT
                                                                                           A+MRNINPYS CDEAYT
Subjt:  -----------------------------------------------------------------------------------AIMRNINPYSPCDEAYT

Query:  MNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL
         NSWILAYAEPIFPVGHVSTWNSSPEFVNI VEPPKTVPRV R+ T RIPSTGEVRQTRKC R GAWGHNRKTCSEPLTTL
Subjt:  MNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]1.5e-12847.51Show/hide
Query:  FIRDMMPRVFIIFDGEWNDSEKYYVGGHTRGLIVNSTITYGEF--LACIPSFQRPTYPIPSFHSSSSNPSSSRQPHPSYGHIGHDVEGLTPLGSDVVPCN
        F  +M  RVFI F GEWNDSEK YVGG  RGL V+S         +ACIPSFQRPT PIPSF SSSSNPSSS+QPH  YGH+GHD+ GLTPL SDVVPCN
Subjt:  FIRDMMPRVFIIFDGEWNDSEKYYVGGHTRGLIVNSTITYGEF--LACIPSFQRPTYPIPSFHSSSSNPSSSRQPHPSYGHIGHDVEGLTPLGSDVVPCN

Query:  LGDDRVCDWDVSGVWNDNEDESNESYDPLAEF-EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTND
        LGDDRVC W++ G+WNDN+DES+ESYD L +  EG  EAE  N+++DDA DE+ EPDVEQV  EIRRDE  V   GC+GLIG PNDEKLQLIVQSSGTND
Subjt:  LGDDRVCDWDVSGVWNDNEDESNESYDPLAEF-EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTND

Query:  VNEGDVFDTKKELSLKMHLVAMRMNF--------------------------------------------------------------------------
        V EG VFDTKKELSL+ HLVAM +NF                                                                          
Subjt:  VNEGDVFDTKKELSLKMHLVAMRMNF--------------------------------------------------------------------------

Query:  --------------------SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYG---------------------------------------------EA
                            +L+YD+AW+S EEALRLIRGDP++SYG+LPAYG                                              A
Subjt:  --------------------SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYG---------------------------------------------EA

Query:  LKIMNP------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQ
        L ++N                                     AKFK DAKA+EELFLKAAK Y+ESYFNSI AQL AY G+REYLDDIGKERW RCFQT+
Subjt:  LKIMNP------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQ

Query:  LRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRGAIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTAR
        LRY QMT+N AESVN+LFRHA  L VTALLDHIR                           EPIFP+ HVSTW SSP+FV+I  E P  VPRV ++++ R
Subjt:  LRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRGAIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTAR

Query:  IP
        IP
Subjt:  IP

XP_022156660.1 uncharacterized protein LOC111023509 [Momordica charantia]9.4e-8647.61Show/hide
Query:  SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYGEALKIMNP-----------------------------------------------------------
        +L+YDRAWRS+EEALRLIR DP+SSYG+LPAYGEALKIM P                                                           
Subjt:  SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYGEALKIMNP-----------------------------------------------------------

Query:  --------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQ
                                                    AKFKDD K VEELFLKAAK Y ESYFNSI AQLRAYLGVREYLDDIGKERWARCFQ
Subjt:  --------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQ

Query:  TQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG---------------------------------------------------------------
        TQLRY QMTTNIAESVN+LFRHA KL VTALLDHIRG                                                               
Subjt:  TQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG---------------------------------------------------------------

Query:  --------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDR
                            A MRNINPYS CDEAYT NSWILAYAEPIFP+ ++STW+SSPEFVNI VEPPKTVPRV R+KTARIPS GEVRQ  KC R
Subjt:  --------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDR

Query:  YGAWGHNRKTCSEPLTTL
         GAWGHNRKTCS+PLTTL
Subjt:  YGAWGHNRKTCSEPLTTL

TrEMBL top hitse value%identityAlignment
A0A6J1DBQ5 uncharacterized protein LOC1110190242.4e-8772.2Show/hide
Query:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-
        +K+   AKFK+DAKAVEELFLKAAK YRESYFNSI AQLRAY GVREYLDDIGKERWARCFQTQLRY QMTTNIAESVN+LFRHA KL VTALLDHIRG 
Subjt:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-

Query:  -------------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPR
                                                   A+MRNINPYS CDEAYT NSWILAYAEPIFPVGHVSTWNSSPEFVNI VEPPKTVPR
Subjt:  -------------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPR

Query:  VRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL
        V R+KT RIPSTGEVRQTRKC R GAWGHNRKTCSEPL TL
Subjt:  VRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL

A0A6J1DJT1 uncharacterized protein LOC1110207154.6e-10240.13Show/hide
Query:  EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTNDVNEGDVFDTKKELSLKMHLVAMRMNF-------
        EG  EAE  N+++DDALDEE EPDVEQVH EI RDE AV+  GC+GL G  N E LQLIVQSSGTNDV EG+VFDTKKELSL+MHLV MR+NF       
Subjt:  EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTNDVNEGDVFDTKKELSLKMHLVAMRMNF-------

Query:  ---------------------------------------------------------------------------------------SLNYDRAWRSNEE
                                                                                               +L+YD+AWRS+EE
Subjt:  ---------------------------------------------------------------------------------------SLNYDRAWRSNEE

Query:  ALRLIRGDPSSSYGILPAYGEALKIMNP------------------------------------------------------------------------
        ALRLIRGDP+SSYG+LP YGEALKIMNP                                                                        
Subjt:  ALRLIRGDPSSSYGILPAYGEALKIMNP------------------------------------------------------------------------

Query:  -----------------------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVRE
                                                                   AKFK DAKA+EELFLKAAK YRESYFNSI AQL AY GVRE
Subjt:  -----------------------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVRE

Query:  YLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG------------------------------------------------
        YLDDIGKERWARCFQT+LRY QMT+N AESVN+LFRHA KL VTALLDHIRG                                                
Subjt:  YLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG------------------------------------------------

Query:  ------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTA
                                            A+MRNINPY+ CDEAYT NSW++AYAEPIFP+GHVSTWNSSP+FV+  VE P  VPRV R++T 
Subjt:  ------------------------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTA

Query:  RIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTT
        RIPSTGEVRQTRKC R G  GHN KTC+EPL T
Subjt:  RIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTT

A0A6J1DK35 uncharacterized protein LOC1110218332.0e-8160.85Show/hide
Query:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-
        +K+    KFK+DAKAVEELFLKA K YRESYFNSI AQLRAY GVREYLDDIGKERWARCFQTQLRY QMTTNIAESVN+ FRHA KL VTALLDHIRG 
Subjt:  LKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG-

Query:  -----------------------------------------------------------------------------------AIMRNINPYSPCDEAYT
                                                                                           A+MRNINPYS CDEAYT
Subjt:  -----------------------------------------------------------------------------------AIMRNINPYSPCDEAYT

Query:  MNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL
         NSWILAYAEPIFPVGHVSTWNSSPEFVNI VEPPKTVPRV R+ T RIPSTGEVRQTRKC R GAWGHNRKTCSEPLTTL
Subjt:  MNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL

A0A6J1DP00 uncharacterized protein LOC1110229547.4e-12947.51Show/hide
Query:  FIRDMMPRVFIIFDGEWNDSEKYYVGGHTRGLIVNSTITYGEF--LACIPSFQRPTYPIPSFHSSSSNPSSSRQPHPSYGHIGHDVEGLTPLGSDVVPCN
        F  +M  RVFI F GEWNDSEK YVGG  RGL V+S         +ACIPSFQRPT PIPSF SSSSNPSSS+QPH  YGH+GHD+ GLTPL SDVVPCN
Subjt:  FIRDMMPRVFIIFDGEWNDSEKYYVGGHTRGLIVNSTITYGEF--LACIPSFQRPTYPIPSFHSSSSNPSSSRQPHPSYGHIGHDVEGLTPLGSDVVPCN

Query:  LGDDRVCDWDVSGVWNDNEDESNESYDPLAEF-EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTND
        LGDDRVC W++ G+WNDN+DES+ESYD L +  EG  EAE  N+++DDA DE+ EPDVEQV  EIRRDE  V   GC+GLIG PNDEKLQLIVQSSGTND
Subjt:  LGDDRVCDWDVSGVWNDNEDESNESYDPLAEF-EGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEKLQLIVQSSGTND

Query:  VNEGDVFDTKKELSLKMHLVAMRMNF--------------------------------------------------------------------------
        V EG VFDTKKELSL+ HLVAM +NF                                                                          
Subjt:  VNEGDVFDTKKELSLKMHLVAMRMNF--------------------------------------------------------------------------

Query:  --------------------SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYG---------------------------------------------EA
                            +L+YD+AW+S EEALRLIRGDP++SYG+LPAYG                                              A
Subjt:  --------------------SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYG---------------------------------------------EA

Query:  LKIMNP------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQ
        L ++N                                     AKFK DAKA+EELFLKAAK Y+ESYFNSI AQL AY G+REYLDDIGKERW RCFQT+
Subjt:  LKIMNP------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQTQ

Query:  LRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRGAIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTAR
        LRY QMT+N AESVN+LFRHA  L VTALLDHIR                           EPIFP+ HVSTW SSP+FV+I  E P  VPRV ++++ R
Subjt:  LRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRGAIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTAR

Query:  IP
        IP
Subjt:  IP

A0A6J1DQY6 uncharacterized protein LOC1110235094.6e-8647.61Show/hide
Query:  SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYGEALKIMNP-----------------------------------------------------------
        +L+YDRAWRS+EEALRLIR DP+SSYG+LPAYGEALKIM P                                                           
Subjt:  SLNYDRAWRSNEEALRLIRGDPSSSYGILPAYGEALKIMNP-----------------------------------------------------------

Query:  --------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQ
                                                    AKFKDD K VEELFLKAAK Y ESYFNSI AQLRAYLGVREYLDDIGKERWARCFQ
Subjt:  --------------------------------------------AKFKDDAKAVEELFLKAAKVYRESYFNSILAQLRAYLGVREYLDDIGKERWARCFQ

Query:  TQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG---------------------------------------------------------------
        TQLRY QMTTNIAESVN+LFRHA KL VTALLDHIRG                                                               
Subjt:  TQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRG---------------------------------------------------------------

Query:  --------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDR
                            A MRNINPYS CDEAYT NSWILAYAEPIFP+ ++STW+SSPEFVNI VEPPKTVPRV R+KTARIPS GEVRQ  KC R
Subjt:  --------------------AIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNISVEPPKTVPRVRRQKTARIPSTGEVRQTRKCDR

Query:  YGAWGHNRKTCSEPLTTL
         GAWGHNRKTCS+PLTTL
Subjt:  YGAWGHNRKTCSEPLTTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTGGACTAATCTAGAAGCATATTTAGTCTGGGACATGTCTTGTACCAGTCTTGCTCCGAGATTTGTTCAGGACAAGTCTCGGACCAAGACTGATCCAAGATTCAT
CCGAGATATGATGCCTCGTGTTTTCATAATATTCGATGGAGAATGGAATGATAGCGAAAAATATTATGTCGGCGGTCATACGAGAGGATTGATAGTGAATAGTACAATCA
CGTACGGAGAATTTCTAGCTTGTATTCCCTCATTCCAAAGACCCACATATCCTATACCCTCATTTCATTCTTCATCATCGAACCCCTCTTCTTCCCGACAGCCACACCCC
TCCTACGGGCATATAGGTCATGATGTAGAGGGTTTAACACCATTGGGATCAGATGTTGTTCCATGTAATCTGGGAGATGACAGGGTGTGTGATTGGGATGTGTCGGGAGT
GTGGAATGATAACGAAGATGAAAGTAATGAATCATATGACCCGTTGGCAGAGTTTGAAGGACACTCTGAAGCAGAATGTGGGAACGAAGAGCATGATGATGCGCTTGATG
AAGAGCTTGAGCCTGATGTGGAACAGGTGCACACTGAGATTCGCAGGGATGAAGAAGCGGTCCGGCCACCGGGATGTAATGGTCTCATCGGACACCCTAATGATGAGAAA
TTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTCTTTGATACTAAGAAGGAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGATGAA
TTTCAGTTTAAATTATGATAGAGCATGGCGTTCTAATGAAGAAGCACTCCGGCTTATTAGAGGGGATCCATCATCGTCATACGGTATACTTCCAGCTTATGGTGAAGCTT
TGAAAATCATGAACCCAGCAAAATTTAAAGACGATGCGAAGGCAGTCGAGGAACTATTTTTAAAGGCTGCAAAGGTGTATCGCGAGTCATATTTCAACTCGATCTTGGCC
CAACTTCGTGCATACCTCGGTGTACGGGAATATCTAGACGATATTGGGAAGGAGCGTTGGGCTCGTTGTTTCCAAACTCAATTGAGGTATATACAGATGACTACAAATAT
CGCGGAGTCCGTAAATTCCCTCTTCAGGCACGCCCCTAAGTTGTCGGTTACCGCCTTACTTGACCACATTAGAGGGGCGATAATGCGAAATATAAATCCATACAGTCCGT
GTGACGAGGCATATACGATGAACTCCTGGATATTGGCTTATGCAGAACCTATATTTCCAGTCGGACACGTCTCAACATGGAACAGTTCGCCAGAGTTTGTCAACATATCG
GTGGAACCACCGAAGACTGTTCCAAGAGTTAGGAGGCAGAAGACGGCTAGGATTCCTTCCACGGGCGAGGTACGTCAAACACGTAAGTGCGATCGCTATGGTGCATGGGG
GCATAATCGCAAAACATGTAGTGAACCCCTTACCACATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTGGACTAATCTAGAAGCATATTTAGTCTGGGACATGTCTTGTACCAGTCTTGCTCCGAGATTTGTTCAGGACAAGTCTCGGACCAAGACTGATCCAAGATTCAT
CCGAGATATGATGCCTCGTGTTTTCATAATATTCGATGGAGAATGGAATGATAGCGAAAAATATTATGTCGGCGGTCATACGAGAGGATTGATAGTGAATAGTACAATCA
CGTACGGAGAATTTCTAGCTTGTATTCCCTCATTCCAAAGACCCACATATCCTATACCCTCATTTCATTCTTCATCATCGAACCCCTCTTCTTCCCGACAGCCACACCCC
TCCTACGGGCATATAGGTCATGATGTAGAGGGTTTAACACCATTGGGATCAGATGTTGTTCCATGTAATCTGGGAGATGACAGGGTGTGTGATTGGGATGTGTCGGGAGT
GTGGAATGATAACGAAGATGAAAGTAATGAATCATATGACCCGTTGGCAGAGTTTGAAGGACACTCTGAAGCAGAATGTGGGAACGAAGAGCATGATGATGCGCTTGATG
AAGAGCTTGAGCCTGATGTGGAACAGGTGCACACTGAGATTCGCAGGGATGAAGAAGCGGTCCGGCCACCGGGATGTAATGGTCTCATCGGACACCCTAATGATGAGAAA
TTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATGTCTTTGATACTAAGAAGGAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGATGAA
TTTCAGTTTAAATTATGATAGAGCATGGCGTTCTAATGAAGAAGCACTCCGGCTTATTAGAGGGGATCCATCATCGTCATACGGTATACTTCCAGCTTATGGTGAAGCTT
TGAAAATCATGAACCCAGCAAAATTTAAAGACGATGCGAAGGCAGTCGAGGAACTATTTTTAAAGGCTGCAAAGGTGTATCGCGAGTCATATTTCAACTCGATCTTGGCC
CAACTTCGTGCATACCTCGGTGTACGGGAATATCTAGACGATATTGGGAAGGAGCGTTGGGCTCGTTGTTTCCAAACTCAATTGAGGTATATACAGATGACTACAAATAT
CGCGGAGTCCGTAAATTCCCTCTTCAGGCACGCCCCTAAGTTGTCGGTTACCGCCTTACTTGACCACATTAGAGGGGCGATAATGCGAAATATAAATCCATACAGTCCGT
GTGACGAGGCATATACGATGAACTCCTGGATATTGGCTTATGCAGAACCTATATTTCCAGTCGGACACGTCTCAACATGGAACAGTTCGCCAGAGTTTGTCAACATATCG
GTGGAACCACCGAAGACTGTTCCAAGAGTTAGGAGGCAGAAGACGGCTAGGATTCCTTCCACGGGCGAGGTACGTCAAACACGTAAGTGCGATCGCTATGGTGCATGGGG
GCATAATCGCAAAACATGTAGTGAACCCCTTACCACATTGTGA
Protein sequenceShow/hide protein sequence
MSWTNLEAYLVWDMSCTSLAPRFVQDKSRTKTDPRFIRDMMPRVFIIFDGEWNDSEKYYVGGHTRGLIVNSTITYGEFLACIPSFQRPTYPIPSFHSSSSNPSSSRQPHP
SYGHIGHDVEGLTPLGSDVVPCNLGDDRVCDWDVSGVWNDNEDESNESYDPLAEFEGHSEAECGNEEHDDALDEELEPDVEQVHTEIRRDEEAVRPPGCNGLIGHPNDEK
LQLIVQSSGTNDVNEGDVFDTKKELSLKMHLVAMRMNFSLNYDRAWRSNEEALRLIRGDPSSSYGILPAYGEALKIMNPAKFKDDAKAVEELFLKAAKVYRESYFNSILA
QLRAYLGVREYLDDIGKERWARCFQTQLRYIQMTTNIAESVNSLFRHAPKLSVTALLDHIRGAIMRNINPYSPCDEAYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIS
VEPPKTVPRVRRQKTARIPSTGEVRQTRKCDRYGAWGHNRKTCSEPLTTL