; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr3:1366914..1368434
RNA-Seq ExpressionMoc03g01800
SyntenyMoc03g01800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]7.6e-9250.67Show/hide
Query:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH
        +R ++ASRPNSEL                          MVCGFTSSVKRKSKGRAHALK VQS+ P T AV Q A QD+AGPSS  PT +IEL STG  
Subjt:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH

Query:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGGHPTRASKFVSDLGSVLQRTIDHAAEAFIASIHSAV
        S EKRSR++S ALDVSPL EVR                                                                              
Subjt:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGGHPTRASKFVSDLGSVLQRTIDHAAEAFIASIHSAV

Query:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA
                                                        EAKAELLKREDERHK HLRA H                      ALE KDAA
Subjt:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA

Query:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL
        IGRL AELK EKE L NGALLEAAFRQHPDFDGFAKDFS+AGFKFLMKGIAAD+PHL+VDLGDLKK Y EKWASGPNGT GPASLV+KYVRDLDSDYSDL
Subjt:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL

Query:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS
        +ED+ PSQEP EVG TQE VPSQQ GSQEVNLLGSQGELSSHLGSS
Subjt:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]8.2e-9473.64Show/hide
Query:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA
        RAS+FVSD GSVLQRTID+AAEAFIASIHSAVM+KAELDGREAL AKEREN ST LEAATTLKGELLKA+ +VDIL+AEV+AK +LLK+E E+HK HLRA
Subjt:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA

Query:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY
         HA                      LE KDA+IGRLT ELK  KE L +GALLE +FRQHP+FDGFAKDFS+AGFKFLMKGIAADMPHLQ+DL DLKK Y
Subjt:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY

Query:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGG
         E WASGPNGTPGP SLV+KYVR+LDSDYSD+EE+DAPSQEP +VG TQEE PSQ GG
Subjt:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]9.3e-10676.62Show/hide
Query:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA
        RASKFVSD GSVLQRTID+AAEAF+ASIHSA+M+KAELDGREALAAKERENSS ALEAATTLKGELLKA+ +V IL+AEV+AKAELLK+E E+HK HLRA
Subjt:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA

Query:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY
         HA                      LE KD +IGRLTAELK  KE L NG+LLE +FRQH DFDGFAKDFS+AGFKFLMKGIAADMPHLQ+DL +LKK Y
Subjt:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY

Query:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS
         EKWASGPNGTPGP SLV KYVR+LDSDYSD+EE+DAPSQEPNE+G TQEEVPSQQ GSQEVNLLGS+GELSSHLGSS
Subjt:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS

XP_022158409.1 uncharacterized protein LOC111024898 [Momordica charantia]9.7e-8773.17Show/hide
Query:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA
        MIKAELDGREALAAKE+ENS  ALEAATT+K ELLKARS+V ILKA+V+ KAE+LK+E E+HK HL A H                      ALE  DA 
Subjt:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA

Query:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL
        IGRL+ ELK  KE L NG LLE AF+QHPDFDGFAKDFS+AGFKFLMKGIA DM HLQ+DL D+KK Y EKWASGPNGTPGP SLV+KYVR+LDSDYSD+
Subjt:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL

Query:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS
        EE DAPSQEPNEVG TQEEVPSQ GGSQEVNLLGSQGELSSHLGSS
Subjt:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.2e-13863.54Show/hide
Query:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH
        +R ++ASRPNSEL                          MVCGFT SVKRKSKGRAHALKTV  T+P T  V +   Q  +GPSS VPT +IEL  +GG 
Subjt:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH

Query:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGG-------------------------------HPTR
        S EKRSR +S ALDVSPL EVR  SPL+RRRKKKK +SSSE   RG+LPTSH DLVDDPEARM G                               +  R
Subjt:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGG-------------------------------HPTR

Query:  ASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRAT
        ASKFVSD GSVLQRTID+ AEAFIASIH AVM+KAELDGREALAAKERENS  ALEAATTLKGELLKA+ +VDIL+AEV+AK +LLK+E E+HK HLRA 
Subjt:  ASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRAT

Query:  HA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYV
        HA                      LE KDA+IGRLT ELK  KE L NG LLE +FRQHPDFDGFAKDFS+AGFKFLMKGIAADMPHLQ+DL  LKK Y 
Subjt:  HA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYV

Query:  EKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGS
        EKWASGPNGTP P SLV+KYVR+LDSDYSD+EE+DAPSQEP EVG TQEEVPSQQGGS
Subjt:  EKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124673.7e-9250.67Show/hide
Query:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH
        +R ++ASRPNSEL                          MVCGFTSSVKRKSKGRAHALK VQS+ P T AV Q A QD+AGPSS  PT +IEL STG  
Subjt:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH

Query:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGGHPTRASKFVSDLGSVLQRTIDHAAEAFIASIHSAV
        S EKRSR++S ALDVSPL EVR                                                                              
Subjt:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGGHPTRASKFVSDLGSVLQRTIDHAAEAFIASIHSAV

Query:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA
                                                        EAKAELLKREDERHK HLRA H                      ALE KDAA
Subjt:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA

Query:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL
        IGRL AELK EKE L NGALLEAAFRQHPDFDGFAKDFS+AGFKFLMKGIAAD+PHL+VDLGDLKK Y EKWASGPNGT GPASLV+KYVRDLDSDYSDL
Subjt:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL

Query:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS
        +ED+ PSQEP EVG TQE VPSQQ GSQEVNLLGSQGELSSHLGSS
Subjt:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS

A0A6J1D1N9 uncharacterized protein LOC1110161933.9e-9473.64Show/hide
Query:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA
        RAS+FVSD GSVLQRTID+AAEAFIASIHSAVM+KAELDGREAL AKEREN ST LEAATTLKGELLKA+ +VDIL+AEV+AK +LLK+E E+HK HLRA
Subjt:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA

Query:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY
         HA                      LE KDA+IGRLT ELK  KE L +GALLE +FRQHP+FDGFAKDFS+AGFKFLMKGIAADMPHLQ+DL DLKK Y
Subjt:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY

Query:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGG
         E WASGPNGTPGP SLV+KYVR+LDSDYSD+EE+DAPSQEP +VG TQEE PSQ GG
Subjt:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGG

A0A6J1DF31 uncharacterized protein LOC1110199094.5e-10676.62Show/hide
Query:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA
        RASKFVSD GSVLQRTID+AAEAF+ASIHSA+M+KAELDGREALAAKERENSS ALEAATTLKGELLKA+ +V IL+AEV+AKAELLK+E E+HK HLRA
Subjt:  RASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRA

Query:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY
         HA                      LE KD +IGRLTAELK  KE L NG+LLE +FRQH DFDGFAKDFS+AGFKFLMKGIAADMPHLQ+DL +LKK Y
Subjt:  THA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSY

Query:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS
         EKWASGPNGTPGP SLV KYVR+LDSDYSD+EE+DAPSQEPNE+G TQEEVPSQQ GSQEVNLLGS+GELSSHLGSS
Subjt:  VEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-13863.54Show/hide
Query:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH
        +R ++ASRPNSEL                          MVCGFT SVKRKSKGRAHALKTV  T+P T  V +   Q  +GPSS VPT +IEL  +GG 
Subjt:  MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGH

Query:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGG-------------------------------HPTR
        S EKRSR +S ALDVSPL EVR  SPL+RRRKKKK +SSSE   RG+LPTSH DLVDDPEARM G                               +  R
Subjt:  SSEKRSRNQSVALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGG-------------------------------HPTR

Query:  ASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRAT
        ASKFVSD GSVLQRTID+ AEAFIASIH AVM+KAELDGREALAAKERENS  ALEAATTLKGELLKA+ +VDIL+AEV+AK +LLK+E E+HK HLRA 
Subjt:  ASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRAT

Query:  HA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYV
        HA                      LE KDA+IGRLT ELK  KE L NG LLE +FRQHPDFDGFAKDFS+AGFKFLMKGIAADMPHLQ+DL  LKK Y 
Subjt:  HA----------------------LEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYV

Query:  EKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGS
        EKWASGPNGTP P SLV+KYVR+LDSDYSD+EE+DAPSQEP EVG TQEEVPSQQGGS
Subjt:  EKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGS

A0A6J1DZB5 uncharacterized protein LOC1110248984.7e-8773.17Show/hide
Query:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA
        MIKAELDGREALAAKE+ENS  ALEAATT+K ELLKARS+V ILKA+V+ KAE+LK+E E+HK HL A H                      ALE  DA 
Subjt:  MIKAELDGREALAAKERENSSTALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATH----------------------ALEAKDAA

Query:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL
        IGRL+ ELK  KE L NG LLE AF+QHPDFDGFAKDFS+AGFKFLMKGIA DM HLQ+DL D+KK Y EKWASGPNGTPGP SLV+KYVR+LDSDYSD+
Subjt:  IGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAADMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDL

Query:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS
        EE DAPSQEPNEVG TQEEVPSQ GGSQEVNLLGSQGELSSHLGSS
Subjt:  EEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTCGGTTAAAGCTTCGAGACCGAACTCTGAACTTGGTAAGTCGAGCTCGGCCTTCTTTCTACTTATATTTCAATTGTTATTTCTGACATCTTGCTTTAACTGCCT
TGCAACAATGGTGTGCGGATTCACCAGCAGTGTAAAGCGCAAGTCTAAGGGCCGTGCTCACGCCCTTAAGACTGTTCAAAGCACTAAGCCTGCGACTTCTGCTGTGGCTC
AGCCTGCGGTTCAGGACAAGGCTGGGCCATCCTCTGAAGTTCCAACTCTGTTGATCGAGTTGGGCTCTACTGGGGGACACTCCAGCGAGAAGCGCTCGAGGAACCAATCC
GTGGCGCTAGACGTGTCGCCTCTTTGCGAGGTGAGGGAGGGCTCTCCTCTGAAGAGGAGAAGGAAAAAGAAGAAAGCCACCTCCTCCTCGGAGGTTGAACCTCGTGGTTC
CCTGCCCACGAGCCATGTCGACCTGGTGGACGACCCTGAAGCTCGGATGGGGGGACATCCGACGAGGGCGTCCAAGTTCGTAAGTGACCTTGGGTCTGTACTGCAAAGGA
CCATTGACCACGCTGCCGAGGCGTTTATTGCTTCCATTCATTCAGCGGTTATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGGCAGCGAAGGAGAGGGAGAACTCC
TCTACTGCCTTAGAGGCTGCCACTACGCTGAAGGGCGAGCTGCTGAAGGCCCGGAGCGACGTGGATATTTTGAAGGCCGAGGTGGAAGCCAAGGCCGAGCTGCTGAAGAG
GGAGGATGAGAGGCATAAGGGCCACCTCCGAGCTACCCACGCCCTTGAGGCGAAGGACGCTGCAATTGGGCGTCTCACTGCTGAGCTCAAGGTGGAGAAGGAATGCCTCG
CCAACGGAGCTCTTCTAGAAGCAGCCTTCAGGCAACACCCAGACTTTGATGGGTTTGCCAAGGATTTCAGCAATGCAGGCTTCAAGTTTTTGATGAAAGGCATTGCTGCT
GACATGCCCCACCTCCAGGTCGACCTCGGCGATCTGAAGAAGAGCTATGTTGAGAAATGGGCTTCTGGGCCTAACGGCACTCCAGGTCCTGCTTCCCTGGTAGAAAAGTA
CGTCAGAGATCTAGACTCTGACTACTCCGACCTGGAAGAAGACGATGCCCCTAGTCAGGAGCCTAACGAGGTCGGCATTACCCAAGAAGAAGTTCCTTCGCAGCAGGGCG
GATCTCAGGAGGTCAACCTTCTGGGTTCCCAAGGCGAGCTATCCTCTCACCTCGGGAGCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTCGGTTAAAGCTTCGAGACCGAACTCTGAACTTGGTAAGTCGAGCTCGGCCTTCTTTCTACTTATATTTCAATTGTTATTTCTGACATCTTGCTTTAACTGCCT
TGCAACAATGGTGTGCGGATTCACCAGCAGTGTAAAGCGCAAGTCTAAGGGCCGTGCTCACGCCCTTAAGACTGTTCAAAGCACTAAGCCTGCGACTTCTGCTGTGGCTC
AGCCTGCGGTTCAGGACAAGGCTGGGCCATCCTCTGAAGTTCCAACTCTGTTGATCGAGTTGGGCTCTACTGGGGGACACTCCAGCGAGAAGCGCTCGAGGAACCAATCC
GTGGCGCTAGACGTGTCGCCTCTTTGCGAGGTGAGGGAGGGCTCTCCTCTGAAGAGGAGAAGGAAAAAGAAGAAAGCCACCTCCTCCTCGGAGGTTGAACCTCGTGGTTC
CCTGCCCACGAGCCATGTCGACCTGGTGGACGACCCTGAAGCTCGGATGGGGGGACATCCGACGAGGGCGTCCAAGTTCGTAAGTGACCTTGGGTCTGTACTGCAAAGGA
CCATTGACCACGCTGCCGAGGCGTTTATTGCTTCCATTCATTCAGCGGTTATGATCAAGGCCGAGCTGGATGGAAGGGAGGCCTTGGCAGCGAAGGAGAGGGAGAACTCC
TCTACTGCCTTAGAGGCTGCCACTACGCTGAAGGGCGAGCTGCTGAAGGCCCGGAGCGACGTGGATATTTTGAAGGCCGAGGTGGAAGCCAAGGCCGAGCTGCTGAAGAG
GGAGGATGAGAGGCATAAGGGCCACCTCCGAGCTACCCACGCCCTTGAGGCGAAGGACGCTGCAATTGGGCGTCTCACTGCTGAGCTCAAGGTGGAGAAGGAATGCCTCG
CCAACGGAGCTCTTCTAGAAGCAGCCTTCAGGCAACACCCAGACTTTGATGGGTTTGCCAAGGATTTCAGCAATGCAGGCTTCAAGTTTTTGATGAAAGGCATTGCTGCT
GACATGCCCCACCTCCAGGTCGACCTCGGCGATCTGAAGAAGAGCTATGTTGAGAAATGGGCTTCTGGGCCTAACGGCACTCCAGGTCCTGCTTCCCTGGTAGAAAAGTA
CGTCAGAGATCTAGACTCTGACTACTCCGACCTGGAAGAAGACGATGCCCCTAGTCAGGAGCCTAACGAGGTCGGCATTACCCAAGAAGAAGTTCCTTCGCAGCAGGGCG
GATCTCAGGAGGTCAACCTTCTGGGTTCCCAAGGCGAGCTATCCTCTCACCTCGGGAGCAGCTGA
Protein sequenceShow/hide protein sequence
MRSVKASRPNSELGKSSSAFFLLIFQLLFLTSCFNCLATMVCGFTSSVKRKSKGRAHALKTVQSTKPATSAVAQPAVQDKAGPSSEVPTLLIELGSTGGHSSEKRSRNQS
VALDVSPLCEVREGSPLKRRRKKKKATSSSEVEPRGSLPTSHVDLVDDPEARMGGHPTRASKFVSDLGSVLQRTIDHAAEAFIASIHSAVMIKAELDGREALAAKERENS
STALEAATTLKGELLKARSDVDILKAEVEAKAELLKREDERHKGHLRATHALEAKDAAIGRLTAELKVEKECLANGALLEAAFRQHPDFDGFAKDFSNAGFKFLMKGIAA
DMPHLQVDLGDLKKSYVEKWASGPNGTPGPASLVEKYVRDLDSDYSDLEEDDAPSQEPNEVGITQEEVPSQQGGSQEVNLLGSQGELSSHLGSS