; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012071 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012071
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPolyketide cyclase/dehydrase and lipid transporter
Genome locationscaffold708:63397..65905
RNA-Seq ExpressionMS012071
SyntenyMS012071
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]3.1e-10384.05Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS A +T NS PNS  L H SI+RRNG+LFMAIPT RSINS+S SLP+LVF+IPR S KRSRNP  PR+KF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKV+EDNPSLSRWSLKY AFGQDIEFSWLA+NLQ        PT NQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLL+RGL+SFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

XP_022148300.1 uncharacterized protein LOC111016984 isoform X1 [Momordica charantia]4.7e-12095.26Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSFAAVTCNSNPNSPNLSHGSIKRRNGVL MAIPTRRSINSKSPSLPELVFRIPRGSLKRS NPNFPRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]1.2e-10786.64Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSINS+S SLP+L+FRIPR S K  RNP  PRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

XP_023004647.1 uncharacterized protein LOC111497878 [Cucurbita maxima]6.0e-10786.21Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSIN +S SLP+L+FRIPR S K  RNP  PRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]5.4e-10886.64Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRR+INS+S SLP+L+FRIPRGS K  RNP  PRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X11.5e-10384.05Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS A +T NS PNS  L H SI+RRNG+LFMAIPT RSINS+S SLP+LVF+IPR S KRSRNP  PR+KF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKV+EDNPSLSRWSLKY AFGQDIEFSWLA+NLQ        PT NQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLL+RGL+SFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X12.3e-12095.26Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSFAAVTCNSNPNSPNLSHGSIKRRNGVL MAIPTRRSINSKSPSLPELVFRIPRGSLKRS NPNFPRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

A0A6J1D4Q5 uncharacterized protein LOC111016984 isoform X27.4e-10395.05Show/hide
Query:  MAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY
        MAIPTRRSINSKSPSLPELVFRIPRGSLKRS NPNFPRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY
Subjt:  MAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY

Query:  KAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQ
        KAFGQDIEFSWLAKNLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQ
Subjt:  KAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQ

Query:  TA
        TA
Subjt:  TA

A0A6J1H9D8 uncharacterized protein LOC1114612535.9e-10886.64Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSINS+S SLP+L+FRIPR S K  RNP  PRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978782.9e-10786.21Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSIN +S SLP+L+FRIPR S K  RNP  PRVKF SPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQ        PTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLL
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLL

Query:  SPVASALQPLLERLLKRGLESFATFAKKYQTA
        SPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  SPVASALQPLLERLLKRGLESFATFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein8.8e-4857.14Show/hide
Query:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNR
        PVM+WQ+ T KM VD PASVAYK Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K ++FGQ+IE+ +LAKNLQ        P  ++KIHWRS+EG  NR
Subjt:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNR

Query:  GVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK
        G VRF+P+G SSCLVE++ SYEVP   +PVA A++P +E++++ GLE FA F K
Subjt:  GVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.2e-4656.77Show/hide
Query:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPN
        PVM+WQ+ T  KM VD PASVAYK Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K ++FGQ+IE+ +LAKNLQ        P  ++KIHWRS+EG  N
Subjt:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPN

Query:  RGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK
        RG VRF+P+G SSCLVE++ SYEVP   +PVA A++P +E++++ GLE FA F K
Subjt:  RGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.0e-5961.83Show/hide
Query:  PSLPELVFRIPRGSLKRSRNPNF--PRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSW
        P  P+ +      S   SR   F  P+ +  S  MEWQ+C+ KMEVD+P SVAY  Y DRE+ PKWMPFISSV+V++D P LSRWSLKY AFGQDI++SW
Subjt:  PSLPELVFRIPRGSLKRSRNPNF--PRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSW

Query:  LAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK
        LA+NLQ        PT NQKIHWRSLEGLPN+G VRF+PKG SSC+VELTVSYEVP LL+PVAS L+P +E LL+ GLE FA  AK
Subjt:  LAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein3.7e-5450.64Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFA--SPVMEWQNCTAKMEVDIPASVAYKCY
        MS  A+   +NP       G+    +   F    +R   +S SP  P  +    R S   S N +F    F     +MEWQ C  KM+V++P SVAY  Y
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFA--SPVMEWQNCTAKMEVDIPASVAYKCY

Query:  SDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPP
        S+RE+IPKWM FISSVKV++D P LSRW+LKYKAFGQ++E++WLAKNLQ        P  NQKIHW SLEGLPN+G VRF+P G SSC VELT +YEVP 
Subjt:  SDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPP

Query:  LLSPVASALQPLLERLLKRGLESFATFAKKYQT
        LL P A+ALQPL++ L+K  LE FA  AK  +T
Subjt:  LLSPVASALQPLLERLLKRGLESFATFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGCAGCAGTCACCTGTAATTCCAATCCCAACTCTCCGAATCTCAGTCATGGATCCATCAAACGGAGAAATGGCGTTCTTTTCATGGCGATTCCCACTCGCAG
AAGCATCAATTCGAAGTCCCCATCTCTACCCGAGCTCGTTTTCAGAATCCCACGCGGTTCTTTGAAGCGCAGTAGAAACCCCAATTTCCCTCGGGTTAAATTCGCCTCCC
CTGTGATGGAATGGCAAAATTGCACGGCCAAGATGGAAGTCGACATACCTGCCTCCGTTGCCTATAAATGCTACTCAGATCGGGAAGCCATCCCCAAATGGATGCCCTTC
ATTTCATCTGTGAAGGTAGTGGAAGATAATCCTAGTTTATCACGGTGGTCATTAAAATACAAGGCTTTTGGTCAAGATATTGAGTTCTCTTGGCTTGCTAAAAACTTGCA
GGCAAACAATTCTCTTCCTCCTTCCCCGACCCTAAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCAAACAGAGGTGTTGTACGATTTTATCCAAAAGGTTCCT
CATCCTGCCTTGTAGAACTGACAGTCTCTTATGAAGTTCCTCCACTTTTGTCTCCAGTGGCATCTGCACTGCAGCCTTTGCTTGAGAGATTACTTAAACGGGGTCTTGAA
AGCTTTGCTACTTTTGCCAAGAAATACCAAACGGCT
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGCAGCAGTCACCTGTAATTCCAATCCCAACTCTCCGAATCTCAGTCATGGATCCATCAAACGGAGAAATGGCGTTCTTTTCATGGCGATTCCCACTCGCAG
AAGCATCAATTCGAAGTCCCCATCTCTACCCGAGCTCGTTTTCAGAATCCCACGCGGTTCTTTGAAGCGCAGTAGAAACCCCAATTTCCCTCGGGTTAAATTCGCCTCCC
CTGTGATGGAATGGCAAAATTGCACGGCCAAGATGGAAGTCGACATACCTGCCTCCGTTGCCTATAAATGCTACTCAGATCGGGAAGCCATCCCCAAATGGATGCCCTTC
ATTTCATCTGTGAAGGTAGTGGAAGATAATCCTAGTTTATCACGGTGGTCATTAAAATACAAGGCTTTTGGTCAAGATATTGAGTTCTCTTGGCTTGCTAAAAACTTGCA
GGCAAACAATTCTCTTCCTCCTTCCCCGACCCTAAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCAAACAGAGGTGTTGTACGATTTTATCCAAAAGGTTCCT
CATCCTGCCTTGTAGAACTGACAGTCTCTTATGAAGTTCCTCCACTTTTGTCTCCAGTGGCATCTGCACTGCAGCCTTTGCTTGAGAGATTACTTAAACGGGGTCTTGAA
AGCTTTGCTACTTTTGCCAAGAAATACCAAACGGCT
Protein sequenceShow/hide protein sequence
MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSRNPNFPRVKFASPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPF
ISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQANNSLPPSPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLE
SFATFAKKYQTA