; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0512 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0512
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationMC09:4661002..4666342
RNA-Seq ExpressionMC09g0512
SyntenyMC09g0512
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148300.1 uncharacterized protein LOC111016984 isoform X1 [Momordica charantia]1.74e-15999.55Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSFAAVTCNSNPNSPNLSHGSIKRRNGVL MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_022148307.1 uncharacterized protein LOC111016984 isoform X2 [Momordica charantia]6.23e-137100Show/hide
Query:  MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY
        MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY
Subjt:  MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY

Query:  KAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
        KAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  KAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]2.42e-14189.73Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSINS+S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023004647.1 uncharacterized protein LOC111497878 [Cucurbita maxima]1.99e-14089.29Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSIN +S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]8.44e-14289.73Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRR+INS+S SLP+L+FRIPRGS K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X17.24e-13687.05Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS A +T NS PNS  L H SI+RRNG+LFMAIPT RSINS+S SLP+LVF+IPR S KRS NP  PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNPSLSRWSLKY AFGQDIEFSWLA+NLQPT NQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X18.44e-16099.55Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSFAAVTCNSNPNSPNLSHGSIKRRNGVL MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1D4Q5 uncharacterized protein LOC111016984 isoform X23.02e-137100Show/hide
Query:  MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY
        MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY
Subjt:  MAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKY

Query:  KAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
        KAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA
Subjt:  KAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQTA

A0A6J1H9D8 uncharacterized protein LOC1114612531.17e-14189.73Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSINS+S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978789.61e-14189.29Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVLFMAIPTRRSIN +S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein4.1e-5048.06Show/hide
Query:  RNGVLFMAIPTRRSINS----KSPSLPELVFRIPRGSLKRSTNPNFPRVKFVS-------PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISS
        R+ + F  IP   ++ S     S S P ++  +   S   +   N      +S       PVM+WQ+ T KM VD PASVAYK Y+DRE  PKWMPF+SS
Subjt:  RNGVLFMAIPTRRSINS----KSPSLPELVFRIPRGSLKRSTNPNFPRVKFVS-------PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISS

Query:  VKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLES
        V+ +E +P LSR+ +K ++FGQ+IE+ +LAKNLQP  ++KIHWRS+EG  NRG VRF+P+G SSCLVE++ SYEVP   +PVA A++P +E++++ GLE 
Subjt:  VKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLES

Query:  FATFAK
        FA F K
Subjt:  FATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.0e-4847.83Show/hide
Query:  RNGVLFMAIPTRRSINS----KSPSLPELVFRIPRGSLKRSTNPNFPRVKFVS-------PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFIS
        R+ + F  IP   ++ S     S S P ++  +   S   +   N      +S       PVM+WQ+ T  KM VD PASVAYK Y+DRE  PKWMPF+S
Subjt:  RNGVLFMAIPTRRSINS----KSPSLPELVFRIPRGSLKRSTNPNFPRVKFVS-------PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFIS

Query:  SVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLE
        SV+ +E +P LSR+ +K ++FGQ+IE+ +LAKNLQP  ++KIHWRS+EG  NRG VRF+P+G SSCLVE++ SYEVP   +PVA A++P +E++++ GLE
Subjt:  SVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLE

Query:  SFATFAK
         FA F K
Subjt:  SFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.7e-6268.1Show/hide
Query:  SLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHW
        S+ R +    P+ +  S  MEWQ+C+ KMEVD+P SVAY  Y DRE+ PKWMPFISSV+V++D P LSRWSLKY AFGQDI++SWLA+NLQPT NQKIHW
Subjt:  SLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHW

Query:  RSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK
        RSLEGLPN+G VRF+PKG SSC+VELTVSYEVP LL+PVAS L+P +E LL+ GLE FA  AK
Subjt:  RSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein3.4e-5752.89Show/hide
Query:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKF--VSPVMEWQNCTAKMEVDIPASVAYKCY
        MS  A+   +NP       G+    +   F    +R   +S SP  P  +    R S   STN +F    F     +MEWQ C  KM+V++P SVAY  Y
Subjt:  MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKF--VSPVMEWQNCTAKMEVDIPASVAYKCY

Query:  SDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASA
        S+RE+IPKWM FISSVKV++D P LSRW+LKYKAFGQ++E++WLAKNLQP  NQKIHW SLEGLPN+G VRF+P G SSC VELT +YEVP LL P A+A
Subjt:  SDREAIPKWMPFISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASA

Query:  LQPLLERLLKRGLESFATFAKKYQT
        LQPL++ L+K  LE FA  AK  +T
Subjt:  LQPLLERLLKRGLESFATFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGCAGCAGTCACCTGTAATTCCAATCCCAACTCTCCGAATCTCAGTCATGGATCCATCAAACGGAGAAATGGCGTTCTTTTCATGGCGATTCCCACTCGCAG
AAGCATCAATTCGAAGTCCCCATCTCTACCCGAGCTCGTTTTCAGAATCCCACGCGGTTCTTTGAAGCGCAGTACAAACCCCAATTTCCCTCGGGTTAAATTCGTCTCCC
CTGTGATGGAATGGCAAAATTGCACGGCCAAGATGGAAGTCGACATACCTGCCTCCGTTGCCTATAAATGCTACTCAGATCGGGAAGCCATCCCCAAATGGATGCCCTTC
ATTTCATCTGTGAAGGTAGTGGAAGATAATCCTAGTTTATCACGGTGGTCATTAAAATACAAGGCTTTTGGTCAAGATATTGAGTTCTCTTGGCTTGCTAAAAACTTGCA
GCCGACCCTAAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCAAACAGAGGTGTTGTACGATTTTATCCAAAAGGTTCCTCATCCTGCCTTGTAGAACTGACAG
TCTCTTATGAAGTTCCTCCACTTTTGTCTCCAGTGGCATCTGCACTGCAGCCTTTGCTTGAGAGATTACTTAAACGGGGTCTTGAAAGCTTTGCTACTTTTGCCAAGAAA
TACCAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAATGATTTTTTTTTTTAGTTTCACCATTTCACTGATCACCTTCCGTTCTTCGGTCGTTTCGATCTCTGACCGCTCTGATATTTCTTGTGGCTTTCGAAAGACCGAAT
TCAGAGGCTGTTCTTGCATTTGGTGGCGTCAATTCTCTGCTCACCACCCCCTTTAATGTCCCTTTCCCGCCTTCTTCTTCGCTAATTGCAAATTCCACTATGTCTTTTGC
AGCAGTCACCTGTAATTCCAATCCCAACTCTCCGAATCTCAGTCATGGATCCATCAAACGGAGAAATGGCGTTCTTTTCATGGCGATTCCCACTCGCAGAAGCATCAATT
CGAAGTCCCCATCTCTACCCGAGCTCGTTTTCAGAATCCCACGCGGTTCTTTGAAGCGCAGTACAAACCCCAATTTCCCTCGGGTTAAATTCGTCTCCCCTGTGATGGAA
TGGCAAAATTGCACGGCCAAGATGGAAGTCGACATACCTGCCTCCGTTGCCTATAAATGCTACTCAGATCGGGAAGCCATCCCCAAATGGATGCCCTTCATTTCATCTGT
GAAGGTAGTGGAAGATAATCCTAGTTTATCACGGTGGTCATTAAAATACAAGGCTTTTGGTCAAGATATTGAGTTCTCTTGGCTTGCTAAAAACTTGCAGCCGACCCTAA
ATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCAAACAGAGGTGTTGTACGATTTTATCCAAAAGGTTCCTCATCCTGCCTTGTAGAACTGACAGTCTCTTATGAA
GTTCCTCCACTTTTGTCTCCAGTGGCATCTGCACTGCAGCCTTTGCTTGAGAGATTACTTAAACGGGGTCTTGAAAGCTTTGCTACTTTTGCCAAGAAATACCAAACGGC
TTGAAAACCCGAGTTATTTGTCTTAATGGGTATCCCACATTATCATCTTTTTTCAAAAACTCTTATAAATTAATGTAAAATCTTGAATAGAATACATAACAGAATCTGTT
GTACTGAATATCTCAATGTATTTTACAAAAATATGCTGAGTTCAGCTGATTTATTGAAGCTATGTATGCCTGCAATCTGAGGGAGTAGATATCAGAAAACTATTTACATT
CCAATAAATGGGATTGCAGAGCTGAACGCTTCAATCTGTTATAAATGAGGGGTTTTACAGGAAATAGTTACACTCATTGGCATCCACTGTTGTCTAACCATCTCTGCAAG
CTACAAAGGGCCTGTGCAGAGAGGCTGCAGTATCTCAATTTTCGCTGCAGTGGCAACGGATTGTCACGGTGGAAACTAGAGAAGCAAACAACGACAACAGTGAGATTATC
GAATGAGTCTAGTCGCAAAGCCTGCAGGACGAGGTCTCGAGCACATCGCTCAGGGTCGTCGTGTCGCTGAAGTCCCTGTCGAACCACATTGACTGCTTGCTGGCTAGACA
TCACGTCCCAGATCCCATCGCAGGCTATGATCAGGAACTCATCCTCCTCTGTTAAAACCACCTGCCTGCATTCTGGTTCTGCAATGAGAGGTGAAGGAGAACCATGGGGG
AGCTTCATGTCCCAATCCCCCAAGGCCCTGGATACCGATAAAACGCCGTTAAGATAGCCGCCATCAACGTAGCCACCCAACTCTTCCACCCGTTGCCTCTCCGAGGTATA
AATTGGCCGGTGGTCTTGAGACATATCTACCGCCTCTCCTTTCCGAGAGAGAACTGCCCGGCAATCTCCTACGTTAGCCACCATTAAAAGCCTGAAGAACAAAAACATGA
TCAGCCACAGAACTATGGCTCCCCATTTGTGCATTATGAACAAAAAAACAATTGCACTCATGACAGTAAAATTGGAAGAACATACATATTGGTAGGTTAACGAAACCTAT
AACAGCAGTCTACAGATACAGAGATTGCTATCAGGTATGAAGGTTCAAGAACATGGATACATTGGTACGCTACAGCTTACAAATGCAGAGATTGCTATCAGTTATAAAGG
ATCTTTACATGGGGGAGGAAGAACGCATACCTTCCAAGTATCAGAGCAGTCAGCGCCGTCGTCCCGGAAGAACTACTGACATCAGAATCATCTGCCAGAGCTTGATCAGC
CAGAAGAAACGCTTCGCGAAGAAAGTTTTCGATGTCCTCCGGCAAGACTTCATCAACATCTGGCATTAGAGGAAACCTAACATCCTCAAAGAATAGTCTTAGAACATTCT
TTCTGATATAAGCTGCTGCTTCAGGTCCTCCATGGCCATCAAATACCTGCTCAACATATAAAAAAACTCCACTCAGAATCAGATCACGAAAACTTCCGTATCCCTAGTTC
CTAGAAAGCACTTCTGACTCAAAACTTACCGCATAGAAAGCACTTGGCTTGGGAAACGTGAAGACCGAGGATCCTAAATGAGAAGACAGATCATCTATCCTTACATGTTC
ATCTTCCATGTATCTCCTAGGTCCAATATCAGCAAAGCTCCCCGACCGGAGGCTCGGAAGAAATTCCTCAACGCCGGCCGAACCGTCGTCGGAGTCCGGAATTTTCTTCA
TGGGTTTAATATCCTGAAGAGAGGAACGAACATCATATGAACTCAAGAGCAGAAGAGCCGAAAACCCCATCAGATCATATTTTTGCAGTCGCAGCTAGAACTAAATTATT
ACACTATACCCCAAAATCGGCCAAAGATTTTGAAGAAGAAGAAGGAAAACATACCGATTGAGAGCTAGACAACTCGGCGGAGACCGATTCCGAAGCTCGAACTCGGTCGA
GAGTCGGCGAAATCGCGTCGCCGATCGACTCAATTTGATCGAAATACTGCACCTCCAAAACCGGGAGGCTCTTCGGGCAAACAACAACCTCAGCTTCTGCTACCATATTT
GTTTTTTCTCCTTTTTCACATTAACCAGAAACGGAAACCAGGGATGATCGGCTCCTCGTGATCGGACTTCGAAATCGAGATCGTGTAATCAGAAGGGTCGGCGATGAAGA
AGAATATATAGAAGTCAGTGGAGAAGACGGGAAGCAGCCGCCGCGGGTGAGAGTAAATTCGTGGAAGGAGACGGAACCACGTAACGCCGTTACCGGGACTTGCCCCGTGA
ACACACGGTAACGTTTTCACGTCGACACGTGGCTGGGTTTTGCTTTTGTGAAATTACTTAAATGCGCTCCCTTTGAATCGCGTTGACTTTTTATTTTTGTTTTTTTATTT
TTATTATTTTTTTTCCTTCTTGGAGCGGGTATGGGTATGAGTGACGTAGGACCGTGGCAGAATTTTCTAGCGACTGAGGGGAAGGTTCTTTATCTTCCCACGCGCCGACG
TGGAAATTAAGTTATTTTTGTTGTTTATTTTATTAATGTGTTCAAATTATTGTGCAATAAAGGGGGGAATTTGAAAGTACATTAAAGAAATAAATTTTATAAATTAAATT
ATAA
Protein sequenceShow/hide protein sequence
MSFAAVTCNSNPNSPNLSHGSIKRRNGVLFMAIPTRRSINSKSPSLPELVFRIPRGSLKRSTNPNFPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPF
ISSVKVVEDNPSLSRWSLKYKAFGQDIEFSWLAKNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGSSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKK
YQTA