; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002747 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002747
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF3082 domain-containing protein
Genome locationscaffold66:239531..242578
RNA-Seq ExpressionMS002747
SyntenyMS002747
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021434 - Protein of unknown function DUF3082


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600385.1 hypothetical protein SDJN03_05618, partial [Cucurbita argyrosperma subsp. sororia]2.2e-9875.62Show/hide
Query:  MLQTHNFLSSI-FPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTG
        ML T N LSS  FP +LSLTH   LSPP  SSLHRPIT P      R     C PQ SELS+A A+F +DDGP+ELP TIFATTDDPSS+QVATSVLLTG
Subjt:  MLQTHNFLSSI-FPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTG

Query:  AISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLI
        AIS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA++SLKAISTGPI+SKS PSP+QAFLGAIAAGVIALILYKFTTTIEAALNRQT+SDNFS+ +  + 
Subjt:  AISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLI

Query:  ISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQS
        I      RTIVNGICYLATFVFGINAVGLFLYSGQLA+NSIME+GS  KE AT  DKQVS PNSTVE  LD TESSS+KDDQS
Subjt:  ISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQS

KAG7031048.1 hypothetical protein SDJN02_05087, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-9973.84Show/hide
Query:  MLQTHNFLSSI-FPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTG
        ML T N LSS  FP +LSLTH   LSPP  SSLHRPIT P      R     C PQ SELS+A A+F +DDGP+ELP TIFATTDDPSS+QVATSVLLTG
Subjt:  MLQTHNFLSSI-FPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTG

Query:  AISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSL-DKG--
        AIS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA++SLKAISTGPI+SKS PSP+QAFLGAIAAGVIALILYKFTTTIEAALNRQT+SDNFS+  +G  
Subjt:  AISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSL-DKG--

Query:  ----VLIISVSF------IR------RTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDD
            VLIIS+SF      +R      RTIVNGICYLATFVFGINAVGLFLYSGQLA+NSIME+GS  KE AT  DKQVS PNSTVET LD TESSS+KDD
Subjt:  ----VLIISVSF------IR------RTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDD

Query:  QS
        QS
Subjt:  QS

XP_022142844.1 uncharacterized protein LOC111012858 [Momordica charantia]2.0e-13695.47Show/hide
Query:  MLQTHNFLSSIFPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTGA
        MLQTHNFLSSIFPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTGA
Subjt:  MLQTHNFLSSIFPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTGA

Query:  ISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLII
        ISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS+ +  + I
Subjt:  ISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLII

Query:  SVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ
              RTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ
Subjt:  SVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ

XP_023535820.1 uncharacterized protein LOC111797135 [Cucurbita pepo subsp. pepo]4.1e-9776.14Show/hide
Query:  MLQTHNFLSSI-FPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQ--CSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLL
        ML T N LSS  FP +LSLTH   LSPP  S+LHRPIT P +     LR QQ  C PQ SELS A A    DDGP+ELP TIFATTDDPSS+QVATSVLL
Subjt:  MLQTHNFLSSI-FPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQ--CSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLL

Query:  TGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGV
        TGAIS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA++SLKAISTGPIESKS PSPIQ FLGAIAAGVIALILYKFTTTIEAALNRQT+SDNFS+ +  
Subjt:  TGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGV

Query:  LIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQS
        + I      RTIVNGICYLATFVFGINAVGLFLYSGQLA+NS+ME+GS DKE AT  DKQVS PNSTVET LD TESSS+KDDQS
Subjt:  LIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQS

XP_038905614.1 uncharacterized protein LOC120091579 [Benincasa hispida]1.6e-9674.5Show/hide
Query:  MLQTHNFLSSIFP-STLSLT--HKSCLSPP-----SLSSLHRPITFPF---LSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSL
        MLQT N LSS FP  TLSLT  HK  LSPP     S SSLHRPI F     L+TH       C PQ SEL  A ATF +D+GPVELP TIFATTDDPSSL
Subjt:  MLQTHNFLSSIFP-STLSLT--HKSCLSPP-----SLSSLHRPITFPF---LSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSL

Query:  QVATSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTGAIS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA+DSLKAISTGPIESKSTPSPIQAFLGAIAAGVIA+ILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  NFSLDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ
        NFS+ +  + I      RTIVNG+CYLATFVFGINA+GLFLYSGQLA+NS+ME+GS DKE     DKQVSPPNST ET L+STESS+++DDQSSSN Q
Subjt:  NFSLDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ

TrEMBL top hitse value%identityAlignment
A0A1S4E0Y7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034962101.6e-9472.79Show/hide
Query:  MLQTHNFLSSIFP----STLSLTHKSCLSPP-SLSSLHRPITFPFLS---THRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVA
        M  T N LSS  P    S  +  HK  LSPP +LSSLHRPITF  +S   THR      C PQ ++L  A ATF +D+GPVELPPTIFATTD+PSSLQVA
Subjt:  MLQTHNFLSSIFP----STLSLTHKSCLSPP-SLSSLHRPITFPFLS---THRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVA

Query:  TSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS
        TSVLLTGAIS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SDNFS
Subjt:  TSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS

Query:  LDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNL
        + +  + I      RTIVNG+CYLATFVFGINA+GLFLYSGQLA+NS+ME+GS DKE     D+QVSPP ST ET L+STESS++KDDQSSSNL
Subjt:  LDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNL

A0A5A7UXD2 DUF3082 domain-containing protein1.6e-9472.79Show/hide
Query:  MLQTHNFLSSIFP----STLSLTHKSCLSPP-SLSSLHRPITFPFLS---THRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVA
        M  T N LSS  P    S  +  HK  LSPP +LSSLHRPITF  +S   THR      C PQ ++L  A ATF +D+GPVELPPTIFATTD+PSSLQVA
Subjt:  MLQTHNFLSSIFP----STLSLTHKSCLSPP-SLSSLHRPITFPFLS---THRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVA

Query:  TSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS
        TSVLLTGAIS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SDNFS
Subjt:  TSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS

Query:  LDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNL
        + +  + I      RTIVNG+CYLATFVFGINA+GLFLYSGQLA+NS+ME+GS DKE     D+QVSPP ST ET L+STESS++KDDQSSSNL
Subjt:  LDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNL

A0A5B7BP03 Uncharacterized protein (Fragment)3.6e-7561.22Show/hide
Query:  MLQTHNFLSSIFPSTLSLTHKSCLSPPS----LSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELP---PTIFATTDDPSSLQVAT
        MLQ+ + LSS FP  L   H    S  S    L+ L+RPIT   +S H R R +     ++++ E   T  ED+GP+ELP   P+IFA TDDPS+LQVAT
Subjt:  MLQTHNFLSSIFPSTLSLTHKSCLSPPS----LSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELP---PTIFATTDDPSSLQVAT

Query:  SVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSL
        SVLLTGAIS+FLFRSLRRRA+RAKELKFRS G KKSLKEEA+DSLKA++  P+++KS PSP+QA LG + AGVIALILYKFTTTIEAALNRQT+SDNFS+
Subjt:  SVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSL

Query:  DKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ
         +  + I      RTIVNGICYLATFVFGIN+VGL LYSGQLA+NSIM D ST KET    + Q+S PNST ++  DS+E SS+  DQSS   Q
Subjt:  DKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ

A0A5D3CH90 DUF3082 domain-containing protein8.3e-9673.47Show/hide
Query:  MLQTHNFLSSIFP----STLSLTHKSCLSPP-SLSSLHRPITFPFLS---THRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVA
        M  T N LSS FP    S  +  HK  LSPP +LSSLHRPITF  +S   THR      C PQ ++L  A ATF +D+GPVELPPTIFATTD+PSSLQVA
Subjt:  MLQTHNFLSSIFP----STLSLTHKSCLSPP-SLSSLHRPITFPFLS---THRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVA

Query:  TSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS
        TSVLLTGAIS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SDNFS
Subjt:  TSVLLTGAISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS

Query:  LDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNL
        + +  + I      RTIVNG+CYLATFVFGINA+GLFLYSGQLA+NS+ME+GS DKE     D+QVSPP ST ET LDSTESS++KDDQSSSNL
Subjt:  LDKGVLIISVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNL

A0A6J1CM21 uncharacterized protein LOC1110128589.7e-13795.47Show/hide
Query:  MLQTHNFLSSIFPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTGA
        MLQTHNFLSSIFPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTGA
Subjt:  MLQTHNFLSSIFPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTGA

Query:  ISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLII
        ISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFS+ +  + I
Subjt:  ISIFLFRSLRRRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLII

Query:  SVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ
              RTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ
Subjt:  SVSFIRRTIVNGICYLATFVFGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G15110.1 unknown protein6.4e-5655Show/hide
Query:  LSTHRRLRIQQCSPQISELSEATATFD----EDDGPVELPP----------TIFATTDDPSSLQVATSVLLTGAISIFLFRSLRRRARRAKELKFRSVGV
        LS+  R+R     P I  LS      D    E+DGP+ELP           +IFAT+DDP+ LQ+ATSVLLTGAI++FL RS+RRRA+RAKEL FRS G 
Subjt:  LSTHRRLRIQQCSPQISELSEATATFD----EDDGPVELPP----------TIFATTDDPSSLQVATSVLLTGAISIFLFRSLRRRARRAKELKFRSVGV

Query:  KKSLKEEALDSLKAISTGPIE-SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLIISVSFIRRTIVNGICYLATFVFGINA
        KKSLKEEA+D+LKA+S+ PIE   STPS  QAFLGAIAAGVIALILYKFT T+E+ LNRQT+SDNFS      +  ++   RTI+NGICYLATFVFG+NA
Subjt:  KKSLKEEALDSLKAISTGPIE-SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLIISVSFIRRTIVNGICYLATFVFGINA

Query:  VGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSN
         GL LYSGQLA N   ED + +   AT       P +S   ++ D++E + + +DQSS +
Subjt:  VGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAGACCCACAACTTTCTCTCCTCCATCTTCCCTTCCACTCTCTCTCTAACCCACAAATCCTGTCTCTCTCCTCCCTCCCTCTCCTCTCTCCACAGACCCATCAC
CTTCCCCTTTCTTTCCACCCACCGTCGCCTCAGAATTCAGCAATGTTCGCCCCAAATTTCTGAACTATCAGAGGCCACCGCCACTTTTGATGAAGACGACGGCCCAGTTG
AGCTTCCACCCACCATTTTTGCTACCACGGATGACCCTTCTTCTCTCCAAGTGGCTACCAGTGTTCTCCTCACGGGGGCCATCTCCATTTTCCTCTTCCGCTCCCTCCGC
CGCCGCGCTCGGCGGGCCAAAGAGCTGAAATTTAGGTCTGTTGGAGTAAAGAAGTCCTTGAAGGAGGAAGCATTGGATAGCTTGAAAGCAATCAGTACAGGTCCAATTGA
ATCTAAGTCTACACCTTCACCGATACAAGCATTCTTGGGAGCAATAGCAGCTGGTGTCATTGCGTTAATCTTATATAAGTTCACCACCACCATTGAAGCTGCTCTGAACC
GACAGACAATGTCCGATAACTTCTCGCTCGATAAAGGAGTATTGATCATTTCTGTATCTTTTATTCGCAGAACTATTGTGAACGGAATATGCTACCTTGCAACATTTGTT
TTTGGAATTAATGCTGTTGGTTTGTTCCTTTACTCCGGTCAGTTGGCCGTAAATTCCATAATGGAAGATGGTTCCACAGATAAAGAAACTGCAACTATAGTTGACAAGCA
AGTTAGCCCACCAAATTCAACGGTTGAAACAGCGCTTGATAGCACCGAATCAAGCAGCAACAAGGATGATCAAAGTTCAAGTAATTTGCAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCAGACCCACAACTTTCTCTCCTCCATCTTCCCTTCCACTCTCTCTCTAACCCACAAATCCTGTCTCTCTCCTCCCTCCCTCTCCTCTCTCCACAGACCCATCAC
CTTCCCCTTTCTTTCCACCCACCGTCGCCTCAGAATTCAGCAATGTTCGCCCCAAATTTCTGAACTATCAGAGGCCACCGCCACTTTTGATGAAGACGACGGCCCAGTTG
AGCTTCCACCCACCATTTTTGCTACCACGGATGACCCTTCTTCTCTCCAAGTGGCTACCAGTGTTCTCCTCACGGGGGCCATCTCCATTTTCCTCTTCCGCTCCCTCCGC
CGCCGCGCTCGGCGGGCCAAAGAGCTGAAATTTAGGTCTGTTGGAGTAAAGAAGTCCTTGAAGGAGGAAGCATTGGATAGCTTGAAAGCAATCAGTACAGGTCCAATTGA
ATCTAAGTCTACACCTTCACCGATACAAGCATTCTTGGGAGCAATAGCAGCTGGTGTCATTGCGTTAATCTTATATAAGTTCACCACCACCATTGAAGCTGCTCTGAACC
GACAGACAATGTCCGATAACTTCTCGCTCGATAAAGGAGTATTGATCATTTCTGTATCTTTTATTCGCAGAACTATTGTGAACGGAATATGCTACCTTGCAACATTTGTT
TTTGGAATTAATGCTGTTGGTTTGTTCCTTTACTCCGGTCAGTTGGCCGTAAATTCCATAATGGAAGATGGTTCCACAGATAAAGAAACTGCAACTATAGTTGACAAGCA
AGTTAGCCCACCAAATTCAACGGTTGAAACAGCGCTTGATAGCACCGAATCAAGCAGCAACAAGGATGATCAAAGTTCAAGTAATTTGCAG
Protein sequenceShow/hide protein sequence
MLQTHNFLSSIFPSTLSLTHKSCLSPPSLSSLHRPITFPFLSTHRRLRIQQCSPQISELSEATATFDEDDGPVELPPTIFATTDDPSSLQVATSVLLTGAISIFLFRSLR
RRARRAKELKFRSVGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDNFSLDKGVLIISVSFIRRTIVNGICYLATFV
FGINAVGLFLYSGQLAVNSIMEDGSTDKETATIVDKQVSPPNSTVETALDSTESSSNKDDQSSSNLQ