; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01150 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01150
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTrigger_N domain-containing protein
Genome locationchr7:948556..952876
RNA-Seq ExpressionMoc07g01150
SyntenyMoc07g01150
Gene Ontology termsGO:0000413 - protein peptidyl-prolyl isomerization (biological process)
GO:0015031 - protein transport (biological process)
GO:0043335 - protein unfolding (biological process)
GO:0051083 - 'de novo' cotranslational protein folding (biological process)
GO:0061077 - chaperone-mediated protein folding (biological process)
GO:0003755 - peptidyl-prolyl cis-trans isomerase activity (molecular function)
GO:0043022 - ribosome binding (molecular function)
GO:0044183 - protein folding chaperone (molecular function)
InterPro domainsIPR005215 - Trigger factor
IPR008881 - Trigger factor, ribosome-binding, bacterial
IPR036611 - Trigger factor ribosome-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460020.1 PREDICTED: uncharacterized protein LOC103498959 [Cucumis melo]3.9e-9882.89Show/hide
Query:  ASATATVTNLA-SEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        A+ATATVTN+A SE+ R  F KV VKGYC + L C N GGVEFDNVQ+GLCRRSSFS S H++GSRFLS+PTSIASSGLEAA+TDYKGNAI LKNAK+VV
Subjt:  ASATATVTNLA-SEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
        ESE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKKEN+NVKD KVNTTQTA
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA

Query:  DELKLLFTPGKEFGFNAILELESANSED
        DELK+LF PGKEFGFNAILELESA+  +
Subjt:  DELKLLFTPGKEFGFNAILELESANSED

XP_011656737.1 uncharacterized protein LOC101212225 [Cucumis sativus]4.6e-9982.82Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATATVTN+ASE+ R  F KVPVKGYC + L C N GGVEFDNV++GLCRRSSFS S HK+GSRFLS+PTSIASSGLEAAITDYKGN I LKNAK+VVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD
        SE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKF+IQEILNSTM DYAKKEN+NVKDKKV+TTQTAD
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD

Query:  ELKLLFTPGKEFGFNAILELESANSED
        ELK+LF PGKEFGFNAILELESA+  +
Subjt:  ELKLLFTPGKEFGFNAILELESANSED

XP_022155406.1 uncharacterized protein LOC111022553 [Momordica charantia]2.1e-120100Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
        ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA

Query:  DELKLLFTPGKEFGFNAILELESANSEDES
        DELKLLFTPGKEFGFNAILELESANSEDES
Subjt:  DELKLLFTPGKEFGFNAILELESANSEDES

XP_023513631.1 uncharacterized protein LOC111778179 isoform X1 [Cucurbita pepo subsp. pepo]3.9e-9081.53Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATV N+A E+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S SP  +GSRFLSRPT+IASSGLEAAITDYKG AI LKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
        ESEDE++IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKKE++ VKDK VNTTQT 
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA

Query:  DELKLLFTPGKEFGFNAILELE
        DELK+LF PGKEFGFNAILELE
Subjt:  DELKLLFTPGKEFGFNAILELE

XP_038906537.1 uncharacterized protein LOC120092511 [Benincasa hispida]3.2e-10084.14Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATAT TN+ASE+R   F +VPVKGYCR+ L C N GGVEFDNVQ+GLCRRSSFS S HK+GSRFLS+PTSIASSGLEAAITDYKGNAI LKNAK+VVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD
        SE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RV KFVIQEILNSTMADYAKKEN+NVKDKKVNTTQTAD
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD

Query:  ELKLLFTPGKEFGFNAILELESANSED
        ELK+LF+PGKEFGFNAILELESA+  +
Subjt:  ELKLLFTPGKEFGFNAILELESANSED

TrEMBL top hitse value%identityAlignment
A0A0A0K988 Trigger_N domain-containing protein2.2e-9982.82Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATATVTN+ASE+ R  F KVPVKGYC + L C N GGVEFDNV++GLCRRSSFS S HK+GSRFLS+PTSIASSGLEAAITDYKGN I LKNAK+VVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD
        SE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKF+IQEILNSTM DYAKKEN+NVKDKKV+TTQTAD
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD

Query:  ELKLLFTPGKEFGFNAILELESANSED
        ELK+LF PGKEFGFNAILELESA+  +
Subjt:  ELKLLFTPGKEFGFNAILELESANSED

A0A1S3CBN2 uncharacterized protein LOC1034989591.9e-9882.89Show/hide
Query:  ASATATVTNLA-SEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        A+ATATVTN+A SE+ R  F KV VKGYC + L C N GGVEFDNVQ+GLCRRSSFS S H++GSRFLS+PTSIASSGLEAA+TDYKGNAI LKNAK+VV
Subjt:  ASATATVTNLA-SEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
        ESE+EN IQLRVDL+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKKEN+NVKD KVNTTQTA
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA

Query:  DELKLLFTPGKEFGFNAILELESANSED
        DELK+LF PGKEFGFNAILELESA+  +
Subjt:  DELKLLFTPGKEFGFNAILELESANSED

A0A6J1DRL2 uncharacterized protein LOC1110225531.0e-120100Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
        ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA

Query:  DELKLLFTPGKEFGFNAILELESANSEDES
        DELKLLFTPGKEFGFNAILELESANSEDES
Subjt:  DELKLLFTPGKEFGFNAILELESANSEDES

A0A6J1H8V8 uncharacterized protein LOC1114611155.5e-9081.98Show/hide
Query:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV
        MASATATV N+ASE+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S SP  +GSRFLSRPT+IASSGLEAAITDYKG AI LKNAKIVV
Subjt:  MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVV

Query:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA
        ESEDEN IQLRVDL+GDETQK+FDQVLTNLARSAP MPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTM DYAKKEN+ VKDK VNTTQT 
Subjt:  ESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTA

Query:  DELKLLFTPGKEFGFNAILELE
        DELK LF PGKEFGFNAILELE
Subjt:  DELKLLFTPGKEFGFNAILELE

A0A6J1KVH6 uncharacterized protein LOC1114975772.1e-8978.63Show/hide
Query:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE
        A+ATATV N+ASE+RR  F KVPV G  R+ L C N GGVEF+NVQ+ LC RSS S S   +GSRFLSRPT+IASSGLEAAITDYKG AI LKNAKIVVE
Subjt:  ASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVE

Query:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD
        SEDEN IQLRV+L+GDETQK+FDQVLTNLARSAPPMPGFR QKGGKTSNVPKSFLLEVLG++RVTKFVIQEILNSTMADYAKKEN+ VKDK VNTTQT D
Subjt:  SEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTAD

Query:  ELKLLFTPGKEFGFNAILELE-----SANSEDES
        ELK+LF PGKEFGFNAILELE      + SEDE+
Subjt:  ELKLLFTPGKEFGFNAILELE-----SANSEDES

SwissProt top hitse value%identityAlignment
B1XL18 Trigger factor1.0e-0525.6Show/hide
Query:  KIVVESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNT
        K++ E    + + L +++  D TQK +D  +  LAR+   +PGFR  K      VPK  L++ LG  R+   V++++++ ++     +EN+         
Subjt:  KIVVESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNT

Query:  TQTADELKLLFTPGKEFGFNAILEL
          + D+L   + PG+   F A +++
Subjt:  TQTADELKLLFTPGKEFGFNAILEL

Q3M725 Trigger factor4.0e-0526.98Show/hide
Query:  KIVVESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNT
        K+  E    + I L ++++ + TQK ++QV+ NL+R+   +PGFR  K      VP+  LL+ LG+  +    ++E+L   +    K+E++    +    
Subjt:  KIVVESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNT

Query:  TQTADELKLLFTPGKEFGFNAILELE
          + D+L   + PG+   F A +++E
Subjt:  TQTADELKLLFTPGKEFGFNAILELE

Q8YQX9 Trigger factor4.0e-0526.98Show/hide
Query:  KIVVESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNT
        K+  E    + I L ++++ + TQK ++QV+ NL+R+   +PGFR  K      VP+  LL+ LG+  +    ++E+L   +    K+E++    +    
Subjt:  KIVVESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNT

Query:  TQTADELKLLFTPGKEFGFNAILELE
          + D+L   + PG+   F A +++E
Subjt:  TQTADELKLLFTPGKEFGFNAILELE

Arabidopsis top hitse value%identityAlignment
AT2G30695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 253 Blast hits to 253 proteins in 72 species: Archae - 0; Bacteria - 138; Metazoa - 0; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink).3.9e-2441.27Show/hide
Query:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQT
        VE+E  N +++ V +SG++TQ +F+ V   +  +A P+PGFR  KGGKT N+PK  LLE+LG  +V K VI++++NS + DY K+E++ V  K++   Q+
Subjt:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQT

Query:  ADELKLLFTPGKEFGFNAILELESAN
         ++L+  F PG+ F F+A ++L+ A+
Subjt:  ADELKLLFTPGKEFGFNAILELESAN

AT2G30695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: protein folding, protein transport; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Trigger factor, ribosome-binding, bacterial (InterPro:IPR008881); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).3.9e-2441.27Show/hide
Query:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQT
        VE+E  N +++ V +SG++TQ +F+ V   +  +A P+PGFR  KGGKT N+PK  LLE+LG  +V K VI++++NS + DY K+E++ V  K++   Q+
Subjt:  VESEDENLIQLRVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQT

Query:  ADELKLLFTPGKEFGFNAILELESAN
         ++L+  F PG+ F F+A ++L+ A+
Subjt:  ADELKLLFTPGKEFGFNAILELESAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCGCAACCGCAACGGTAACCAATCTCGCCTCAGAATACCGGCGGGCAAGCTTTATCAAAGTTCCTGTTAAAGGCTACTGCCGCAGTGTTCTGGCCTGTTCGAA
CTCCGGAGGCGTTGAATTCGATAATGTACAAAATGGGCTGTGCAGGAGGTCGTCATTTTCTTGTAGTCCTCACAAATTGGGTTCCAGATTTTTGTCCAGACCGACTTCAA
TAGCTAGTTCAGGTTTGGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAGCCTTAAAAAATGCTAAGATAGTTGTAGAGTCTGAAGATGAAAACTTGATACAGCTT
CGAGTGGACTTGAGTGGGGACGAGACACAAAAAATTTTCGATCAGGTTTTGACAAATTTGGCCCGTTCCGCACCGCCAATGCCAGGATTTCGTATGCAAAAAGGAGGGAA
AACATCAAATGTCCCAAAAAGCTTCCTATTAGAAGTCCTTGGTGAGGAGCGTGTCACAAAGTTTGTCATACAAGAAATATTGAACTCAACCATGGCAGATTATGCAAAGA
AGGAAAATGTAAATGTGAAGGACAAGAAGGTTAACACAACACAAACAGCAGATGAACTGAAACTGTTGTTTACTCCAGGAAAAGAGTTTGGATTCAATGCCATACTTGAG
CTTGAATCTGCTAATTCAGAAGATGAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCCGCAACCGCAACGGTAACCAATCTCGCCTCAGAATACCGGCGGGCAAGCTTTATCAAAGTTCCTGTTAAAGGCTACTGCCGCAGTGTTCTGGCCTGTTCGAA
CTCCGGAGGCGTTGAATTCGATAATGTACAAAATGGGCTGTGCAGGAGGTCGTCATTTTCTTGTAGTCCTCACAAATTGGGTTCCAGATTTTTGTCCAGACCGACTTCAA
TAGCTAGTTCAGGTTTGGAGGCAGCCATCACAGATTACAAAGGCAATGCAATAGCCTTAAAAAATGCTAAGATAGTTGTAGAGTCTGAAGATGAAAACTTGATACAGCTT
CGAGTGGACTTGAGTGGGGACGAGACACAAAAAATTTTCGATCAGGTTTTGACAAATTTGGCCCGTTCCGCACCGCCAATGCCAGGATTTCGTATGCAAAAAGGAGGGAA
AACATCAAATGTCCCAAAAAGCTTCCTATTAGAAGTCCTTGGTGAGGAGCGTGTCACAAAGTTTGTCATACAAGAAATATTGAACTCAACCATGGCAGATTATGCAAAGA
AGGAAAATGTAAATGTGAAGGACAAGAAGGTTAACACAACACAAACAGCAGATGAACTGAAACTGTTGTTTACTCCAGGAAAAGAGTTTGGATTCAATGCCATACTTGAG
CTTGAATCTGCTAATTCAGAAGATGAAAGTTGA
Protein sequenceShow/hide protein sequence
MASATATVTNLASEYRRASFIKVPVKGYCRSVLACSNSGGVEFDNVQNGLCRRSSFSCSPHKLGSRFLSRPTSIASSGLEAAITDYKGNAIALKNAKIVVESEDENLIQL
RVDLSGDETQKIFDQVLTNLARSAPPMPGFRMQKGGKTSNVPKSFLLEVLGEERVTKFVIQEILNSTMADYAKKENVNVKDKKVNTTQTADELKLLFTPGKEFGFNAILE
LESANSEDES