; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg001781 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg001781
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionoral cancer-overexpressed protein 1 homolog
Genome locationscaffold8:31904550..31921752
RNA-Seq ExpressionSpg001781
SyntenySpg001781
Gene Ontology termsGO:0005542 - folic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047887.1 formimidoyltransferase-cyclodeaminase isoform X2 [Cucumis melo var. makuwa]8.5e-5785.61Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEE H KEGYA GY+DGLVAGKEE +QVGLKVGFEVGEE+GFY GCVDVWNSAIRI+PERFS+RVRKSVK +EEL+EKYPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF
        RLKFRAI ATLGVKLEYNGYP+S SDG +ID+
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF

XP_022947461.1 oral cancer-overexpressed protein 1 homolog [Cucurbita moschata]2.5e-5687.69Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEETH KEGYA+GYKDGLVAGKEE KQVGLKVGFEVGEE+GF+ GCVDVW S I IDPERFS RV+KSVKQMEELVE+YPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEI
        RLKFR ICATLGVKLEY+G+PKSASDG EI
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEI

XP_022970845.1 oral cancer-overexpressed protein 1 homolog [Cucurbita maxima]4.2e-5686.92Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEETH KEGYA+GYKDGLVAGKEE KQVGLKVGFEVGEE+GF+ GCVDVWNS IRI PERFS RV+KSVKQMEELVE+YPL+DPENEQVQE+MEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEI
        RLKFR ICATLGVKLEY+G+PKSASDG EI
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEI

XP_023532878.1 oral cancer-overexpressed protein 1 homolog [Cucurbita pepo subsp. pepo]2.5e-5687.69Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEETH KEGYA+GYKDGLVAGKEE KQVGLKVGFEVGEE+GF+ GCVDVW S I IDPERFS RV+KSVKQMEELVE+YPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEI
        RLKFR ICATLGVKLEY+G+PKSASDG EI
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEI

XP_038902357.1 formimidoyltransferase-cyclodeaminase-like [Benincasa hispida]1.2e-5889.39Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEETH KEG+A+GYKDGLVAGKEE +QVGLKVGFEVGEELGFY GCVDVWNS IRI+PERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF
        RLKFRAI ATLGVKLEY GYPKS SDG +I+F
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF

TrEMBL top hitse value%identityAlignment
A0A0A0KE52 Yae1_N domain-containing protein2.4e-5787.12Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEE H KEGYA GYKDGLVAGKEE +QVGLKVGFEVGEELGFY GCVDVWNS IRI+PERFSIRVRKSVK MEEL+EKYPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF
        RLKFRA+ ATLGVKLEY+GYPKS SDG +I+F
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF

A0A5D3BVB4 Formimidoyltransferase-cyclodeaminase isoform X24.1e-5785.61Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEE H KEGYA GY+DGLVAGKEE +QVGLKVGFEVGEE+GFY GCVDVWNSAIRI+PERFS+RVRKSVK +EEL+EKYPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF
        RLKFRAI ATLGVKLEYNGYP+S SDG +ID+
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEIDF

A0A6J1CFK0 formimidoyltransferase-cyclodeaminase-like isoform X29.4e-5492.37Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEETH KEGYA+GYKDGLVAGKEE KQVGLKVGF+VGEELGFY GCVD WNSAI IDPERFS+RVRKS+KQMEELVEKYPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYN
        RLKFRAICATLGVKLEYN
Subjt:  RLKFRAICATLGVKLEYN

A0A6J1G6H8 oral cancer-overexpressed protein 1 homolog1.2e-5687.69Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEETH KEGYA+GYKDGLVAGKEE KQVGLKVGFEVGEE+GF+ GCVDVW S I IDPERFS RV+KSVKQMEELVE+YPLQDPENEQVQELMEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEI
        RLKFR ICATLGVKLEY+G+PKSASDG EI
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEI

A0A6J1I414 oral cancer-overexpressed protein 1 homolog2.0e-5686.92Show/hide
Query:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL
        SSLNLEETH KEGYA+GYKDGLVAGKEE KQVGLKVGFEVGEE+GF+ GCVDVWNS IRI PERFS RV+KSVKQMEELVE+YPL+DPENEQVQE+MEGL
Subjt:  SSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGL

Query:  RLKFRAICATLGVKLEYNGYPKSASDGNEI
        RLKFR ICATLGVKLEY+G+PKSASDG EI
Subjt:  RLKFRAICATLGVKLEYNGYPKSASDGNEI

SwissProt top hitse value%identityAlignment
Q75JW3 Protein LTO1 homolog2.3e-0930.77Show/hide
Query:  EEDGAISIVSKVYSSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPE------------RFSIRVRKSVKQ
        E D  +S+ S  Y S        KE   +G  DG   G  EG Q+G + G E+G+E+G+Y  CV VWN  + I+              +FS+R  +++++
Subjt:  EEDGAISIVSKVYSSLNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPE------------RFSIRVRKSVKQ

Query:  MEELVEKYPLQDPENEQVQELMEGLRLKFRAICATLGVKLEYN
        + +L+E Y L D  +E +   +  +RLKF+     LG++ + N
Subjt:  MEELVEKYPLQDPENEQVQELMEGLRLKFRAICATLGVKLEYN

Q8CH62 Protein LTO1 homolog3.6e-1031.03Show/hide
Query:  EETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRLKFR
        +E    EGY +GY++G   G  EGK+ G+  G ++G E+G Y G    W   +         R  K V+ +  L++ +P  DP  E++ E ++ +R KFR
Subjt:  EETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRLKFR

Query:  AICATLGVKLEYNGYP
         +C+ L V+ ++   P
Subjt:  AICATLGVKLEYNGYP

Q8WV07 Protein LTO1 homolog1.1e-0927.56Show/hide
Query:  EETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRLKFR
        +E    EGY +GY++G   G  EG+Q G   G ++G E+G Y G    W   +         R  K ++ +  +++K+P  DP  +++ E ++ +R KF+
Subjt:  EETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRLKFR

Query:  AICATLGVKLEYNGYPKSASDGNEIDF
          C+ L V+ ++    K +++G+ + F
Subjt:  AICATLGVKLEYNGYPKSASDGNEIDF

Arabidopsis top hitse value%identityAlignment
AT2G20830.2 transferases;folic acid binding2.0e-2750Show/hide
Query:  LNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRL
        + LEETH ++G+ +GY++GLV+G+E+ + +GLK+GFE GE +GFY GC  +WNSA+RIDP RFS ++ K +     L++K PL DPE+E    + + LR+
Subjt:  LNLEETHQKEGYAKGYKDGLVAGKEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRL

Query:  KFRAICATLG
        KF  ICA+LG
Subjt:  KFRAICATLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTAATCATTCTCCCCACCTTAAAAGCTTTCGTCCTCGAAAGACTACTGCTCAAACAGCTCTGGGTACTTGGATCTATTTTCGACTTCTCTTTCACCAGCCCCCC
CCTGCCGATCTCGAGGCGCCGTCGTCGTCTGGGTCGCGTCGCCGCCCCGTTCCGTCAGTGGTGTCGACGCAGCAAGCCACCGCGTGGTTCTTCTTCCCCCGCGAGTTCTC
TCTTCGCGGATCCCTCTCTCTGCACCGTCTCTCTCTCGCTCCCATCGATTTCTCTCCCTCCGTCGCAGCCCGAGTGCCGCCGCTCAGACCTGCAGCCGTTGCCGCTGCCC
CTTGCAACCGTTGCTCAAGTTCAGCCGCCGCTTGCTCGCGCCGCCGCCTTCCTCTCTGTTTCGTGGGTTAGATTCGTGGCTCTCGCTCTCTCTGTCCATGCGTTTTTGGC
CGAGATAAGTCGTGGATCTCACGCGGACAGCAGCTCGAAGCCCCGTCTCCTCCACGTTTCAGCCTTAAAAATATTGTATCTCGCGTGTCTAGCAATTCGGAGTCCCGTCG
ACCTCGTTTCAGTCGATTCCGCCTCTGTCCAACAGCATTCTTGGTTGTTGTTGGCGTCGTTGGGCGTTTCCGCGCCGTCTGAGTGTTCGATTAAGTTCGAAACGCTTCGA
CTTGGATACCCACTGCCCAAGGAGCGTTCTAGCTCGCTGGTTGTGGTTTGTAAAACCCGTCTAACGTGGAAGCGGGTTCGTTTGGCTGCTGAAAAGCCTTATCTGGAGTC
GTTTGGCGAACACCCATTGCCCGTTTTCGAGGTGCTTGTGGAGCTTGGTGAGTGTGGCCTCGTTGGGTTTTGGAAGCGTGTTTTCATGAGATATGTGACTGTCTTGCTGT
GGTTGTTGATTAAACATGTTGTGTTGGGTGTCAAGCATGTGCACAGCGTGGCTCGTAATTCACGTGATTGGTTTTGCGTAGCGTGGCGCGTAATGCACGTGTTTGTCTTG
TTAGAGGAGATAGGTTGTGAAATCTGGTTGAAGTTTAGTGGAGTTGGGTTGGTTGCCTGTTTAGAAGTGCTAAATGTCTTTTATCAGCCCTCGGGCTTTGTTACCTTCAA
AGCCCACCAAGAGCTCTCTGATTTGGAGATTTTTCGGATATCTCGTTGCCTCGAGGATGCTGCTCTCTTGATTTTGGCAGAAGAGAACCACCCACGTGGACAAGACCATA
TCTTTGCCAAAATTGAAGCCATCCTGACAGCCCCAAAAAGCAATTTTGGACCACCCGACGCACAAGGGGCTGACGAGGACGTCCGGGCGAAAATAGGGCTAGGAGATCGA
CCCAGAGGAAAAGCCGACCAAAGGGCCGGGCCAACTTGGCCCGACCCATATGGTCGGCCTCGGCCCAAGGCCGAGGCTGACCATTCGGCCCGCTTGCGCGGGCAGAGCTC
GGTCACCTCCTCTCGGTCCCTGATGCCTCTAGCCGCCCCGGTTTCCCCTGGTTTCATCGGAGGCGGTGTGGCTAGCACCACACCGGTGTGCAGGTTTTCAGTTTTGCAGG
CCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGCGAACCCTCTGGCAAACTGGAATTGCCGGTCCCCAACACGAGACCCAGCTCTCTAGC
ACCTTCGAACTGTCGGCTCATTCCTTATATTTCGATGCTGCTCCCGTCGCCGAATGCCAGCGACAGAAGGATGAGCAGATTCCGGCCATTCGCCGCCATGACGGCGTAGA
TTCCATGAGCCCGGCCGAGCACCGTCGAGTTGATAGCTGGGCCGGCGATTTCAGATTCTACAATACCCAAAACTTGTTTTTCAGCAATGGCGAGGGAAGGGGAGATGAGG
AAGACGGCGCCATTTCTATAGTGTCCAAGGTCTATTCCTCTCTCAATCTTGAAGAGACCCACCAGAAGGAAGGCTACGCCAAGGGCTACAAAGATGGCCTCGTGGCTGGC
AAAGAAGAGGGAAAACAAGTGGGCCTTAAAGTTGGTTTCGAGGTAGGCGAGGAACTGGGATTCTACAGTGGGTGTGTGGACGTGTGGAATTCTGCAATTCGGATCGACCC
AGAACGGTTTTCGATTCGGGTCCGGAAGAGTGTGAAGCAGATGGAGGAGTTGGTGGAGAAGTACCCGCTTCAGGACCCTGAGAATGAGCAAGTTCAGGAGCTGATGGAAG
GGTTGAGGCTCAAGTTCAGAGCGATTTGCGCCACTCTTGGTGTCAAATTGGAGTATAATGGCTATCCGAAATCGGCTTCAGATGGAAACGAGATTGATTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTAATCATTCTCCCCACCTTAAAAGCTTTCGTCCTCGAAAGACTACTGCTCAAACAGCTCTGGGTACTTGGATCTATTTTCGACTTCTCTTTCACCAGCCCCCC
CCTGCCGATCTCGAGGCGCCGTCGTCGTCTGGGTCGCGTCGCCGCCCCGTTCCGTCAGTGGTGTCGACGCAGCAAGCCACCGCGTGGTTCTTCTTCCCCCGCGAGTTCTC
TCTTCGCGGATCCCTCTCTCTGCACCGTCTCTCTCTCGCTCCCATCGATTTCTCTCCCTCCGTCGCAGCCCGAGTGCCGCCGCTCAGACCTGCAGCCGTTGCCGCTGCCC
CTTGCAACCGTTGCTCAAGTTCAGCCGCCGCTTGCTCGCGCCGCCGCCTTCCTCTCTGTTTCGTGGGTTAGATTCGTGGCTCTCGCTCTCTCTGTCCATGCGTTTTTGGC
CGAGATAAGTCGTGGATCTCACGCGGACAGCAGCTCGAAGCCCCGTCTCCTCCACGTTTCAGCCTTAAAAATATTGTATCTCGCGTGTCTAGCAATTCGGAGTCCCGTCG
ACCTCGTTTCAGTCGATTCCGCCTCTGTCCAACAGCATTCTTGGTTGTTGTTGGCGTCGTTGGGCGTTTCCGCGCCGTCTGAGTGTTCGATTAAGTTCGAAACGCTTCGA
CTTGGATACCCACTGCCCAAGGAGCGTTCTAGCTCGCTGGTTGTGGTTTGTAAAACCCGTCTAACGTGGAAGCGGGTTCGTTTGGCTGCTGAAAAGCCTTATCTGGAGTC
GTTTGGCGAACACCCATTGCCCGTTTTCGAGGTGCTTGTGGAGCTTGGTGAGTGTGGCCTCGTTGGGTTTTGGAAGCGTGTTTTCATGAGATATGTGACTGTCTTGCTGT
GGTTGTTGATTAAACATGTTGTGTTGGGTGTCAAGCATGTGCACAGCGTGGCTCGTAATTCACGTGATTGGTTTTGCGTAGCGTGGCGCGTAATGCACGTGTTTGTCTTG
TTAGAGGAGATAGGTTGTGAAATCTGGTTGAAGTTTAGTGGAGTTGGGTTGGTTGCCTGTTTAGAAGTGCTAAATGTCTTTTATCAGCCCTCGGGCTTTGTTACCTTCAA
AGCCCACCAAGAGCTCTCTGATTTGGAGATTTTTCGGATATCTCGTTGCCTCGAGGATGCTGCTCTCTTGATTTTGGCAGAAGAGAACCACCCACGTGGACAAGACCATA
TCTTTGCCAAAATTGAAGCCATCCTGACAGCCCCAAAAAGCAATTTTGGACCACCCGACGCACAAGGGGCTGACGAGGACGTCCGGGCGAAAATAGGGCTAGGAGATCGA
CCCAGAGGAAAAGCCGACCAAAGGGCCGGGCCAACTTGGCCCGACCCATATGGTCGGCCTCGGCCCAAGGCCGAGGCTGACCATTCGGCCCGCTTGCGCGGGCAGAGCTC
GGTCACCTCCTCTCGGTCCCTGATGCCTCTAGCCGCCCCGGTTTCCCCTGGTTTCATCGGAGGCGGTGTGGCTAGCACCACACCGGTGTGCAGGTTTTCAGTTTTGCAGG
CCACGTCTTCCCCCTCATCTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGCGAACCCTCTGGCAAACTGGAATTGCCGGTCCCCAACACGAGACCCAGCTCTCTAGC
ACCTTCGAACTGTCGGCTCATTCCTTATATTTCGATGCTGCTCCCGTCGCCGAATGCCAGCGACAGAAGGATGAGCAGATTCCGGCCATTCGCCGCCATGACGGCGTAGA
TTCCATGAGCCCGGCCGAGCACCGTCGAGTTGATAGCTGGGCCGGCGATTTCAGATTCTACAATACCCAAAACTTGTTTTTCAGCAATGGCGAGGGAAGGGGAGATGAGG
AAGACGGCGCCATTTCTATAGTGTCCAAGGTCTATTCCTCTCTCAATCTTGAAGAGACCCACCAGAAGGAAGGCTACGCCAAGGGCTACAAAGATGGCCTCGTGGCTGGC
AAAGAAGAGGGAAAACAAGTGGGCCTTAAAGTTGGTTTCGAGGTAGGCGAGGAACTGGGATTCTACAGTGGGTGTGTGGACGTGTGGAATTCTGCAATTCGGATCGACCC
AGAACGGTTTTCGATTCGGGTCCGGAAGAGTGTGAAGCAGATGGAGGAGTTGGTGGAGAAGTACCCGCTTCAGGACCCTGAGAATGAGCAAGTTCAGGAGCTGATGGAAG
GGTTGAGGCTCAAGTTCAGAGCGATTTGCGCCACTCTTGGTGTCAAATTGGAGTATAATGGCTATCCGAAATCGGCTTCAGATGGAAACGAGATTGATTTTTGA
Protein sequenceShow/hide protein sequence
MELIILPTLKAFVLERLLLKQLWVLGSIFDFSFTSPPLPISRRRRRLGRVAAPFRQWCRRSKPPRGSSSPASSLFADPSLCTVSLSLPSISLPPSQPECRRSDLQPLPLP
LATVAQVQPPLARAAAFLSVSWVRFVALALSVHAFLAEISRGSHADSSSKPRLLHVSALKILYLACLAIRSPVDLVSVDSASVQQHSWLLLASLGVSAPSECSIKFETLR
LGYPLPKERSSSLVVVCKTRLTWKRVRLAAEKPYLESFGEHPLPVFEVLVELGECGLVGFWKRVFMRYVTVLLWLLIKHVVLGVKHVHSVARNSRDWFCVAWRVMHVFVL
LEEIGCEIWLKFSGVGLVACLEVLNVFYQPSGFVTFKAHQELSDLEIFRISRCLEDAALLILAEENHPRGQDHIFAKIEAILTAPKSNFGPPDAQGADEDVRAKIGLGDR
PRGKADQRAGPTWPDPYGRPRPKAEADHSARLRGQSSVTSSRSLMPLAAPVSPGFIGGGVASTTPVCRFSVLQATSSPSSTNLPLVAREGQRTLWQTGIAGPQHETQLSS
TFELSAHSLYFDAAPVAECQRQKDEQIPAIRRHDGVDSMSPAEHRRVDSWAGDFRFYNTQNLFFSNGEGRGDEEDGAISIVSKVYSSLNLEETHQKEGYAKGYKDGLVAG
KEEGKQVGLKVGFEVGEELGFYSGCVDVWNSAIRIDPERFSIRVRKSVKQMEELVEKYPLQDPENEQVQELMEGLRLKFRAICATLGVKLEYNGYPKSASDGNEIDF