; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g08940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g08940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionNuclear transcription factor Y subunit C-4, putative
Genome locationchr4:6599730..6600749
RNA-Seq ExpressionMoc04g08940
SyntenyMoc04g08940
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031439.1 uncharacterized protein E6C27_scaffold139G001960 [Cucumis melo var. makuwa]1.3e-6564.22Show/hide
Query:  LVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITINGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVF
        +VILT PCIVSILG+ES SSEF SVSD+VDS +LDL FRD G+EG + NG K +ILSS  T GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV 
Subjt:  LVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITINGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVF

Query:  AWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRYSSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTK
        +W  +DSDF+DRILKTGGI+AFP   N+ PSNHF+KKPNY+P+FL+RY+SIIVAMEKTA+ D++VY+SASRR L + S  TT AA+R LE+         
Subjt:  AWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRYSSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTK

Query:  AVAKPSNLMRKIKYITDL
         V KP+ L RKI Y+TD+
Subjt:  AVAKPSNLMRKIKYITDL

XP_008455527.1 PREDICTED: uncharacterized protein LOC103495679 [Cucumis melo]5.0e-7862.22Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITI
        MD ARFNR         F N    +WNS THLVI FP+ RIL VIS S F A+VILT PCIVSILG+ES SSEF SVSD+VDS +LDL FRD G+EG + 
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITI

Query:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRY
        NG K +ILSS  T GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV +W  +DSDF+DRILKTGGI+AFP   N+ PSNHF+KKPNY+P+FL+RY
Subjt:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRY

Query:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL
        +SIIVAMEKTA+ D++VY+SASRR L + S  TT AA+R LE+          V KP+ L RKI Y+TD+
Subjt:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL

XP_011659719.1 uncharacterized protein LOC105436238 [Cucumis sativus]1.2e-7460.37Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESS-SEFLSVSDLVDSGQLDLLFRDFGYEGITI
        MD ARFNR         F N    +WNS THLVI FP  +IL VIS S F A+VILT PCIVSILG+E+  SEF SV D+VDS +LDL FRD G+EG + 
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESS-SEFLSVSDLVDSGQLDLLFRDFGYEGITI

Query:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSG-PSNHFQKKPNYRPVFLSRY
        NG K +ILSS  T+GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV +WG +DSDF+DRILK GGI+AFP  N+  PS+HF+KKPNY+PVFL+RY
Subjt:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSG-PSNHFQKKPNYRPVFLSRY

Query:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL
        +SIIVAMEKT M D +VY+SASRR L + S  T  AA+R LE+          V KP+ L RKIKY+ D+
Subjt:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL

XP_022141924.1 uncharacterized protein LOC111012177 [Momordica charantia]1.3e-187100Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN
        MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN

Query:  GRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSI
        GRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSI
Subjt:  GRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSI

Query:  IVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQ
        IVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQ
Subjt:  IVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQ

Query:  NYPRKGQAEESKSSITVRNAAVDWLKAGFVEKLMEMRAM
        NYPRKGQAEESKSSITVRNAAVDWLKAGFVEKLMEMRAM
Subjt:  NYPRKGQAEESKSSITVRNAAVDWLKAGFVEKLMEMRAM

XP_038889013.1 uncharacterized protein LOC120078778 [Benincasa hispida]3.5e-8766.3Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN
        MDF  FNR  +    K F      +WNS THLVIKFP+ +IL VIS SLF A+ ILT P IVSILG+ES SEF SVSD++DS QLDL FRD G+EG+TIN
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN

Query:  GRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSS
        G KA+ILSS  T GL Q+RV+D DE KL+IV+DSDFD+SGLFSDDSFDFV + G VDSDF+DRILK GGI+AFP  N+ PSNHFQKKPNYRPVFL+RYSS
Subjt:  GRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSS

Query:  IIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRL
        IIV MEKTAM D +VY+S+SRR L QFS  T  AA+R L ED+L E P K VAKP+ L RK+KY+ DLVD +  RL
Subjt:  IIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRL

TrEMBL top hitse value%identityAlignment
A0A0A0K451 Uncharacterized protein5.6e-7560.37Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESS-SEFLSVSDLVDSGQLDLLFRDFGYEGITI
        MD ARFNR         F N    +WNS THLVI FP  +IL VIS S F A+VILT PCIVSILG+E+  SEF SV D+VDS +LDL FRD G+EG + 
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESS-SEFLSVSDLVDSGQLDLLFRDFGYEGITI

Query:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSG-PSNHFQKKPNYRPVFLSRY
        NG K +ILSS  T+GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV +WG +DSDF+DRILK GGI+AFP  N+  PS+HF+KKPNY+PVFL+RY
Subjt:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSG-PSNHFQKKPNYRPVFLSRY

Query:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL
        +SIIVAMEKT M D +VY+SASRR L + S  T  AA+R LE+          V KP+ L RKIKY+ D+
Subjt:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL

A0A1S3C0P0 uncharacterized protein LOC1034956792.4e-7862.22Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITI
        MD ARFNR         F N    +WNS THLVI FP+ RIL VIS S F A+VILT PCIVSILG+ES SSEF SVSD+VDS +LDL FRD G+EG + 
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITI

Query:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRY
        NG K +ILSS  T GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV +W  +DSDF+DRILKTGGI+AFP   N+ PSNHF+KKPNY+P+FL+RY
Subjt:  NGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRY

Query:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL
        +SIIVAMEKTA+ D++VY+SASRR L + S  TT AA+R LE+          V KP+ L RKI Y+TD+
Subjt:  SSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDL

A0A5A7SQ50 Uncharacterized protein6.2e-6664.22Show/hide
Query:  LVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITINGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVF
        +VILT PCIVSILG+ES SSEF SVSD+VDS +LDL FRD G+EG + NG K +ILSS  T GL Q+RV+D DE KL+IV+DSDFD++GLFSDDSFDFV 
Subjt:  LVILTLPCIVSILGRES-SSEFLSVSDLVDSGQLDLLFRDFGYEGITINGRKAVILSSG-TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVF

Query:  AWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRYSSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTK
        +W  +DSDF+DRILKTGGI+AFP   N+ PSNHF+KKPNY+P+FL+RY+SIIVAMEKTA+ D++VY+SASRR L + S  TT AA+R LE+         
Subjt:  AWGNVDSDFMDRILKTGGILAFP-FTNSGPSNHFQKKPNYRPVFLSRYSSIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTK

Query:  AVAKPSNLMRKIKYITDL
         V KP+ L RKI Y+TD+
Subjt:  AVAKPSNLMRKIKYITDL

A0A6J1CK51 uncharacterized protein LOC1110121776.4e-188100Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN
        MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITIN

Query:  GRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSI
        GRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSI
Subjt:  GRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSI

Query:  IVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQ
        IVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQ
Subjt:  IVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQ

Query:  NYPRKGQAEESKSSITVRNAAVDWLKAGFVEKLMEMRAM
        NYPRKGQAEESKSSITVRNAAVDWLKAGFVEKLMEMRAM
Subjt:  NYPRKGQAEESKSSITVRNAAVDWLKAGFVEKLMEMRAM

A0A6J5W009 Uncharacterized protein5.1e-5240.75Show/hide
Query:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSE--FLSVSDLVDSGQLDLLFRDFGYEGIT
        M+ A    AK+Q K+K    G  G  +S+THLVIK PD ++L +ISRS+FL LVILTLPCI S+L   S SE  + + S++ +  QL  LF D   EG+ 
Subjt:  MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSE--FLSVSDLVDSGQLDLLFRDFGYEGIT

Query:  INGRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYS
            KA+I+S    G+    +   D    DIV+DSD ++   F D+S DFVFA+  VD+ F+DRILK GGI+A P +N  PSN F+ KPNY+ V+L RY+
Subjt:  INGRKAVILSSGTDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYS

Query:  SIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYF
        S  VAM KT+    +   S   R L QF +   K  ++GL ED++ E P + +AK +  ++KIK++ +L+  SL+   + V  FV V L E+N  + ++F
Subjt:  SIIVAMEKTAMPDHMVYSSASRRLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYF

Query:  DQNYPR----------KGQAEESKSSITV--RNAAVDWLKAGFVEK
         QNYP+          + + EE  S+     RN   DWLK    E+
Subjt:  DQNYPR----------KGQAEESKSSITV--RNAAVDWLKAGFVEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G58120.1 BEST Arabidopsis thaliana protein match is: methyltransferases (TAIR:AT5G01710.1)7.3e-1928.89Show/hide
Query:  SDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSI-LGRESSSEFLSV-SDLVDSGQLDLLFRDFGYEGITINGRKAVILSSGTDGLTQVRVMDN-DE
        S     +K     +L +  RS  LAL+ L+   +  +  G  +++   SV SDL +   L LL  D   +G+   G KA+ LS G D +T         E
Subjt:  SDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSI-LGRESSSEFLSV-SDLVDSGQLDLLFRDFGYEGITINGRKAVILSSGTDGLTQVRVMDN-DE

Query:  RKLDIVLDSDFDQSGLFSDDSFDFVFAWG-NVDS-DFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSIIVAMEKTAMPDH-MVYSSASRR
          + +V  SD +   +  D++FDF FA   ++DS +F+DR LK GGI            +F K PNY  V++      ++ M KT   +      +  R+
Subjt:  RKLDIVLDSDFDQSGLFSDDSFDFVFAWG-NVDS-DFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSIIVAMEKTAMPDH-MVYSSASRR

Query:  LLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQNYPRKGQAEESKSSITVRN-A
        LL        + A+R L ED+L E P  A  K     ++ +Y+ DL+  +L     +   F+ VG  + +  M ++F +NYP + Q  E     TV +  
Subjt:  LLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQNYPRKGQAEESKSSITVRN-A

Query:  AVDWLKAGFVEKLME
        +++  K G  E L E
Subjt:  AVDWLKAGFVEKLME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTGCCCGTTTTAATCGAGCCAAGAACCAACTCAAGATCAAAGGATTCAGAAATGGCGCCGGTGGCGCCTGGAATTCTGACACGCATCTGGTTATCAAGTTTCC
TGATCCACGGATCCTTCACGTAATCTCTCGTTCGTTGTTTTTGGCTTTGGTTATTCTCACTCTGCCTTGTATTGTTTCCATTCTCGGGCGAGAAAGTAGCTCTGAGTTTT
TATCTGTATCTGATTTGGTCGATTCTGGGCAATTGGATCTGCTTTTTCGTGATTTCGGTTACGAAGGCATCACCATTAATGGCCGTAAGGCTGTCATTTTGAGCTCTGGA
ACCGACGGCCTGACTCAGGTTCGTGTGATGGACAATGACGAACGCAAACTTGACATTGTTCTGGACTCTGATTTTGATCAGAGTGGGTTGTTTTCTGATGATTCTTTTGA
TTTTGTGTTTGCTTGGGGCAATGTGGACTCTGATTTCATGGATAGAATCCTCAAAACTGGTGGCATTTTGGCCTTCCCATTCACCAACAGTGGCCCATCAAACCATTTCC
AAAAGAAACCAAATTACAGACCTGTGTTTCTCAGCAGATACAGCTCCATTATTGTGGCAATGGAGAAGACAGCCATGCCTGATCACATGGTTTATTCTTCAGCTTCAAGA
AGACTCCTCACCCAATTCTCATCACACACCACTAAGGCTGCAATGAGAGGCCTTGAGGAGGATATTCTACATGAGCTACCAACCAAGGCCGTGGCAAAACCAAGCAACCT
TATGAGGAAAATCAAGTACATCACCGACCTTGTCGACGGTTCTCTCAAACGCCTGAGGCAAACGGTCTCCAACTTCGTCACGGTTGGCCTGCCTGAAGAGAATGGGGACA
TGATCCAATACTTTGATCAGAACTACCCAAGAAAGGGGCAAGCCGAGGAGTCAAAGTCATCAATCACGGTGAGAAATGCAGCTGTGGATTGGCTGAAGGCGGGATTTGTT
GAGAAACTGATGGAAATGAGGGCAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTGCCCGTTTTAATCGAGCCAAGAACCAACTCAAGATCAAAGGATTCAGAAATGGCGCCGGTGGCGCCTGGAATTCTGACACGCATCTGGTTATCAAGTTTCC
TGATCCACGGATCCTTCACGTAATCTCTCGTTCGTTGTTTTTGGCTTTGGTTATTCTCACTCTGCCTTGTATTGTTTCCATTCTCGGGCGAGAAAGTAGCTCTGAGTTTT
TATCTGTATCTGATTTGGTCGATTCTGGGCAATTGGATCTGCTTTTTCGTGATTTCGGTTACGAAGGCATCACCATTAATGGCCGTAAGGCTGTCATTTTGAGCTCTGGA
ACCGACGGCCTGACTCAGGTTCGTGTGATGGACAATGACGAACGCAAACTTGACATTGTTCTGGACTCTGATTTTGATCAGAGTGGGTTGTTTTCTGATGATTCTTTTGA
TTTTGTGTTTGCTTGGGGCAATGTGGACTCTGATTTCATGGATAGAATCCTCAAAACTGGTGGCATTTTGGCCTTCCCATTCACCAACAGTGGCCCATCAAACCATTTCC
AAAAGAAACCAAATTACAGACCTGTGTTTCTCAGCAGATACAGCTCCATTATTGTGGCAATGGAGAAGACAGCCATGCCTGATCACATGGTTTATTCTTCAGCTTCAAGA
AGACTCCTCACCCAATTCTCATCACACACCACTAAGGCTGCAATGAGAGGCCTTGAGGAGGATATTCTACATGAGCTACCAACCAAGGCCGTGGCAAAACCAAGCAACCT
TATGAGGAAAATCAAGTACATCACCGACCTTGTCGACGGTTCTCTCAAACGCCTGAGGCAAACGGTCTCCAACTTCGTCACGGTTGGCCTGCCTGAAGAGAATGGGGACA
TGATCCAATACTTTGATCAGAACTACCCAAGAAAGGGGCAAGCCGAGGAGTCAAAGTCATCAATCACGGTGAGAAATGCAGCTGTGGATTGGCTGAAGGCGGGATTTGTT
GAGAAACTGATGGAAATGAGGGCAATGTAA
Protein sequenceShow/hide protein sequence
MDFARFNRAKNQLKIKGFRNGAGGAWNSDTHLVIKFPDPRILHVISRSLFLALVILTLPCIVSILGRESSSEFLSVSDLVDSGQLDLLFRDFGYEGITINGRKAVILSSG
TDGLTQVRVMDNDERKLDIVLDSDFDQSGLFSDDSFDFVFAWGNVDSDFMDRILKTGGILAFPFTNSGPSNHFQKKPNYRPVFLSRYSSIIVAMEKTAMPDHMVYSSASR
RLLTQFSSHTTKAAMRGLEEDILHELPTKAVAKPSNLMRKIKYITDLVDGSLKRLRQTVSNFVTVGLPEENGDMIQYFDQNYPRKGQAEESKSSITVRNAAVDWLKAGFV
EKLMEMRAM