; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:8043482..8046862
RNA-Seq ExpressionMoc07g10470
SyntenyMoc07g10470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]7.7e-8060.44Show/hide
Query:  EQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDIAMI-----------------------------------------
        +QLRRLFIN+FSARQ LKLPPSHL T+KQRDNESLTEYIAR MDEHVKVVSCTDDIAM+                                         
Subjt:  EQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDIAMI-----------------------------------------

Query:  --------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLY
                RG+D D +S P KK H DD++SSR+  DD++RG+ DER  S+  G  FD+ TPLNAS+AEIYAT+++TD++ALF  P+KL RPS KRDKRLY
Subjt:  --------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLY

Query:  CRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAINTIHGG
        CRFHKDHGH++SRCFHLKEQV+DLI  GYLKKYVG+RERA+P+GS RE+K+E+ QPP RKEDRPA INTIHGG
Subjt:  CRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAINTIHGG

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.0e-5238.38Show/hide
Query:  GRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGH
        G+D ++  P SK     D+ S     + +   RR E  P+      ++R TP    I+EI   I+++ +E L   PEKLR    +R K  YCRFH++HGH
Subjt:  GRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGH

Query:  DTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPPKRKEDRPAAINTIHGG----------------------------------
        +TS  + LK Q+EDLI  GY KK+VG     +P+ S+ EKK+E  + + P R+ DRPA INTI GG                                  
Subjt:  DTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPPKRKEDRPAAINTIHGG----------------------------------

Query:  ------------------------HVKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE------------
                                HV VRRVLVDGGAS NILS  TY ALGW R  LK SPTPLVGF+GE+V+ EGCI LPVT+G+ +            
Subjt:  ------------------------HVKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE------------

Query:  -------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCR---TPVEELELVP
                                 HQVLKY TP G+ TV+GEQ  SRECYA+ +KGT+ C A+  L + +   + EA+L  R    P EELELVP
Subjt:  -------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCR---TPVEELELVP

XP_022152367.1 uncharacterized protein LOC111020111 [Momordica charantia]2.6e-5138.52Show/hide
Query:  RDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHD
        R  D KS          R   RR++ D NR R             ++R TP    I+EI A I+++ +E L   PEKL+    KR+K  YCRF +DHGH+
Subjt:  RDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHD

Query:  TSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAI----------------------NTIHGGH------------VK
        TS C+ LK Q+EDLI  GY KK+VG    +  +   +E+++++ + P R++DRPA I                        +H  H            V 
Subjt:  TSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAI----------------------NTIHGGH------------VK

Query:  VRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQ
        VRRVLVDGGAS NILS  TY ALGW R  LK SPTPLVGF+GE+V  EGCI LPVT+G+                                       HQ
Subjt:  VRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQ

Query:  VLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCRTPVEELELVP
        VLKY TP G+ TV+GEQ+TSRECYA+ +KG++ C    +    +P    E +      VEELELVP
Subjt:  VLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCRTPVEELELVP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]4.2e-5433.47Show/hide
Query:  QASLDSYSAPRVEVMDRRGVGRWHLRIRSGPPRRSVSEGDRGPQEQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDI
        QA+ D+      ++        W+ R+    P R +S        QLR+ FI++FS+R   +  P+HL T++Q++ E+L EY+ RF +E +KV  C+DD 
Subjt:  QASLDSYSAPRVEVMDRRGVGRWHLRIRSGPPRRSVSEGDRGPQEQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDI

Query:  AM----------------------------------IRGRDW--DHKSPPSK--------KWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLN--FDRS
        AM                                  I G++        P K        K  G   + SR      +  R D R  ++ H  +  ++  
Subjt:  AM----------------------------------IRGRDW--DHKSPPSK--------KWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLN--FDRS

Query:  TPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPP
        TP    I EI   I++T +E L   PEKLR    KR+   YCRFH+DHGH+TS  + LK Q+EDLI  GY KK+VG     +P+ ++ EKK+E  +L+ P
Subjt:  TPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPP

Query:  KRKEDRPAAINT---------------------------------IHGGH------------VKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTP
         R++DRPA IN                                  +H  H            V VRR+LVDGGAS NILS  TY ALGW R  LK SPTP
Subjt:  KRKEDRPAAINT---------------------------------IHGGH------------VKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTP

Query:  LVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCA
        LVGF+GE++  EGCI LPV+I + +                                     HQVLKY T  G+ TV+GE +TSRECYA+  K ++ CA
Subjt:  LVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCA

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]3.3e-9150.74Show/hide
Query:  MDEHVKVVSCTDDIAMI-------------------------------------------------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGR
        MDEHVKVVSCTDDIAM+                                                 RGRD DHKSPPSKK   DDR+SSRR DDDK+R R
Subjt:  MDEHVKVVSCTDDIAMI-------------------------------------------------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGR

Query:  RDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEP
        RDERV SN  G  FD+ TPLNASIAEIYA ++DTD+E LFA+PEKLRRPS KR+KRLYCRFHKDHGHDTSRCFHLKEQVEDLI  GYLKKYVG+RE+AE 
Subjt:  RDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEP

Query:  KGSAREKKQEKLQPPKRKEDRPAAINTIHGG----------------------------------------------------------HVKVRRVLVDG
        +GSARE+K+E+ QPP+ KEDRPA INTIHGG                                                          HVKVRRV VDG
Subjt:  KGSAREKKQEKLQPPKRKEDRPAAINTIHGG----------------------------------------------------------HVKVRRVLVDG

Query:  GASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGEHQVLKYPTPTGI--ATVQGEQRTSRECYAAAMKGTATCAAIMELGTI
        GAS NI SF TY+ALGWERRHLKH  T LVGFA E+V  EGCI LPVTI EGEHQV +      I  ++        R+C       + +C      G  
Subjt:  GASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGEHQVLKYPTPTGI--ATVQGEQRTSRECYAAAMKGTATCAAIMELGTI

Query:  EPQH
        +P+H
Subjt:  EPQH

TrEMBL top hitse value%identityAlignment
A0A6J1D5T3 uncharacterized protein LOC1110175483.7e-8060.44Show/hide
Query:  EQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDIAMI-----------------------------------------
        +QLRRLFIN+FSARQ LKLPPSHL T+KQRDNESLTEYIAR MDEHVKVVSCTDDIAM+                                         
Subjt:  EQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDIAMI-----------------------------------------

Query:  --------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLY
                RG+D D +S P KK H DD++SSR+  DD++RG+ DER  S+  G  FD+ TPLNAS+AEIYAT+++TD++ALF  P+KL RPS KRDKRLY
Subjt:  --------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLY

Query:  CRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAINTIHGG
        CRFHKDHGH++SRCFHLKEQV+DLI  GYLKKYVG+RERA+P+GS RE+K+E+ QPP RKEDRPA INTIHGG
Subjt:  CRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAINTIHGG

A0A6J1DD03 uncharacterized protein LOC1110198995.1e-5338.38Show/hide
Query:  GRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGH
        G+D ++  P SK     D+ S     + +   RR E  P+      ++R TP    I+EI   I+++ +E L   PEKLR    +R K  YCRFH++HGH
Subjt:  GRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGH

Query:  DTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPPKRKEDRPAAINTIHGG----------------------------------
        +TS  + LK Q+EDLI  GY KK+VG     +P+ S+ EKK+E  + + P R+ DRPA INTI GG                                  
Subjt:  DTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPPKRKEDRPAAINTIHGG----------------------------------

Query:  ------------------------HVKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE------------
                                HV VRRVLVDGGAS NILS  TY ALGW R  LK SPTPLVGF+GE+V+ EGCI LPVT+G+ +            
Subjt:  ------------------------HVKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE------------

Query:  -------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCR---TPVEELELVP
                                 HQVLKY TP G+ TV+GEQ  SRECYA+ +KGT+ C A+  L + +   + EA+L  R    P EELELVP
Subjt:  -------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCR---TPVEELELVP

A0A6J1DG07 uncharacterized protein LOC1110201111.2e-5138.52Show/hide
Query:  RDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHD
        R  D KS          R   RR++ D NR R             ++R TP    I+EI A I+++ +E L   PEKL+    KR+K  YCRF +DHGH+
Subjt:  RDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHD

Query:  TSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAI----------------------NTIHGGH------------VK
        TS C+ LK Q+EDLI  GY KK+VG    +  +   +E+++++ + P R++DRPA I                        +H  H            V 
Subjt:  TSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAI----------------------NTIHGGH------------VK

Query:  VRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQ
        VRRVLVDGGAS NILS  TY ALGW R  LK SPTPLVGF+GE+V  EGCI LPVT+G+                                       HQ
Subjt:  VRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQ

Query:  VLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCRTPVEELELVP
        VLKY TP G+ TV+GEQ+TSRECYA+ +KG++ C    +    +P    E +      VEELELVP
Subjt:  VLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCRTPVEELELVP

A0A6J1DHB3 uncharacterized protein LOC1110204792.1e-5433.47Show/hide
Query:  QASLDSYSAPRVEVMDRRGVGRWHLRIRSGPPRRSVSEGDRGPQEQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDI
        QA+ D+      ++        W+ R+    P R +S        QLR+ FI++FS+R   +  P+HL T++Q++ E+L EY+ RF +E +KV  C+DD 
Subjt:  QASLDSYSAPRVEVMDRRGVGRWHLRIRSGPPRRSVSEGDRGPQEQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSCTDDI

Query:  AM----------------------------------IRGRDW--DHKSPPSK--------KWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLN--FDRS
        AM                                  I G++        P K        K  G   + SR      +  R D R  ++ H  +  ++  
Subjt:  AM----------------------------------IRGRDW--DHKSPPSK--------KWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLN--FDRS

Query:  TPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPP
        TP    I EI   I++T +E L   PEKLR    KR+   YCRFH+DHGH+TS  + LK Q+EDLI  GY KK+VG     +P+ ++ EKK+E  +L+ P
Subjt:  TPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQE--KLQPP

Query:  KRKEDRPAAINT---------------------------------IHGGH------------VKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTP
         R++DRPA IN                                  +H  H            V VRR+LVDGGAS NILS  TY ALGW R  LK SPTP
Subjt:  KRKEDRPAAINT---------------------------------IHGGH------------VKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTP

Query:  LVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCA
        LVGF+GE++  EGCI LPV+I + +                                     HQVLKY T  G+ TV+GE +TSRECYA+  K ++ CA
Subjt:  LVGFAGETVIGEGCILLPVTIGEGE-------------------------------------HQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCA

A0A6J1E0L8 uncharacterized protein LOC1110253101.6e-9150.74Show/hide
Query:  MDEHVKVVSCTDDIAMI-------------------------------------------------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGR
        MDEHVKVVSCTDDIAM+                                                 RGRD DHKSPPSKK   DDR+SSRR DDDK+R R
Subjt:  MDEHVKVVSCTDDIAMI-------------------------------------------------RGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGR

Query:  RDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEP
        RDERV SN  G  FD+ TPLNASIAEIYA ++DTD+E LFA+PEKLRRPS KR+KRLYCRFHKDHGHDTSRCFHLKEQVEDLI  GYLKKYVG+RE+AE 
Subjt:  RDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLILWGYLKKYVGNRERAEP

Query:  KGSAREKKQEKLQPPKRKEDRPAAINTIHGG----------------------------------------------------------HVKVRRVLVDG
        +GSARE+K+E+ QPP+ KEDRPA INTIHGG                                                          HVKVRRV VDG
Subjt:  KGSAREKKQEKLQPPKRKEDRPAAINTIHGG----------------------------------------------------------HVKVRRVLVDG

Query:  GASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGEHQVLKYPTPTGI--ATVQGEQRTSRECYAAAMKGTATCAAIMELGTI
        GAS NI SF TY+ALGWERRHLKH  T LVGFA E+V  EGCI LPVTI EGEHQV +      I  ++        R+C       + +C      G  
Subjt:  GASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETVIGEGCILLPVTIGEGEHQVLKYPTPTGI--ATVQGEQRTSRECYAAAMKGTATCAAIMELGTI

Query:  EPQH
        +P+H
Subjt:  EPQH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATCCCGTAGTGGGTCGAGACTTCCACCTTGCTTCCGACCAGTTTCCAACGCTCCAGCCTCAAAGGAACAGTCTGCCACCCCGTACTCCACGCCTTCGCAGCTG
GGGAAACACGAGTAAACGTTTCGGGACAGACACCGAAGTAGGTGTGGATCCTATTGTTGTAGCCAATGTAATTGCTGAGCTGACAGAGGTCAAGGCGCGGCTCGAAGCAG
TTGAGAGGGGAAGCGAGATGTCAGACTCCTCCTCCTCCAGAGACCCCGCTAGAGGGAAGGAGCCAATGCACCTAACCAAGAGGACAGAGTATCAGTTCCGACATGCCAGG
AAGATCCGAGCAGGAACGCCCTCGCGGAGGCTTCAACGCCTGGGGTTGGGCGGTACCCCGAGGGGACAGGACTTGAACAAGGGGCCCCACCCTCTCATTGGCCGAGAGGG
ACTTTCTCCCTCCCTTCAATCAAGGTGCTACCACAAGCATGATCCCGAGACCAAAGAGGATAGCGAGGAAGATCCGGTGGTGGTGTTCGAGGGGAATTCACTGAAGAAAC
GTCTTCAAAGGTTTGGAATTTCCTCTGTATTTTCTTCTTTAAAACTAGATTTTGTATCCCGAAAATCAGCAACGAAAATCCGCTTCCGCTCTGGGATTCACAATGATCAG
GTGGAAAACCAACGCCCAAGGGTCCGGCTGCCTCGGACCCATCAGGCATCGCTCGACAGCTACAGCGCCCCTAGGGTTGAGGTGATGGACCGTCGAGGCGTCGGTAGATG
GCACCTGAGGATCAGGAGTGGACCTCCTCGGAGATCAGTTTCAGAGGGAGATAGAGGACCTCAAGAGCAGCTGAGAAGGCTGTTCATCAACAAGTTTTCGGCTAGACAGT
CATTGAAGTTGCCGCCCTCTCACCTTGGAACAATGAAGCAACGGGACAATGAGTCCCTTACGGAGTACATCGCTCGGTTCATGGACGAGCATGTCAAGGTGGTAAGCTGT
ACTGACGACATCGCCATGATCCGCGGTAGAGACTGGGACCATAAGTCTCCACCCTCCAAGAAGTGGCACGGTGATGATCGAAATTCATCTCGGCGAGTCGATGACGACAA
GAATAGAGGCCGGCGTGATGAAAGAGTCCCTTCAAATTGTCACGGGCTAAATTTTGACAGGTCTACTCCGCTGAATGCTTCGATCGCTGAGATCTACGCGACAATCAAAG
ACACCGACCTAGAGGCGCTGTTCGCAACCCCAGAAAAGCTTCGCCGACCTTCGAGAAAGCGAGACAAGCGACTCTACTGCCGATTCCATAAGGATCATGGCCACGATACC
TCCCGTTGCTTCCATCTAAAGGAGCAAGTCGAGGATCTGATCCTGTGGGGTTATTTGAAGAAATATGTCGGCAACAGAGAGCGAGCAGAACCAAAAGGGTCTGCTCGGGA
AAAGAAGCAAGAAAAGTTACAGCCGCCCAAACGAAAAGAGGATCGCCCTGCAGCAATAAATACCATTCACGGGGGTCATGTGAAGGTCAGAAGAGTTCTTGTTGATGGCG
GGGCGTCGACCAACATATTGTCCTTTTTGACCTACTCGGCTTTGGGGTGGGAGAGAAGGCACCTGAAGCATAGTCCAACGCCTCTAGTCGGTTTTGCAGGGGAGACAGTC
ATCGGGGAAGGATGCATCTTGCTCCCTGTGACCATTGGCGAAGGGGAGCACCAAGTTTTGAAATATCCTACTCCGACCGGAATTGCAACGGTCCAAGGCGAACAAAGAAC
CTCGAGAGAATGCTACGCAGCTGCTATGAAAGGAACCGCCACTTGCGCAGCGATTATGGAGCTCGGAACGATCGAACCACAGCATGATCTCGAAGCAGAGCTCAACTGCC
GCACACCGGTAGAGGAGTTAGAACTTGTCCCGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGATCCCGTAGTGGGTCGAGACTTCCACCTTGCTTCCGACCAGTTTCCAACGCTCCAGCCTCAAAGGAACAGTCTGCCACCCCGTACTCCACGCCTTCGCAGCTG
GGGAAACACGAGTAAACGTTTCGGGACAGACACCGAAGTAGGTGTGGATCCTATTGTTGTAGCCAATGTAATTGCTGAGCTGACAGAGGTCAAGGCGCGGCTCGAAGCAG
TTGAGAGGGGAAGCGAGATGTCAGACTCCTCCTCCTCCAGAGACCCCGCTAGAGGGAAGGAGCCAATGCACCTAACCAAGAGGACAGAGTATCAGTTCCGACATGCCAGG
AAGATCCGAGCAGGAACGCCCTCGCGGAGGCTTCAACGCCTGGGGTTGGGCGGTACCCCGAGGGGACAGGACTTGAACAAGGGGCCCCACCCTCTCATTGGCCGAGAGGG
ACTTTCTCCCTCCCTTCAATCAAGGTGCTACCACAAGCATGATCCCGAGACCAAAGAGGATAGCGAGGAAGATCCGGTGGTGGTGTTCGAGGGGAATTCACTGAAGAAAC
GTCTTCAAAGGTTTGGAATTTCCTCTGTATTTTCTTCTTTAAAACTAGATTTTGTATCCCGAAAATCAGCAACGAAAATCCGCTTCCGCTCTGGGATTCACAATGATCAG
GTGGAAAACCAACGCCCAAGGGTCCGGCTGCCTCGGACCCATCAGGCATCGCTCGACAGCTACAGCGCCCCTAGGGTTGAGGTGATGGACCGTCGAGGCGTCGGTAGATG
GCACCTGAGGATCAGGAGTGGACCTCCTCGGAGATCAGTTTCAGAGGGAGATAGAGGACCTCAAGAGCAGCTGAGAAGGCTGTTCATCAACAAGTTTTCGGCTAGACAGT
CATTGAAGTTGCCGCCCTCTCACCTTGGAACAATGAAGCAACGGGACAATGAGTCCCTTACGGAGTACATCGCTCGGTTCATGGACGAGCATGTCAAGGTGGTAAGCTGT
ACTGACGACATCGCCATGATCCGCGGTAGAGACTGGGACCATAAGTCTCCACCCTCCAAGAAGTGGCACGGTGATGATCGAAATTCATCTCGGCGAGTCGATGACGACAA
GAATAGAGGCCGGCGTGATGAAAGAGTCCCTTCAAATTGTCACGGGCTAAATTTTGACAGGTCTACTCCGCTGAATGCTTCGATCGCTGAGATCTACGCGACAATCAAAG
ACACCGACCTAGAGGCGCTGTTCGCAACCCCAGAAAAGCTTCGCCGACCTTCGAGAAAGCGAGACAAGCGACTCTACTGCCGATTCCATAAGGATCATGGCCACGATACC
TCCCGTTGCTTCCATCTAAAGGAGCAAGTCGAGGATCTGATCCTGTGGGGTTATTTGAAGAAATATGTCGGCAACAGAGAGCGAGCAGAACCAAAAGGGTCTGCTCGGGA
AAAGAAGCAAGAAAAGTTACAGCCGCCCAAACGAAAAGAGGATCGCCCTGCAGCAATAAATACCATTCACGGGGGTCATGTGAAGGTCAGAAGAGTTCTTGTTGATGGCG
GGGCGTCGACCAACATATTGTCCTTTTTGACCTACTCGGCTTTGGGGTGGGAGAGAAGGCACCTGAAGCATAGTCCAACGCCTCTAGTCGGTTTTGCAGGGGAGACAGTC
ATCGGGGAAGGATGCATCTTGCTCCCTGTGACCATTGGCGAAGGGGAGCACCAAGTTTTGAAATATCCTACTCCGACCGGAATTGCAACGGTCCAAGGCGAACAAAGAAC
CTCGAGAGAATGCTACGCAGCTGCTATGAAAGGAACCGCCACTTGCGCAGCGATTATGGAGCTCGGAACGATCGAACCACAGCATGATCTCGAAGCAGAGCTCAACTGCC
GCACACCGGTAGAGGAGTTAGAACTTGTCCCGTTCTAG
Protein sequenceShow/hide protein sequence
MDDPVVGRDFHLASDQFPTLQPQRNSLPPRTPRLRSWGNTSKRFGTDTEVGVDPIVVANVIAELTEVKARLEAVERGSEMSDSSSSRDPARGKEPMHLTKRTEYQFRHAR
KIRAGTPSRRLQRLGLGGTPRGQDLNKGPHPLIGREGLSPSLQSRCYHKHDPETKEDSEEDPVVVFEGNSLKKRLQRFGISSVFSSLKLDFVSRKSATKIRFRSGIHNDQ
VENQRPRVRLPRTHQASLDSYSAPRVEVMDRRGVGRWHLRIRSGPPRRSVSEGDRGPQEQLRRLFINKFSARQSLKLPPSHLGTMKQRDNESLTEYIARFMDEHVKVVSC
TDDIAMIRGRDWDHKSPPSKKWHGDDRNSSRRVDDDKNRGRRDERVPSNCHGLNFDRSTPLNASIAEIYATIKDTDLEALFATPEKLRRPSRKRDKRLYCRFHKDHGHDT
SRCFHLKEQVEDLILWGYLKKYVGNRERAEPKGSAREKKQEKLQPPKRKEDRPAAINTIHGGHVKVRRVLVDGGASTNILSFLTYSALGWERRHLKHSPTPLVGFAGETV
IGEGCILLPVTIGEGEHQVLKYPTPTGIATVQGEQRTSRECYAAAMKGTATCAAIMELGTIEPQHDLEAELNCRTPVEELELVPF