; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g20330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g20330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:14782086..14783910
RNA-Seq ExpressionMoc04g20330
SyntenyMoc04g20330
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]3.5e-10782.38Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGE LAKDES              V+IRPVPELTQ SFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPPLGEEVREE
        AVRPIESSRPNSELAMVCGF+S VKRKSKG+AHALEAAQSSKP TPAV  PASEDPAPVIELESS GPSREKRPRDQ       TEAVDV PLGEEVREE
Subjt:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPPLGEEVREE

Query:  APLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQ
         PLKRRRKKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT  FR+EPSSSGVRDQ
Subjt:  APLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQ

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.3e-10149.71Show/hide
Query:  RRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQ
        +RRKKKKAIS SEVGAC VLPA FADRVDDPAARMGGTSDVTA FRIEPSSSGVRDQVSRISAASLDRCLRRASKF                 AFVASIQ
Subjt:  RRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKA+VESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK

Query:  ERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------Q
        ERL NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED        +
Subjt:  ERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------Q

Query:  VGSTQDGAP
        VG+TQ+G P
Subjt:  VGSTQDGAP

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.5e-10294.61Show/hide
Query:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLL
        IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE LAKDESGRSFFDVPTRFGNLVSIRPVPELTQ SFDTLKYYKERFPRGRKVGTLVTDELLL
Subjt:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLL

Query:  ESGLLDYNPAVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVP
        ESGLLDYNPAVRPIE SRPNS LAMVC F+SGVKRKSKGRAHALEAAQSSKP TPAV  PASEDPAPVIELESSGGPSREKRPRDQTEAVDA+TEA DVP
Subjt:  ESGLLDYNPAVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVP

Query:  PLGE
        PLGE
Subjt:  PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.2e-12388.42Show/hide
Query:  GTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A  RIEPSSSGVRDQVSRISAASLDRCLRRASKF                 AFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFD
        ELLKAHSEVETLKA+VESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERL+NGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQDGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQ+GA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQDGAPQAGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.5e-17865.75Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGE LAKDESGR+FFDVPTRFGNLVSI+ +PEL Q +FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAV--------AEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPP
         VR IE+SRPNSELAMVCGF+  VKRKSKGRAHAL+    ++P TP V        + P+S  P PVIEL+ SGG S EKR R++       +EA+DV P
Subjt:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAV--------AEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPP

Query:  LGEEVREEAPLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF------------
        L  EVR E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V   F +EPSSSGV+DQVSRISA  LDR LRRASKF            
Subjt:  LGEEVREEAPLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF------------

Query:  -----AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
             AF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+A+V+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  -----AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA
        KEKDD+ Q LE KD  +   T EL+  KERL NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+
Subjt:  KEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA

Query:  LVDQYVRDLDSDYSDPEED--------QVGSTQDGAP--QAGS
        LVD+YVR+LDSDYSD EE+        +VG+TQ+  P  Q GS
Subjt:  LVDQYVRDLDSDYSDPEED--------QVGSTQDGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.7e-10782.38Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGE LAKDES              V+IRPVPELTQ SFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPPLGEEVREE
        AVRPIESSRPNSELAMVCGF+S VKRKSKG+AHALEAAQSSKP TPAV  PASEDPAPVIELESS GPSREKRPRDQ       TEAVDV PLGEEVREE
Subjt:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPPLGEEVREE

Query:  APLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQ
         PLKRRRKKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT  FR+EPSSSGVRDQ
Subjt:  APLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQ

A0A6J1CLV1 uncharacterized protein LOC1110124676.3e-10249.71Show/hide
Query:  RRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQ
        +RRKKKKAIS SEVGAC VLPA FADRVDDPAARMGGTSDVTA FRIEPSSSGVRDQVSRISAASLDRCLRRASKF                 AFVASIQ
Subjt:  RRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKA+VESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK

Query:  ERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------Q
        ERL NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED        +
Subjt:  ERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------Q

Query:  VGSTQDGAP
        VG+TQ+G P
Subjt:  VGSTQDGAP

A0A6J1CR42 uncharacterized protein LOC1110138267.4e-10394.61Show/hide
Query:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLL
        IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE LAKDESGRSFFDVPTRFGNLVSIRPVPELTQ SFDTLKYYKERFPRGRKVGTLVTDELLL
Subjt:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLL

Query:  ESGLLDYNPAVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVP
        ESGLLDYNPAVRPIE SRPNS LAMVC F+SGVKRKSKGRAHALEAAQSSKP TPAV  PASEDPAPVIELESSGGPSREKRPRDQTEAVDA+TEA DVP
Subjt:  ESGLLDYNPAVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVP

Query:  PLGE
        PLGE
Subjt:  PLGE

A0A6J1D971 uncharacterized protein LOC1110185385.8e-12488.42Show/hide
Query:  GTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A  RIEPSSSGVRDQVSRISAASLDRCLRRASKF                 AFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF-----------------AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFD
        ELLKAHSEVETLKA+VESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERL+NGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQDGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQ+GA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQDGAPQAGS

A0A6J1DZB3 uncharacterized protein LOC1110256653.2e-17865.75Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGE LAKDESGR+FFDVPTRFGNLVSI+ +PEL Q +FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAV--------AEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPP
         VR IE+SRPNSELAMVCGF+  VKRKSKGRAHAL+    ++P TP V        + P+S  P PVIEL+ SGG S EKR R++       +EA+DV P
Subjt:  AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAV--------AEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPP

Query:  LGEEVREEAPLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF------------
        L  EVR E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V   F +EPSSSGV+DQVSRISA  LDR LRRASKF            
Subjt:  LGEEVREEAPLKRRRKKKKAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKF------------

Query:  -----AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL
             AF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+A+V+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL
Subjt:  -----AFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA
        KEKDD+ Q LE KD  +   T EL+  KERL NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+
Subjt:  KEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA

Query:  LVDQYVRDLDSDYSDPEED--------QVGSTQDGAP--QAGS
        LVD+YVR+LDSDYSD EE+        +VG+TQ+  P  Q GS
Subjt:  LVDQYVRDLDSDYSDPEED--------QVGSTQDGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGC
TTCCGGGGAATGTCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGTCT
CCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCT
GCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTTCAAGCGGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGC
CGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGAGCCTGCTTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCC
CCAGGGATCAGACCGAGGCGGTGGACGCCCGAACCGAGGCGGTGGATGTCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAG
AAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCACGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGC
ACTGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGCGTTCGTGG
CTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAAGCTGCTTCCTCCACC
ATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCAAGGTGGAGTCCCAGGCCGAGCTACTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACT
CCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGC
ATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAATCATTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCT
GACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGATCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAATGGGCGTCTGGGCC
TGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGATGGTGCTC
CCCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGCTAAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGC
TTCCGGGGAATGTCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGTCT
CCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCT
GCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTTCAAGCGGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGC
CGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGAGCCTGCTTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCC
CCAGGGATCAGACCGAGGCGGTGGACGCCCGAACCGAGGCGGTGGATGTCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAG
AAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCACGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGC
ACTGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGCGTTCGTGG
CTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAAGCTGCTTCCTCCACC
ATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCAAGGTGGAGTCCCAGGCCGAGCTACTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACT
CCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGC
ATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAATCATTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTTTCT
GACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGATCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAATGGGCGTCTGGGCC
TGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGATGGTGCTC
CCCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGECLAKDESGRSFFDVPTRFGNLVSIRPVPELTQVSFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
AVRPIESSRPNSELAMVCGFSSGVKRKSKGRAHALEAAQSSKPATPAVAEPASEDPAPVIELESSGGPSREKRPRDQTEAVDARTEAVDVPPLGEEVREEAPLKRRRKKK
KAISPSEVGACTVLPASFADRVDDPAARMGGTSDVTALFRIEPSSSGVRDQVSRISAASLDRCLRRASKFAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASST
MKDELLKAHSEVETLKAKVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEESFRQHPDFDGFAKDFS
DAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQDGAPQAGS