; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g07140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g07140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:5174311..5176082
RNA-Seq ExpressionMoc06g07140
SyntenyMoc06g07140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.6e-8474.8Show/hide
Query:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVPELTQASFDTLKYYNEHFPRGTKP------------------------
        MCA KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES      V  R    VPELTQASFDTLKYY EHFPRG K                         
Subjt:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVPELTQASFDTLKYYNEHFPRGTKP------------------------

Query:  -------------WFAGNVKRKSKRRAHALEAAQSSKPVTPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVREEVSLKRRRKKKKT
                      FA NVKRKSK +AHALEAAQSSKPVTPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVREEV LKRRRKKKKT
Subjt:  -------------WFAGNVKRKSKRRAHALEAAQSSKPVTPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVREEVSLKRRRKKKKT

Query:  TSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV
        TSPLEV ARGVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGV
Subjt:  TSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]4.0e-8344.4Show/hide
Query:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQ
        +RRKKKK  S  EV A  VLPA FADRVDDP ARMGGTSDVTARFR+EPSSSGV                   S F+          +    +AFVASIQ
Subjt:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQ

Query:  SALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------
        SALA KAELD REVLAAREK+EFSAALEAASSTMKDELLKAHSEVE LKAEVE+                                              
Subjt:  SALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------KAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAK
                                              KAELLK+E+ R KA LRAAHAIT+GLEKEKFQLLKEKDD+LQALE KD  +    AEL+  K
Subjt:  --------------------------------------KAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGI +D+P L++DL  LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYSDL+ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.3e-10175.09Show/hide
Query:  GTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKD
        G   + A+ R+EPSSSGV                   S F+          +    +AFVASIQSALA KAELD REVLAAREK+EFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEE+RR+AQLRAAHAITRGLE+EKFQLLKEKDD+LQALEAKD++L+HATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQADS
        GFAKDFSDAGFKFLMKGI SDMPDLQIDLSGLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA    S
Subjt:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQADS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]5.4e-8062.54Show/hide
Query:  MGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV                   S F+          +    +AFVASI SA+  KAELD RE LAA+E++  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV IL+AEV+ KAELLKKE  + KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP
        FDGFAKDFSDAGFKFLMKGI +DMP LQIDLS LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.6e-14358.11Show/hide
Query:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLV-----PELTQASFDTLKYYNEHFPRGTK--------------------
        MCA KG  GIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLV     PEL QA+FDTLK+Y +HFPR  K                    
Subjt:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLV-----PELTQASFDTLKYYNEHFPRGTK--------------------

Query:  -----------------PWFAGNVKRKSKRRAHALEAAQSSKPVTPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVRE
                           F G+VKRKSK RAHAL+    ++PVTP V         GP+S  P PVIEL+ S G S EKR R+++EA+DVSPL  EVR 
Subjt:  -----------------PWFAGNVKRKSKRRAHALEAAQSSKPVTPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVSLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAF
        E  L+RRRKKKKT+S  E  ARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV                   S F+          +    +AF
Subjt:  EVSLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAF

Query:  VASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVL
        +ASI  A+  KAELD RE LAA+E++   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE  + KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVL

Query:  QALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI +DMP LQIDL+GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAP
        +LDSDYSD+EE+        +VGTTQE  P
Subjt:  DLDSDYSDLEED--------QVGTTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092987.9e-8574.8Show/hide
Query:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVPELTQASFDTLKYYNEHFPRGTKP------------------------
        MCA KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES      V  R    VPELTQASFDTLKYY EHFPRG K                         
Subjt:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVPELTQASFDTLKYYNEHFPRGTKP------------------------

Query:  -------------WFAGNVKRKSKRRAHALEAAQSSKPVTPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVREEVSLKRRRKKKKT
                      FA NVKRKSK +AHALEAAQSSKPVTPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVREEV LKRRRKKKKT
Subjt:  -------------WFAGNVKRKSKRRAHALEAAQSSKPVTPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVREEVSLKRRRKKKKT

Query:  TSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV
        TSPLEV ARGVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGV
Subjt:  TSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV

A0A6J1CLV1 uncharacterized protein LOC1110124671.9e-8344.4Show/hide
Query:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQ
        +RRKKKK  S  EV A  VLPA FADRVDDP ARMGGTSDVTARFR+EPSSSGV                   S F+          +    +AFVASIQ
Subjt:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQ

Query:  SALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------
        SALA KAELD REVLAAREK+EFSAALEAASSTMKDELLKAHSEVE LKAEVE+                                              
Subjt:  SALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------KAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAK
                                              KAELLK+E+ R KA LRAAHAIT+GLEKEKFQLLKEKDD+LQALE KD  +    AEL+  K
Subjt:  --------------------------------------KAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGI +D+P L++DL  LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYSDL+ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185382.1e-10175.09Show/hide
Query:  GTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKD
        G   + A+ R+EPSSSGV                   S F+          +    +AFVASIQSALA KAELD REVLAAREK+EFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEE+RR+AQLRAAHAITRGLE+EKFQLLKEKDD+LQALEAKD++L+HATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQADS
        GFAKDFSDAGFKFLMKGI SDMPDLQIDLSGLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA    S
Subjt:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQADS

A0A6J1DF31 uncharacterized protein LOC1110199092.6e-8062.54Show/hide
Query:  MGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV                   S F+          +    +AFVASI SA+  KAELD RE LAA+E++  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV IL+AEV+ KAELLKKE  + KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP
        FDGFAKDFSDAGFKFLMKGI +DMP LQIDLS LKK+Y+E+WASGP+GTPGPQ+LV KYVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-14358.11Show/hide
Query:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLV-----PELTQASFDTLKYYNEHFPRGTK--------------------
        MCA KG  GIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLV     PEL QA+FDTLK+Y +HFPR  K                    
Subjt:  MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLV-----PELTQASFDTLKYYNEHFPRGTK--------------------

Query:  -----------------PWFAGNVKRKSKRRAHALEAAQSSKPVTPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVRE
                           F G+VKRKSK RAHAL+    ++PVTP V         GP+S  P PVIEL+ S G S EKR R+++EA+DVSPL  EVR 
Subjt:  -----------------PWFAGNVKRKSKRRAHALEAAQSSKPVTPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVSLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAF
        E  L+RRRKKKKT+S  E  ARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV                   S F+          +    +AF
Subjt:  EVSLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGV-----------------ARSNFL----------LGVFPQAF

Query:  VASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVL
        +ASI  A+  KAELD RE LAA+E++   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE  + KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVL

Query:  QALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI +DMP LQIDL+GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKDEKLKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAP
        +LDSDYSD+EE+        +VGTTQE  P
Subjt:  DLDSDYSDLEED--------QVGTTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGCAAGTAAAGGCGCAGACGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTC
AGATCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACATTGAAATATTACAATGAGCATTTTCCGAGGGGTA
CGAAGCCATGGTTTGCGGGTAACGTGAAACGCAAGTCCAAGCGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGTCACTCCTGCTGTGGTAGGGCCAGCC
TCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGA
GGTGAGGGAGGAAGTCTCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTCCCTGCAAGCTTCGCAGATCGGGTGG
ACGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTAGCTCGGTCTAATTTTCTTCTTGGTGTTTTTCCT
CAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGGAAAGGCCGAGCTGGATGAGAGGGAAGTTCTGGCAGCGAGGGAGAAAAAGGAGTTCTCTGCTGCCTTGGAGGC
TGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGAACAGAC
GCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACGTGCTCCAGGCGCTTGAAGCGAAGGAT
GAGAAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGC
CAAAGACTTCTCTGACGCGGGCTTTAAGTTCCTCATGAAAGGCATTACTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTAAAAAAGAGGTATGCCGAGCAGT
GGGCGTCTGGGCCTGACGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACT
CAGGAGGGCGCTCCTCAGGCAGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCGCAAGTAAAGGCGCAGACGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTC
AGATCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACATTGAAATATTACAATGAGCATTTTCCGAGGGGTA
CGAAGCCATGGTTTGCGGGTAACGTGAAACGCAAGTCCAAGCGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGTCACTCCTGCTGTGGTAGGGCCAGCC
TCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGA
GGTGAGGGAGGAAGTCTCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTCCCTGCAAGCTTCGCAGATCGGGTGG
ACGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTAGCTCGGTCTAATTTTCTTCTTGGTGTTTTTCCT
CAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGGAAAGGCCGAGCTGGATGAGAGGGAAGTTCTGGCAGCGAGGGAGAAAAAGGAGTTCTCTGCTGCCTTGGAGGC
TGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGAACAGAC
GCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACGTGCTCCAGGCGCTTGAAGCGAAGGAT
GAGAAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGC
CAAAGACTTCTCTGACGCGGGCTTTAAGTTCCTCATGAAAGGCATTACTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTAAAAAAGAGGTATGCCGAGCAGT
GGGCGTCTGGGCCTGACGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACT
CAGGAGGGCGCTCCTCAGGCAGACTCTTAG
Protein sequenceShow/hide protein sequence
MCASKGADGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVPELTQASFDTLKYYNEHFPRGTKPWFAGNVKRKSKRRAHALEAAQSSKPVTPAVVGPA
SEDPAPVIELESSRGPSREKRPRDQTEAVDVSPLGEEVREEVSLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVARSNFLLGVFP
QAFVASIQSALAGKAELDEREVLAAREKKEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEENRRKAQLRAAHAITRGLEKEKFQLLKEKDDVLQALEAKD
EKLKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKKRYAEQWASGPDGTPGPQALVDKYVRDLDSDYSDLEEDQVGTT
QEGAPQADS