; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g11310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g11310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:8520104..8521910
RNA-Seq ExpressionMoc04g11310
SyntenyMoc04g11310
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.7e-11788.58Show/hide
Query:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES              V+IRPV ELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVREEAPLKRRR
        AVRPIESSRP+SELAMVCGFASNVKRKSKG+ HALEAAQSSKP TPAVVGPASEDPA VIELESS GPSREKRPRDQTEAVD+SPLGEEVREE PLKRRR
Subjt:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]5.7e-9748.72Show/hide
Query:  RRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQ
        +RRKKKK  S  EVGA   LPA FADRVDDP ARMGGTSDVT RFR+EPSSSGVRDQVSRISAASLDRCL RASKFV+    +L       ++ FVASIQ
Subjt:  RRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQ

Query:  SALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEV------------------------------------------------
        SALAVKAELDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEV                                                
Subjt:  SALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEV------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVK
                                            EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE K+  +  + AEL+  K
Subjt:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVK

Query:  ERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEED--------Q
        ERL+N  LLE +FRQH DFDGFAKDFSDAGFKFLMKGIA D+P L++DLG LKKRYAE+WASGP GT GP +LVDKYVR LDSDYSDL+ED        +
Subjt:  ERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEED--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.6e-11381.4Show/hide
Query:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCL RASKFV+    +L       ++ FVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+H TAELET KERLSN VLLEE+FRQH DFD
Subjt:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFD

Query:  GFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIA DMPDLQIDL GLK+RYAE+WASGPGGTPGPQALVD+YVR LDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEEDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.1e-9795.81Show/hide
Query:  IAKKPGRFYMCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL
        IAKKPGRFYMCARKGAGGIVKGPTSIK WVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPV ELTQASFDTLKYYKE FPRGRKVGTLVTD+LLL
Subjt:  IAKKPGRFYMCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL

Query:  ESGLLDYNPAVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVD
        ESGLLDYNPAVRPIESSRP+SELAMVCGFAS VKRKSKGR HALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  ESGLLDYNPAVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.5e-18768.47Show/hide
Query:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ + EL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAV--------VGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVRE
         VR IE+SRP+SELAMVCGF  +VKRKSKGR HAL+    ++P TP V         GP+S  P  VIEL+ SGG S EKR R+++EA+D+SPL  EVR 
Subjt:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAV--------VGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTF
        E+PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V +RF +EPSSSGV+DQVSRISA  LDR L RASKFV+    +L       ++ F
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV++L+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE K+  +  +T EL+ +KERL+N  LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA DMP LQIDL GLKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  YLDSDYSDLEED--------QVGTTQEGAP--QAGS
         LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  YLDSDYSDLEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092988.3e-11888.58Show/hide
Query:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES              V+IRPV ELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVREEAPLKRRR
        AVRPIESSRP+SELAMVCGFASNVKRKSKG+ HALEAAQSSKP TPAVVGPASEDPA VIELESS GPSREKRPRDQTEAVD+SPLGEEVREE PLKRRR
Subjt:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQ

A0A6J1CLV1 uncharacterized protein LOC1110124672.8e-9748.72Show/hide
Query:  RRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQ
        +RRKKKK  S  EVGA   LPA FADRVDDP ARMGGTSDVT RFR+EPSSSGVRDQVSRISAASLDRCL RASKFV+    +L       ++ FVASIQ
Subjt:  RRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQ

Query:  SALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEV------------------------------------------------
        SALAVKAELDGRE LAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEV                                                
Subjt:  SALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEV------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVK
                                            EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE K+  +  + AEL+  K
Subjt:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVK

Query:  ERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEED--------Q
        ERL+N  LLE +FRQH DFDGFAKDFSDAGFKFLMKGIA D+P L++DLG LKKRYAE+WASGP GT GP +LVDKYVR LDSDYSDL+ED        +
Subjt:  ERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEED--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185381.2e-11381.4Show/hide
Query:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCL RASKFV+    +L       ++ FVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+H TAELET KERLSN VLLEE+FRQH DFD
Subjt:  ELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFD

Query:  GFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIA DMPDLQIDL GLK+RYAE+WASGPGGTPGPQALVD+YVR LDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255025.6e-9895.81Show/hide
Query:  IAKKPGRFYMCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL
        IAKKPGRFYMCARKGAGGIVKGPTSIK WVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPV ELTQASFDTLKYYKE FPRGRKVGTLVTD+LLL
Subjt:  IAKKPGRFYMCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL

Query:  ESGLLDYNPAVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVD
        ESGLLDYNPAVRPIESSRP+SELAMVCGFAS VKRKSKGR HALEAAQSSKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  ESGLLDYNPAVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-18768.47Show/hide
Query:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ + EL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAV--------VGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVRE
         VR IE+SRP+SELAMVCGF  +VKRKSKGR HAL+    ++P TP V         GP+S  P  VIEL+ SGG S EKR R+++EA+D+SPL  EVR 
Subjt:  AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAV--------VGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTF
        E+PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V +RF +EPSSSGV+DQVSRISA  LDR L RASKFV+    +L       ++ F
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLL----GVFSQTF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV++L+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE K+  +  +T EL+ +KERL+N  LLEESFRQH DFDGFAKDFSDAGFKFLMKGIA DMP LQIDL GLKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFDGFAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  YLDSDYSDLEED--------QVGTTQEGAP--QAGS
         LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  YLDSDYSDLEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGGATAGTTAAGGGGCCGACCTCCATCAAGGAATGGGTGAGGAAGTGGTTCTACGC
TTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCTCCGAGCTTACGCAAGCCT
CCTTCGACACGTTGAAATATTATAAGGAGCATTTTCCGAGGGGTAGAAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCT
GCAGTTCGTCCCATTGAATCCTCAAGGCCGGACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGTTCATGCTCTTGAGGC
CGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCAAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCC
CCAGGGATCAGACCGAGGCGGTGGACATCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCCCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAG
GTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGTACGGTTTAGAGTCGAGCCGTC
AAGTTCTGGGGTGAGGGATCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGCAGGGCGTCCAAATTTGTAGCTCGGTCTAATTTTCTTCTTGGTGTTT
TTTCTCAGACGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGAAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTG
GAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAGTTTTGAAGGCCGAGGTAGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGA
CAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGA
AGGAGGAGGAGCTGAAGCATGTGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGAAGTCCTATTGGAGGAATCGTTTAGGCAACATCTTGACTTCGATGGA
TTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTTCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTTAAGAAGAGGTATGCTGA
GCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGATATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCA
CCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGGATAGTTAAGGGGCCGACCTCCATCAAGGAATGGGTGAGGAAGTGGTTCTACGC
TTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCTCCGAGCTTACGCAAGCCT
CCTTCGACACGTTGAAATATTATAAGGAGCATTTTCCGAGGGGTAGAAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCT
GCAGTTCGTCCCATTGAATCCTCAAGGCCGGACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGTTCATGCTCTTGAGGC
CGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCAAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCC
CCAGGGATCAGACCGAGGCGGTGGACATCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCCCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAG
GTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGTACGGTTTAGAGTCGAGCCGTC
AAGTTCTGGGGTGAGGGATCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGCAGGGCGTCCAAATTTGTAGCTCGGTCTAATTTTCTTCTTGGTGTTT
TTTCTCAGACGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGAAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTG
GAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAGTTTTGAAGGCCGAGGTAGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGA
CAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGA
AGGAGGAGGAGCTGAAGCATGTGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGAAGTCCTATTGGAGGAATCGTTTAGGCAACATCTTGACTTCGATGGA
TTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTTCGACATGCCTGACCTTCAGATCGATCTCGGTGGTCTTAAGAAGAGGTATGCTGA
GCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGATATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCA
CCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MIAKKPGRFYMCARKGAGGIVKGPTSIKEWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVSELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
AVRPIESSRPDSELAMVCGFASNVKRKSKGRVHALEAAQSSKPATPAVVGPASEDPAQVIELESSGGPSREKRPRDQTEAVDISPLGEEVREEAPLKRRRKKKKTTSPLE
VGARGALPASFADRVDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISAASLDRCLSRASKFVARSNFLLGVFSQTFVASIQSALAVKAELDGREALAAREKEEFSAAL
EAASSTMKDELLKAHSEVEVLKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHVTAELETVKERLSNEVLLEESFRQHLDFDG
FAKDFSDAGFKFLMKGIAFDMPDLQIDLGGLKKRYAEQWASGPGGTPGPQALVDKYVRYLDSDYSDLEEDQVGTTQEGAPQAGS