; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g25060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g25060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:18191986..18195631
RNA-Seq ExpressionMoc04g25060
SyntenyMoc04g25060
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.4e-11789.2Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYA GEWLAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
        AVRP+ESSRPNSELAMVCGFASNVKRKSKG+AHA E AQSSKP TPAVVGPAS+DPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG
        KKKKTTSPLEVGARGVLPASFADRVDDPEAR+GGTPDVT RFR+EPSSSG
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.0e-10451.78Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV---
        +SI+P+PEL QA+FDTLK+YK++FPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSELAMVCGF S+VKRKSKGRAHA ++ QSS P TPAV   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV---

Query:  -----VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFR
              GP+S  P PVIEL+S+G  SREKR R ++EA+DVSPL  EVR                                                    
Subjt:  -----VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFR

Query:  IEPSSSGRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITK
                                                                                  EAKAELL +E++R KA LRAAHAITK
Subjt:  IEPSSSGRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITK

Query:  GLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWAS
        GLEKEKFQLLKEKDDMLQALE K   +    AEL+  KERL+NGALLE +FRQHPDFDG AKDFSDAGFKFLMKGIA+D+P L++DLG LKKRY E+WAS
Subjt:  GLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWAS

Query:  GPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAPQVQ
        GP+GT GP +LVDKYVRDLDSDYSDL+ED        +VGTTQEG P  Q
Subjt:  GPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAPQVQ

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.9e-10477.5Show/hide
Query:  GTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSG                                RTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELL KEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK +EL+HATAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFD

Query:  GVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGA
        G AKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RY E+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA
Subjt:  GVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGA

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.1e-9795.29Show/hide
Query:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL
        IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLL
Subjt:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL

Query:  ESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVD
        ESGLLDYNPAVRP+ESSRPNSELAMVCGFAS VKRKSKGRAHA E AQSSKPATPAVVGPAS+DPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  ESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.5e-18066.42Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+A GEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV--------VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         VR +E+SRPNSELAMVCGF  +VKRKSKGRAHA +    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV--------VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEAR+ GT +V  RF +EPSSSG                                RTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LL KE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVR
        Q LE K   +   T EL+ +KERL+NG LLEESFRQHPDFDG AKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAPQVQ
        +LDSDYSD+EE+        +VGTTQE  P  Q
Subjt:  DLDSDYSDLEED--------QVGTTQEGAPQVQ

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092986.6e-11889.2Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYA GEWLAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
        AVRP+ESSRPNSELAMVCGFASNVKRKSKG+AHA E AQSSKP TPAVVGPAS+DPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR
Subjt:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG
        KKKKTTSPLEVGARGVLPASFADRVDDPEAR+GGTPDVT RFR+EPSSSG
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG

A0A6J1CLV1 uncharacterized protein LOC1110124674.9e-10551.78Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV---
        +SI+P+PEL QA+FDTLK+YK++FPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSELAMVCGF S+VKRKSKGRAHA ++ QSS P TPAV   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV---

Query:  -----VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFR
              GP+S  P PVIEL+S+G  SREKR R ++EA+DVSPL  EVR                                                    
Subjt:  -----VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFR

Query:  IEPSSSGRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITK
                                                                                  EAKAELL +E++R KA LRAAHAITK
Subjt:  IEPSSSGRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITK

Query:  GLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWAS
        GLEKEKFQLLKEKDDMLQALE K   +    AEL+  KERL+NGALLE +FRQHPDFDG AKDFSDAGFKFLMKGIA+D+P L++DLG LKKRY E+WAS
Subjt:  GLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWAS

Query:  GPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAPQVQ
        GP+GT GP +LVDKYVRDLDSDYSDL+ED        +VGTTQEG P  Q
Subjt:  GPSGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAPQVQ

A0A6J1D971 uncharacterized protein LOC1110185381.9e-10477.5Show/hide
Query:  GTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSG                                RTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFSAALE ASSTMKD
Subjt:  GTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELL KEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK +EL+HATAELET KERLSNG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFD

Query:  GVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGA
        G AKDFSDAGFKFLMKGIASDMPDLQIDL GLK+RY E+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA
Subjt:  GVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGA

A0A6J1DXS5 uncharacterized protein LOC1110255029.9e-9895.29Show/hide
Query:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL
        IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLL
Subjt:  IAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLL

Query:  ESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVD
        ESGLLDYNPAVRP+ESSRPNSELAMVCGFAS VKRKSKGRAHA E AQSSKPATPAVVGPAS+DPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  ESGLLDYNPAVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256657.2e-18166.42Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+A GEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV--------VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         VR +E+SRPNSELAMVCGF  +VKRKSKGRAHA +    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAV--------VGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEAR+ GT +V  RF +EPSSSG                                RTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSG--------------------------------RTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+AK +LL KE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAKAELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVR
        Q LE K   +   T EL+ +KERL+NG LLEESFRQHPDFDG AKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYPEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAPQVQ
        +LDSDYSD+EE+        +VGTTQE  P  Q
Subjt:  DLDSDYSDLEED--------QVGTTQEGAPQVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGC
TTTTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGCCT
CCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCT
GCAGTTCGTCCCGTTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCCTGAGGT
CGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGAAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCC
CCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGAGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAG
GTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATAGGCGGGACGCCCGACGTGACAGCACGGTTCAGAATCGAGCCGTC
AAGTTCTGGGAGGACCATCGACTACGCCGCTGAGGCATTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGG
AGAAAGAGGAGTTCTCTGCTGCCTTAGAGGCTGCCTCTTCCACCATGAAGGATGAACTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAAGCCAAG
GCCGAGCTGCTGAATAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGA
CGACATGCTCCAGGCGCTTGAAGCGAAGAAGGAGGAGCTGAAGCATGCGACTGCCGAGTTAGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGT
TTAGGCAACATCCTGACTTCGATGGAGTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTC
GGTGGTCTGAAGAAGAGGTATCCTGAGCAATGGGCGTCTGGGCCTAGCGGCACCCCTGGTCCTCAAGCGTTGGTGGATAAGTACGTTAGAGATCTGGACTCTGACTACTC
CGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGTTCAGAGGCTTGATCATTTTGAACTTTTCATATTGTCTCTTTACCTTGAAGGCCCTCTTC
AGGGGTTAGGCATTTCAATAGAGGCAGGGAAAAGCCGCGTTGGTACAACGCTCCACTTCGGACCACGAACCGAGTTGCTTGCCTTGCCAACCTTCTGCGCTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGC
TTTTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGCCT
CCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCT
GCAGTTCGTCCCGTTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCCTGAGGT
CGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGAAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCC
CCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGAGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAG
GTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATAGGCGGGACGCCCGACGTGACAGCACGGTTCAGAATCGAGCCGTC
AAGTTCTGGGAGGACCATCGACTACGCCGCTGAGGCATTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGG
AGAAAGAGGAGTTCTCTGCTGCCTTAGAGGCTGCCTCTTCCACCATGAAGGATGAACTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAAGCCAAG
GCCGAGCTGCTGAATAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGA
CGACATGCTCCAGGCGCTTGAAGCGAAGAAGGAGGAGCTGAAGCATGCGACTGCCGAGTTAGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGGAGGAATCGT
TTAGGCAACATCCTGACTTCGATGGAGTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTC
GGTGGTCTGAAGAAGAGGTATCCTGAGCAATGGGCGTCTGGGCCTAGCGGCACCCCTGGTCCTCAAGCGTTGGTGGATAAGTACGTTAGAGATCTGGACTCTGACTACTC
CGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGTTCAGAGGCTTGATCATTTTGAACTTTTCATATTGTCTCTTTACCTTGAAGGCCCTCTTC
AGGGGTTAGGCATTTCAATAGAGGCAGGGAAAAGCCGCGTTGGTACAACGCTCCACTTCGGACCACGAACCGAGTTGCTTGCCTTGCCAACCTTCTGCGCTCCTTAG
Protein sequenceShow/hide protein sequence
MIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYAFGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
AVRPVESSRPNSELAMVCGFASNVKRKSKGRAHAPEVAQSSKPATPAVVGPASKDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLE
VGARGVLPASFADRVDDPEARIGGTPDVTARFRIEPSSSGRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVEAK
AELLNKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKKEELKHATAELETVKERLSNGALLEESFRQHPDFDGVAKDFSDAGFKFLMKGIASDMPDLQIDL
GGLKKRYPEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQVQRLDHFELFILSLYLEGPLQGLGISIEAGKSRVGTTLHFGPRTELLALPTFCAP