; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g02410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g02410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:1944667..1946949
RNA-Seq ExpressionMoc07g02410
SyntenyMoc07g02410
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.3e-10256.07Show/hide
Query:  AVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--
        A+SI+P+PEL QA+FDTLK+YK++FPRG K+GTLVTDKLLLESGLLDYNP VRPIEASRPNSELAMVCGF S+VKRKSKGRAHAL+  QSS+P TPAV  
Subjt:  AVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--

Query:  ------AGPASEDPIPVIELEVLHGRSAQEVGACGVLPASFADRVDDPKARMGGTSDVTARFRIEPSSSGVRDQRTIDYAAEAFVASIQSALAVKAELDG
              AGP+S  P PVIEL+                                              S+G R                            
Subjt:  ------AGPASEDPIPVIELEVLHGRSAQEVGACGVLPASFADRVDDPKARMGGTSDVTARFRIEPSSSGVRDQRTIDYAAEAFVASIQSALAVKAELDG

Query:  REVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAE
             +REK           S  + E L    +V  L+   EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE KD  + R  AE
Subjt:  REVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAE

Query:  LETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED----
        L+  KERL NGALLE +FRQHPDFDGFAKDF DAGFKFLMKGI +D+P L++DL DLKKRYAE+WASGPNGT GP +LV+KYVRDLDSDYSDL+ED    
Subjt:  LETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED----

Query:  ----QVGTTQEG
            +VGTTQEG
Subjt:  ----QVGTTQEG

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.1e-9474.4Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR----------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARD+EEAELLDVDQLLACFEAKRIAKKPGR+Y+CARKGAGG+ +          
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR----------

Query:  ------------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFAS
                                     VSIRPVPELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  ------------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE
         VKRKSKGRAHALEAAQSS+P TPAV GPASEDP PVIELE   G S ++
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]6.8e-10778.25Show/hide
Query:  GTSDVTARFRIEPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRD                            QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFD
        ELL+AHSEV+ LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAKD+EL+ ATAELETAKERL NG LLEE+FRQHPDFD
Subjt:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFD

Query:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS
        GFAKDF DAGFKFLMKGI SDMPDLQIDLS LK+RYAE+WASGP GTPGPQALV++YVRDLDSDYSD EEDQVG+TQEGA  TGS
Subjt:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.9e-14177.39Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFAIPENILLRILEEGERADNPPDGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSR+PEHY G LRRGFAIPENILLR+ EEGERADNPP+GWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFAIPENILLRILEEGERADNPPDGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR---------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARD+EEAEL DVDQLLACFEAKRIAKKPGR+Y+CARKGAGG+ +               
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR---------------

Query:  -------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRK
                                VSIRPVPELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIE+SRPNSELAMVCGFAS VKRK
Subjt:  -------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE
        SKGRAHALEAAQSS+PATPAV GPASEDP  VIELE   G S ++
Subjt:  SKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.7e-13560Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV---
        VSI+ +PEL QA+FDTLK+YK+HFPR  K+ TLVTDKLLLESGLLDYNP VR IEASRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP TP V   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV---

Query:  -----AGPASEDPIPVIELEVLHGRSAQ-------------------------------------EVGACGVLPASFADRVDDPKARMGGTSDVTARFRI
             +GP+S  P PVIEL++  GRS +                                     E GA G LP S AD VDDP+ARM GTS+V  RF +
Subjt:  -----AGPASEDPIPVIELEVLHGRSAQ-------------------------------------EVGACGVLPASFADRVDDPKARMGGTSDVTARFRI

Query:  EPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDI
        EPSSSGV+D                            QRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELL+A  EVDI
Subjt:  EPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDI

Query:  LKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGF
        L+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE KD  + R T EL+  KERL NG LLEESFRQHPDFDGFAKDF DAGF
Subjt:  LKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGF

Query:  KFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE
        KFLMKGI +DMP LQIDL+ LKK+Y+E+WASGPNGTP PQ+LV+KYVR+LDSDYSD+EE+        +VGTTQE
Subjt:  KFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124676.5e-10356.07Show/hide
Query:  AVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--
        A+SI+P+PEL QA+FDTLK+YK++FPRG K+GTLVTDKLLLESGLLDYNP VRPIEASRPNSELAMVCGF S+VKRKSKGRAHAL+  QSS+P TPAV  
Subjt:  AVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--

Query:  ------AGPASEDPIPVIELEVLHGRSAQEVGACGVLPASFADRVDDPKARMGGTSDVTARFRIEPSSSGVRDQRTIDYAAEAFVASIQSALAVKAELDG
              AGP+S  P PVIEL+                                              S+G R                            
Subjt:  ------AGPASEDPIPVIELEVLHGRSAQEVGACGVLPASFADRVDDPKARMGGTSDVTARFRIEPSSSGVRDQRTIDYAAEAFVASIQSALAVKAELDG

Query:  REVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAE
             +REK           S  + E L    +V  L+   EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE KD  + R  AE
Subjt:  REVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAE

Query:  LETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED----
        L+  KERL NGALLE +FRQHPDFDGFAKDF DAGFKFLMKGI +D+P L++DL DLKKRYAE+WASGPNGT GP +LV+KYVRDLDSDYSDL+ED    
Subjt:  LETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED----

Query:  ----QVGTTQEG
            +VGTTQEG
Subjt:  ----QVGTTQEG

A0A6J1CR42 uncharacterized protein LOC1110138262.5e-9474.4Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR----------
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARD+EEAELLDVDQLLACFEAKRIAKKPGR+Y+CARKGAGG+ +          
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR----------

Query:  ------------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFAS
                                     VSIRPVPELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  ------------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE
         VKRKSKGRAHALEAAQSS+P TPAV GPASEDP PVIELE   G S ++
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE

A0A6J1D971 uncharacterized protein LOC1110185383.3e-10778.25Show/hide
Query:  GTSDVTARFRIEPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRD                            QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFD
        ELL+AHSEV+ LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAKD+EL+ ATAELETAKERL NG LLEE+FRQHPDFD
Subjt:  ELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFD

Query:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS
        GFAKDF DAGFKFLMKGI SDMPDLQIDLS LK+RYAE+WASGP GTPGPQALV++YVRDLDSDYSD EEDQVG+TQEGA  TGS
Subjt:  GFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS

A0A6J1DXS5 uncharacterized protein LOC1110255029.2e-14277.39Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFAIPENILLRILEEGERADNPPDGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSR+PEHY G LRRGFAIPENILLR+ EEGERADNPP+GWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFAIPENILLRILEEGERADNPPDGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR---------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAI FWLRARD+EEAEL DVDQLLACFEAKRIAKKPGR+Y+CARKGAGG+ +               
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTR---------------

Query:  -------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRK
                                VSIRPVPELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIE+SRPNSELAMVCGFAS VKRK
Subjt:  -------------------ILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE
        SKGRAHALEAAQSS+PATPAV GPASEDP  VIELE   G S ++
Subjt:  SKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQE

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-13560Show/hide
Query:  VSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV---
        VSI+ +PEL QA+FDTLK+YK+HFPR  K+ TLVTDKLLLESGLLDYNP VR IEASRPNSELAMVCGF  +VKRKSKGRAHAL+    +EP TP V   
Subjt:  VSIRPVPELTQASFDTLKYYKEHFPRGMKVGTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV---

Query:  -----AGPASEDPIPVIELEVLHGRSAQ-------------------------------------EVGACGVLPASFADRVDDPKARMGGTSDVTARFRI
             +GP+S  P PVIEL++  GRS +                                     E GA G LP S AD VDDP+ARM GTS+V  RF +
Subjt:  -----AGPASEDPIPVIELEVLHGRSAQ-------------------------------------EVGACGVLPASFADRVDDPKARMGGTSDVTARFRI

Query:  EPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDI
        EPSSSGV+D                            QRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELL+A  EVDI
Subjt:  EPSSSGVRD----------------------------QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDI

Query:  LKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGF
        L+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ Q LE KD  + R T EL+  KERL NG LLEESFRQHPDFDGFAKDF DAGF
Subjt:  LKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGF

Query:  KFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE
        KFLMKGI +DMP LQIDL+ LKK+Y+E+WASGPNGTP PQ+LV+KYVR+LDSDYSD+EE+        +VGTTQE
Subjt:  KFLMKGITSDMPDLQIDLSDLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGTAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGACGGGGAGGA
TAGTGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGTTACCTGAGCACTACTTTGGACCCCTTCGTAGGGGGTTCGCTATCCCTGAAAACATCCTCCTTA
GGATCCTGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGATGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATTTTTTTTTGGTTACGAGCTCGGGACAATGAAGA
GGCCGAGCTATTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCCAAGAAGCCTGGTCGGTACTATATATGCGCAAGGAAAGGCGCAGGAGGTCTAA
CTCGGATTTTGTCTCGTGCAGTATCAATCAGGCCAGTTCCCGAGCTTACTCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCACTTTCCGAGGGGCATGAAGGTC
GGAACCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGCTGTTAGATTACAACCCTGCAGTTCGTCCCATTGAAGCTTCAAGGCCGAACTCCGAACTAGCCATGGTTTG
CGGATTTGCGAGTAACGTAAAGCGCAAGTCAAAGGGCCGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAG
ATCCAATCCCAGTGATCGAGCTGGAGGTCCTTCACGGGAGAAGCGCCCAAGAGGTCGGAGCTTGTGGGGTCCTGCCCGCGAGCTTCGCAGACCGGGTGGACGATCCTAAA
GCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAACCGTCAAGTTCTGGGGTGAGGGACCAGAGGACCATTGACTACGCTGCTGAGGCGTTTGTTGC
TTCCATTCAATCGGCCCTAGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAATTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCA
TGAAGGATGAGCTGCTAAGGGCTCACTCTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTC
CGAGCTGCCCACGCTATCACCAAGGGCCTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGACGAGGAGCTGAAGCG
TGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCGGCAACGGAGCCCTGCTGGAGGAGTCTTTCAGGCAACATCCTGACTTCGACGGATTTGCCAAAGACTTTTTTG
ACGCGGGCTTCAAGTTCCTCATGAAGGGCATTACTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGATCTGAAGAAGAGGTATGCCGAGCAGTGGGCTTCTGGGCCT
AACGGCACCCCTGGCCCCCAAGCGTTGGTGAATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCA
AACAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGTAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGACGGGGAGGA
TAGTGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGTTACCTGAGCACTACTTTGGACCCCTTCGTAGGGGGTTCGCTATCCCTGAAAACATCCTCCTTA
GGATCCTGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGATGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTTGCTTTGGCCATTTTTTTTTGGTTACGAGCTCGGGACAATGAAGA
GGCCGAGCTATTGGATGTTGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCCAAGAAGCCTGGTCGGTACTATATATGCGCAAGGAAAGGCGCAGGAGGTCTAA
CTCGGATTTTGTCTCGTGCAGTATCAATCAGGCCAGTTCCCGAGCTTACTCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCACTTTCCGAGGGGCATGAAGGTC
GGAACCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGCTGTTAGATTACAACCCTGCAGTTCGTCCCATTGAAGCTTCAAGGCCGAACTCCGAACTAGCCATGGTTTG
CGGATTTGCGAGTAACGTAAAGCGCAAGTCAAAGGGCCGAGCCCATGCTCTCGAGGCCGCCCAGAGTTCGGAACCTGCAACTCCTGCTGTGGCAGGGCCAGCCTCAGAAG
ATCCAATCCCAGTGATCGAGCTGGAGGTCCTTCACGGGAGAAGCGCCCAAGAGGTCGGAGCTTGTGGGGTCCTGCCCGCGAGCTTCGCAGACCGGGTGGACGATCCTAAA
GCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAACCGTCAAGTTCTGGGGTGAGGGACCAGAGGACCATTGACTACGCTGCTGAGGCGTTTGTTGC
TTCCATTCAATCGGCCCTAGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAATTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCA
TGAAGGATGAGCTGCTAAGGGCTCACTCTGAGGTGGACATTCTGAAGGCCGAGGTGGAGGCCAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTC
CGAGCTGCCCACGCTATCACCAAGGGCCTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGACGAGGAGCTGAAGCG
TGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCGGCAACGGAGCCCTGCTGGAGGAGTCTTTCAGGCAACATCCTGACTTCGACGGATTTGCCAAAGACTTTTTTG
ACGCGGGCTTCAAGTTCCTCATGAAGGGCATTACTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGATCTGAAGAAGAGGTATGCCGAGCAGTGGGCTTCTGGGCCT
AACGGCACCCCTGGCCCCCAAGCGTTGGTGAATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCA
AACAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYFGPLRRGFAIPENILLRILEEGERADNPPDGWVTLYFKMFEYGLRLPLHPFVQ
EFLFRTGLAPAQVAPNGWGVIFALAIFFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYICARKGAGGLTRILSRAVSIRPVPELTQASFDTLKYYKEHFPRGMKV
GTLVTDKLLLESGLLDYNPAVRPIEASRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPIPVIELEVLHGRSAQEVGACGVLPASFADRVDDPK
ARMGGTSDVTARFRIEPSSSGVRDQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLRAHSEVDILKAEVEAKAELLKKEEDRRKAQL
RAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATAELETAKERLGNGALLEESFRQHPDFDGFAKDFFDAGFKFLMKGITSDMPDLQIDLSDLKKRYAEQWASGP
NGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQTGS