; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g26660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g26660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:19879521..19883424
RNA-Seq ExpressionMoc09g26660
SyntenyMoc09g26660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]5.5e-10284.94Show/hide
Query:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN
        MCARKGACGIV+GPT IKG V KWFYASGEWL KDES  +   VP      ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPN
Subjt:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN

Query:  SELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVREEDPLKRRRKKKKAISPSEV
        SELAMVCGFAS+VKRKSKGQAHALEAAQSSKP TPAV GPASEDPAPVIELE S GPSREKRPRDQTEAVD  PLGEEVREE PLKRRRKKKK  SP EV
Subjt:  SELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVREEDPLKRRRKKKKAISPSEV

Query:  GACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG
        GA GVLPASFADRVDDP ARMGGT DVT RFRV+PSS+G
Subjt:  GACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.0e-10886.07Show/hide
Query:  PHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL
        P+GWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIV+GPT IKG V KWFYASGEWL KDESGRSFFDVPTRFGNL
Subjt:  PHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL

Query:  -----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGP
                   ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKG+AHALEAAQSSKP TPAV GP
Subjt:  -----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGP

Query:  ASEDPAPVIELEYSGGPSREKRPRDQTEAVDAL-------PLGE
        ASEDPAPVIELE SGGPSREKRPRDQTEAVDA        PLGE
Subjt:  ASEDPAPVIELEYSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.5e-8892.89Show/hide
Query:  SAGAFVASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKE
        +A AFVASIQSALAVKAELDGREVLA  EKEEFSAALE ASSTMKDELLKAHSEVE LKAEVESQAELLKKEEDRR+AQL+AA+AITRGLE+EKFQLLKE
Subjt:  SAGAFVASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKE

Query:  KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK
        KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGT  P+
Subjt:  KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.4e-12872.96Show/hide
Query:  MSSSFSSSLGSDEDLARRLESELEKIENFRFSDDGEDSDAS----GLEYPSRIPEHYLGSLRRGISLPD-------------------WV----------
        MSSS SS+L  + DLARRLES+LE+IEN R SDDGEDSDAS    GLEYPSRIPEHYLGSLRRG ++P+                   WV          
Subjt:  MSSSFSSSLGSDEDLARRLESELEKIENFRFSDDGEDSDAS----GLEYPSRIPEHYLGSLRRGISLPD-------------------WV----------

Query:  ------------------GSGSSGPHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYA
                                P+GWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA GIV+GPT IKG V KWFYA
Subjt:  ------------------GSGSSGPHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYA

Query:  SGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK
        SGEWL KDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK

Query:  SKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVD
        SKG+AHALEAAQSSKP TPAV GPASEDPA VIELE SGGPSREKRPRDQTEAVD
Subjt:  SKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.6e-14961.79Show/hide
Query:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG  GIV+GPT IKG VGKWF+ASGEWL KDESGR+FFDVPTRFGNL           A+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAV--------AGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKG+AHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAV--------AGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EDPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG----------------------------------------AF
        E PL+RRRKKKK  S SE GA G LP S AD VDDP ARM GTS+V  RF ++PSS+G                                        AF
Subjt:  EDPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG----------------------------------------AF

Query:  VASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LA  E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA L+AA+AIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GT  P+
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.7e-10284.94Show/hide
Query:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN
        MCARKGACGIV+GPT IKG V KWFYASGEWL KDES  +   VP      ASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPN
Subjt:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPN

Query:  SELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVREEDPLKRRRKKKKAISPSEV
        SELAMVCGFAS+VKRKSKGQAHALEAAQSSKP TPAV GPASEDPAPVIELE S GPSREKRPRDQTEAVD  PLGEEVREE PLKRRRKKKK  SP EV
Subjt:  SELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVREEDPLKRRRKKKKAISPSEV

Query:  GACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG
        GA GVLPASFADRVDDP ARMGGT DVT RFRV+PSS+G
Subjt:  GACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG

A0A6J1CR42 uncharacterized protein LOC1110138261.5e-10886.07Show/hide
Query:  PHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL
        P+GWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGA GIV+GPT IKG V KWFYASGEWL KDESGRSFFDVPTRFGNL
Subjt:  PHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL

Query:  -----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGP
                   ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKG+AHALEAAQSSKP TPAV GP
Subjt:  -----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGP

Query:  ASEDPAPVIELEYSGGPSREKRPRDQTEAVDAL-------PLGE
        ASEDPAPVIELE SGGPSREKRPRDQTEAVDA        PLGE
Subjt:  ASEDPAPVIELEYSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1D971 uncharacterized protein LOC1110185381.7e-8892.89Show/hide
Query:  SAGAFVASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKE
        +A AFVASIQSALAVKAELDGREVLA  EKEEFSAALE ASSTMKDELLKAHSEVE LKAEVESQAELLKKEEDRR+AQL+AA+AITRGLE+EKFQLLKE
Subjt:  SAGAFVASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKE

Query:  KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK
        KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGT  P+
Subjt:  KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK

A0A6J1DXS5 uncharacterized protein LOC1110255021.7e-12872.96Show/hide
Query:  MSSSFSSSLGSDEDLARRLESELEKIENFRFSDDGEDSDAS----GLEYPSRIPEHYLGSLRRGISLPD-------------------WV----------
        MSSS SS+L  + DLARRLES+LE+IEN R SDDGEDSDAS    GLEYPSRIPEHYLGSLRRG ++P+                   WV          
Subjt:  MSSSFSSSLGSDEDLARRLESELEKIENFRFSDDGEDSDAS----GLEYPSRIPEHYLGSLRRGISLPD-------------------WV----------

Query:  ------------------GSGSSGPHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYA
                                P+GWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGA GIV+GPT IKG V KWFYA
Subjt:  ------------------GSGSSGPHGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYA

Query:  SGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK
        SGEWL KDESGRSFFDVPTRFGNL           ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK

Query:  SKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVD
        SKG+AHALEAAQSSKP TPAV GPASEDPA VIELE SGGPSREKRPRDQTEAVD
Subjt:  SKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256652.2e-14961.79Show/hide
Query:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG  GIV+GPT IKG VGKWF+ASGEWL KDESGR+FFDVPTRFGNL           A+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNL-----------ASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAV--------AGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKG+AHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAV--------AGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EDPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG----------------------------------------AF
        E PL+RRRKKKK  S SE GA G LP S AD VDDP ARM GTS+V  RF ++PSS+G                                        AF
Subjt:  EDPLKRRRKKKKAISPSEVGACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAG----------------------------------------AF

Query:  VASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LA  E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA L+AA+AIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLQAAYAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GT  P+
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTLAPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTGCTAGAGTTTGCACCATCCGTAGGTTATTGCCGACAAGTTCGTGGGTCGCTAGATAACTCTTTAGATCTTGGGTGTCAAGCTTCTATTGCAATGATACTGCA
GATCTATGTGTCAAAGTTTGTCATGCCTGTAGGATCATTGGAGATCTCATTGCTCGTTGCTGTTCAAGGACGTGCCGGTGCAAGTTTCTATCGTAGGTTGCCGACGCTTG
CTCTCGTTGGAGGAGGGAGGAAAGGAGTGGAGACTGGTTTTAGTGGGAGAGAAAGAGAGAGATTAGGGTTTGATGGCGTGAATAGGAGAAAAAGAAAGGGCGTATTTCAG
ATTGCAGCTCGAACTCGGCCTGCAGACCGACCTGAACACTTGAGCAGACCTGCACAAAAAGGTAAGCACTCCGACGATCAAGTCAGTATAGGGTATTCTCTTCCCCAAAC
ATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTCGCGGAA
GATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGAGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCA
CTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAGCTTAGGATCCGATGAGGACTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGAAGATAGAAAACTT
TAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGAATTTCTCTTCCGGACT
GGGTTGGCTCCGGCTCAAGTGGCCCCCATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTA
GACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGTGCAAGGAAAGGCGCATGCGGTATAGTTAGAGGGCCGACCTTCAT
CAAGGGATGTGTGGGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGTAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTAGCCTCCT
TCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTCGATTACAACCCTGCA
GTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCAAGCCCATGCTCTTGAGGCCGC
CCAGAGTTCGAAACCTACCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTATTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCA
GGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGAGGAGGTGAGGGAGGAAGACCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTC
GGAGCTTGCGGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGTTTCAGAGTTCAGCCGTCGAG
TGCTGGGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAACGGGGGAGAAAGAGGAGTTCTCTGCTGCCTTGG
AGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGAC
AGGCGCAAGGCCCAACTCCAAGCTGCCTACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAA
GGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAACGGAGTCCTACTGGAGGAATCGTTTAGGCAGCATCCTGACTTCGATGGAT
TTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAG
AAGTGGGCGTCTGGGCCTGGCGGCACCCTGGCCCCCAAACGTTGGTGGATCGGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAGGAGGACCAGGTCGGCGCC
GCACAGGAGGGCACTCCTCAGGCGGACTCTTAGGCGATCATCCTTCATGAGGCTTTTCTCTGTCTCTCTTCTCTTCCTTTTTTGTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTGCTAGAGTTTGCACCATCCGTAGGTTATTGCCGACAAGTTCGTGGGTCGCTAGATAACTCTTTAGATCTTGGGTGTCAAGCTTCTATTGCAATGATACTGCA
GATCTATGTGTCAAAGTTTGTCATGCCTGTAGGATCATTGGAGATCTCATTGCTCGTTGCTGTTCAAGGACGTGCCGGTGCAAGTTTCTATCGTAGGTTGCCGACGCTTG
CTCTCGTTGGAGGAGGGAGGAAAGGAGTGGAGACTGGTTTTAGTGGGAGAGAAAGAGAGAGATTAGGGTTTGATGGCGTGAATAGGAGAAAAAGAAAGGGCGTATTTCAG
ATTGCAGCTCGAACTCGGCCTGCAGACCGACCTGAACACTTGAGCAGACCTGCACAAAAAGGTAAGCACTCCGACGATCAAGTCAGTATAGGGTATTCTCTTCCCCAAAC
ATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGATTTGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTCGCGGAA
GATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTAGAGAGGATCCTAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCA
CTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAGCTTAGGATCCGATGAGGACTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGAAGATAGAAAACTT
TAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGAATTTCTCTTCCGGACT
GGGTTGGCTCCGGCTCAAGTGGCCCCCATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTA
GACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGTGCAAGGAAAGGCGCATGCGGTATAGTTAGAGGGCCGACCTTCAT
CAAGGGATGTGTGGGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGTAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTAGCCTCCT
TCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTCGATTACAACCCTGCA
GTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCAAGCCCATGCTCTTGAGGCCGC
CCAGAGTTCGAAACCTACCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTATTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCA
GGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGAGGAGGTGAGGGAGGAAGACCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTC
GGAGCTTGCGGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGTTTCAGAGTTCAGCCGTCGAG
TGCTGGGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAACGGGGGAGAAAGAGGAGTTCTCTGCTGCCTTGG
AGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGAC
AGGCGCAAGGCCCAACTCCAAGCTGCCTACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAA
GGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAACGGAGTCCTACTGGAGGAATCGTTTAGGCAGCATCCTGACTTCGATGGAT
TTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAG
AAGTGGGCGTCTGGGCCTGGCGGCACCCTGGCCCCCAAACGTTGGTGGATCGGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAGGAGGACCAGGTCGGCGCC
GCACAGGAGGGCACTCCTCAGGCGGACTCTTAGGCGATCATCCTTCATGAGGCTTTTCTCTGTCTCTCTTCTCTTCCTTTTTTGTTTGTAA
Protein sequenceShow/hide protein sequence
MGLLEFAPSVGYCRQVRGSLDNSLDLGCQASIAMILQIYVSKFVMPVGSLEISLLVAVQGRAGASFYRRLPTLALVGGGRKGVETGFSGRERERLGFDGVNRRKRKGVFQ
IAARTRPADRPEHLSRPAQKGKHSDDQVSIGYSLPQTLAPSLSGPISTWQRSSFDLLWTRGDFLFVGKYNRRGRFIVGIFKYSDASDLREDPSRSLITRLEPLVGRSLPS
LSLSNVVAMSSSFSSSLGSDEDLARRLESELEKIENFRFSDDGEDSDASGLEYPSRIPEHYLGSLRRGISLPDWVGSGSSGPHGWGVIFALAILFWLRARDSEEAELLDV
DQLLACFEAKRIAKKPGRFYMCARKGACGIVRGPTFIKGCVGKWFYASGEWLVKDESGRSFFDVPTRFGNLASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPA
VRPIESSRPNSELAMVCGFASSVKRKSKGQAHALEAAQSSKPTTPAVAGPASEDPAPVIELEYSGGPSREKRPRDQTEAVDALPLGEEVREEDPLKRRRKKKKAISPSEV
GACGVLPASFADRVDDPAARMGGTSDVTARFRVQPSSAGAFVASIQSALAVKAELDGREVLATGEKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEED
RRKAQLQAAYAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAE
KWASGPGGTLAPKRWWIGMSEIWTLTTPISRRTRSAPHRRALLRRTLRRSSFMRLFSVSLLFLFCL