; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g27080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g27080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:19954114..19956237
RNA-Seq ExpressionMoc04g27080
SyntenyMoc04g27080
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]2.4e-10653.03Show/hide
Query:  LSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGAQP
        LSIKPIPELAQATFDTLK YKD+FPRG KIGTLVTDKLLLESGLLDYNPLVRP+EASRPNSELAMVCGF+SSVKRKSKGRAHALK V+S++P TP   Q 
Subjt:  LSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGAQP

Query:  AAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRFKI
        AAQD AG  S  PTPV+EL+S GE SREKR R ESEALDV PL  VR                                                     
Subjt:  AAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRFKI

Query:  KPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVDIL
                                                                                                            
Subjt:  KPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVDIL

Query:  KAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGLK
            EAKAELLK EDERHK HLRAAHAITKGL+KEKFQLLKEKD+MLQALE  DA IGRL  +LK EKERLTNG LLE AFRQHPDFDGFAKDFSDAG K
Subjt:  KAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGLK

Query:  FLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        FLMK IA D+PHL++DLGDLKKRY EKWASGP+GT  P SLVDKY
Subjt:  FLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.8e-10178Show/hide
Query:  MRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQS
        MRF+++ SSSGVKDQVSRIS  CLDRCLRR S+FVSDPGS+LQRTID+A EAFIASIHSAVM+K ELDGR+AL AKER+N S  LEAAT LK ELLK Q 
Subjt:  MRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQS

Query:  EVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFS
        EVDIL+AEV+AK +LLK E E+HK HLRAAHAITKGL+KEKFQLLKEKD++ Q LE  DA IGRLT +LK  KERLT+G LLEE+FRQHP+FDGFAKDFS
Subjt:  EVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFS

Query:  DAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        DAG KFLMK IA DMPHLQIDL DLKKRY E WASGP+GTP PQSLVDKY
Subjt:  DAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.2e-8969.26Show/hide
Query:  GTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALE-AATMLKD
        G   +  + +I+PSSSGV+DQVSRIS A LDRCLRR SKFVS PGS+LQRTID+A EAF+ASI SA+ +K ELDGR+ LAA+E++  SAALE A++ +KD
Subjt:  GTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALE-AATMLKD

Query:  ELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFD
        ELLK  SEV+ LKAEVE++AELLK E++R +  LRAAHAIT+GL++EKFQLLKEKD+MLQALE  D  +   T +L+  KERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        GFAKDFSDAG KFLMK IA DMP LQIDL  LK+RY EKWASGP GTP PQ+LVD+Y
Subjt:  GFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]6.0e-10578.29Show/hide
Query:  MGGTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLK
        MGGT DV+ RF+++PSSSGVKDQVSRIS  CLDRCL+R SKFVSDPGS+LQRTID+A EAF+ASIHSA+M+K ELDGR+ALAAKER+NSSAALEAAT LK
Subjt:  MGGTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLK

Query:  DELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDF
         ELLK Q EV IL+AEV+AKAELLK E E+HK HLRAAHAITKGL+KEKFQLLKEKD++ Q LE  D  IGRLT +LK  KERLTNG LLEE+FRQH DF
Subjt:  DELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDF

Query:  DGFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        DGFAKDFSDAG KFLMK IA DMPHLQIDL +LKK+Y EKWASGP+GTP PQSLV KY
Subjt:  DGFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.4e-18376.96Show/hide
Query:  NPLSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGA
        N +SIK IPELAQATFDTLKHYKDHFPR  KI TLVTDKLLLESGLLDYNPLVR +EASRPNSELAMVCGF+ SVKRKSKGRAHALK V  TEP TP   
Subjt:  NPLSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGA

Query:  QPAAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRF
        +  AQ  +G  S VPTPV+EL+ +G  S EKR R+ESEALDV PLN VRGESPL++R KKKKT+SSSE G RG LP SH +LVD+PEARM GTS+V+MRF
Subjt:  QPAAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRF

Query:  KIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVD
         ++PSSSGVKDQVSRIS  CLDR LRR SKFVSDPGS+LQRTID+  EAFIASIH AVM+K ELDGR+ALAAKER+NS AALEAAT LK ELLK Q EVD
Subjt:  KIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVD

Query:  ILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG
        IL+AEV+AK +LLK E E+HK HLRAAHAITKGL+KEKFQLLKEKD++ Q LE  DA IGRLT +LK  KERLTNG LLEE+FRQHPDFDGFAKDFSDAG
Subjt:  ILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG

Query:  LKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
         KFLMK IA DMPHLQIDL  LKK+Y EKWASGP+GTP PQSLVDKY
Subjt:  LKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.2e-10653.03Show/hide
Query:  LSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGAQP
        LSIKPIPELAQATFDTLK YKD+FPRG KIGTLVTDKLLLESGLLDYNPLVRP+EASRPNSELAMVCGF+SSVKRKSKGRAHALK V+S++P TP   Q 
Subjt:  LSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGAQP

Query:  AAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRFKI
        AAQD AG  S  PTPV+EL+S GE SREKR R ESEALDV PL  VR                                                     
Subjt:  AAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRFKI

Query:  KPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVDIL
                                                                                                            
Subjt:  KPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVDIL

Query:  KAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGLK
            EAKAELLK EDERHK HLRAAHAITKGL+KEKFQLLKEKD+MLQALE  DA IGRL  +LK EKERLTNG LLE AFRQHPDFDGFAKDFSDAG K
Subjt:  KAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAGLK

Query:  FLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        FLMK IA D+PHL++DLGDLKKRY EKWASGP+GT  P SLVDKY
Subjt:  FLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

A0A6J1D1N9 uncharacterized protein LOC1110161938.7e-10278Show/hide
Query:  MRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQS
        MRF+++ SSSGVKDQVSRIS  CLDRCLRR S+FVSDPGS+LQRTID+A EAFIASIHSAVM+K ELDGR+AL AKER+N S  LEAAT LK ELLK Q 
Subjt:  MRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQS

Query:  EVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFS
        EVDIL+AEV+AK +LLK E E+HK HLRAAHAITKGL+KEKFQLLKEKD++ Q LE  DA IGRLT +LK  KERLT+G LLEE+FRQHP+FDGFAKDFS
Subjt:  EVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFS

Query:  DAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        DAG KFLMK IA DMPHLQIDL DLKKRY E WASGP+GTP PQSLVDKY
Subjt:  DAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

A0A6J1D971 uncharacterized protein LOC1110185385.9e-9069.26Show/hide
Query:  GTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALE-AATMLKD
        G   +  + +I+PSSSGV+DQVSRIS A LDRCLRR SKFVS PGS+LQRTID+A EAF+ASI SA+ +K ELDGR+ LAA+E++  SAALE A++ +KD
Subjt:  GTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALE-AATMLKD

Query:  ELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFD
        ELLK  SEV+ LKAEVE++AELLK E++R +  LRAAHAIT+GL++EKFQLLKEKD+MLQALE  D  +   T +L+  KERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        GFAKDFSDAG KFLMK IA DMP LQIDL  LK+RY EKWASGP GTP PQ+LVD+Y
Subjt:  GFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

A0A6J1DF31 uncharacterized protein LOC1110199092.9e-10578.29Show/hide
Query:  MGGTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLK
        MGGT DV+ RF+++PSSSGVKDQVSRIS  CLDRCL+R SKFVSDPGS+LQRTID+A EAF+ASIHSA+M+K ELDGR+ALAAKER+NSSAALEAAT LK
Subjt:  MGGTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLK

Query:  DELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDF
         ELLK Q EV IL+AEV+AKAELLK E E+HK HLRAAHAITKGL+KEKFQLLKEKD++ Q LE  D  IGRLT +LK  KERLTNG LLEE+FRQH DF
Subjt:  DELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDF

Query:  DGFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
        DGFAKDFSDAG KFLMK IA DMPHLQIDL +LKK+Y EKWASGP+GTP PQSLV KY
Subjt:  DGFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

A0A6J1DZB3 uncharacterized protein LOC1110256657.0e-18476.96Show/hide
Query:  NPLSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGA
        N +SIK IPELAQATFDTLKHYKDHFPR  KI TLVTDKLLLESGLLDYNPLVR +EASRPNSELAMVCGF+ SVKRKSKGRAHALK V  TEP TP   
Subjt:  NPLSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGA

Query:  QPAAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRF
        +  AQ  +G  S VPTPV+EL+ +G  S EKR R+ESEALDV PLN VRGESPL++R KKKKT+SSSE G RG LP SH +LVD+PEARM GTS+V+MRF
Subjt:  QPAAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKKRTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRF

Query:  KIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVD
         ++PSSSGVKDQVSRIS  CLDR LRR SKFVSDPGS+LQRTID+  EAFIASIH AVM+K ELDGR+ALAAKER+NS AALEAAT LK ELLK Q EVD
Subjt:  KIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDGRKALAAKERDNSSAALEAATMLKDELLKTQSEVD

Query:  ILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG
        IL+AEV+AK +LLK E E+HK HLRAAHAITKGL+KEKFQLLKEKD++ Q LE  DA IGRLT +LK  KERLTNG LLEE+FRQHPDFDGFAKDFSDAG
Subjt:  ILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNGVLLEEAFRQHPDFDGFAKDFSDAG

Query:  LKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY
         KFLMK IA DMPHLQIDL  LKK+Y EKWASGP+GTP PQSLVDKY
Subjt:  LKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGTTGTGGAAGACGTTTCGATCTGTCAGGCTGTCGGAGTACTCAAGTATTCCGTCGTTACGGATCTCGAGATGATCCTAGCCGCTTGTTCATTACACGTGTCGGA
CTATAAGCAGTTCGCCCCCCAACCCAACGATTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCGAGCTTGAAGAGATAGAGAACCCATTATCAATCAAACCGATTCCCG
AGCTCGCTCAAGCCACCTTCGACACCCTCAAGCATTATAAGGATCACTTCCCAAGGGGCTGGAAAATCGGGACCTTGGTAACTGACAAACTGCTCCTCGAATCAGGGTTG
TTGGACTACAACCCCTTGGTGCGTCCGGTTGAAGCTTCAAGGCCAAATTCCGAACTCGCAATGGTGTGTGGATTCTCCAGCAGCGTGAAACGCAAATCCAAGGGTCGTGC
ACACGCCCTCAAGGCTGTTCGGAGCACGGAGCCAACGACCCCCGTCGGGGCTCAGCCTGCGGCTCAAGACACTGCTGGGGCATGTTCCGAAGTCCCAACTCCGGTGGTTG
AGTTGGAATCTGCTGGGGAGCACTCCAGAGAGAAGCGTCCAAGGGATGAGTCGGAGGCGCTGGACGTGCCTCCTCTGAACGCGGTGAGGGGAGAGTCCCCTTTGAAGAAA
AGAACGAAGAAGAAAAAGACTACCTCCTCCTCGGAGATTGGAACTCGTGGGCCCCTGCCTGCAAGTCATGTCGAATTGGTGGATAACCCCGAAGCTCGGATGGGGGGGAC
GTCTGATGTAAAGATGCGGTTCAAAATTAAACCGTCAAGCTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCAACTGCATGCTTGGACCGCTGTCTGAGGAGAACGTCCA
AGTTTGTAAGCGATCCAGGGTCTATGCTGCAAAGGACCATTGACCACGCCGTCGAGGCGTTCATCGCTTCCATCCATTCAGCGGTTATGATCAAGGTCGAACTGGATGGA
AGAAAAGCTTTGGCAGCAAAGGAGAGGGATAACTCCTCTGCTGCCTTAGAGGCTGCCACCATGCTGAAGGACGAACTGCTGAAGACCCAGAGCGAGGTGGATATCTTGAA
GGCTGAGGTGGAAGCCAAGGCCGAGCTACTGAAGATGGAGGATGAGAGGCATAAGACCCACCTCCGAGCTGCCCACGCTATCACTAAAGGGCTGAAGAAGGAGAAGTTCC
AACTCCTAAAGGAGAAGGACGAAATGCTCCAGGCCCTCGAGACGATGGACGCTATGATAGGGCGCCTTACTGTCGACCTCAAGGTGGAGAAAGAACGCCTCACCAACGGA
GTTCTTCTGGAGGAAGCGTTCAGGCAGCACCCTGACTTTGATGGGTTTGCCAAGGACTTCAGCGACGCAGGCTTAAAATTTCTAATGAAAGACATTGCTGTTGATATGCC
CCACCTCCAGATCGACCTCGGCGATCTGAAGAAGAGGTATGTTGAGAAATGGGCCTCCGGTCCTGACGGTACTCCTAGCCCCCAGTCCCTGGTGGACAAGTACGCCGGGA
GCTGGACTCTGACTACTCCGACACGGAGGATGAAGACGCTCCAAGTCGAGAGCTCGCTAATGTTGGGATCACGCAAGAGGAAGTCCCTTCACAACAGGGCGGATCTCAGG
AGATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGTTGTGGAAGACGTTTCGATCTGTCAGGCTGTCGGAGTACTCAAGTATTCCGTCGTTACGGATCTCGAGATGATCCTAGCCGCTTGTTCATTACACGTGTCGGA
CTATAAGCAGTTCGCCCCCCAACCCAACGATTCTGGGGAGGACTTAGCTCTTAGGTTAGAGTCCGAGCTTGAAGAGATAGAGAACCCATTATCAATCAAACCGATTCCCG
AGCTCGCTCAAGCCACCTTCGACACCCTCAAGCATTATAAGGATCACTTCCCAAGGGGCTGGAAAATCGGGACCTTGGTAACTGACAAACTGCTCCTCGAATCAGGGTTG
TTGGACTACAACCCCTTGGTGCGTCCGGTTGAAGCTTCAAGGCCAAATTCCGAACTCGCAATGGTGTGTGGATTCTCCAGCAGCGTGAAACGCAAATCCAAGGGTCGTGC
ACACGCCCTCAAGGCTGTTCGGAGCACGGAGCCAACGACCCCCGTCGGGGCTCAGCCTGCGGCTCAAGACACTGCTGGGGCATGTTCCGAAGTCCCAACTCCGGTGGTTG
AGTTGGAATCTGCTGGGGAGCACTCCAGAGAGAAGCGTCCAAGGGATGAGTCGGAGGCGCTGGACGTGCCTCCTCTGAACGCGGTGAGGGGAGAGTCCCCTTTGAAGAAA
AGAACGAAGAAGAAAAAGACTACCTCCTCCTCGGAGATTGGAACTCGTGGGCCCCTGCCTGCAAGTCATGTCGAATTGGTGGATAACCCCGAAGCTCGGATGGGGGGGAC
GTCTGATGTAAAGATGCGGTTCAAAATTAAACCGTCAAGCTCCGGGGTGAAGGACCAGGTGTCCCGCATCTCAACTGCATGCTTGGACCGCTGTCTGAGGAGAACGTCCA
AGTTTGTAAGCGATCCAGGGTCTATGCTGCAAAGGACCATTGACCACGCCGTCGAGGCGTTCATCGCTTCCATCCATTCAGCGGTTATGATCAAGGTCGAACTGGATGGA
AGAAAAGCTTTGGCAGCAAAGGAGAGGGATAACTCCTCTGCTGCCTTAGAGGCTGCCACCATGCTGAAGGACGAACTGCTGAAGACCCAGAGCGAGGTGGATATCTTGAA
GGCTGAGGTGGAAGCCAAGGCCGAGCTACTGAAGATGGAGGATGAGAGGCATAAGACCCACCTCCGAGCTGCCCACGCTATCACTAAAGGGCTGAAGAAGGAGAAGTTCC
AACTCCTAAAGGAGAAGGACGAAATGCTCCAGGCCCTCGAGACGATGGACGCTATGATAGGGCGCCTTACTGTCGACCTCAAGGTGGAGAAAGAACGCCTCACCAACGGA
GTTCTTCTGGAGGAAGCGTTCAGGCAGCACCCTGACTTTGATGGGTTTGCCAAGGACTTCAGCGACGCAGGCTTAAAATTTCTAATGAAAGACATTGCTGTTGATATGCC
CCACCTCCAGATCGACCTCGGCGATCTGAAGAAGAGGTATGTTGAGAAATGGGCCTCCGGTCCTGACGGTACTCCTAGCCCCCAGTCCCTGGTGGACAAGTACGCCGGGA
GCTGGACTCTGACTACTCCGACACGGAGGATGAAGACGCTCCAAGTCGAGAGCTCGCTAATGTTGGGATCACGCAAGAGGAAGTCCCTTCACAACAGGGCGGATCTCAGG
AGATGA
Protein sequenceShow/hide protein sequence
MTVVEDVSICQAVGVLKYSVVTDLEMILAACSLHVSDYKQFAPQPNDSGEDLALRLESELEEIENPLSIKPIPELAQATFDTLKHYKDHFPRGWKIGTLVTDKLLLESGL
LDYNPLVRPVEASRPNSELAMVCGFSSSVKRKSKGRAHALKAVRSTEPTTPVGAQPAAQDTAGACSEVPTPVVELESAGEHSREKRPRDESEALDVPPLNAVRGESPLKK
RTKKKKTTSSSEIGTRGPLPASHVELVDNPEARMGGTSDVKMRFKIKPSSSGVKDQVSRISTACLDRCLRRTSKFVSDPGSMLQRTIDHAVEAFIASIHSAVMIKVELDG
RKALAAKERDNSSAALEAATMLKDELLKTQSEVDILKAEVEAKAELLKMEDERHKTHLRAAHAITKGLKKEKFQLLKEKDEMLQALETMDAMIGRLTVDLKVEKERLTNG
VLLEEAFRQHPDFDGFAKDFSDAGLKFLMKDIAVDMPHLQIDLGDLKKRYVEKWASGPDGTPSPQSLVDKYAGSWTLTTPTRRMKTLQVESSLMLGSRKRKSLHNRADLR
R