; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g23710 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g23710
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:16775976..16779349
RNA-Seq ExpressionMoc03g23710
SyntenyMoc03g23710
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]4.4e-10551.84Show/hide
Query:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF
        E P K RRKKKKAIS SEVGACRVLPA F DRVDDPAARMGGTSDVTARFR+EPSSSGVR+QVSRISAASL+RCLRRASK +S PGSVL R IDYAAEAF
Subjt:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQ----------------------------------------
        VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVESQ                                        
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQ----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAE
                                                    AELLK+E++R KA LRAAHAIT+GLEKEKFQLLKEKDDMLQAL+ KD  +    AE
Subjt:  --------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAE

Query:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
        L+  KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKF MKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDS
Subjt:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]8.0e-12392.4Show/hide
Query:  GTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVR+QVSRISAASL+RCLRRASK +S PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQAL+AKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
        GFAKDFSDAGFKF MKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
Subjt:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]5.7e-9774.72Show/hide
Query:  MGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV++QVSRISA  L+RCL+RASK +SDPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV IL+AEV+++AELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q L+ KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
        FDGFAKDFSDAGFKF MKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDS
Subjt:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.8e-12069.03Show/hide
Query:  TLGSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYIGSLRRGFAIPEDILLRLLEEGERADNPPEGWVTLYFKMFEYGLRL
        ++ SNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHY+GSLRRGFAIPE+ILLRL EEGERADNPPEGWVTLYFKMFEYGLRL
Subjt:  TLGSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYIGSLRRGFAIPEDILLRLLEEGERADNPPEGWVTLYFKMFEYGLRL

Query:  PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE
        PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE
Subjt:  PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE

Query:  WLAKDESGRSFFDVPTSF--------------------------------------------------NPT-----SPRAYTSL----------------
        WLAKDESGRSFFDVPT F                                                  NP      S R  + L                
Subjt:  WLAKDESGRSFFDVPTSF--------------------------------------------------NPT-----SPRAYTSL----------------

Query:  ----LRHAEILEGALPEGHGPASEDPAPGIELESSGGPSREKRPRDQTEAVD
            L  A+  + A P   GPASEDPA  IELESSGGPSREKRPRDQTEAVD
Subjt:  ----LRHAEILEGALPEGHGPASEDPAPGIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.5e-13456.75Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTSFN--------PTSPRAYTSLLRH------------------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPT F         P   +A    L+H                              
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTSFN--------PTSPRAYTSLLRH------------------------------

Query:  -AEILEGALPEGH--------------------------------------------GPASEDPAPGIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
           ++E + P                                               GP+S  P P IEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  -AEILEGALPEGH--------------------------------------------GPASEDPAPGIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF
        E+PL++RRKKKK  S SE GA   LP +  D VDDP ARM GTS+V  RF +EPSSSGV++QVSRISA  L+R LRRASK +SDPGSVLQRTID  AEAF
Subjt:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        Q L+ KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKF MKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDS
        +LDS
Subjt:  DLDS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124672.1e-10551.84Show/hide
Query:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF
        E P K RRKKKKAIS SEVGACRVLPA F DRVDDPAARMGGTSDVTARFR+EPSSSGVR+QVSRISAASL+RCLRRASK +S PGSVL R IDYAAEAF
Subjt:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQ----------------------------------------
        VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVESQ                                        
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQ----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAE
                                                    AELLK+E++R KA LRAAHAIT+GLEKEKFQLLKEKDDMLQAL+ KD  +    AE
Subjt:  --------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAE

Query:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
        L+  KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKF MKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDS
Subjt:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS

A0A6J1D971 uncharacterized protein LOC1110185383.8e-12392.4Show/hide
Query:  GTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVR+QVSRISAASL+RCLRRASK +S PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQAL+AKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
        GFAKDFSDAGFKF MKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
Subjt:  GFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS

A0A6J1DF31 uncharacterized protein LOC1110199092.8e-9774.72Show/hide
Query:  MGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV++QVSRISA  L+RCL+RASK +SDPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV IL+AEV+++AELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q L+ KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS
        FDGFAKDFSDAGFKF MKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDS
Subjt:  FDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDS

A0A6J1DXS5 uncharacterized protein LOC1110255022.3e-12069.03Show/hide
Query:  TLGSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYIGSLRRGFAIPEDILLRLLEEGERADNPPEGWVTLYFKMFEYGLRL
        ++ SNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHY+GSLRRGFAIPE+ILLRL EEGERADNPPEGWVTLYFKMFEYGLRL
Subjt:  TLGSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYIGSLRRGFAIPEDILLRLLEEGERADNPPEGWVTLYFKMFEYGLRL

Query:  PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE
        PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE
Subjt:  PLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE

Query:  WLAKDESGRSFFDVPTSF--------------------------------------------------NPT-----SPRAYTSL----------------
        WLAKDESGRSFFDVPT F                                                  NP      S R  + L                
Subjt:  WLAKDESGRSFFDVPTSF--------------------------------------------------NPT-----SPRAYTSL----------------

Query:  ----LRHAEILEGALPEGHGPASEDPAPGIELESSGGPSREKRPRDQTEAVD
            L  A+  + A P   GPASEDPA  IELESSGGPSREKRPRDQTEAVD
Subjt:  ----LRHAEILEGALPEGHGPASEDPAPGIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256657.5e-13556.75Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTSFN--------PTSPRAYTSLLRH------------------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPT F         P   +A    L+H                              
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTSFN--------PTSPRAYTSLLRH------------------------------

Query:  -AEILEGALPEGH--------------------------------------------GPASEDPAPGIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE
           ++E + P                                               GP+S  P P IEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  -AEILEGALPEGH--------------------------------------------GPASEDPAPGIELESSGGPSREKRPRDQTEAVDAPPLGEEVRE

Query:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF
        E+PL++RRKKKK  S SE GA   LP +  D VDDP ARM GTS+V  RF +EPSSSGV++QVSRISA  L+R LRRASK +SDPGSVLQRTID  AEAF
Subjt:  EAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSSSGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR
        Q L+ KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKF MKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR
Subjt:  QALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVR

Query:  DLDS
        +LDS
Subjt:  DLDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAGCGCTCTGAACCGAAAAAATTTCATTCTTATCAAGTCCCCAGATAACCTCAGTCATCGGAATCACTACGAGGCAGCGGGTGTATTTCTGATTGCAGCTCGAAC
TCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAT
CATCGTACCTGATCGTGGAGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGCAACTTAGGATCCGATGAGGACCTAGCT
CGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGAT
ACCTGAGCACTACATCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGGACATCCTCCTCAGGCTTCTGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGG
TCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAATTTCTCTTCCGGACGGGGTTGGCTCCGGCTCAAGTGGCCCCCAAT
GGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGACAGTGAGGAAGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAA
AAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGAAAAGGCGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACG
CTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGTCTCCTTCGACACGCT
GAAATACTAGAAGGAGCGCTTCCCGAGGGCCATGGGCCTGCCTCGGAAGATCCAGCCCCAGGGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAG
GGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCAAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCG
GAGCTTGCAGGGTCTTGCCTGCAAATTTCCCAGATCGGGTGGACGATCCCGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGT
TCCGGGGTGAGGGAGCAGGTGTCCCGCATATCAGCTGCAAGTTTGAACCGCTGCCTAAGAAGGGCGTCCAAATCTTTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCAT
CGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGCAAGGGAGAAAGAGGAGTTTTCTG
CTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAG
GAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCT
TGATGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACT
TCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCTTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGG
TATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTACGTCAGAGATCTGGACTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGATAAGCGCTCTGAACCGAAAAAATTTCATTCTTATCAAGTCCCCAGATAACCTCAGTCATCGGAATCACTACGAGGCAGCGGGTGTATTTCTGATTGCAGCTCGAAC
TCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAT
CATCGTACCTGATCGTGGAGTCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGCAACTTAGGATCCGATGAGGACCTAGCT
CGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGAT
ACCTGAGCACTACATCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGGACATCCTCCTCAGGCTTCTGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGG
TCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAATTTCTCTTCCGGACGGGGTTGGCTCCGGCTCAAGTGGCCCCCAAT
GGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGACAGTGAGGAAGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAA
AAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGAAAAGGCGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACG
CTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGTTTCAATCCGACCAGTCCCCGAGCTTACACAAGTCTCCTTCGACACGCT
GAAATACTAGAAGGAGCGCTTCCCGAGGGCCATGGGCCTGCCTCGGAAGATCCAGCCCCAGGGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAG
GGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGGCGAGGAGGTGAGGGAGGAAGCCCCTCTGAAGCAAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCG
GAGCTTGCAGGGTCTTGCCTGCAAATTTCCCAGATCGGGTGGACGATCCCGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGT
TCCGGGGTGAGGGAGCAGGTGTCCCGCATATCAGCTGCAAGTTTGAACCGCTGCCTAAGAAGGGCGTCCAAATCTTTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCAT
CGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGCAAGGGAGAAAGAGGAGTTTTCTG
CTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAG
GAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCT
TGATGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACT
TCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCTTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGG
TATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTACGTCAGAGATCTGGACTCATGA
Protein sequenceShow/hide protein sequence
MISALNRKNFILIKSPDNLSHRNHYEAAGVFLIAARTRPPDRSEYLGGPAQKGEHSDDQVSIGRIPSLVRGQKSSYLIVESYLTFPEFLEFDLKAARTLGSNLGSDEDLA
RRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYIGSLRRGFAIPEDILLRLLEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPN
GWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTSFNPTSPRAYTSLLRHA
EILEGALPEGHGPASEDPAPGIELESSGGPSREKRPRDQTEAVDAPPLGEEVREEAPLKQRRKKKKAISPSEVGACRVLPANFPDRVDDPAARMGGTSDVTARFRVEPSS
SGVREQVSRISAASLNRCLRRASKSLSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKK
EEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALDAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIASDMPDLQIDLSGLKRR
YAEKWASGPGGTPGPQALVDQYVRDLDS