; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g03640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g03640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:2676050..2678391
RNA-Seq ExpressionMoc08g03640
SyntenyMoc08g03640
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]6.0e-10851.73Show/hide
Query:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLPA +ADRVDDPAARMGGTSDVTARFRIEPSS GVR+QV+RISAASLDRC+RRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQ

Query:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SAL +KAELDGRE LAAREKEEFSAALEAASST+KDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAN
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+   
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAN

Query:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
        ERL+NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP+GT GP +LVD+YVRDLDSDYSD ++D
Subjt:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]6.8e-12892.62Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRC+RRASKFVSAPGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALE ASST+KD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETA ERLSNGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYSDPE+D
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]6.6e-9973.26Show/hide
Query:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV
        MGGT DV  RFR+EPSS GV++QV+RISA  LDRC++RASKFVS PGSVLQRTID AAEAFVASI SA+++KAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+   ERL+NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP+GTPGPQ+LV +YVR+LDSDYSD E++
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]3.2e-9378.08Show/hide
Query:  EPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET
        EPSS GVR+QV+RISAASLDRC+RRASKFVS PGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALEAA  T+KDELLKAHSEVET
Subjt:  EPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETA ERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
                      +IDLSGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYSDP++D
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.0e-15260.55Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR                       
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------

Query:  --------------AMVCGFASGVKRKSKGRAQALEAT----------------------------------------QRRSAPETEAADAPPLGEEARE
                      AMVCGF   VKRKSKGRA AL+                                          ++RS  E+EA D  PL  E R 
Subjt:  --------------AMVCGFASGVKRKSKGRAQALEAT----------------------------------------QRRSAPETEAADAPPLGEEARE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR +RRASKFVS PGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAF

Query:  VASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML
        +ASI  A+++KAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ 
Subjt:  VASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVR
        Q LE KD  +   T EL+   ERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP+GTP PQ+LVD+YVR
Subjt:  QALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVR

Query:  DLDSDYSDPEKD
        +LDSDYSD E++
Subjt:  DLDSDYSDPEKD

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124672.9e-10851.73Show/hide
Query:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLPA +ADRVDDPAARMGGTSDVTARFRIEPSS GVR+QV+RISAASLDRC+RRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQ

Query:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SAL +KAELDGRE LAAREKEEFSAALEAASST+KDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAN
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+   
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAN

Query:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
        ERL+NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP+GT GP +LVD+YVRDLDSDYSD ++D
Subjt:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

A0A6J1D971 uncharacterized protein LOC1110185383.3e-12892.62Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRC+RRASKFVSAPGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALE ASST+KD
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETA ERLSNGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYSDPE+D
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

A0A6J1DF31 uncharacterized protein LOC1110199093.2e-9973.26Show/hide
Query:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV
        MGGT DV  RFR+EPSS GV++QV+RISA  LDRC++RASKFVS PGSVLQRTID AAEAFVASI SA+++KAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+   ERL+NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP+GTPGPQ+LV +YVR+LDSDYSD E++
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

A0A6J1DVF6 uncharacterized protein LOC1110247401.5e-9378.08Show/hide
Query:  EPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET
        EPSS GVR+QV+RISAASLDRC+RRASKFVS PGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALEAA  T+KDELLKAHSEVET
Subjt:  EPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETA ERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD
                      +IDLSGLKRRYAEKWASGP GTPGPQALVDQYVRDLDSDYSDP++D
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVRDLDSDYSDPEKD

A0A6J1DZB3 uncharacterized protein LOC1110256651.9e-15260.55Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR                       
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------

Query:  --------------AMVCGFASGVKRKSKGRAQALEAT----------------------------------------QRRSAPETEAADAPPLGEEARE
                      AMVCGF   VKRKSKGRA AL+                                          ++RS  E+EA D  PL  E R 
Subjt:  --------------AMVCGFASGVKRKSKGRAQALEAT----------------------------------------QRRSAPETEAADAPPLGEEARE

Query:  EAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR +RRASKFVS PGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCIRRASKFVSAPGSVLQRTIDYAAEAF

Query:  VASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML
        +ASI  A+++KAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ 
Subjt:  VASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVR
        Q LE KD  +   T EL+   ERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP+GTP PQ+LVD+YVR
Subjt:  QALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQALVDQYVR

Query:  DLDSDYSDPEKD
        +LDSDYSD E++
Subjt:  DLDSDYSDPEKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCCAACCTTCAGGTCATACCTTACGTTCCTTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAACA
CTTTAGAATCTCCGATGACGGGGAGGATAGCGATGCTTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGAA
TTCCTCCTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGTGTCATTTTCGCTTTGGCCATACTCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCC
GAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCGGGCGGTATAGTTAA
GGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTG
GGAACCTTGTTTCAATCCGACCGGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGAGCCATGGTTTGCGGATTTGCAAGC
GGCGTGAAGCGCAAGTCCAAGGGCCGAGCTCAGGCTCTTGAGGCTACCCAGAGGAGAAGCGCCCCAGAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAGGCAAG
GGAGGAAGCCCCTCTAAAGCGCAGAAGGAAGAAAAAGAAGGCGATTTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTGGGCTGATCGGGTGGACGATC
CTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACTCGCATCTCGGCTGCGAGTTTGGAC
CGCTGCATAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGT
TATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCCTGGAGGCTGCTTCCTCCACCGTGAAGGATGAACTGCTGAAGG
CTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCCATCACC
AGGGGCTTGGAGAGGGAGAAATTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGAC
GGCGAACGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCA
TGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAA
GCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGATTACTCCGATCCTGAAAAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCCCAACCTTCAGGTCATACCTTACGTTCCTTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAACA
CTTTAGAATCTCCGATGACGGGGAGGATAGCGATGCTTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTCCGTAGGGGAA
TTCCTCCTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGTGTCATTTTCGCTTTGGCCATACTCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCC
GAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCGGGCGGTATAGTTAA
GGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTG
GGAACCTTGTTTCAATCCGACCGGTCCCCGAGCTTACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTTCCGAGAGCCATGGTTTGCGGATTTGCAAGC
GGCGTGAAGCGCAAGTCCAAGGGCCGAGCTCAGGCTCTTGAGGCTACCCAGAGGAGAAGCGCCCCAGAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAGGCAAG
GGAGGAAGCCCCTCTAAAGCGCAGAAGGAAGAAAAAGAAGGCGATTTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTGGGCTGATCGGGTGGACGATC
CTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACTCGCATCTCGGCTGCGAGTTTGGAC
CGCTGCATAAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGT
TATAAAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCCTGGAGGCTGCTTCCTCCACCGTGAAGGATGAACTGCTGAAGG
CTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCCATCACC
AGGGGCTTGGAGAGGGAGAAATTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGAC
GGCGAACGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCA
TGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAA
GCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGATTACTCCGATCCTGAAAAGGACTAG
Protein sequenceShow/hide protein sequence
MPPTFRSYLTFLEFLEFDLKAARTLGRLESELEEIEHFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGIPPPDWVGSGSSGPQWVGVIFALAILFWLRARDSEEA
ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRAMVCGFAS
GVKRKSKGRAQALEATQRRSAPETEAADAPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLD
RCIRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAIT
RGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETANERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPSGTPGPQ
ALVDQYVRDLDSDYSDPEKD