; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g02170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g02170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:1436856..1439502
RNA-Seq ExpressionMoc01g02170
SyntenyMoc01g02170
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.1e-10750.2Show/hide
Query:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLPA +ADRVDDPAARMGGTSDVTARFRIEPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ

Query:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SAL +KAELDGRE LAAREKEEFSAALEAASST+KDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    +EL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAK

Query:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSIS
        ERL+NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWAS P GT GP +LVD+Y RDLDSDYSD +ED+  +     
Subjt:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSIS

Query:  LGSAPHSI
        +G+    +
Subjt:  LGSAPHSI

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.5e-12893.01Show/hide
Query:  GTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALE ASST+KD
Subjt:  GTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAT+ELETAKERLSNGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWAS PGGTPGPQALVDQY RDLDSDYSDPEEDQ
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.6e-9869.31Show/hide
Query:  MGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV
        MGGT DV  RFR+EPSS GV++QV+RISA  LDRCL+RASKFVS PGSVLQRTID AAEAFVASI SA+++KAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T+EL+  KERL+NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWAS P GTPGPQ+LV +Y R+LDSDYSD EE+   +     +G+    +
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]9.2e-9478.54Show/hide
Query:  EPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET
        EPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALEAA  T+KDELLKAHSEVET
Subjt:  EPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHAT+ELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ
                      +IDLSGLKRRYAEKWAS PGGTPGPQALVDQY RDLDSDYSDP+EDQ
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.1e-16260.77Show/hide
Query:  MCARKDAGGIVKGPTSIKGWVRKWFYASGERLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------
        MCARK  GGIVKGPTSIKGWV KWF+ASGE LAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR                       
Subjt:  MCARKDAGGIVKGPTSIKGWVRKWFYASGERLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------

Query:  --------------AMVCGFASGVKRKSKGRAHALEAAQSSKPPTPT--------EVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQAEAADAQT
                      AMVCGF   VKRKSKGRAHAL+    ++P TPT          GP+S  P PVIEL+ SGG S EKR R+++EA+D          
Subjt:  --------------AMVCGFASGVKRKSKGRAHALEAAQSSKPPTPT--------EVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQAEAADAQT

Query:  EAADAPPLGEEARGEAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPG
              PL  E RGE+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PG
Subjt:  EAADAPPLGEEARGEAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPG

Query:  SVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE
        SVLQRTID  AEAF+ASI  A+++KAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE
Subjt:  SVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE

Query:  REKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPG
        +EKFQLLKEKDD+ Q LE KD  +   T+EL+  KERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWAS P 
Subjt:  REKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPG

Query:  GTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI
        GTP PQ+LVD+Y R+LDSDYSD EE+   +     +G+    +
Subjt:  GTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124675.4e-10850.2Show/hide
Query:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ
        +RRKKKKAIS SEVGACRVLPA +ADRVDDPAARMGGTSDVTARFRIEPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQ

Query:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SAL +KAELDGRE LAAREKEEFSAALEAASST+KDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    +EL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAK

Query:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSIS
        ERL+NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWAS P GT GP +LVD+Y RDLDSDYSD +ED+  +     
Subjt:  ERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSIS

Query:  LGSAPHSI
        +G+    +
Subjt:  LGSAPHSI

A0A6J1D971 uncharacterized protein LOC1110185387.3e-12993.01Show/hide
Query:  GTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALE ASST+KD
Subjt:  GTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAT+ELETAKERLSNGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWAS PGGTPGPQALVDQY RDLDSDYSDPEEDQ
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ

A0A6J1DF31 uncharacterized protein LOC1110199097.8e-9969.31Show/hide
Query:  MGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV
        MGGT DV  RFR+EPSS GV++QV+RISA  LDRCL+RASKFVS PGSVLQRTID AAEAFVASI SA+++KAELDGREALAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTV

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T+EL+  KERL+NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWAS P GTPGPQ+LV +Y R+LDSDYSD EE+   +     +G+    +
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI

A0A6J1DVF6 uncharacterized protein LOC1110247404.4e-9478.54Show/hide
Query:  EPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET
        EPSS GVR+QV+RISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL +KAELDGRE LAAREKEEFSAALEAA  T+KDELLKAHSEVET
Subjt:  EPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHAT+ELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ
                      +IDLSGLKRRYAEKWAS PGGTPGPQALVDQY RDLDSDYSDP+EDQ
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQ

A0A6J1DZB3 uncharacterized protein LOC1110256651.0e-16260.77Show/hide
Query:  MCARKDAGGIVKGPTSIKGWVRKWFYASGERLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------
        MCARK  GGIVKGPTSIKGWV KWF+ASGE LAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR                       
Subjt:  MCARKDAGGIVKGPTSIKGWVRKWFYASGERLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPR-----------------------

Query:  --------------AMVCGFASGVKRKSKGRAHALEAAQSSKPPTPT--------EVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQAEAADAQT
                      AMVCGF   VKRKSKGRAHAL+    ++P TPT          GP+S  P PVIEL+ SGG S EKR R+++EA+D          
Subjt:  --------------AMVCGFASGVKRKSKGRAHALEAAQSSKPPTPT--------EVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQAEAADAQT

Query:  EAADAPPLGEEARGEAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPG
              PL  E RGE+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFVS PG
Subjt:  EAADAPPLGEEARGEAPLKRRRKKKKAISPSEVGACRVLPASWADRVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPG

Query:  SVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE
        SVLQRTID  AEAF+ASI  A+++KAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE
Subjt:  SVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAASSTVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLE

Query:  REKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPG
        +EKFQLLKEKDD+ Q LE KD  +   T+EL+  KERL+NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWAS P 
Subjt:  REKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPG

Query:  GTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI
        GTP PQ+LVD+Y R+LDSDYSD EE+   +     +G+    +
Subjt:  GTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGACGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTC
TACGCTTCCGGGGAACGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCAGACCAGTCCCCGAGCTT
ACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTCCCGAGGGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCC
CATGCTCTTGAGGCTGCCCAGAGTTCGAAACCACCCACCCCTACCGAGGTTGGGCCCGCCTCGGAAGATCCGGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGT
CCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGGCCGAGGCGGCAGACGCTCAGACCGAGGCGGCAGATGCCCCGCCTTTGGGCGAG
GAGGCGAGGGGGGAAGCCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCCTGCCTGCAAGTTGGGCTGAT
CGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACTTCCGATGTGACGGCGCGGTTCAGAATCGAGCCGTCGAGTGTCGGGGTGAGGGAGCAGGTGACCCGCATC
TCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTCGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATTGATTACGCCGCCGAGGCGTTTGTT
GCTTCCATTCAATCGGCTCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAGGCTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCC
TCCACCGTGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGC
AAGGCCCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAG
GATAAGGAGCTGGAGCATGCGACTTCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGAT
GGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGG
TATGCCGAGAAGTGGGCGTCTAGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGCCAGGGATCTGGACTCTGATTACTCCGATCCCGAAGAG
GACCAGGGCAGAGCTGCAAGGTCTATAAGCCTTGGCTCTGCTCCTCATTCAATAAAGAGACTCTCATTCGCTTCTACTTTGTTGTTAGTGACCTCTTTTCTTTGC
TTTTCCTTTGAACTGCAACCAGTGTCACATCGCACCTCTTTACTTTTGAGGATAATAACGCTTCAGGTGTTCCGCGTTCCACGGGTGCGCGAGGACATCTCCTTT
CAGATCGGCCAACACGTACGTCCCAGGTCGGACTATGCCCTTGATCTCAAAGGGGCCTTCCCAGGCCGGATCAAGGGCACCCACATGCGTTTGGACCCTCCTCAA
GACCAGATGTCCGACCTAAAAGGCCCGAGGTCGAATGCGGGCATTGTAATGTCTGGCCATCCTGCCCTGATATTCCGCCAGGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGACGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTC
TACGCTTCCGGGGAACGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCAGACCAGTCCCCGAGCTT
ACGCAGGCCTCCTTCGATACTCTGAAATACTACAAGGAGCGCTTCCCGAGGGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCC
CATGCTCTTGAGGCTGCCCAGAGTTCGAAACCACCCACCCCTACCGAGGTTGGGCCCGCCTCGGAAGATCCGGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGT
CCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGGCCGAGGCGGCAGACGCTCAGACCGAGGCGGCAGATGCCCCGCCTTTGGGCGAG
GAGGCGAGGGGGGAAGCCCCTCTAAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCTTGCAGGGTCCTGCCTGCAAGTTGGGCTGAT
CGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACTTCCGATGTGACGGCGCGGTTCAGAATCGAGCCGTCGAGTGTCGGGGTGAGGGAGCAGGTGACCCGCATC
TCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTCGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATTGATTACGCCGCCGAGGCGTTTGTT
GCTTCCATTCAATCGGCTCTGGTTATAAAGGCCGAGCTGGATGGGAGGGAGGCTTTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCC
TCCACCGTGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAGGAGGACAGGCGC
AAGGCCCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAG
GATAAGGAGCTGGAGCATGCGACTTCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTGCTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGAT
GGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGG
TATGCCGAGAAGTGGGCGTCTAGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGCCAGGGATCTGGACTCTGATTACTCCGATCCCGAAGAG
GACCAGGGCAGAGCTGCAAGGTCTATAAGCCTTGGCTCTGCTCCTCATTCAATAAAGAGACTCTCATTCGCTTCTACTTTGTTGTTAGTGACCTCTTTTCTTTGC
TTTTCCTTTGAACTGCAACCAGTGTCACATCGCACCTCTTTACTTTTGAGGATAATAACGCTTCAGGTGTTCCGCGTTCCACGGGTGCGCGAGGACATCTCCTTT
CAGATCGGCCAACACGTACGTCCCAGGTCGGACTATGCCCTTGATCTCAAAGGGGCCTTCCCAGGCCGGATCAAGGGCACCCACATGCGTTTGGACCCTCCTCAA
GACCAGATGTCCGACCTAAAAGGCCCGAGGTCGAATGCGGGCATTGTAATGTCTGGCCATCCTGCCCTGATATTCCGCCAGGCGTAG
Protein sequenceShow/hide protein sequence
MIAKKPGRFYMCARKDAGGIVKGPTSIKGWVRKWFYASGERLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRAMVCGFASGVKRKSKGRA
HALEAAQSSKPPTPTEVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQAEAADAQTEAADAPPLGEEARGEAPLKRRRKKKKAISPSEVGACRVLPASWAD
RVDDPAARMGGTSDVTARFRIEPSSVGVREQVTRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALVIKAELDGREALAAREKEEFSAALEAAS
STVKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATSELETAKERLSNGVLLEEAFRQHPDFD
GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASRPGGTPGPQALVDQYARDLDSDYSDPEEDQGRAARSISLGSAPHSIKRLSFASTLLLVTSFLC
FSFELQPVSHRTSLLLRIITLQVFRVPRVREDISFQIGQHVRPRSDYALDLKGAFPGRIKGTHMRLDPPQDQMSDLKGPRSNAGIVMSGHPALIFRQA