; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g14660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g14660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:11260715..11263013
RNA-Seq ExpressionMoc08g14660
SyntenyMoc08g14660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]5.9e-11587.4Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWL KDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS+  TPAV GPAS+DPAPVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARG LPASFADR+DDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.2e-11380.95Show/hide
Query:  MFEYGLRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEF+FRTG                            EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE
         VKRKSKGRAHALEAAQSS+  TPAV GPAS+DPAPVIELESS GPSREKRPR        QTEAADV  LGE
Subjt:  NVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.4e-12184.91Show/hide
Query:  GTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKD
        G   +  + R+EPSSSGVRDQVSRIS ASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAV AELDGRE LAAREKEEFSAALE ASS MKD
Subjt:  GTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE++ FQLLKEKDDMLQALEAK++EL+HATAELE  KERLSNG LL+E+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDKVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EED+VG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDKVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.1e-16184.79Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPQGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDD EDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPP+GWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPQGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEF+FRTG                            EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAAD
        SKGRAHALEAAQSS+ ATPAV GPAS+DPA VIELESS GPSREKRPR QTEA D
Subjt:  SKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAAD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]5.7e-19570.9Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWL KDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAV--------AGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +E  TP V        +GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAV--------AGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKKT+S  E GARG LP S AD +DDPEARM GTS+V +RF +EPSSSGV+DQVSRIS   LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDML
        +ASI  A+ V AELDGREALAA+E+E   AALEAA++ +K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEK+ FQLLKEKDD+ 
Subjt:  VASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDML

Query:  QALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
        Q LE K+  +   T EL+ +KERL+NG LL+ESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------KVGTTQEGAP--QAGS
        +LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSDLEED--------KVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.8e-11587.4Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWL KDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS+  TPAV GPAS+DPAPVIELESS GPSREKRPR QTEA DVS LGEEVREE PLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARG LPASFADR+DDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.6e-11380.95Show/hide
Query:  MFEYGLRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEF+FRTG                            EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE
         VKRKSKGRAHALEAAQSS+  TPAV GPAS+DPAPVIELESS GPSREKRPR        QTEAADV  LGE
Subjt:  NVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPR-------GQTEAADVSSLGE

A0A6J1D971 uncharacterized protein LOC1110185387.0e-12284.91Show/hide
Query:  GTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKD
        G   +  + R+EPSSSGVRDQVSRIS ASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAV AELDGRE LAAREKEEFSAALE ASS MKD
Subjt:  GTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKD

Query:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAIT+GLE++ FQLLKEKDDMLQALEAK++EL+HATAELE  KERLSNG LL+E+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDMLQALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDKVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EED+VG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDKVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255025.3e-16284.79Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPQGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDD EDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPP+GWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPQGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEF+FRTG                            EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFIFRTG----------------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWL KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAAD
        SKGRAHALEAAQSS+ ATPAV GPAS+DPA VIELESS GPSREKRPR QTEA D
Subjt:  SKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAAD

A0A6J1DZB3 uncharacterized protein LOC1110256652.8e-19570.9Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWL KDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAV--------AGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    +E  TP V        +GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAV--------AGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSSLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKKT+S  E GARG LP S AD +DDPEARM GTS+V +RF +EPSSSGV+DQVSRIS   LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDML
        +ASI  A+ V AELDGREALAA+E+E   AALEAA++ +K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEK+ FQLLKEKDD+ 
Subjt:  VASIQSALAVMAELDGREALAAREKEEFSAALEAASSNMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKD-FQLLKEKDDML

Query:  QALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
        Q LE K+  +   T EL+ +KERL+NG LL+ESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHATAELEMVKERLSNGALLKESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------KVGTTQEGAP--QAGS
        +LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSDLEED--------KVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACAGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAAGAGGGGGAGAGAGCTGACAATCCTCCACAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTATCTTCCGAACTGGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAAAAGCCTGGTCGGTTTTATATGTG
CGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGGAAAGGACGAGTCGGGTC
GTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTT
CCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCAGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTC
CGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAAGCTGCCACTCCTGCCGTGG
CAGGGCCAGCCTCGAAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGGTCAGACCGAGGCGGCGGACGTCTCGTCC
TTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGC
AGATCGGATGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGTACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCT
CGGTTGCAAGTTTGGACCGTTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTCCTGCAGAGAACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCC
ATTCAATCGGCTCTGGCCGTGATGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCAACATGAA
GGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAACTCCGAG
CTGCCCATGCTATCACCAAGGGCTTGGAGAAGGATTTCCAACTCCTCAAGGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACT
GCCGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGAAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGG
CTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCA
CCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATAAGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGCA
GGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACAGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAAGAGGGGGAGAGAGCTGACAATCCTCCACAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTATCTTCCGAACTGGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAAAAGCCTGGTCGGTTTTATATGTG
CGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGGAAAGGACGAGTCGGGTC
GTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTT
CCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCAGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTC
CGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAAGCTGCCACTCCTGCCGTGG
CAGGGCCAGCCTCGAAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCCCAGGGGTCAGACCGAGGCGGCGGACGTCTCGTCC
TTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGC
AGATCGGATGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGTACGGTTCAGAGTCGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCT
CGGTTGCAAGTTTGGACCGTTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTCCTGCAGAGAACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCC
ATTCAATCGGCTCTGGCCGTGATGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCCTCTTCCAACATGAA
GGATGAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTGGAGGCCAAGGCCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAACTCCGAG
CTGCCCATGCTATCACCAAGGGCTTGGAGAAGGATTTCCAACTCCTCAAGGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACT
GCCGAGCTGGAGATGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGAAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGG
CTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCA
CCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATAAGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGCA
GGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDREDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPQGWVTLYFKMFEYGLRLPLHPFVQ
EFIFRTGEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLGKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHF
PRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSEAATPAVAGPASKDPAPVIELESSEGPSREKRPRGQTEAADVSS
LGEEVREEAPLKRRRKKKKTTSPLEVGARGALPASFADRMDDPEARMGGTSDVTVRFRVEPSSSGVRDQVSRISVASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVAS
IQSALAVMAELDGREALAAREKEEFSAALEAASSNMKDELLKAHSEVEILKAEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKDFQLLKEKDDMLQALEAKEEELKHAT
AELEMVKERLSNGALLKESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDKVGTTQEGAPQA
GS