; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g14460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g14460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:11361552..11364668
RNA-Seq ExpressionMoc06g14460
SyntenyMoc06g14460
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]5.7e-11688.19Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDE               V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLL ES LLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKS+ +AHALEAAQSSKP TPAVVGPASEDPA VIELES  GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.6e-13491.21Show/hide
Query:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQE LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LL ES LLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKS+ RAHALEAAQSSKP TPAVVGPASEDPA VIELES GGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDAL-------PLGE

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]2.1e-10295.88Show/hide
Query:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQE LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLL ES LLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.8e-18592.96Show/hide
Query:  MSSSFRSNIGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFDIPENILLRIPEEGEGADNPPEGWVTLYFKMFEYG
        MSSS  SN+  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGF IPENILLR+PEEGE ADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFRSNIGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFDIPENILLRIPEEGEGADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQE LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LL ES LLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVD
        S+ RAHALEAAQSSKPATPAVVGPASEDPALVIELES GGPSREKRPRDQTEAVD
Subjt:  SESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.1e-16764.88Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDE GR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLL ES LLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAV--------VGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKS+ RAHAL+    ++P TP V         GP+S  P  VIEL+  GG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAV--------VGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GT +V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE  
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--

Query:  -----------AELDGREVLAAREKEEFSAALEAASFTMKDELLKAHSEVEILKAEVETKAELLKKEEGRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
                   AELDGRE LAA+E+E   AALEAA+ T+K ELLKA  EV+IL+AEV+ K +LLKKE  + KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  -----------AELDGREVLAAREKEEFSAALEAASFTMKDELLKAHSEVEILKAEVETKAELLKKEEGRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF-----------------------KYAEQSASGPGDTTGPQALVDKYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGF                       KY+E+ ASGP  T  PQ+LVDKYVR
Subjt:  QALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF-----------------------KYAEQSASGPGDTTGPQALVDKYVR

Query:  DLDSDYSDLEEDQGRAARSIE
        +LDSDYSD+EE+   +    E
Subjt:  DLDSDYSDLEEDQGRAARSIE

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.8e-11688.19Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP
        MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDE               V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLL ES LLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKS+ +AHALEAAQSSKP TPAVVGPASEDPA VIELES  GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138267.7e-13591.21Show/hide
Query:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQE LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LL ES LLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKS+ RAHALEAAQSSKP TPAVVGPASEDPA VIELES GGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1DWF1 uncharacterized protein LOC1110251081.0e-10295.88Show/hide
Query:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQE LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAM
        KWFYASGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLL ES LLDYNPAVRPIESSRPNSEL M
Subjt:  KWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255024.8e-18592.96Show/hide
Query:  MSSSFRSNIGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFDIPENILLRIPEEGEGADNPPEGWVTLYFKMFEYG
        MSSS  SN+  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGF IPENILLR+PEEGE ADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFRSNIGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFDIPENILLRIPEEGEGADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQE LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK+IAKKPGRFYMCARKGA GIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDE GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LL ES LLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVD
        S+ RAHALEAAQSSKPATPAVVGPASEDPALVIELES GGPSREKRPRDQTEAVD
Subjt:  SESRAHALEAAQSSKPATPAVVGPASEDPALVIELESYGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256652.0e-16764.88Show/hide
Query:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDE GR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLL ES LLDYNP
Subjt:  MCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAV--------VGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKS+ RAHAL+    ++P TP V         GP+S  P  VIEL+  GG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAV--------VGPASEDPALVIELESYGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GT +V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE  
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE--

Query:  -----------AELDGREVLAAREKEEFSAALEAASFTMKDELLKAHSEVEILKAEVETKAELLKKEEGRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
                   AELDGRE LAA+E+E   AALEAA+ T+K ELLKA  EV+IL+AEV+ K +LLKKE  + KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  -----------AELDGREVLAAREKEEFSAALEAASFTMKDELLKAHSEVEILKAEVETKAELLKKEEGRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF-----------------------KYAEQSASGPGDTTGPQALVDKYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGF                       KY+E+ ASGP  T  PQ+LVDKYVR
Subjt:  QALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF-----------------------KYAEQSASGPGDTTGPQALVDKYVR

Query:  DLDSDYSDLEEDQGRAARSIE
        +LDSDYSD+EE+   +    E
Subjt:  DLDSDYSDLEEDQGRAARSIE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGGAGCAACATAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGATATCCCTGAGAACATCCTCCTCA
GGATTCCGGAGGAGGGGGAGGGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTGCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGATGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCATGCGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTTAGGTCGTTCCTTCTTTGACGTCCCCACTAGG
TTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTT
GGTGACCGACAAGCTGCTGCCTGAGTCCAGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTG
CGAGTAACGTGAAACGCAAGTCCGAGAGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCAAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCC
CTAGTGATCGAGCTGGAGTCTTATGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCAGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGT
CCCTTTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCA
GGATGGGCGGGACGTTCGATGTGACGGCACGGTTTAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTA
AGAAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGCGAGGGAGAA
AGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTTCACCATGAAGGATGAGCTACTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCG
AGCTGCTGAAGAAGGAAGAGGGCAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTTCTGAAGGAGAAGGACGAC
ATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAG
GCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTATGCCGAGCAGTCGGCGTCTGGGCCTGGCGACACCACTGGCCCCCAAGCGTTGG
TGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGGCAGAGCTGCAAGATCCATTGAAAACCCTTTTGGCATTTCAAGGATAATAACG
CTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATGCTCTTGACCTTAAATGGGCCTTCCCAGG
TCGGATCAAAGGCACCCACATGGGTTTGGACCCTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGGAGCAACATAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGATATCCCTGAGAACATCCTCCTCA
GGATTCCGGAGGAGGGGGAGGGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTGCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGATGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCATGCGGTATAG
TTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTTAGGTCGTTCCTTCTTTGACGTCCCCACTAGG
TTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTT
GGTGACCGACAAGCTGCTGCCTGAGTCCAGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTG
CGAGTAACGTGAAACGCAAGTCCGAGAGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCAAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCC
CTAGTGATCGAGCTGGAGTCTTATGGGGGTCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCAGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGT
CCCTTTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCA
GGATGGGCGGGACGTTCGATGTGACGGCACGGTTTAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTA
AGAAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGCGAGGGAGAA
AGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTTCACCATGAAGGATGAGCTACTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCG
AGCTGCTGAAGAAGGAAGAGGGCAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTTCTGAAGGAGAAGGACGAC
ATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAG
GCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTATGCCGAGCAGTCGGCGTCTGGGCCTGGCGACACCACTGGCCCCCAAGCGTTGG
TGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGGCAGAGCTGCAAGATCCATTGAAAACCCTTTTGGCATTTCAAGGATAATAACG
CTTCAGGTGCTCCGCGTTCCACGGGTGCGCGAGGACGTCTCCTTTCAGATCGGCCAATATGTACGTCCCAGGTCGGACTATGCTCTTGACCTTAAATGGGCCTTCCCAGG
TCGGATCAAAGGCACCCACATGGGTTTGGACCCTCCTTAA
Protein sequenceShow/hide protein sequence
MSSSFRSNIGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFDIPENILLRIPEEGEGADNPPEGWVTLYFKMFEYGLRLPLHPFVQ
ELLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGRFYMCARKGACGIVKGPTSIKGWVRKWFYASGEWLAKDELGRSFFDVPTR
FGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLPESRLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSESRAHALEAAQSSKPATPAVVGPASEDPA
LVIELESYGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTFDVTARFRVEPSSSGVRDQVSRISAASLDRCL
RRASKFVSDPGSVLQRTIDYAAEAELDGREVLAAREKEEFSAALEAASFTMKDELLKAHSEVEILKAEVETKAELLKKEEGRRKAQLRAAHAITRGLEKEKFQLLKEKDD
MLQALEAKDEELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKYAEQSASGPGDTTGPQALVDKYVRDLDSDYSDLEEDQGRAARSIENPFGISRIIT
LQVLRVPRVREDVSFQIGQYVRPRSDYALDLKWAFPGRIKGTHMGLDPP