; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g14360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g14360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:10631331..10633681
RNA-Seq ExpressionMoc02g14360
SyntenyMoc02g14360
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]6.3e-11788.58Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVREEVPLKRRR
        AVR IESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAVD+SPLGE+VREEVPLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVREEVPLKRRR

Query:  KKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQ
        KKKKTTSPLEV ARG LP SFADRVDDPEARMGGT DVT RFRVEPSSSG+RDQ
Subjt:  KKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.1e-12892.25Show/hide
Query:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES
        EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES
Subjt:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES

Query:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEA
        GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IE SRPNS LAMVC FAS VKRKSKGRAHALEA
Subjt:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEA

Query:  AQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAV-------DISPLGE
        AQSSKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAV       D+ PLGE
Subjt:  AQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAV-------DISPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.1e-10886.85Show/hide
Query:  GTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKD
        G   + A+ R+EPSSSG+RDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFS ALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++ ELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HAT ELET KERLSN VLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK
        GFAKDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WASGP GTPGP+
Subjt:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]5.6e-13475.49Show/hide
Query:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDGR------------------------------------IVMPPPRVRVWNTLLGY-----------
        MSSS +S+L  + DLARRLES+LEEIEN R SDDG                                     + +P    R  N   G+           
Subjt:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDGR------------------------------------IVMPPPRVRVWNTLLGY-----------

Query:  LSTTSDPF-GEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        L     PF  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE EL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LSTTSDPF-GEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATP VVGPASEDPA VIELESS GPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.4e-18772.97Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVRE
         VRLIE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S G S EKR R+++EA+D+SPL E VR 
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E  ARG LPTS AD VDDPEARM GTS+V  RF +EPSSSG++DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E    ALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK
        Q LE K+  +   T EL+ +KERL+N  LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WASGP+GTP P+
Subjt:  QALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.0e-11788.58Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVREEVPLKRRR
        AVR IESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAVD+SPLGE+VREEVPLKRRR
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVREEVPLKRRR

Query:  KKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQ
        KKKKTTSPLEV ARG LP SFADRVDDPEARMGGT DVT RFRVEPSSSG+RDQ
Subjt:  KKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.0e-12892.25Show/hide
Query:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES
        EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE ELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES
Subjt:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES

Query:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEA
        GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IE SRPNS LAMVC FAS VKRKSKGRAHALEA
Subjt:  GRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEA

Query:  AQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAV-------DISPLGE
        AQSSKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAV       D+ PLGE
Subjt:  AQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAV-------DISPLGE

A0A6J1D971 uncharacterized protein LOC1110185382.0e-10886.85Show/hide
Query:  GTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKD
        G   + A+ R+EPSSSG+RDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGRE LAAREKEEFS ALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKD

Query:  ELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++ ELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HAT ELET KERLSN VLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK
        GFAKDFSDAGFKFLMKGIASDM DLQIDL GLK+RYAE+WASGP GTPGP+
Subjt:  GFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK

A0A6J1DXS5 uncharacterized protein LOC1110255022.7e-13475.49Show/hide
Query:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDGR------------------------------------IVMPPPRVRVWNTLLGY-----------
        MSSS +S+L  + DLARRLES+LEEIEN R SDDG                                     + +P    R  N   G+           
Subjt:  MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDGR------------------------------------IVMPPPRVRVWNTLLGY-----------

Query:  LSTTSDPF-GEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        L     PF  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEE EL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LSTTSDPF-GEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVR IESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRLIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATP VVGPASEDPA VIELESS GPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256652.1e-18772.97Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVRE
         VRLIE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S G S EKR R+++EA+D+SPL E VR 
Subjt:  AVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVV--------VGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E  ARG LPTS AD VDDPEARM GTS+V  RF +EPSSSG++DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGREALAA+E+E    ALEAA +T+K ELLKA  EV+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREALAAREKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK
        Q LE K+  +   T EL+ +KERL+N  LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL GLKK+Y+E+WASGP+GTP P+
Subjt:  QALEAKEEELKHATVELETVKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAACAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGAGGAT
AGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGGGGAATTTCTCTTCCGAACTGGGTTGGCTCCCGCTC
AAGTGGCCCCCAATGGGTGGGGTGTCATCTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGTCGAGCTGTTGGATGTAGACCAGCTCCTCGCG
TGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGTGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAG
GAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCGACCAGTCCCCG
AGCTCACGCAAGCCTCCTTTGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCTGGGCTG
CTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGC
CCATGCTCTTGAGGCCGCCCAAAGTTCGAAACCTGCCACTCCTGTCGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTT
CGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACATCTCGCCCTTGGGCGAGAAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACC
ACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTT
CAGAGTCGAGCCGTCAAGTTCTGGGATGAGGGATCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGT
CCGTCCTGCAGAGGACCATTGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGG
GAGAAAGAGGAGTTCTCTACTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTAAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTAGAGGCCAA
GACCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGG
ACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGTCGAGCTGGAGACGGTGAAGGAGCGTCTCAGTAATGAAGTCCTATTGGAGGAATCG
TTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTCTGACCTCCAGATCGATCT
CGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTC
CGACCTCGAAGAGGATCGGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAGGCGATCACCTTTCATGAGGCCTTTCTCTGTCTTTCTTCTCTTCTTTTGT
AAGTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAACAGCGACTTAGGGTCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGAGGAT
AGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGGGGAATTTCTCTTCCGAACTGGGTTGGCTCCCGCTC
AAGTGGCCCCCAATGGGTGGGGTGTCATCTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGTCGAGCTGTTGGATGTAGACCAGCTCCTCGCG
TGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGTGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAG
GAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTATCAATCCGACCAGTCCCCG
AGCTCACGCAAGCCTCCTTTGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCTGGGCTG
CTAGATTACAACCCTGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGC
CCATGCTCTTGAGGCCGCCCAAAGTTCGAAACCTGCCACTCCTGTCGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGAGGGTCCTT
CGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACATCTCGCCCTTGGGCGAGAAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACC
ACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGCCCTGCCTACGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTT
CAGAGTCGAGCCGTCAAGTTCTGGGATGAGGGATCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGT
CCGTCCTGCAGAGGACCATTGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGG
GAGAAAGAGGAGTTCTCTACTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTAAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCTGAGGTAGAGGCCAA
GACCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAAGGAGAAGG
ACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGTCGAGCTGGAGACGGTGAAGGAGCGTCTCAGTAATGAAGTCCTATTGGAGGAATCG
TTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGTCTGACCTCCAGATCGATCT
CGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTC
CGACCTCGAAGAGGATCGGGTCGGCACCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAGGCGATCACCTTTCATGAGGCCTTTCTCTGTCTTTCTTCTCTTCTTTTGT
AAGTGTTAG
Protein sequenceShow/hide protein sequence
MSSSFNSDLGSDEDLARRLESELEEIENFRFSDDGRIVMPPPRVRVWNTLLGYLSTTSDPFGEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEVELLDVDQLLA
CFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPRGRKVGTLVTDKLLLESGL
LDYNPAVRLIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPVVVGPASEDPAPVIELESSEGPSREKRPRDQTEAVDISPLGEKVREEVPLKRRRKKKKT
TSPLEVRARGALPTSFADRVDDPEARMGGTSDVTARFRVEPSSSGMRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREALAAR
EKEEFSTALEAASSTMKDELLKAHSEVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATVELETVKERLSNEVLLEES
FRQHPDFDGFAKDFSDAGFKFLMKGIASDMSDLQIDLGGLKKRYAEQWASGPSGTPGPKRWWISTSEIWTLTTPTSKRIGSAPLKRALLKQALRRSPFMRPFSVFLLFFC
KC