; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g13260 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g13260
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:11252802..11256347
RNA-Seq ExpressionMoc09g13260
SyntenyMoc09g13260
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.1e-14495.6Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFR  LAPAQVAPNGWGVIFALAILFWLRARDSE+AELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLPLGE
         VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQ EAVDAQTEA DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.1e-13191.9Show/hide
Query:  ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDE
        E  + A+ R+EPSSS VRDQVSRISAASLDRCLRR SKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKDE
Subjt:  ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDE

Query:  LPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDG
        L KAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDG
Subjt:  LPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDG

Query:  FAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQALGDQYVRDLDSDYSDPEEDQVSSTQEGAPQAGS
        FAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RY EKWASGPGGTPGPQAL DQYVRDLDSDYSDPEEDQV STQEGA   GS
Subjt:  FAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQALGDQYVRDLDSDYSDPEEDQVSSTQEGAPQAGS

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]6.5e-10598.44Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFR  LAPAQVAPNGWGVIFALAILFWLRARDSE+AELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.9e-19096.32Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLHRGFAIPENILLRLPEEGKRADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSL RGFAIPENILLRLPEEG+RADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLHRGFAIPENILLRLPEEGKRADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFR  LAPAQVAPNGWGVIFALAILFWLRARDSE+AEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSK

Query:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD
        GRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQ EAVD
Subjt:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.2e-17264.76Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLP
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+       ++EA+DV P
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLP

Query:  LGEEGLACKFRR------------SGGRSY----------DQDGR---ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTID
        L E       RR            +G R            D + R    S+V  RF +EPSSS V+DQVSRISA  LDR LRR SKFVSDPGSVLQRTID
Subjt:  LGEEGLACKFRR------------SGGRSY----------DQDGR---ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTID

Query:  YAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLK
          AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K EL KA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLK
Subjt:  YAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLK

Query:  EKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQAL
        EKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y EKWASGP GTP PQ+L
Subjt:  EKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQAL

Query:  GDQYVRDLDSDYSDPEED--------QVSSTQEGAP--QAGS
         D+YVR+LDSDYSD EE+        +V +TQE  P  Q GS
Subjt:  GDQYVRDLDSDYSDPEED--------QVSSTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138263.4e-14495.6Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFR  LAPAQVAPNGWGVIFALAILFWLRARDSE+AELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLPLGE
         VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQ EAVDAQTEA DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLPLGE

A0A6J1D971 uncharacterized protein LOC1110185381.5e-13191.9Show/hide
Query:  ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDE
        E  + A+ R+EPSSS VRDQVSRISAASLDRCLRR SKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKDE
Subjt:  ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDE

Query:  LPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDG
        L KAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFDG
Subjt:  LPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDG

Query:  FAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQALGDQYVRDLDSDYSDPEEDQVSSTQEGAPQAGS
        FAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RY EKWASGPGGTPGPQAL DQYVRDLDSDYSDPEEDQV STQEGA   GS
Subjt:  FAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQALGDQYVRDLDSDYSDPEEDQVSSTQEGAPQAGS

A0A6J1DWD2 uncharacterized protein LOC1110246803.1e-10598.44Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFR  LAPAQVAPNGWGVIFALAILFWLRARDSE+AELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255021.4e-19096.32Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLHRGFAIPENILLRLPEEGKRADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSL RGFAIPENILLRLPEEG+RADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLHRGFAIPENILLRLPEEGKRADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFR  LAPAQVAPNGWGVIFALAILFWLRARDSE+AEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSK

Query:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD
        GRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQ EAVD
Subjt:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256653.0e-17264.76Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLP
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+       ++EA+DV P
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLP

Query:  LGEEGLACKFRR------------SGGRSY----------DQDGR---ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTID
        L E       RR            +G R            D + R    S+V  RF +EPSSS V+DQVSRISA  LDR LRR SKFVSDPGSVLQRTID
Subjt:  LGEEGLACKFRR------------SGGRSY----------DQDGR---ESDVTARFRVEPSSSEVRDQVSRISAASLDRCLRRTSKFVSDPGSVLQRTID

Query:  YAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLK
          AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K EL KA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLK
Subjt:  YAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELPKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLK

Query:  EKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQAL
        EKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y EKWASGP GTP PQ+L
Subjt:  EKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEKWASGPGGTPGPQAL

Query:  GDQYVRDLDSDYSDPEED--------QVSSTQEGAP--QAGS
         D+YVR+LDSDYSD EE+        +V +TQE  P  Q GS
Subjt:  GDQYVRDLDSDYSDPEED--------QVSSTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGCGGGCATCTGATTTAGATGATCGCAACTCTACTTCGGGTTTCCGTGTTCTATTCGGTGGCAATCTAATCATTGTGTTCCAGATTGTAGCTCGAACT
CGGCCTCCGGACCGACATGAACACTTGGGCAGACCTGCACAAAAAGGTGAGCACTTCGACGATCAAGTCAGTATAGGTCGGATTCCAAGTTTAGTTCGAGGCCAG
AAATCGTCGTACCTGATCGGGGAATCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCT
CTTTCGAACGTAGTTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATC
TCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCATAGGGGGTTCGCT
ATCCCTGAGAACATCCTCCTCAGGCTGCCGGAGGAGGGGAAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTCAAAATGTTTGAGTACGGCCTC
AGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGATTAGATTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATC
CTCTTTTGGCTACGAGCTCGGGATAGTGAGCAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGG
TTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCA
AAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTG
AAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGT
CCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGGTTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCC
CAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGC
CCCAGGGATCAGGCCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGTCCTGCCTTTGGGCGAGGAGGGTCTTGCCTGCAAGTTTCGTAGATCGGGTGGACGA
TCCTACGACCAGGATGGGCGGGAGTCTGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGAGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGT
TTGGACCGCTGCCTAAGGAGGACGTCCAAATTTGTGAGCGACCCTGGGTCCGTTTTGCAGAGGACCATCGATTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAG
TCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAG
GATGAGCTGCCGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTC
CGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAAGCGCTTGAAGCGAAGGATAAGGAGCTG
GAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAGTCGTTTAGGCAACATCCTGACTTCGACGGATTTGCCAAA
GACTTTTCTGATGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGGTATGTCGAGAAG
TGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGGGGATCAGTATGTCAGAGATTTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCAGC
TCCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGCGGGCATCTGATTTAGATGATCGCAACTCTACTTCGGGTTTCCGTGTTCTATTCGGTGGCAATCTAATCATTGTGTTCCAGATTGTAGCTCGAACT
CGGCCTCCGGACCGACATGAACACTTGGGCAGACCTGCACAAAAAGGTGAGCACTTCGACGATCAAGTCAGTATAGGTCGGATTCCAAGTTTAGTTCGAGGCCAG
AAATCGTCGTACCTGATCGGGGAATCATACCTTACGTTCCCTGAATTCTTGGAGTTCGATCTGAAGGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCT
CTTTCGAACGTAGTTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATC
TCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCATAGGGGGTTCGCT
ATCCCTGAGAACATCCTCCTCAGGCTGCCGGAGGAGGGGAAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTCAAAATGTTTGAGTACGGCCTC
AGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGATTAGATTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATC
CTCTTTTGGCTACGAGCTCGGGATAGTGAGCAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGG
TTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCA
AAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTG
AAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGT
CCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGGTTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCC
CAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGC
CCCAGGGATCAGGCCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGTCCTGCCTTTGGGCGAGGAGGGTCTTGCCTGCAAGTTTCGTAGATCGGGTGGACGA
TCCTACGACCAGGATGGGCGGGAGTCTGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGAGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGT
TTGGACCGCTGCCTAAGGAGGACGTCCAAATTTGTGAGCGACCCTGGGTCCGTTTTGCAGAGGACCATCGATTACGCCGCTGAGGCGTTCGTTGCTTCCATTCAG
TCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAG
GATGAGCTGCCGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTC
CGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAAGCGCTTGAAGCGAAGGATAAGGAGCTG
GAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAGTCGTTTAGGCAACATCCTGACTTCGACGGATTTGCCAAA
GACTTTTCTGATGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAAGAGGTATGTCGAGAAG
TGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGGGGATCAGTATGTCAGAGATTTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCAGC
TCCACTCAAGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MDLRASDLDDRNSTSGFRVLFGGNLIIVFQIVARTRPPDRHEHLGRPAQKGEHFDDQVSIGRIPSLVRGQKSSYLIGESYLTFPEFLEFDLKAARTLGRSVSSLS
LSNVVAMSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLHRGFAIPENILLRLPEEGKRADNPPEGWVTLYFKMFEYGL
RLPLHPFVQEFLFRIRLAPAQVAPNGWGVIFALAILFWLRARDSEQAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLA
KDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAA
QSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQAEAVDAQTEAVDVLPLGEEGLACKFRRSGGRSYDQDGRESDVTARFRVEPSSSEVRDQVSRISAAS
LDRCLRRTSKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELPKAHSEVEILKAEVESQAELLKKEEDRRKAQL
RAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYVEK
WASGPGGTPGPQALGDQYVRDLDSDYSDPEEDQVSSTQEGAPQAGS