; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:998871..1001117
RNA-Seq ExpressionMoc07g01210
SyntenyMoc07g01210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.7e-10983.52Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGEEVREE
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRD       QTEAVD  PLGEEVREE
Subjt:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGEEVREE

Query:  APLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ
         PLKRRRKKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFR+EPSSSGVRDQ
Subjt:  APLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.3e-14596.34Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIK WVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  DVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGE
         VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAV AQTEA D PPLGE
Subjt:  DVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.8e-12795.45Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+ QLRAAHAITRGLEREKFQLLKEKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.6e-18996.88Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRWFAIPENILLRLLEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYL SLRR FAIPENILLRL EEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRWFAIPENILLRLLEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIK WVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASDVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASDVKRKSK

Query:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAV
        GRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAV
Subjt:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAV

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.0e-18770.51Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPP
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+       ++EA+D  P
Subjt:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPP

Query:  LGEEVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTI
        L  EVR E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVS PGSVLQRTI
Subjt:  LGEEVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTI

Query:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLL
        D  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ K  LRAAHAIT+GLE+EKFQLL
Subjt:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA
        KEKDD+ QVLE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+
Subjt:  KEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA

Query:  LVDQYVRDLDSD
        LVD+YVR+LDSD
Subjt:  LVDQYVRDLDSD

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092988.2e-11083.52Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGEEVREE
        AVRPIESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRPRD       QTEAVD  PLGEEVREE
Subjt:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGEEVREE

Query:  APLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ
         PLKRRRKKKK  SP EVGA  VLPASFADRVDDP ARMGGT DVT RFR+EPSSSGVRDQ
Subjt:  APLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.6e-14596.34Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIK WVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  DVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGE
         VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAV AQTEA D PPLGE
Subjt:  DVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGE

A0A6J1D971 uncharacterized protein LOC1110185388.7e-12895.45Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+ QLRAAHAITRGLEREKFQLLKEKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD

A0A6J1DXS5 uncharacterized protein LOC1110255028.0e-19096.88Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRWFAIPENILLRLLEEGERADNPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYL SLRR FAIPENILLRL EEGERADNPPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRWFAIPENILLRLLEEGERADNPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIK WVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASDVKRKSK
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRKSK
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASDVKRKSK

Query:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAV
        GRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRPRDQTEAV
Subjt:  GRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAV

A0A6J1DZB3 uncharacterized protein LOC1110256659.8e-18870.51Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPP
         VR IE+SRPNSELAMVCGF   VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+       ++EA+D  P
Subjt:  AVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVGAQTEAVDAPP

Query:  LGEEVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTI
        L  EVR E+PL+RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVS PGSVLQRTI
Subjt:  LGEEVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRTI

Query:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLL
        D  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ K  LRAAHAIT+GLE+EKFQLL
Subjt:  DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHAITRGLEREKFQLL

Query:  KEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA
        KEKDD+ QVLE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+
Subjt:  KEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQA

Query:  LVDQYVRDLDSD
        LVD+YVR+LDSD
Subjt:  LVDQYVRDLDSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGTGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
TGGAAGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTT
CTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGG
GGCCGACCTCCATCAAGAGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGAC
TGACGAACTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGACCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCG
ACGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCTCAGAGTTCGAAACCTGCCACCCCTGCAGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTG
ATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGGCGCCCAGACCGAGGCGGTTGACGCCCCGCCTTTGGGCGAGGA
GGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGG
ACGATCCTGCGGCCAGGATGGGCGGGACATCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGT
TTGGACCGCTGCCTAAGGAGGGCATCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGC
TCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGT
TGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAAACCCAACTCCGAGCTGCCCACGCT
ATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGTGCTTGAAGCGAAGGACAAGGAGCTGGAGCATGCGACTGCCGAGCT
GGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGT
TCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGC
CCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGA
CGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGTGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTC
TGGAAGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTT
CTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGA
GCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAGGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGG
GGCCGACCTCCATCAAGAGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGG
AACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGAC
TGACGAACTGCTGCTTGAGTCCGGACTGCTAGATTACAACCCTGCAGTTCGACCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCG
ACGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCCGCTCAGAGTTCGAAACCTGCCACCCCTGCAGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTG
ATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGGCGCCCAGACCGAGGCGGTTGACGCCCCGCCTTTGGGCGAGGA
GGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGG
ACGATCCTGCGGCCAGGATGGGCGGGACATCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGT
TTGGACCGCTGCCTAAGGAGGGCATCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGC
TCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGT
TGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAAACCCAACTCCGAGCTGCCCACGCT
ATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGTGCTTGAAGCGAAGGACAAGGAGCTGGAGCATGCGACTGCCGAGCT
GGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGT
TCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGC
CCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTAA
Protein sequenceShow/hide protein sequence
MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRWFAIPENILLRLLEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEF
LFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVPTRFG
NLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASDVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPV
IELESSGGPSREKRPRDQTEAVGAQTEAVDAPPLGEEVREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAAS
LDRCLRRASKFVSAPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKTQLRAAHA
ITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPG
PQALVDQYVRDLDSD