; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:19331285..19333492
RNA-Seq ExpressionMoc08g26740
SyntenyMoc08g26740
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.2e-10986.4Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP
        KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLVTD LL ESGLLDYNPAVRP
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP

Query:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKK
        IESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAVVGPAS+DPAPVIELESSRGPSREKRPRDQTEAVD  PLGEEVREEVP KRRRKKKK
Subjt:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKK

Query:  TTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQ
        TTSPLEVGA  VLPASFADRV+DP ARMGGT DVT  FRVEPSSSGVRDQ
Subjt:  TTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.9e-11683.15Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD LL ESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TPAVVGPAS+DPAPVIELESS GPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDAL-------PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.8e-10887.06Show/hide
Query:  GTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAVREKEEFSAALEAASSTMKD
        G   + A  R+EPSSSGVRDQVS ISAASLDR LRRASKFVS PGSVLQRTIDYAAE             AELDGREVLA REKEEFSAALE ASSTMKD
Subjt:  GTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAVREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDK+LEHAT ELE AKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]9.0e-16687.32Show/hide
Query:  MSSSFSSNLGSDEDLARKLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLAR+LES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYL SLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARKLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD LL ESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPAS+DPA VIELESS GPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.2e-17569.51Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP
        KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD LL ESGLLDYNP VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP

Query:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQ
        IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S G S EKR R+++EA+D  PL  EVR E P 
Subjt:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQ

Query:  KRRRKKKKTTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE------
        +RRRKKKKT+S  E GA   LP S AD V+DP ARM GTS+V   F +EPSSSGV+DQVS ISA  LDRYLRRASKFVSDPGSVLQRTID  AE      
Subjt:  KRRRKKKKTTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE------

Query:  -------AELDGREVLAVREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALE
               AELDGRE LA +E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE
Subjt:  -------AELDGREVLAVREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALE

Query:  AKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD
         KD  +   TTEL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD
Subjt:  AKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092985.8e-11086.4Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP
        KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLVTD LL ESGLLDYNPAVRP
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP

Query:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKK
        IESSRPNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAVVGPAS+DPAPVIELESSRGPSREKRPRDQTEAVD  PLGEEVREEVP KRRRKKKK
Subjt:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKK

Query:  TTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQ
        TTSPLEVGA  VLPASFADRV+DP ARMGGT DVT  FRVEPSSSGVRDQ
Subjt:  TTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.4e-11683.15Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD LL ESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQSSKP TPAVVGPAS+DPAPVIELESS GPSREKRPRDQTEAVDA        PLGE
Subjt:  SVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1D971 uncharacterized protein LOC1110185381.9e-10887.06Show/hide
Query:  GTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAVREKEEFSAALEAASSTMKD
        G   + A  R+EPSSSGVRDQVS ISAASLDR LRRASKFVS PGSVLQRTIDYAAE             AELDGREVLA REKEEFSAALE ASSTMKD
Subjt:  GTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE-------------AELDGREVLAVREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDK+LEHAT ELE AKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD

A0A6J1DXS5 uncharacterized protein LOC1110255024.3e-16687.32Show/hide
Query:  MSSSFSSNLGSDEDLARKLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLAR+LES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYL SLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARKLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVA NGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD LL ESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPAS+DPA VIELESS GPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256656.0e-17669.51Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP
        KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD LL ESGLLDYNP VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQASFDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRP

Query:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQ
        IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S G S EKR R+++EA+D  PL  EVR E P 
Subjt:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------VGPASKDPAPVIELESSRGPSREKRPRDQTEAVDALPLGEEVREEVPQ

Query:  KRRRKKKKTTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE------
        +RRRKKKKT+S  E GA   LP S AD V+DP ARM GTS+V   F +EPSSSGV+DQVS ISA  LDRYLRRASKFVSDPGSVLQRTID  AE      
Subjt:  KRRRKKKKTTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFVSDPGSVLQRTIDYAAE------

Query:  -------AELDGREVLAVREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALE
               AELDGRE LA +E+E   AALEAA +T+K ELLKA  EV+IL+AEV+++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE
Subjt:  -------AELDGREVLAVREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALE

Query:  AKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD
         KD  +   TTEL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD
Subjt:  AKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGACTTAGCTCGTAAGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGG
GAGGATAGTGACGCCTCCACATCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCTCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTCAGGCTCCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCGGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCTCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATAGTGAGGAAGCCGAGCTAAAAGGCGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGG
GAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGAGCAGTCCCCGAGCTTACGCAAGCCTCC
TTCGACACGTTGAAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGTGCTACTGCGCGAGTCCGGGCTGCTAGATTACAAC
CCTGCAGTTCGTCCCATTGAATCCTCGAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAAGGCCGAGCCCATGCT
CTTGAGGCCGCCCAGAGTTCGAAACCCGCCACCCCTGCCGTGGTAGGGCCTGCCTCGAAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCCTCG
AGGGAGAAGCGCCCAAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCAGAAGCGAAGGAGGAAGAAAAAGAAG
ACGACCTCCCCCTTGGAGGTCGGAGCTTGTAGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGAACGATCCCGCAGCCAGGATGGGCGGGACGTCCGACGTGACG
GCATGGTTCAGAGTTGAGCCGTCAAGCTCCGGGGTGAGGGACCAGGTGTCCTGCATCTCAGCTGCAAGTTTGGACCGCTACCTAAGGAGGGCGTCCAAATTTGTG
AGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGTAAGGGAGAAAGAGGAGTTCTCTGCT
GCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAG
AAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAAGAGAAGGACGACATGCTC
CAGGCGCTTGAAGCGAAGGATAAGAAGCTGGAGCATGCGACTACCGAGCTGGAGGCGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTCAGG
CAGCATCCTGACTTTGATGGATTTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTC
AGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGACTTAGCTCGTAAGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGG
GAGGATAGTGACGCCTCCACATCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCTCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTCAGGCTCCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCGGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCTCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATAGTGAGGAAGCCGAGCTAAAAGGCGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGG
GAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTCGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGAGCAGTCCCCGAGCTTACGCAAGCCTCC
TTCGACACGTTGAAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGTGCTACTGCGCGAGTCCGGGCTGCTAGATTACAAC
CCTGCAGTTCGTCCCATTGAATCCTCGAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAAGGCCGAGCCCATGCT
CTTGAGGCCGCCCAGAGTTCGAAACCCGCCACCCCTGCCGTGGTAGGGCCTGCCTCGAAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCCTCG
AGGGAGAAGCGCCCAAGGGATCAGACCGAGGCGGTGGACGCCCTGCCTTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCAGAAGCGAAGGAGGAAGAAAAAGAAG
ACGACCTCCCCCTTGGAGGTCGGAGCTTGTAGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGAACGATCCCGCAGCCAGGATGGGCGGGACGTCCGACGTGACG
GCATGGTTCAGAGTTGAGCCGTCAAGCTCCGGGGTGAGGGACCAGGTGTCCTGCATCTCAGCTGCAAGTTTGGACCGCTACCTAAGGAGGGCGTCCAAATTTGTG
AGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCCGAGCTGGATGGGAGGGAAGTTTTGGCAGTAAGGGAGAAAGAGGAGTTCTCTGCT
GCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGATTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAG
AAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAAGAGAAGGACGACATGCTC
CAGGCGCTTGAAGCGAAGGATAAGAAGCTGGAGCATGCGACTACCGAGCTGGAGGCGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTCAGG
CAGCATCCTGACTTTGATGGATTTGCCAAGGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTC
AGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARKLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPL
HPFVQEFLFRTGLAPAQVASNGWGVIFALAILFWLRARDSEEAELKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRAVPELTQAS
FDTLKYYKERFPRGRKVGTLVTDVLLRESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVVGPASKDPAPVIELESSRGPS
REKRPRDQTEAVDALPLGEEVREEVPQKRRRKKKKTTSPLEVGACRVLPASFADRVNDPAARMGGTSDVTAWFRVEPSSSGVRDQVSCISAASLDRYLRRASKFV
SDPGSVLQRTIDYAAEAELDGREVLAVREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
QALEAKDKKLEHATTELEAAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVD