; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g11090 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g11090
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:8432166..8435593
RNA-Seq ExpressionMoc06g11090
SyntenyMoc06g11090
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.5e-11689.6Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP
        KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRP
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP

Query:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKK
        IES RPNSELAMVCGFASNVKRKSKG+AHALEAAQ+SKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKK
Subjt:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKK

Query:  TTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ
        TTSPLEVGARGVLPASFADRVDDPEARMGGT DVTTRFRVEPSSSGVRDQ
Subjt:  TTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.5e-12486.45Show/hide
Query:  MFEYGLKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVR
        MFEYGL+LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA                   KGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE  RPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQ+SKP TP VVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.4e-11683.16Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEKFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKE+FSAALE ASSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEKFSAALEAASSTMKD

Query:  ELLKAHSKVEILKAEVETKAELLK------------------GLEKEKFQLLKENDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHS+VE LKAEVE++AELLK                  GLE+EKFQLLKE DDMLQALEAKDKEL+HATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSKVEILKAEVETKAELLK------------------GLEKEKFQLLKENDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDM DLQIDLSGLK+RYAE+WASGPG TPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.4e-14880.85Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIKNFRFSDDGEDSDASTSG----------------FRR--------------RGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEI+N R SDDGEDSDASTSG                 RR               GERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIKNFRFSDDGEDSDASTSG----------------FRR--------------RGERADNPPEGWVTLYFKMFEYG

Query:  LKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVRKWFYA
        L+LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLA                   KGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIES RPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQ+SKPATP VVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.8e-18668.8Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP
        KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP

Query:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPL
        IE+ RPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TPTV         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR E PL
Subjt:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPL

Query:  KRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI
        +RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI
Subjt:  KRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI

Query:  QSALAVKAELDGREVLAAREKEKFSAALEAASSTMKDELLKAHSKVEILKAEVETKAELL------------------KGLEKEKFQLLKENDDMLQALE
          A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  +V+IL+AEV+ K +LL                  KGLEKEKFQLLKE DD+ Q LE
Subjt:  QSALAVKAELDGREVLAAREKEKFSAALEAASSTMKDELLKAHSKVEILKAEVETKAELL------------------KGLEKEKFQLLKENDDMLQALE

Query:  AKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDS
         KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL+GLKK+Y+E+WASGP  TP PQ+LVDKYVR+LDS
Subjt:  AKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDS

Query:  DYSDLEED--------QVGTTQEGAP--QAGS
        DYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DYSDLEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092987.4e-11789.6Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP
        KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRP
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP

Query:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKK
        IES RPNSELAMVCGFASNVKRKSKG+AHALEAAQ+SKP TP VVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKK
Subjt:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKK

Query:  TTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ
        TTSPLEVGARGVLPASFADRVDDPEARMGGT DVTTRFRVEPSSSGVRDQ
Subjt:  TTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138267.4e-12586.45Show/hide
Query:  MFEYGLKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVR
        MFEYGL+LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA                   KGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE  RPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQ+SKP TP VVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1D971 uncharacterized protein LOC1110185381.6e-11683.16Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEKFSAALEAASSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKE+FSAALE ASSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEKFSAALEAASSTMKD

Query:  ELLKAHSKVEILKAEVETKAELLK------------------GLEKEKFQLLKENDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHS+VE LKAEVE++AELLK                  GLE+EKFQLLKE DDMLQALEAKDKEL+HATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSKVEILKAEVETKAELLK------------------GLEKEKFQLLKENDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDM DLQIDLSGLK+RYAE+WASGPG TPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255021.6e-14880.85Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIKNFRFSDDGEDSDASTSG----------------FRR--------------RGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEI+N R SDDGEDSDASTSG                 RR               GERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIKNFRFSDDGEDSDASTSG----------------FRR--------------RGERADNPPEGWVTLYFKMFEYG

Query:  LKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVRKWFYA
        L+LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLA                   KGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLA-------------------KGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIES RPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESLRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQ+SKPATP VVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256658.9e-18768.8Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP
        KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRP

Query:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPL
        IE+ RPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TPTV         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR E PL
Subjt:  IESLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPL

Query:  KRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI
        +RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI
Subjt:  KRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI

Query:  QSALAVKAELDGREVLAAREKEKFSAALEAASSTMKDELLKAHSKVEILKAEVETKAELL------------------KGLEKEKFQLLKENDDMLQALE
          A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  +V+IL+AEV+ K +LL                  KGLEKEKFQLLKE DD+ Q LE
Subjt:  QSALAVKAELDGREVLAAREKEKFSAALEAASSTMKDELLKAHSKVEILKAEVETKAELL------------------KGLEKEKFQLLKENDDMLQALE

Query:  AKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDS
         KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DM  LQIDL+GLKK+Y+E+WASGP  TP PQ+LVDKYVR+LDS
Subjt:  AKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDS

Query:  DYSDLEED--------QVGTTQEGAP--QAGS
        DYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DYSDLEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGGGAGTGTTATGCCTCTACACTCAAGGGGTCATCGATATGCGCCCTTGAAAAACAAGCCAAATGTGCTGAGAAGCAAGAGTCAGAGGCCGACCTACCCCGAGA
AGGCAAAAAGGAGTTCTCTGCACCAACAGACGAGCTTGAGCTTGTCGCAGCTCGAACTCGGCCTCCGGACCGATCTGGATACTTGGGCGGACCTGCACAAAAAGGTGAGC
ACTCCGACGATCAAGTCAGTATAGCTCGAACCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGA
TCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAAAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGATTCCGGAG
GAGGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAAACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCT
TCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTG
TTGGACGTAGACCAGCTCCTCGCGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGAGAATGGCTTGC
AAAAGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAAT
ATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCTGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAA
TCCTTAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAATTCGAAACC
TGCCACTCCTACTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGG
CGGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTC
TTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGACGACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGA
CCAGGTGTCCCGCATCTCAGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTG
AGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGAAGTTCTCTGCTGCCTTGGAAGCT
GCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTAAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGGGCTTGGAGAAGGAGAA
GTTCCAACTCCTGAAGGAGAATGACGACATGCTCCAGGCACTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCA
ATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGAC
ATGCTTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGACACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGT
CAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTTGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGGGAGTGTTATGCCTCTACACTCAAGGGGTCATCGATATGCGCCCTTGAAAAACAAGCCAAATGTGCTGAGAAGCAAGAGTCAGAGGCCGACCTACCCCGAGA
AGGCAAAAAGGAGTTCTCTGCACCAACAGACGAGCTTGAGCTTGTCGCAGCTCGAACTCGGCCTCCGGACCGATCTGGATACTTGGGCGGACCTGCACAAAAAGGTGAGC
ACTCCGACGATCAAGTCAGTATAGCTCGAACCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGA
TCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAAAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGATTCCGGAG
GAGGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAAACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCT
TCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTG
TTGGACGTAGACCAGCTCCTCGCGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGAGAATGGCTTGC
AAAAGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAAT
ATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCTGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAA
TCCTTAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAATTCGAAACC
TGCCACTCCTACTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGG
CGGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTC
TTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGACGACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGA
CCAGGTGTCCCGCATCTCAGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTG
AGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGAAGTTCTCTGCTGCCTTGGAAGCT
GCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTAAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGGGCTTGGAGAAGGAGAA
GTTCCAACTCCTGAAGGAGAATGACGACATGCTCCAGGCACTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCA
ATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGAC
ATGCTTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGACACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGT
CAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTTGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSRECYASTLKGSSICALEKQAKCAEKQESEADLPREGKKEFSAPTDELELVAARTRPPDRSGYLGGPAQKGEHSDDQVSIARTLVGRSLPSLSLSNVVAMSSSFSSNLG
SDEDLARRLESELEEIKNFRFSDDGEDSDASTSGFRRRGERADNPPEGWVTLYFKMFEYGLKLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL
LDVDQLLAKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIE
SLRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPTVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGARGV
LPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEKFSAALEA
ASSTMKDELLKAHSKVEILKAEVETKAELLKGLEKEKFQLLKENDDMLQALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASD
MLDLQIDLSGLKKRYAEQWASGPGDTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS