; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g10700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g10700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:7586609..7589370
RNA-Seq ExpressionMoc02g10700
SyntenyMoc02g10700
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]3.3e-10684.13Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSR
        KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES         V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSR
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSR

Query:  PNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEAREEAPLKRRRKK
        PNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV GPASEDPA VIELESS  PSREKRPRDQT       EAVD  PLGEE REE PLKRRRKK
Subjt:  PNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEAREEAPLKRRRKK

Query:  KKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQ
        KK  SPLEVGA  VLPASFADRVDDP ARMGGT DVT RFR++PSSSGVRDQ
Subjt:  KKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.2e-10181.45Show/hide
Query:  MAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP----
        +APNGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP    
Subjt:  MAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP----

Query:  -LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVA
         LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQSSKP TPAV 
Subjt:  -LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVA

Query:  GPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEA
        GPASEDPA VIELESSG PSREKRPRDQTE VDAQ EA D PPLGE A
Subjt:  GPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.5e-11491.63Show/hide
Query:  GTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RI+PSSSGVRDQVSRIS ASLDRCLRRASKFVS  GSV+QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAH EVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLL+EKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK
        GFAKDFSDAGFKFLMKGIASDMPDLQI+LSGLKRRY EKWASGPGGTPGP+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.7e-12473.88Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG-----SLSLRTSSSGFR----------------RRGRE
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG     ++ LR    G R                  G  
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG-----SLSLRTSSSGFR----------------RRGRE

Query:  LTI-------LQRDGSLSTSKCLMAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFY
        L +       L R G    +   +APNGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFY
Subjt:  LTI-------LQRDGSLSTSKCLMAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFY

Query:  ASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKR
        ASGEWLAKDESGRSFFDVP     LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKR
Subjt:  ASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKR

Query:  KSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVD
        KSKGRAHALEAAQSSKPATPAV GPASEDPA VIELESSG PSREKRPRDQTE VD
Subjt:  KSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.9e-16867.47Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRP
        KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVP     LVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRP

Query:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEE
        IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TP V        +GP+S  P  VIEL+ SG  S EKR R+++       EA+D  PL  E
Subjt:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEE

Query:  AREEAPLKRRRKKKKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAA
         R E+PL+RRRKKKK  S  E GA   LP S AD VDDP ARM GTS+V  RF ++PSSSGV+DQVSRIS   LDR LRRASKFVSD GSV+QRTID  A
Subjt:  AREEAPLKRRRKKKKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAA

Query:  EAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKD
        EAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL+EKD
Subjt:  EAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKD

Query:  DMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK
        D+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQI+L+GLK++Y EKWASGP GTP P+
Subjt:  DMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.6e-10684.13Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSR
        KGA GIVKGPTSIKGWVRKWFYASGEWLAKDES         V+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSR
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSR

Query:  PNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEAREEAPLKRRRKK
        PNSELAMVCGFAS+VKRKSKG+AHALEAAQSSKP TPAV GPASEDPA VIELESS  PSREKRPRDQT       EAVD  PLGEE REE PLKRRRKK
Subjt:  PNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEAREEAPLKRRRKK

Query:  KKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQ
        KK  SPLEVGA  VLPASFADRVDDP ARMGGT DVT RFR++PSSSGVRDQ
Subjt:  KKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138263.5e-10181.45Show/hide
Query:  MAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP----
        +APNGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP    
Subjt:  MAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP----

Query:  -LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVA
         LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQSSKP TPAV 
Subjt:  -LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVA

Query:  GPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEA
        GPASEDPA VIELESSG PSREKRPRDQTE VDAQ EA D PPLGE A
Subjt:  GPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEA

A0A6J1D971 uncharacterized protein LOC1110185387.3e-11591.63Show/hide
Query:  GTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RI+PSSSGVRDQVSRIS ASLDRCLRRASKFVS  GSV+QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAH EVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLL+EKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK
        GFAKDFSDAGFKFLMKGIASDMPDLQI+LSGLKRRY EKWASGPGGTPGP+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK

A0A6J1DXS5 uncharacterized protein LOC1110255021.3e-12473.88Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG-----SLSLRTSSSGFR----------------RRGRE
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG     ++ LR    G R                  G  
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRG-----SLSLRTSSSGFR----------------RRGRE

Query:  LTI-------LQRDGSLSTSKCLMAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFY
        L +       L R G    +   +APNGWGVIFALAILFWLRARDSEEAEL                           KGAGGIVKGPTSIKGWVRKWFY
Subjt:  LTI-------LQRDGSLSTSKCLMAPNGWGVIFALAILFWLRARDSEEAEL---------------------------KGAGGIVKGPTSIKGWVRKWFY

Query:  ASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKR
        ASGEWLAKDESGRSFFDVP     LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKR
Subjt:  ASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKR

Query:  KSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVD
        KSKGRAHALEAAQSSKPATPAV GPASEDPA VIELESSG PSREKRPRDQTE VD
Subjt:  KSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVD

A0A6J1DZB3 uncharacterized protein LOC1110256653.3e-16867.47Show/hide
Query:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRP
        KG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVP     LVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP VR 
Subjt:  KGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVP-----LVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRP

Query:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEE
        IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P TP V        +GP+S  P  VIEL+ SG  S EKR R+++       EA+D  PL  E
Subjt:  IESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEE

Query:  AREEAPLKRRRKKKKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAA
         R E+PL+RRRKKKK  S  E GA   LP S AD VDDP ARM GTS+V  RF ++PSSSGV+DQVSRIS   LDR LRRASKFVSD GSV+QRTID  A
Subjt:  AREEAPLKRRRKKKKAISPLEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAA

Query:  EAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKD
        EAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLL+EKD
Subjt:  EAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKD

Query:  DMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK
        D+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQI+L+GLK++Y EKWASGP GTP P+
Subjt:  DMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCATCTTGGAGCATCAATGGGGGTCCTCCACGTGTCAAGGGTATTCCCCTCCCCAAACATTGGCCCCTTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTTGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGA
GAGGATCCTTGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAATTGCCATGTCGTCCTCTATTAGCAGCAACCTA
GGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGA
ATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCC
TCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGATGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAG
GAGGCCGAGCTGAAAGGCGCAGGCGGCATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCGAAGGACGAGTC
AGGTCGTTCCTTCTTTGACGTCCCACTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTA
GGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCC
ATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCTAAGGGCCGAGCTCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGC
CTCGGAAGATCCAGCCTCGGTGATCGAGCTGGAGTCTTCTGGGGATCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGTGGTGGACGCCCAGATCGAGGCGGTGG
ACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTTGGAGGTCGGGGCTTGCAGGGTCTTGCCT
GCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTAAGCCGTCAAGTTCTGGGGTGAGGGACCAGGT
GTCCCGCATCTCAACTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCTTGGGTCCGTTATGCAGAGGACCATCGACTACGCCGCCGAGGCGT
TCGTGGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGAAGGGAAGTTCTGGCAGCAAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCC
TCCACCATGAAGGATGAGCTGCTGAAGGCTCACTTTGAGGTGGAGGCTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAGGAGGACAGGCGCAAGGC
CCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAGGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGC
TGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGAT
TTTTCTGACGCGGGCTTCAAGTTCCTCATGAAAGGCATTGCTTCCGACATGCCCGACCTTCAGATCAATCTCAGCGGTCTGAAAAGGAGGTATGTCGAGAAGTGGGCGTC
TGGTCCTGGCGGCACCCCTGGCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCCGACTACTCTGATCCCGAAGAGGACCAGGTCGTCCCCACTCAAGAGGG
CGCTCCTCAAGCGGGCTCTTAGGCAACCACCCTTCACGAGGCTTTTCGCTGATCTTCCTCCCTTTTCTTTGGTTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCATCTTGGAGCATCAATGGGGGTCCTCCACGTGTCAAGGGTATTCCCCTCCCCAAACATTGGCCCCTTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTT
CATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTTGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGA
GAGGATCCTTGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAATTGCCATGTCGTCCTCTATTAGCAGCAACCTA
GGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGGTCAGGGTTTGGA
ATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCC
TCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGATGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAG
GAGGCCGAGCTGAAAGGCGCAGGCGGCATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTCGCGAAGGACGAGTC
AGGTCGTTCCTTCTTTGACGTCCCACTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGATACGCTGAAATACTACAAGGAGCGCTTTCCGAGGGGTA
GGAAGGTCGGAACCCTGGTGACTGACGAACTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCTGAACTTGCC
ATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCTAAGGGCCGAGCTCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGC
CTCGGAAGATCCAGCCTCGGTGATCGAGCTGGAGTCTTCTGGGGATCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGTGGTGGACGCCCAGATCGAGGCGGTGG
ACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTTGGAGGTCGGGGCTTGCAGGGTCTTGCCT
GCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTAAGCCGTCAAGTTCTGGGGTGAGGGACCAGGT
GTCCCGCATCTCAACTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCTTGGGTCCGTTATGCAGAGGACCATCGACTACGCCGCCGAGGCGT
TCGTGGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGAAGGGAAGTTCTGGCAGCAAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCC
TCCACCATGAAGGATGAGCTGCTGAAGGCTCACTTTGAGGTGGAGGCTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAGGAGGACAGGCGCAAGGC
CCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAGGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGC
TGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGAT
TTTTCTGACGCGGGCTTCAAGTTCCTCATGAAAGGCATTGCTTCCGACATGCCCGACCTTCAGATCAATCTCAGCGGTCTGAAAAGGAGGTATGTCGAGAAGTGGGCGTC
TGGTCCTGGCGGCACCCCTGGCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCCGACTACTCTGATCCCGAAGAGGACCAGGTCGTCCCCACTCAAGAGGG
CGCTCCTCAAGCGGGCTCTTAGGCAACCACCCTTCACGAGGCTTTTCGCTGATCTTCCTCCCTTTTCTTTGGTTTTGTAA
Protein sequenceShow/hide protein sequence
MSHLGASMGVLHVSRVFPSPNIGPFSVWSDLDLAEKFIRLALDTWRLPIRGKIQPLRKIYRRNIQIFRRFGSQRGSLPLVDYTSRTLGRSVSSLSLSNVIAMSSSISSNL
GSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGSLSLRTSSSGFRRRGRELTILQRDGSLSTSKCLMAPNGWGVIFALAILFWLRARDSE
EAELKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELA
MVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPASVIELESSGDPSREKRPRDQTEVVDAQIEAVDAPPLGEEAREEAPLKRRRKKKKAISPLEVGACRVLP
ASFADRVDDPAARMGGTSDVTARFRIKPSSSGVRDQVSRISTASLDRCLRRASKFVSDLGSVMQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAAS
STMKDELLKAHFEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLREKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKD
FSDAGFKFLMKGIASDMPDLQINLSGLKRRYVEKWASGPGGTPGPKRWWISMSEIWTPTTLIPKRTRSSPLKRALLKRALRQPPFTRLFADLPPFSLVL