; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g10510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g10510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr11:7586428..7588728
RNA-Seq ExpressionMoc11g10510
SyntenyMoc11g10510
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.2e-11587.8Show/hide
Query:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGW+RKWFYASGEWLAKDES              V+IRPVPELTQA FDTLKYYKE FPRGRKV TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSK  TPAVVGPASEDP PVIELESS GPSREKRPRDQTEAVD  P GEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPAS+ADRVDDPEARMGGT DV TRFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]7.2e-8347.37Show/hide
Query:  VSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAV---
        +SI+P+PEL QA FDTLK+YK+ FPRGRK+ TLVTDKLLLESGLLDYNP VRPIE+SRPNSELAMVCGF S+VKRKSKGRAHAL+  QSS   TPAV   
Subjt:  VSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAV---

Query:  -----VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGE--EVREEVPLKRRRKKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTR
              GP+S  PTPVIEL+S+G  SREKR R ++EA+D  P  E  E + E+ LKR  ++ K                                     
Subjt:  -----VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGE--EVREEVPLKRRRKKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTR

Query:  FRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHS
                                                             A +++A A+   L         EKE+F    E      KD++L    
Subjt:  FRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQAFVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHS

Query:  EVEILKAEALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKKYAPQWASGPSGTPGPQA
               +ALE K+  + R  AEL+  KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKK+YA +WASGP+GT GP +
Subjt:  EVEILKAEALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKKYAPQWASGPSGTPGPQA

Query:  LVDKYVRDLDSDYSDLEED--------QVGTTQEDAP
        LVDKYVRDLDSDYSDL+ED        +VGTTQE  P
Subjt:  LVDKYVRDLDSDYSDLEED--------QVGTTQEDAP

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.9e-11381.72Show/hide
Query:  MFEYGLRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMR
        MFEYGLRLPLHPFVQE                                DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGW+R
Subjt:  MFEYGLRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEV
         VKRKSKGRAHALEAAQSSK  TPAVVGPASEDP PVIELESSGGPSREKRPRDQTEAVDA     +V
Subjt:  NVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEV

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.2e-16385.63Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSR+PEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMRKWFYA
        LRLPLHPFVQE                                DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGW+RKWFYA
Subjt:  LRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSK ATPAVVGPASEDP  VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]4.9e-15660.08Show/hide
Query:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGW+ KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++  TP V         GP+S  PTPVIEL+ SGG S EKR R+++EA+D  P   EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQA
        E PL+RRRKKKKT+S  E GARG LP S+AD VDDPEARM GTS+V  RF +EPSSSGV+DQ             +   +    P      + +   ++A
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQA

Query:  FVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE-------------------------------------------
        F+ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AE                                           
Subjt:  FVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE-------------------------------------------

Query:  --ALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKKYAPQWASGPSGTPGPQALVDKYV
           LE K+  + R T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKKKY+ +WASGP+GTP PQ+LVDKYV
Subjt:  --ALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKKYAPQWASGPSGTPGPQALVDKYV

Query:  RDLDSDYSDLEED--------QVGTTQEDAP
        R+LDSDYSD+EE+        +VGTTQE+ P
Subjt:  RDLDSDYSDLEED--------QVGTTQEDAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.0e-11587.8Show/hide
Query:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGW+RKWFYASGEWLAKDES              V+IRPVPELTQA FDTLKYYKE FPRGRKV TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSK  TPAVVGPASEDP PVIELESS GPSREKRPRDQTEAVD  P GEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPAS+ADRVDDPEARMGGT DV TRFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.9e-11381.72Show/hide
Query:  MFEYGLRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMR
        MFEYGLRLPLHPFVQE                                DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGW+R
Subjt:  MFEYGLRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEV
         VKRKSKGRAHALEAAQSSK  TPAVVGPASEDP PVIELESSGGPSREKRPRDQTEAVDA     +V
Subjt:  NVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEV

A0A6J1DXS5 uncharacterized protein LOC1110255021.5e-16385.63Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSR+PEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMRKWFYA
        LRLPLHPFVQE                                DSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGW+RKWFYA
Subjt:  LRLPLHPFVQE--------------------------------DSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA FDTLKYYKERFPRGRKV TLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSK ATPAVVGPASEDP  VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256065.6e-8169.4Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVREEVPLKRRRKKKKTTSPLEVGARG
        MVCGFAS+VKRKSKGRAHA EAAQSSK ATPAV GPASEDP PVIELESSGGPSREKRPRDQTEAVDALP GEEVREEVPLKRRRKKKKT SPLEVGA G
Subjt:  MVCGFASNVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVREEVPLKRRRKKKKTTSPLEVGARG

Query:  VLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQAFVASIQSALAVKAELDGREALAA
        VLPAS+ADRVDDPEARMGGTSDV  RFRV+PSS+GVRDQ             +   +    P      ++    ++AFVASIQSALAVKAELDGRE LAA
Subjt:  VLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQAFVASIQSALAVKAELDGREALAA

Query:  REKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKEEELKRATAELETVKERLSNGVLLEESFR
        REKEEFS                           ALEAK++EL+ ATAELET KERLSNGVLLEESFR
Subjt:  REKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKEEELKRATAELETVKERLSNGVLLEESFR

A0A6J1DZB3 uncharacterized protein LOC1110256652.4e-15660.08Show/hide
Query:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGW+ KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGRKVRTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++  TP V         GP+S  PTPVIEL+ SGG S EKR R+++EA+D  P   EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAV--------VGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQA
        E PL+RRRKKKKT+S  E GARG LP S+AD VDDPEARM GTS+V  RF +EPSSSGV+DQ             +   +    P      + +   ++A
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQA

Query:  FVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE-------------------------------------------
        F+ASI  A+ VKAELDGREALAA+E+E   AALEAA +T+K ELLKA  EV+IL+AE                                           
Subjt:  FVASIQSALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE-------------------------------------------

Query:  --ALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKKYAPQWASGPSGTPGPQALVDKYV
           LE K+  + R T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKKKY+ +WASGP+GTP PQ+LVDKYV
Subjt:  --ALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKKYAPQWASGPSGTPGPQALVDKYV

Query:  RDLDSDYSDLEED--------QVGTTQEDAP
        R+LDSDYSD+EE+        +VGTTQE+ P
Subjt:  RDLDSDYSDLEED--------QVGTTQEDAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related6.4e-0520.11Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ

Query:  EDSEEAEL--------------------------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMRKWFYA
              ++                          L V+ +       ++  K G+ Y+ + +G   +  GP+  + W+  +FYA
Subjt:  EDSEEAEL--------------------------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTGGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGCTACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTTCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGG
CGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGATGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTG
ACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTTCTTTGACACGCTGAAATACTACAAGGAGCGTTTTCCGAGGGGTAGG
AAGGTCAGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGACTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCAT
GGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAAAGTTCGAAATCTGCCACTCCCGCTGTGGTAGGGCCAGCCT
CGGAAGATCCAACCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCTTCGGGCGAGGAG
GTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACAACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTACGCAGATCGGGTGGA
CGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGATGACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGGTGTCCCGCATCTCGGTCGCAAGT
TTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTAGCTCGGTCTAATTCTCTTCTTTGTACCTTTTCTCAGGCGTTTGTTGCTTCCATTCAATCA
GCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCT
GCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCGTGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCA
GCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCCTCC
GACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAAGTATGCTCCGCAGTGGGCGTCTGGGCCTAGTGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTA
CGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGACGCTCCTCAAGCAGGCGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTGGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGCTACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTTCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTCGTCCAA
GAGGATAGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGG
CGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGATGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTG
ACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTTCTTTGACACGCTGAAATACTACAAGGAGCGTTTTCCGAGGGGTAGG
AAGGTCAGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGACTACAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCAT
GGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAAAGTTCGAAATCTGCCACTCCCGCTGTGGTAGGGCCAGCCT
CGGAAGATCCAACCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCTTGCCTTCGGGCGAGGAG
GTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACAACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTACGCAGATCGGGTGGA
CGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGATGACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGGTGTCCCGCATCTCGGTCGCAAGT
TTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTAGCTCGGTCTAATTCTCTTCTTTGTACCTTTTCTCAGGCGTTTGTTGCTTCCATTCAATCA
GCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCT
GCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCGTGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCA
GCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCCTCC
GACATGCCTGACCTTCAGATCGATCTCGGTGGTCTGAAGAAGAAGTATGCTCCGCAGTGGGCGTCTGGGCCTAGTGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTA
CGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGATCAGGTCGGCACCACTCAAGAGGACGCTCCTCAAGCAGGCGCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQ
EDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWMRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQAFFDTLKYYKERFPRGR
KVRTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKSATPAVVGPASEDPTPVIELESSGGPSREKRPRDQTEAVDALPSGEE
VREEVPLKRRRKKKKTTSPLEVGARGVLPASYADRVDDPEARMGGTSDVMTRFRVEPSSSGVRDQGVPHLGRKFGPLPKEGVQICERPRVARSNSLLCTFSQAFVASIQS
ALAVKAELDGREALAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKEEELKRATAELETVKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIAS
DMPDLQIDLGGLKKKYAPQWASGPSGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEDAPQAGA