; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g14830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g14830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr2:10989109..10991778
RNA-Seq ExpressionMoc02g14830
SyntenyMoc02g14830
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]4.9e-11389.17Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDESV+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

Query:  AMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVREEAPLKRRRKKKKVISPLEAGAC
        AMVCGFAS+VKRKSKG+AHALEAAQSSKP  PAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLG+EVREE PLKRRRKKKK  SPLE GA 
Subjt:  AMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVREEAPLKRRRKKKKVISPLEAGAC

Query:  GVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQ
        GVL ASFAD VDDP ARMGGT DVT RFRV+PSS+G+RDQ
Subjt:  GVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.0e-12791.19Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPG FYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
         VKRKSKGRAHALEAAQSSKP  PAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
Subjt:  SVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]7.0e-12891.27Show/hide
Query:  GTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R++PSS+G+RDQVSRISAASL+RCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVE+QAELLKKEED R+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVRDLDSDYSDLEEDQVGT
        GFAKDF D+GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGT GPQALVDQYVRDLDSDYSD EEDQVG+
Subjt:  GFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVRDLDSDYSDLEEDQVGT

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.0e-17591.27Show/hide
Query:  MSSSFSSNLRSDEDLAHRLESELEEIENFRFSDDGEDSDASTSGQVLEYPSRIPEHYLGSLRRGFAISENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL S  DLA RLES+LEEIEN R SDDGEDSDASTSGQ LEYPSRIPEHYLGSLRRGFAI ENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLRSDEDLAHRLESELEEIENFRFSDDGEDSDASTSGQVLEYPSRIPEHYLGSLRRGFAISENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK+IAKKPG FYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK
        SGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK

Query:  SKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPA PAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.0e-17968.16Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES              VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P  P V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL +EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVRE

Query:  EAPLKRRRKKKKVISPLEAGACGVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S  EAGA G L  S AD VDDP ARM GTS+V  RF ++PSS+G++DQVSRISA  L+R LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKVISPLEAGACGVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+ + +LLKKE +  KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDF D+GFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GT  PQ+LVD+YVR
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVR

Query:  DLDSDYSDLEED
        +LDSDYSD+EE+
Subjt:  DLDSDYSDLEED

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.4e-11389.17Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDESV+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSEL

Query:  AMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVREEAPLKRRRKKKKVISPLEAGAC
        AMVCGFAS+VKRKSKG+AHALEAAQSSKP  PAV GPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLG+EVREE PLKRRRKKKK  SPLE GA 
Subjt:  AMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVREEAPLKRRRKKKKVISPLEAGAC

Query:  GVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQ
        GVL ASFAD VDDP ARMGGT DVT RFRV+PSS+G+RDQ
Subjt:  GVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.9e-12791.19Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAK+IAKKPG FYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
         VKRKSKGRAHALEAAQSSKP  PAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
Subjt:  SVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA

A0A6J1D971 uncharacterized protein LOC1110185383.4e-12891.27Show/hide
Query:  GTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R++PSS+G+RDQVSRISAASL+RCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVE+QAELLKKEED R+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVRDLDSDYSDLEEDQVGT
        GFAKDF D+GFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGT GPQALVDQYVRDLDSDYSD EEDQVG+
Subjt:  GFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVRDLDSDYSDLEEDQVGT

A0A6J1DXS5 uncharacterized protein LOC1110255029.8e-17691.27Show/hide
Query:  MSSSFSSNLRSDEDLAHRLESELEEIENFRFSDDGEDSDASTSGQVLEYPSRIPEHYLGSLRRGFAISENILLRLPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL S  DLA RLES+LEEIEN R SDDGEDSDASTSGQ LEYPSRIPEHYLGSLRRGFAI ENILLRLPEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLRSDEDLAHRLESELEEIENFRFSDDGEDSDASTSGQVLEYPSRIPEHYLGSLRRGFAISENILLRLPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAK+IAKKPG FYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK
        SGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRK

Query:  SKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPA PAV GPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.5e-17968.16Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES              VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVRE
         VR IE+SRPNSELAMVCGF  SVKRKSKGRAHAL+    ++P  P V        +GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL +EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGDEVRE

Query:  EAPLKRRRKKKKVISPLEAGACGVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKK  S  EAGA G L  S AD VDDP ARM GTS+V  RF ++PSS+G++DQVSRISA  L+R LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKVISPLEAGACGVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+ + +LLKKE +  KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVR
        Q LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDF D+GFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GT  PQ+LVD+YVR
Subjt:  QALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVR

Query:  DLDSDYSDLEED
        +LDSDYSD+EE+
Subjt:  DLDSDYSDLEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related1.9e-0631.3Show/hide
Query:  ILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKP
        I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         ++ + P
Subjt:  ILLRLPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKP

Query:  GWFYMCARKGAGGIVKGPTS-IKGWVRKWFY
        G +Y  A K    IV G  S I GW R++F+
Subjt:  GWFYMCARKGAGGIVKGPTS-IKGWVRKWFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCTAGGGTATTCTCTTCCCCAAACATTGGCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTC
ATTCGATTCGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTCGCAGAAGATTTATCGTCGGAATATTCAAATATTCCGACGTTTCGGATCTTAGGG
AGGATCCTAGCCGCTCGTTGATTACACGTGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGC
AACTTAAGATCCGATGAGGACTTAGCTCATAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGG
TCAGGTTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCTCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGA
GAGCTGACAATCCTCCGGAGGGATGGGTCACTCTATACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGG
TTGGCTCCGGCCCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGA
CCAGCTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCTGGTTGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCA
AGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGATGAGTCAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTG
AAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCAT
CGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGA
AACCTGCCGCCCCTGCTGTGGCAGGGCCTGCTTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACC
GAGGCGGTGGACGCCCTGCCTTTGGGCGATGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGTGATCTCCCCCTTGGAGGCCGGAGCTTGCGG
GGTCTTGCTTGCAAGTTTCGCAGATTGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGTTGA
GGGACCAGGTGTCCCGCATTTCGGCTGCAAGTTTGAACCGCTGCTTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCC
GCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGA
GGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGACCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACG
GGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAG
GATAAGGAGCTGGAGCATGCGACTGCAGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTACTGGAGGAGTCGTTTAGGCAGCATCCTGACTTCGATGGATT
TGCCAAGGACTTCCCTGACTCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGA
AGTGGGCGTCTGGGCCTGGCGGCACCTCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACC
GCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCTAGGGTATTCTCTTCCCCAAACATTGGCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTC
ATTCGATTCGCTTTGGACGCGTGGCGACTTCCTATTCGTGGGAAAATATAACCGTCGCAGAAGATTTATCGTCGGAATATTCAAATATTCCGACGTTTCGGATCTTAGGG
AGGATCCTAGCCGCTCGTTGATTACACGTGCAGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGC
AACTTAAGATCCGATGAGGACTTAGCTCATAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGATTCTCCGATGACGGGGAGGATAGTGACGCCTCCACTTCAGG
TCAGGTTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCTCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGA
GAGCTGACAATCCTCCGGAGGGATGGGTCACTCTATACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGG
TTGGCTCCGGCCCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGA
CCAGCTCCTCGCGTGCTTCGAAGCGAAAAAGATAGCTAAGAAGCCTGGTTGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAAGGGCCGACCTCCATCA
AGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGATGAGTCAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTG
AAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCAT
CGAATCCTCAAGGCCGAACTCTGAACTTGCCATGGTTTGCGGATTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGA
AACCTGCCGCCCCTGCTGTGGCAGGGCCTGCTTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACC
GAGGCGGTGGACGCCCTGCCTTTGGGCGATGAGGTGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGTGATCTCCCCCTTGGAGGCCGGAGCTTGCGG
GGTCTTGCTTGCAAGTTTCGCAGATTGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTCAGCCGTCAAGTGCCGGGTTGA
GGGACCAGGTGTCCCGCATTTCGGCTGCAAGTTTGAACCGCTGCTTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCC
GCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGA
GGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGACCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACG
GGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAG
GATAAGGAGCTGGAGCATGCGACTGCAGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTACTGGAGGAGTCGTTTAGGCAGCATCCTGACTTCGATGGATT
TGCCAAGGACTTCCCTGACTCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGA
AGTGGGCGTCTGGGCCTGGCGGCACCTCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACC
GCATAG
Protein sequenceShow/hide protein sequence
MSHLGAPIGVLHVSRVFSSPNIGPSLSGPISTWQRSSFDSLWTRGDFLFVGKYNRRRRFIVGIFKYSDVSDLREDPSRSLITRAARTLGRSVSSLTSLSNVVAMSSSFSS
NLRSDEDLAHRLESELEEIENFRFSDDGEDSDASTSGQVLEYPSRIPEHYLGSLRRGFAISENILLRLPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTG
LAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKKIAKKPGWFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTL
KYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRPIESSRPNSELAMVCGFASSVKRKSKGRAHALEAAQSSKPAAPAVAGPASEDPAPVIELESSGGPSREKRPRDQT
EAVDALPLGDEVREEAPLKRRRKKKKVISPLEAGACGVLLASFADWVDDPAARMGGTSDVTARFRVQPSSAGLRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYA
AEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVETQAELLKKEEDGRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAK
DKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFPDSGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTSGPQALVDQYVRDLDSDYSDLEEDQVGT
A