; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g13280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g13280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr4:10315125..10318107
RNA-Seq ExpressionMoc04g13280
SyntenyMoc04g13280
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.8e-11380.59Show/hide
Query:  MFEYGLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHP VQEFLF+TGLAPAQVAPN WGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPR                                     AMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQ+SKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]6.7e-12386.32Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRIS+ASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFD
        ELLKAHSEVE LKAEVE++ ELLKKEEDRR+AQLRAAHAITRGLE       KEKDDMLQALEAKD+EL+H TAELETAKERLSNGVLLEE+F+QHPDFD
Subjt:  ELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFD

Query:  GFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQGGALQASS
        GFAKDFSDA FKFLMKGIASDMPDLQIDLSGLK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQ GA    S
Subjt:  GFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQGGALQASS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.6e-15983.99Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRIPEEGERADNPPDGWVTLYFKMFEY
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDS DASTSGQGLEYPSRIPEHYL SLRRGFAIPENILLR+PEEGERADNPP+GWVTLYFKMFEY
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRIPEEGERADNPPDGWVTLYFKMFEY

Query:  GLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY
        GLRLPLHP VQEFLF+TGLAPAQVAPN WGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY
Subjt:  GLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY

Query:  ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKR
        ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPR                                     AMVCGFAS VKR
Subjt:  ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKR

Query:  KSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        KSKGRAHALEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  KSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]5.0e-10273.44Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG
        MVCGFAS+VKRKSKGRAHA EAAQ+SKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEV A G
Subjt:  MVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG

Query:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTSDVTARFRV+PSS+GVRDQVSRIS+ASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLL
        EKEEFS                                                                 ALEAKD+EL+H TAELETAKERLSNGVLL
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLL

Query:  EESFK
        EESF+
Subjt:  EESFK

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-17165.21Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-----------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR                       
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-----------------------

Query:  --------------AMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
                      AMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  --------------AMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E  ARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRIS+  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ KV+LLKKE ++ KA LRAAHAIT+GLE       KEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDML

Query:  QALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFDGFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +  +T EL+  KERL+NG LLEESF+QHPDFDGFAKDFSDA FKFLMKGIA+DMP LQIDL+GLKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFDGFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQ
        +LDSDYSD+EE+        +VGTTQ
Subjt:  DLDSDYSDLEED--------QVGTTQ

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.4e-11380.59Show/hide
Query:  MFEYGLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHP VQEFLF+TGLAPAQVAPN WGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPR                                     AMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQ+SKP TPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185383.2e-12386.32Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRIS+ASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFD
        ELLKAHSEVE LKAEVE++ ELLKKEEDRR+AQLRAAHAITRGLE       KEKDDMLQALEAKD+EL+H TAELETAKERLSNGVLLEE+F+QHPDFD
Subjt:  ELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFD

Query:  GFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQGGALQASS
        GFAKDFSDA FKFLMKGIASDMPDLQIDLSGLK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQ GA    S
Subjt:  GFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQGGALQASS

A0A6J1DXS5 uncharacterized protein LOC1110255021.3e-15983.99Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRIPEEGERADNPPDGWVTLYFKMFEY
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDS DASTSGQGLEYPSRIPEHYL SLRRGFAIPENILLR+PEEGERADNPP+GWVTLYFKMFEY
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRIPEEGERADNPPDGWVTLYFKMFEY

Query:  GLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY
        GLRLPLHP VQEFLF+TGLAPAQVAPN WGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY
Subjt:  GLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY

Query:  ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKR
        ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKE FPR                                     AMVCGFAS VKR
Subjt:  ASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-------------------------------------AMVCGFASNVKR

Query:  KSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        KSKGRAHALEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  KSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256062.4e-10273.44Show/hide
Query:  MVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG
        MVCGFAS+VKRKSKGRAHA EAAQ+SKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEV A G
Subjt:  MVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG

Query:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTSDVTARFRV+PSS+GVRDQVSRIS+ASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLL
        EKEEFS                                                                 ALEAKD+EL+H TAELETAKERLSNGVLL
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHVTAELETAKERLSNGVLL

Query:  EESFK
        EESF+
Subjt:  EESFK

A0A6J1DZB3 uncharacterized protein LOC1110256656.5e-17265.21Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-----------------------
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+HFPR                       
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKEHFPR-----------------------

Query:  --------------AMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
                      AMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  --------------AMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E  ARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRIS+  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ KV+LLKKE ++ KA LRAAHAIT+GLE       KEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDML

Query:  QALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFDGFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +  +T EL+  KERL+NG LLEESF+QHPDFDGFAKDFSDA FKFLMKGIA+DMP LQIDL+GLKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDEELKHVTAELETAKERLSNGVLLEESFKQHPDFDGFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQ
        +LDSDYSD+EE+        +VGTTQ
Subjt:  DLDSDYSDLEED--------QVGTTQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related6.7e-0424.53Show/hide
Query:  PENILLRIPEEGERADNPPDGWVTLYFKMF-EYGLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PP+G++ LY   F   GL  PL   + E+  +  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRIPEEGERADNPPDGWVTLYFKMF-EYGLRLPLHPIVQEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRF-GNLVSIRPV----------PELTQAS---
        + PG +Y  A K    IV G  S I GW R++F+                 +W    E      D P  F  N+  IR +          PE  Q     
Subjt:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRF-GNLVSIRPV----------PELTQAS---

Query:  ---FDTLKYYKEHF-----------PRAMVCGFASNVKRK---SKGRAHALEAAQNSKPATPAVVGPASED---PAPVIE---LESSGG--PSREKRPRD
              L  + + F           P  +V     +V+ +   S GR  A E+A        +   P +ED      V+    L S GG  PS+++  RD
Subjt:  ---FDTLKYYKEHF-----------PRAMVCGFASNVKRK---SKGRAHALEAAQNSKPATPAVVGPASED---PAPVIE---LESSGG--PSREKRPRD

Query:  QTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARGVLPASFA--DRVDDPEARMGGTS--DVTARFRVEPSSSGVRDQVSRISSASLDRCLRR---
          E             +VP   RR         E   RG +   F+   +  D       TS  D+ +R R      G  D  S     S+DR + R   
Subjt:  QTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARGVLPASFA--DRVDDPEARMGGTS--DVTARFRVEPSSSGVRDQVSRISSASLDRCLRR---

Query:  ---ASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRK
           A K     G+  + +     +A V++ + A    AE +  + LA     + E SA LE  SS + +++    S V+    E   ++E L K      
Subjt:  ---ASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRK

Query:  AQLRAAHAITRGLEKEKDD--------MLQALEAKDEELKHVT-AELETAKERLSNGV-LLEESFKQHPDFDGFAKDFSDAAFKFLMKGIA
        A+LR +       E++K D         L+ L  K   +   T  ELE  +  L NGV  LE +     D D F +  + A    L+ GI+
Subjt:  AQLRAAHAITRGLEKEKDD--------MLQALEAKDEELKHVT-AELETAKERLSNGV-LLEESFKQHPDFDGFAKDFSDAAFKFLMKGIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTAGTCCCGCTAATCTAGCGACGGTTACACCCGGTAATCTCGGGACCGACGGTTACACCCGTGGTGGTGATCTCGGCGGTCCGAGCTGGGGCATGACTCATGG
TCACCTTGGGGCACCAATGGTTGTCCTCCACGTGTCCAGGACACGTGGCGACTCCCTATTCGTGGGAAAACATAACCGTTGCGGTGGATTTATCATCGGAATATTCAAAT
ATTCCGACGCTTCGGATCTCCGGGAGGATCCTAGCCGCTCGTTGATTACACATTTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCAAACGTAGTTGCC
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCC
TTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGATGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCCGGCTTCCCCTTCACCCTATTGTC
CAAGAATTTCTCTTCCAGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATAGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGA
AGAGGCAGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTA
TAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACT
AGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGCCATGGTTTGCGGGTT
TGCGAGTAACGTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAG
CCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCTAGGGATCAGACCGAGGCGGTGGACGTCTCGCCGTTGGGCGAGGAGGTAAGAGAGGAA
GTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGC
CAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGTCTGCAAGTTTGGACCGCTGCC
TAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTACAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAG
GCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTC
TGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGTCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCT
TGGAGAAGGAGAAGGATGACATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGTGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTC
CTATTGGAGGAATCGTTTAAGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGCCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGA
CCTTCAGATCGATCTCAGTGGTCTAAAAAAGAGGTATGCCGAGCAGTGGGCATCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATC
TGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGGGGGCGCTCTTCAGGCAAGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTAGTCCCGCTAATCTAGCGACGGTTACACCCGGTAATCTCGGGACCGACGGTTACACCCGTGGTGGTGATCTCGGCGGTCCGAGCTGGGGCATGACTCATGG
TCACCTTGGGGCACCAATGGTTGTCCTCCACGTGTCCAGGACACGTGGCGACTCCCTATTCGTGGGAAAACATAACCGTTGCGGTGGATTTATCATCGGAATATTCAAAT
ATTCCGACGCTTCGGATCTCCGGGAGGATCCTAGCCGCTCGTTGATTACACATTTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCAAACGTAGTTGCC
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCAGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCC
TTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGATGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCCGGCTTCCCCTTCACCCTATTGTC
CAAGAATTTCTCTTCCAGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATAGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGA
AGAGGCAGAGCTGTTGGACGTAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTA
TAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCGGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACT
AGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGCCATGGTTTGCGGGTT
TGCGAGTAACGTGAAACGCAAGTCCAAGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAG
CCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCTAGGGATCAGACCGAGGCGGTGGACGTCTCGCCGTTGGGCGAGGAGGTAAGAGAGGAA
GTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGC
CAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGTCTGCAAGTTTGGACCGCTGCC
TAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTACAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAG
GCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTC
TGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGTCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCT
TGGAGAAGGAGAAGGATGACATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGTGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTC
CTATTGGAGGAATCGTTTAAGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGCCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGA
CCTTCAGATCGATCTCAGTGGTCTAAAAAAGAGGTATGCCGAGCAGTGGGCATCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATC
TGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGGGGGCGCTCTTCAGGCAAGCTCTTAG
Protein sequenceShow/hide protein sequence
MGGSPANLATVTPGNLGTDGYTRGGDLGGPSWGMTHGHLGAPMVVLHVSRTRGDSLFVGKHNRCGGFIIGIFKYSDASDLREDPSRSLITHFEPLVGRSLPSLSLSNVVA
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDDASTSGQGLEYPSRIPEHYLRSLRRGFAIPENILLRIPEEGERADNPPDGWVTLYFKMFEYGLRLPLHPIV
QEFLFQTGLAPAQVAPNRWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPT
RFGNLVSIRPVPELTQASFDTLKYYKEHFPRAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREE
VPLKRRRKKKKTTSPLEVRARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISSASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVK
AELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKVELLKKEEDRRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHVTAELETAKERLSNGV
LLEESFKQHPDFDGFAKDFSDAAFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQGGALQASS