; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g44700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g44700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr8:34321598..34324286
RNA-Seq ExpressionMoc08g44700
SyntenyMoc08g44700
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.3e-12692.46Show/hide
Query:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDV+QLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFAS
        KWFYASGEWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IE SRPNS L MVC FAS
Subjt:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP
         VKRKSKGRAHALEAAQSSKP TPAV GPASED APVIELESSG P     P
Subjt:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]7.7e-13091.23Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASST+KD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKD

Query:  ELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLK EVESQAELLKKEEDRR+AQLRAAHAITRGLE       KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGALQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLS LKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGALQAGS

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]6.2e-10397.4Show/hide
Query:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDV+QLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL
        KWFYASGEWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSEL
Subjt:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]4.5e-17892.75Show/hide
Query:  MSSSFSSNLGFDLARRLESELEEVENFRLSDDGEDSDASTSGHGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKVFEYGLR
        MSSS SSNL  DLARRLES+LEE+EN R+SDDGEDSDASTSG GLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFK+FEYGLR
Subjt:  MSSSFSSNLGFDLARRLESELEEVENFRLSDDGEDSDASTSGHGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKVFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DV+QLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFASSVKRKSK
        EWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSEL MVCGFAS VKRKSK
Subjt:  EWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP
        GRAHALEAAQSSKPATPAV GPASED A VIELESSG P     P
Subjt:  GRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]5.5e-16864.45Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRLIESSRPNSELVMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDAAPVIELESSGVPRGRS-----------APGIRPRRW
         VRLIE+SRPNSEL MVCGF  SVKRKSKGRAHAL+    ++P TP V        +GP+S    PVIEL+ SG   G             +P    R  
Subjt:  AVRLIESSRPNSELVMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDAAPVIELESSGVPRGRS-----------APGIRPRRW

Query:  TPRPRRWTPRFWA------------RSFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV
        +P  RR   +  +             S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  TPRPRRWTPRFWA------------RSFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKDELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQ
        ASI  A+ VKAELDGRE LAA+E+E   AALE A++T+K ELLKA  EV+ L+ EV+++ +LLKKE ++ KA LRAAHAIT+GLE       KEKDD+ Q
Subjt:  ASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKDELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQ

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+ LK++Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDPEED--------QVGSTQE
        LDSDYSD EE+        +VG+TQE
Subjt:  LDSDYSDPEED--------QVGSTQE

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.1e-12692.46Show/hide
Query:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDV+QLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFAS
        KWFYASGEWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IE SRPNS L MVC FAS
Subjt:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFAS

Query:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP
         VKRKSKGRAHALEAAQSSKP TPAV GPASED APVIELESSG P     P
Subjt:  SVKRKSKGRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP

A0A6J1D971 uncharacterized protein LOC1110185383.7e-13091.23Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASST+KD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKD

Query:  ELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLK EVESQAELLKKEEDRR+AQLRAAHAITRGLE       KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGALQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLS LKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGALQAGS

A0A6J1DWD2 uncharacterized protein LOC1110246803.0e-10397.4Show/hide
Query:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        +FEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDV+QLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  VFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL
        KWFYASGEWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSEL
Subjt:  KWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255022.2e-17892.75Show/hide
Query:  MSSSFSSNLGFDLARRLESELEEVENFRLSDDGEDSDASTSGHGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKVFEYGLR
        MSSS SSNL  DLARRLES+LEE+EN R+SDDGEDSDASTSG GLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFK+FEYGLR
Subjt:  MSSSFSSNLGFDLARRLESELEEVENFRLSDDGEDSDASTSGHGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKVFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DV+QLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFASSVKRKSK
        EWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVR IESSRPNSEL MVCGFAS VKRKSK
Subjt:  EWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFASSVKRKSK

Query:  GRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP
        GRAHALEAAQSSKPATPAV GPASED A VIELESSG P     P
Subjt:  GRAHALEAAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAP

A0A6J1DZB3 uncharacterized protein LOC1110256652.6e-16864.45Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLVTD+LLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNP

Query:  AVRLIESSRPNSELVMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDAAPVIELESSGVPRGRS-----------APGIRPRRW
         VRLIE+SRPNSEL MVCGF  SVKRKSKGRAHAL+    ++P TP V        +GP+S    PVIEL+ SG   G             +P    R  
Subjt:  AVRLIESSRPNSELVMVCGFASSVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDAAPVIELESSGVPRGRS-----------APGIRPRRW

Query:  TPRPRRWTPRFWA------------RSFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV
        +P  RR   +  +             S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+
Subjt:  TPRPRRWTPRFWA------------RSFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFV

Query:  ASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKDELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQ
        ASI  A+ VKAELDGRE LAA+E+E   AALE A++T+K ELLKA  EV+ L+ EV+++ +LLKKE ++ KA LRAAHAIT+GLE       KEKDD+ Q
Subjt:  ASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKDELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLE-------KEKDDMLQ

Query:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRD
         LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+ LK++Y+EKWASGP GTP PQ+LVD+YVR+
Subjt:  ALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRD

Query:  LDSDYSDPEED--------QVGSTQE
        LDSDYSD EE+        +VG+TQE
Subjt:  LDSDYSDPEED--------QVGSTQE

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic2.3e-0428.21Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKVFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGWVTLYFKVFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA

Query:  ELLDVNQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE
          + +  L    E +R+ K +  R+Y+   KG   I   P+  + +   +F+ + E
Subjt:  ELLDVNQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGE

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related1.3e-0524.83Show/hide
Query:  PENILLRLPEEGERADNPPEGWVTLYFKVF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D +         R+ 
Subjt:  PENILLRLPEEGERADNPPEGWVTLYFKVF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIA

Query:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYASGEWLAKDESDRSFFDVPTRF------------GNLVSIRLVPELTQASFDTLKYYKERFPRGRK
        + PG +Y  A K    IV G  S I GW R++F+      + +  D  F D  T              G L +I  + EL    + T  + + R  R R 
Subjt:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYASGEWLAKDESDRSFFDVPTRF------------GNLVSIRLVPELTQASFDTLKYYKERFPRGRK

Query:  VGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDAAPVIE-LESSGVPR-GRSAPGIRP
        +G ++     L   + ++     L+    P+  +        +   +S GR  A E+A        +   P +ED     + +  S +P  G S P    
Subjt:  VGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFASSVKRKSKGRAHALEAAQSSKPATPAVAGPASEDAAPVIE-LESSGVPR-GRSAPGIRP

Query:  RRWTPR-------------PRRWTPRFWA----RSFADRVDDPAARMGGTS--DVTARFRVEPSSSGVRDQVSRISAASLDRCLRR------ASKFVSDP
        ++ T R             PRR T    A     S+  +  D A     TS  D+ +R R      G  D  S     S+DR + R      A K     
Subjt:  RRWTPR-------------PRRWTPRFWA----RSFADRVDDPAARMGGTS--DVTARFRVEPSSSGVRDQVSRISAASLDRCLRR------ASKFVSDP

Query:  GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEDASSTIKDELLKAHSEVETLKVEVESQAELLKKEEDR-RKAQLRAAHAIT
        G+  + +     +A V++ + A    AE +  + LA     + E SA LE  SS + +++    S V+  ++++E+  +    E  R RK+++    A  
Subjt:  GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR--EKEEFSAALEDASSTIKDELLKAHSEVETLKVEVESQAELLKKEEDR-RKAQLRAAHAIT

Query:  RGLEKEKDDMLQALE---AKDKELEHAT-AELETAKERLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA
        +  + +    LQ LE    K   +  AT  ELE  +  L NGV  LE +     D D F +  + A    L+ GI+
Subjt:  RGLEKEKDDMLQALE---AKDKELEHAT-AELETAKERLSNGV-LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCATCTTGGAGCACCAATAGAGGTCCTCCACGTGTCCAGGGTATTCCCTTCCCCAAACATTGGCCCCCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCA
TTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGGGA
GGATCCTAGCCGCTCGTGGACTACACGTGTACGGTGGGGAAATTCTTCCGACGGGCTATAAATGCCCCCAATCCTTCAGGTCATACCTTACGTTCCCTGAATTCTTGGAG
TTCGATCTGAAGGCAGCTCGAACCCTTGATAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATTCGATTTAGC
TCGTAGGTTAGAGTCCGAGCTCGAGGAGGTAGAAAACTTTAGACTCTCCGATGACGGGGAAGATAGTGACGCCTCCACTTCAGGTCATGGTTTGGAATACCCTTCTAGGA
TACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGG
GTCACTCTCTACTTCAAAGTGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAA
TGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAAACCAGCTTCTCGCGTGCTTCGAAGCGA
AAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGACCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTAC
GCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGATCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAAGC
CTCCTTCGACACGCTGAAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACC
CCGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCTGAACTTGTCATGGTTTGCGGGTTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAG
GCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATGCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGTCCCTCGAGGGAGAAGCGC
CCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGCCACGCTTTTGGGCGAGGAGTTTTGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCG
GGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGAGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCG
TCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCCCTGGCTGTAAAGGCCGAGCTGGA
TGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAAGAGTTCTCTGCTGCCTTGGAGGATGCTTCCTCCACCATAAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGA
CTTTGAAGGTCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAG
AAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGA
ATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCG
ATCTCAGCAGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGAC
TACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGGCGCTCTTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCCATCTTGGAGCACCAATAGAGGTCCTCCACGTGTCCAGGGTATTCCCTTCCCCAAACATTGGCCCCCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCA
TTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGGAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGGGA
GGATCCTAGCCGCTCGTGGACTACACGTGTACGGTGGGGAAATTCTTCCGACGGGCTATAAATGCCCCCAATCCTTCAGGTCATACCTTACGTTCCCTGAATTCTTGGAG
TTCGATCTGAAGGCAGCTCGAACCCTTGATAGGTCGGTCTCTTCCCTCTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATTCGATTTAGC
TCGTAGGTTAGAGTCCGAGCTCGAGGAGGTAGAAAACTTTAGACTCTCCGATGACGGGGAAGATAGTGACGCCTCCACTTCAGGTCATGGTTTGGAATACCCTTCTAGGA
TACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGG
GTCACTCTCTACTTCAAAGTGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAA
TGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTCTTTTGGCTACGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAAACCAGCTTCTCGCGTGCTTCGAAGCGA
AAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGACCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTAC
GCTTCCGGGGAATGGCTCGCAAAGGACGAGTCAGATCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAAGC
CTCCTTCGACACGCTGAAATATTACAAGGAGCGCTTTCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACCGACGAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACC
CCGCAGTTCGTCTCATTGAATCCTCAAGGCCGAACTCTGAACTTGTCATGGTTTGCGGGTTTGCAAGCAGCGTGAAGCGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAG
GCCGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGCAGGGCCTGCCTCGGAAGATGCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGTCCCTCGAGGGAGAAGCGC
CCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGTGGACGCCACGCTTTTGGGCGAGGAGTTTTGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCG
GGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCCGGAGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCG
TCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGACCATCGATTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCCCTGGCTGTAAAGGCCGAGCTGGA
TGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAAGAGTTCTCTGCTGCCTTGGAGGATGCTTCCTCCACCATAAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGA
CTTTGAAGGTCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAAGGAG
AAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGA
ATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCG
ATCTCAGCAGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGAC
TACTCCGATCCCGAAGAGGACCAGGTCGGCTCCACTCAAGAGGGCGCTCTTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSHLGAPIEVLHVSRVFPSPNIGPLCLVRSRPGREVHSTCFGHVATSYSWENTTVAEDLSSEYSNIPTLRISGRILAARGLHVYGGEILPTGYKCPQSFRSYLTFPEFLE
FDLKAARTLDRSVSSLSLSNVVAMSSSFSSNLGFDLARRLESELEEVENFRLSDDGEDSDASTSGHGLEYPSRIPEHYLGSLRRGFAIPENILLRLPEEGERADNPPEGW
VTLYFKVFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVNQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFY
ASGEWLAKDESDRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDELLLESGLLDYNPAVRLIESSRPNSELVMVCGFASSVKRKSKGRAHALE
AAQSSKPATPAVAGPASEDAAPVIELESSGVPRGRSAPGIRPRRWTPRPRRWTPRFWARSFADRVDDPAARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRA
SKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEDASSTIKDELLKAHSEVETLKVEVESQAELLKKEEDRRKAQLRAAHAITRGLEKE
KDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSD
YSDPEEDQVGSTQEGALQAGS