; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g07190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g07190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:5070875..5078925
RNA-Seq ExpressionMoc03g07190
SyntenyMoc03g07190
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.0e-10491.51Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSELEFETAHPC
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT+ELLLES LLDYNPAVRPIE SRPNS L    A  C
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSELEFETAHPC

Query:  RGRACLGRSSPG
        R  + + R S G
Subjt:  RGRACLGRSSPG

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]8.9e-12185.37Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRLVPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTM
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFV APGSVLQRTIDYAAE  +         A +AELDGRE LAARE+EEFSAALE ASSTM
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRLVPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPD
        KDELLKAHSEVETLKAEVESQAELLKKE+DRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSN VLLEE+FRQHPD
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYVSDVDSDYSDREEDQVDSTQEGAPPAGA
        FDGFAKDFSDAGFKFLMKGIASDMPDLQI+ SGLKRRYAEKWASGP GTPGPQALVDQYV D+DSDYSD EEDQV STQEGA P G+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYVSDVDSDYSDREEDQVDSTQEGAPPAGA

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]4.0e-10598.96Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT+ELLLES LLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]5.0e-15697.19Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGF+IPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT+ELLLES LLDYNPAVRPIESSRPNSEL
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.4e-15060.08Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVT++LLLES LLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNP

Query:  AVRPIESSRPNSELEF------ETAHPCRGRACLGRSSPG-ERAGVFWGSLEGEASQGSDRGGGRP----DRGGGRPAFGRGG--------------GGE
         VR IE+SRPNSEL              +GRA   ++  G E           + + G       P    D  GGR    R                 GE
Subjt:  AVRPIESSRPNSELEF------ETAHPCRGRACLGRSSPG-ERAGVFWGSLEGEASQGSDRGGGRP----DRGGGRPAFGRGG--------------GGE

Query:  APLKRRKKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRL
        +PL+RR+KKKK  S SE GA   LP   AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFV  PGSVLQRTID  AE  +
Subjt:  APLKRRKKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRL

Query:  VPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDM
              L    +AELDGREALAA+E E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+
Subjt:  VPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDM

Query:  LQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYV
         Q LE KD  +   T EL+  KERL+N  LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQI+ +GLK++Y+EKWASGP+GTP PQ+LVD+YV
Subjt:  LQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYV

Query:  SDVDSDYSDREED--------QVDSTQEGAP
         ++DSDYSD EE+        +V +TQE  P
Subjt:  SDVDSDYSDREED--------QVDSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138269.6e-10591.51Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSELEFETAHPC
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT+ELLLES LLDYNPAVRPIE SRPNS L    A  C
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSELEFETAHPC

Query:  RGRACLGRSSPG
        R  + + R S G
Subjt:  RGRACLGRSSPG

A0A6J1D971 uncharacterized protein LOC1110185384.3e-12185.37Show/hide
Query:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRLVPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTM
        G   + A+ RIEPSS GVR+QV+RISAASLDRCLRRASKFV APGSVLQRTIDYAAE  +         A +AELDGRE LAARE+EEFSAALE ASSTM
Subjt:  GTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRLVPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPD
        KDELLKAHSEVETLKAEVESQAELLKKE+DRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSN VLLEE+FRQHPD
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYVSDVDSDYSDREEDQVDSTQEGAPPAGA
        FDGFAKDFSDAGFKFLMKGIASDMPDLQI+ SGLKRRYAEKWASGP GTPGPQALVDQYV D+DSDYSD EEDQV STQEGA P G+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYVSDVDSDYSDREEDQVDSTQEGAPPAGA

A0A6J1DWD2 uncharacterized protein LOC1110246801.9e-10598.96Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT+ELLLES LLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255022.4e-15697.19Show/hide
Query:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR
        MSSSISSNL SDLARRLES+LEEIEN RISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGF+IPENILLRLPEEGERAD+PPEGWVTLYFKMFEYGLR
Subjt:  MSSSISSNLGSDLARRLESELEEIENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLR

Query:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
        LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG
Subjt:  LPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASG

Query:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL
        EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVT+ELLLES LLDYNPAVRPIESSRPNSEL
Subjt:  EWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSEL

A0A6J1DZB3 uncharacterized protein LOC1110256651.2e-15060.08Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVT++LLLES LLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNP

Query:  AVRPIESSRPNSELEF------ETAHPCRGRACLGRSSPG-ERAGVFWGSLEGEASQGSDRGGGRP----DRGGGRPAFGRGG--------------GGE
         VR IE+SRPNSEL              +GRA   ++  G E           + + G       P    D  GGR    R                 GE
Subjt:  AVRPIESSRPNSELEF------ETAHPCRGRACLGRSSPG-ERAGVFWGSLEGEASQGSDRGGGRP----DRGGGRPAFGRGG--------------GGE

Query:  APLKRRKKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRL
        +PL+RR+KKKK  S SE GA   LP   AD VDDP ARM GTS+V  RF +EPSS GV++QV+RISA  LDR LRRASKFV  PGSVLQRTID  AE  +
Subjt:  APLKRRKKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEVRL

Query:  VPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDM
              L    +AELDGREALAA+E E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+
Subjt:  VPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDM

Query:  LQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYV
         Q LE KD  +   T EL+  KERL+N  LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQI+ +GLK++Y+EKWASGP+GTP PQ+LVD+YV
Subjt:  LQALEAKDKELEHATAELETAKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYV

Query:  SDVDSDYSDREED--------QVDSTQEGAP
         ++DSDYSD EE+        +V +TQE  P
Subjt:  SDVDSDYSDREED--------QVDSTQEGAP

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic3.7e-0526.73Show/hide
Query:  SRIPEHYLGSLRRGFSIPENILLRLPEEGERADHPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA
        S   E  L  L+  F +   + LR+P   ERAD PP G+ TLY + F YG  L LP+   V E++    +A +Q+       + +L  L  +  R  E  
Subjt:  SRIPEHYLGSLRRGFSIPENILLRLPEEGERADHPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEA

Query:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKY
          + +  L    E +R+ K +  R+Y+   KG   I   P+  + +   +F+ + E    ++       V TR+G     L  + P+P+   ++F  L  
Subjt:  ELLDVDQLLACFEAKRIAK-KPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFG----NLVSIRPVPELTQASFDTLKY

Query:  YK
         K
Subjt:  YK

Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related2.5e-0423.87Show/hide
Query:  PENILLRLPEEGERADHPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL  PL  F+ E+  R  +A +Q+          LAIL       +E    +D D         R+ 
Subjt:  PENILLRLPEEGERADHPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEAKRIA

Query:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG
        + PG +Y  A K    IV G  S I GW R++F+                 +W    E      D P  F  L +I  + EL    + T  + + R  R 
Subjt:  KKPGRFYMCARKGAGGIVKGPTS-IKGWVRKWFYAS--------------GEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQASFDTLKYYKERFPRG

Query:  RKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSELEFETAHPCRGRACLGRSSPGERAGVFWGSLEGEASQGSDRGGGRP---DRGGGRPAFGRGG---
        R +G ++           +    +  +E S   +E   +  +  RG   LGR S  E A               DR   RP   D+G       R     
Subjt:  RKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSELEFETAHPCRGRACLGRSSPGERAGVFWGSLEGEASQGSDRGGGRP---DRGGGRPAFGRGG---

Query:  -GGEAPLKRRKKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTAR-FRIEPSSLGVREQVTRISAA-----SLDRCLRR------ASKFVRAP
         GG  P K+R  +  A    E G+ +V      +     A   G +    A+      ++    + V+RI  A     S+DR + R      A K  ++ 
Subjt:  -GGEAPLKRRKKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTAR-FRIEPSSLGVREQVTRISAA-----SLDRCLRR------ASKFVRAP

Query:  GSVLQRTIDYAAEVRLVPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITR
        G+  + +     + ++       E A+   L  + A     E E SA LE  SS + +++    S V+    E   Q E L K      A+LR +     
Subjt:  GSVLQRTIDYAAEVRLVPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITR

Query:  GLEREKF------------QLLKEK----DDMLQALEAKDKELEHATAELETA
          ER+K             +L+K+K       ++ LE +++ L++   +LE A
Subjt:  GLEREKF------------QLLKEK----DDMLQALEAKDKELEHATAELETA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAGTATGAGGGCCGATGTGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATGTGAGCTACTGTCAGTCCGCCCAAGTATTCAGATC
GGTCCGGAGGCCGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAAACAGGGAAATGTTGGAGGTTCAAGAGCGGAGTAGCATAGTCGTG
CAATTGAGAACCAAGTCATGCGATGATTATGGAGTATCAATTTATGGAAAAAGGGCAGCGCAATCAACTAGAATAGCTTGTCTTGTCATCCCTCTTGTCTTTATC
CCTCATCCTTTGGTGATTTGGGAGGATGACTATCCTTTTGAGCCTTGCCATTGGGAGGCAAGTTCGAGTCATAGCATACGCAAAGATTTGCACAACAGTGTGTTC
CTGGTTGTTGTAGCTCGAACCCGGCCTCCGGACCGACCTGAACGCTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGGGCCTTC
CACGTGTCCCGAGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTTATTCGACCTGCTTTGGACACGTGGCGA
CTTCTTATTCGTGGGAAAACACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGAGAGGATCCCAGCCGCTCGTTGAT
TACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATCTAGCTCGT
AGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTCTAGAATCTCCGATGACGGGGAGGATAGCGATGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGG
ATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCTCTATCCCCGAAAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAT
GGATGGGTCACTCTCTACTTCAAAATGTTAATATTCCGACGCTTCGGATCTCAGAGAGGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCG
GTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGTTAGAGTCTGAGCTCGAGGAGATA
GAAAACTTTAGAATCTCCGATGATGGGGAGGATAGCGATGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTT
CGTAGGGGGTTCTCTATCCCCGAAAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATG
TTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATT
TTCGCTTTGGCCATCCTCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTTCTCGCGTGCTTCGAGGCGAAAAGGATAGCT
AAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCG
GGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCC
TCCTTCGATACTCTGAAATACTACAAGGAGCGCTTCCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACGAACGAACTGCTGCTTGAGTCCAGGCTGCTAGATTAC
AACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGAGTTCGAAACCGCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCT
GGTGAACGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGTCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGACGTCCCGCCTTT
GGGCGAGGAGGCGGGGGGGAAGCCCCTCTAAAGCGAAGAAAGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCCTGCAGGGTCTTGCCTGCAGGTTGG
GCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACC
CGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGGGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGTA
AGACTAGTGCCCCATTTTTTTTTGCTTGAGTTCGCCCAACAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGGAAGAGGAGTTCTCCGCTGCCTTG
GAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAG
GATGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCG
CTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGAAGTCCTGCTGGAGGAATCGTTTAGGCAACAT
CCTGACTTCGATGGATTTGCTAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCAATTTCAGTGGT
CTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGTGATGTGGACTCTGACTACTCC
GATCGCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCAGCAGGCGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTAAGTATGAGGGCCGATGTGAACCTGGCCGAGGTCCGACCTACCGGGAAGCTCGGTGGGGGCCGATGTGAGCTACTGTCAGTCCGCCCAAGTATTCAGATC
GGTCCGGAGGCCGAGTTCGAGCTGCAATCTGAAATACACTGTTGTGCATATCCTTGCATAAACAGGGAAATGTTGGAGGTTCAAGAGCGGAGTAGCATAGTCGTG
CAATTGAGAACCAAGTCATGCGATGATTATGGAGTATCAATTTATGGAAAAAGGGCAGCGCAATCAACTAGAATAGCTTGTCTTGTCATCCCTCTTGTCTTTATC
CCTCATCCTTTGGTGATTTGGGAGGATGACTATCCTTTTGAGCCTTGCCATTGGGAGGCAAGTTCGAGTCATAGCATACGCAAAGATTTGCACAACAGTGTGTTC
CTGGTTGTTGTAGCTCGAACCCGGCCTCCGGACCGACCTGAACGCTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGGGCCTTC
CACGTGTCCCGAGTATTCCCTTCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTTATTCGACCTGCTTTGGACACGTGGCGA
CTTCTTATTCGTGGGAAAACACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTCAGAGAGGATCCCAGCCGCTCGTTGAT
TACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATCTAGCTCGT
AGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTCTAGAATCTCCGATGACGGGGAGGATAGCGATGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGG
ATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCTCTATCCCCGAAAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAT
GGATGGGTCACTCTCTACTTCAAAATGTTAATATTCCGACGCTTCGGATCTCAGAGAGGATCCCAGCCGCTCGTTGATTACACGTCTCGAACCCTTGGTAGGTCG
GTCTCTTCCCTCTCTCTTTCGAACATAATTGCCATGTCGTCCTCTATTAGCAGCAACCTAGGATCCGATCTAGCTCGTAGGTTAGAGTCTGAGCTCGAGGAGATA
GAAAACTTTAGAATCTCCGATGATGGGGAGGATAGCGATGCCTCCACTTCAGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTT
CGTAGGGGGTTCTCTATCCCCGAAAACATCCTCCTCAGGCTTCCGGAGGAGGGGGAGAGAGCTGACCATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATG
TTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATT
TTCGCTTTGGCCATCCTCTTTTGGCTTCGAGCTCGGGATAGTGAGGAGGCCGAGCTGTTGGACGTAGACCAGCTTCTCGCGTGCTTCGAGGCGAAAAGGATAGCT
AAGAAGCCCGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCG
GGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCC
TCCTTCGATACTCTGAAATACTACAAGGAGCGCTTCCCGAGGGGTAGGAAGGTCGGAACCCTGGTGACGAACGAACTGCTGCTTGAGTCCAGGCTGCTAGATTAC
AACCCTGCAGTTCGTCCCATCGAATCCTCAAGGCCGAACTCTGAACTTGAGTTCGAAACCGCCCACCCCTGCCGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCT
GGTGAACGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGTCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGGACGTCCCGCCTTT
GGGCGAGGAGGCGGGGGGGAAGCCCCTCTAAAGCGAAGAAAGAAGAAAAAGAAGGCGATCTCTCCCTCGGAGGTCGGAGCCTGCAGGGTCTTGCCTGCAGGTTGG
GCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGATGTGACGGCGCGGTTCAGAATTGAGCCGTCAAGTCTCGGGGTGAGGGAGCAGGTGACC
CGCATCTCAGCTGCGAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGGGCCCCTGGGTCCGTTCTGCAGAGGACCATTGACTACGCCGCCGAGGTA
AGACTAGTGCCCCATTTTTTTTTGCTTGAGTTCGCCCAACAGGCCGAGCTGGATGGGAGGGAAGCTTTGGCAGCGAGGGAGGAAGAGGAGTTCTCCGCTGCCTTG
GAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCCGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTACTGAAGAAGGAG
GATGACAGGCGCAAGGCTCAACTCCGAGCTGCCCACGCCATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCG
CTCGAAGCGAAGGATAAGGAGTTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGAAGTCCTGCTGGAGGAATCGTTTAGGCAACAT
CCTGACTTCGATGGATTTGCTAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCAATTTCAGTGGT
CTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGTGATGTGGACTCTGACTACTCC
GATCGCGAAGAGGACCAGGTCGACTCCACTCAGGAGGGCGCTCCCCCAGCAGGCGCTTAG
Protein sequenceShow/hide protein sequence
MLSMRADVNLAEVRPTGKLGGGRCELLSVRPSIQIGPEAEFELQSEIHCCAYPCINREMLEVQERSSIVVQLRTKSCDDYGVSIYGKRAAQSTRIACLVIPLVFI
PHPLVIWEDDYPFEPCHWEASSSHSIRKDLHNSVFLVVVARTRPPDRPERLGGPAQKGEHSDDQVSIGAFHVSRVFPSPNIGPLSVWSDLDLAEKFIRPALDTWR
LLIRGKTQPSRKIYRRNIQIFRRFGSQRGSQPLVDYTSRTLGRSVSSLSLSNIIAMSSSISSNLGSDLARRLESELEEIENSRISDDGEDSDASTSGQGLEYPSR
IPEHYLGSLRRGFSIPENILLRLPEEGERADHPPDGWVTLYFKMLIFRRFGSQRGSQPLVDYTSRTLGRSVSSLSLSNIIAMSSSISSNLGSDLARRLESELEEI
ENFRISDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFSIPENILLRLPEEGERADHPPEGWVTLYFKMFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVI
FALAILFWLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELTQA
SFDTLKYYKERFPRGRKVGTLVTNELLLESRLLDYNPAVRPIESSRPNSELEFETAHPCRGRACLGRSSPGERAGVFWGSLEGEASQGSDRGGGRPDRGGGRPAF
GRGGGGEAPLKRRKKKKKAISPSEVGACRVLPAGWADRVDDPAARMGGTSDVTARFRIEPSSLGVREQVTRISAASLDRCLRRASKFVRAPGSVLQRTIDYAAEV
RLVPHFFLLEFAQQAELDGREALAAREEEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEDDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQA
LEAKDKELEHATAELETAKERLSNEVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINFSGLKRRYAEKWASGPSGTPGPQALVDQYVSDVDSDYS
DREEDQVDSTQEGAPPAGA