; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g19980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g19980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:13466148..13471680
RNA-Seq ExpressionMoc03g19980
SyntenyMoc03g19980
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]6.6e-10683.46Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKG TSIKGWVRKWFYASGEWLAKDES              V+IRPVPEL QA FDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVREEAPLKRRR
        AV             MVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDP PVIELESS  PSREKRPRDQTEAVDVSPLGEEVREE PLKRRR
Subjt:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQ
        KKKKTTSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVE SSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.3e-11180.95Show/hide
Query:  MFEYDLRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR
        MFEY LRLPLHPFVQE LFRTGLAPAQVAPNG                  EEAELLDVDQLLA FEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVR
Subjt:  MFEYDLRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPEL QA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV             MVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQSSKP TPAVVGPASEDP PVIELESSG PSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]9.9e-10275.79Show/hide
Query:  GTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETFVASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKD
        G   +  + R+E SSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDY AE FVASIQSALAVKAELDGRE L AREKEEFSAALE ASSTMKD
Subjt:  GTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETFVASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKD

Query:  ELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDMLQALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFD
        ELLKAH EVE LKA+V+++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQL KEKDDMLQALE K++EL+HAT ELE  KERL+NG LLEE+FRQHP+FD
Subjt:  ELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDMLQALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------RDLDSDYSELEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+                        RDLDSDYS+ EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------RDLDSDYSELEEDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.0e-15984.79Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVSLYFKMFEYD
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWV+LYFKMFEY 
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVSLYFKMFEYD

Query:  LRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA
        LRLPLHPFVQE LFRTGLAPAQVAPNG                  EEAEL DVDQLLA FEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  LRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPEL QA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV             MVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDP  VIELESSG PSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.4e-17165.67Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVRE
         V             MVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SG  S EKR R+++EA+DVSPL  EVR 
Subjt:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETF
        E+PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V +RF +E SSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE F
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETF

Query:  VASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKDELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDML
        +ASI  A+ VKAELDGREAL A+E+E   AALEAA +T+K ELLKA  EV++L+A+V AK +LLKKE ++ KA LRAAHAITKGLEKEKFQL KEKDD+ 
Subjt:  VASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKDELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDML

Query:  QALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------R
        Q LE K+  +   T EL+ +KERL NG LLEESFRQHP+FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK                        R
Subjt:  QALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------R

Query:  DLDSDYSELEED--------QVGTTQEGAP--QAGS
        +LDSDYS++EE+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSELEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.2e-10683.46Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKG TSIKGWVRKWFYASGEWLAKDES              V+IRPVPEL QA FDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVREEAPLKRRR
        AV             MVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDP PVIELESS  PSREKRPRDQTEAVDVSPLGEEVREE PLKRRR
Subjt:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVREEAPLKRRR

Query:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQ
        KKKKTTSPLEVGARG LPASFADRVDDPEARMGGT DVT RFRVE SSSGVRDQ
Subjt:  KKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.1e-11180.95Show/hide
Query:  MFEYDLRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR
        MFEY LRLPLHPFVQE LFRTGLAPAQVAPNG                  EEAELLDVDQLLA FEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVR
Subjt:  MFEYDLRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPEL QA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV             MVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRAHALEAAQSSKP TPAVVGPASEDP PVIELESSG PSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185384.8e-10275.79Show/hide
Query:  GTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETFVASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKD
        G   +  + R+E SSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDY AE FVASIQSALAVKAELDGRE L AREKEEFSAALE ASSTMKD
Subjt:  GTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETFVASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKD

Query:  ELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDMLQALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFD
        ELLKAH EVE LKA+V+++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQL KEKDDMLQALE K++EL+HAT ELE  KERL+NG LLEE+FRQHP+FD
Subjt:  ELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDMLQALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------RDLDSDYSELEEDQVGTTQEGAPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDL GLK+                        RDLDSDYS+ EEDQVG+TQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------RDLDSDYSELEEDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255025.0e-16084.79Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVSLYFKMFEYD
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWV+LYFKMFEY 
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVSLYFKMFEYD

Query:  LRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA
        LRLPLHPFVQE LFRTGLAPAQVAPNG                  EEAEL DVDQLLA FEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  LRLPLHPFVQELLFRTGLAPAQVAPNG------------------EEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPEL QA FDTLKYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAV             MVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAV------------PMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDP  VIELESSG PSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESSGDPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-17165.67Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKG TSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLVSI+ +PEL QA FDTLK+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVRE
         V             MVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SG  S EKR R+++EA+DVSPL  EVR 
Subjt:  AV------------PMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPVPVIELESSGDPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETF
        E+PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V +RF +E SSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AE F
Subjt:  EAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYTAETF

Query:  VASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKDELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDML
        +ASI  A+ VKAELDGREAL A+E+E   AALEAA +T+K ELLKA  EV++L+A+V AK +LLKKE ++ KA LRAAHAITKGLEKEKFQL KEKDD+ 
Subjt:  VASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKDELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLFKEKDDML

Query:  QALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------R
        Q LE K+  +   T EL+ +KERL NG LLEESFRQHP+FDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK                        R
Subjt:  QALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKK------------------------R

Query:  DLDSDYSELEED--------QVGTTQEGAP--QAGS
        +LDSDYS++EE+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSELEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related7.8e-0422.28Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVSLYFKMF-EYDLRLPLHPFVQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E  LR P+  F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVSLYFKMF-EYDLRLPLHPFVQ

Query:  ELLFRTGLAPAQ--VAPNGEEAEL----------LDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA
               +A +Q  VA     A L          L V+ +       ++  K G+ Y+ + +G   +  G +  + W+  +FYA
Subjt:  ELLFRTGLAPAQ--VAPNGEEAEL----------LDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACAAGTGTTGCCACCCACCAAGACTCTTGGAGTAGAGCATCACCTCACAGAAGATCCATCAGAGATCAAGGAAACGAAAGATGAAAAAGAAATGGTAGAAATCAA
CACTCAATTCAACACATGGACCAACAATGATGGCCTTCTTACCTCATGGCTTCTTGGAATCATTGTTGAAGAAATGGTGGCTTTGATTAAAGGTACTGATACTGCAAAGC
AGGTTTTCTCTCCTCCAGTAGTAGATTATGAAGCCATGGGTAATAGATATGAAGGTGGCTGTTCGCTGGCCGTGGGTAGCAGATTTGAAACCTGTTTATGCAAGGATATG
CACAACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCGGACGATCAAGTCAGTATAGG
TCGGATTCCCAGTTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCTCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACGC
GTGGAGACTTCCTATTCGTGGGAAAATATAACCGCTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGGATCCTAGCCGCTCGTTG
ATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTT
AGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTA
GGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGA
TGGGTCTCTCTCTACTTCAAAATGTTTGAGTACGACCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGCTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCC
CAATGGTGAAGAGGCCGAGCTGCTGGACGTAGACCAGCTCCTCGCAAGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCG
CAGGCGGGATAGTTAAGGGGTCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGAC
GTTCCCACTAGGTTTGGGAACCTGGTTTCAATCCGACCAGTCCCCGAGCTTATGCAAGCCTTCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAA
GGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCTGGACTGCTAGATTACAACCCTGCAGTTCCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCA
AGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGTCCCAGTGATCGAGCTGGAGTCTTCT
GGGGATCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCCCTGAAGCGAAGGAGGAAGAA
GAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGA
CAGTACGGTTCAGAGTCGAGCTGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGT
GACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACACCGCTGAGACGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCT
GGAAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTATGAGGTGGAAGTTTTGAAGGCCAAGG
TGAAGGCCAAGGCCGAGTTGCTGAAGAAAGAAGAGGACAGGCGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCTTC
AAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGTGAAGGAGGAGGAGTTGAAGCACGCGACTGTTGAGCTGGAGATGGTGAAGGAGCGTCTCAACAATGGAGCCCTATT
GGAGGAATCGTTCAGGCAACATCCTGAATTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTC
AGATCGATCTCGGTGGTCTGAAGAAGAGAGATCTGGACTCTGACTACTCCGAACTCGAAGAGGATCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCACAAGTGTTGCCACCCACCAAGACTCTTGGAGTAGAGCATCACCTCACAGAAGATCCATCAGAGATCAAGGAAACGAAAGATGAAAAAGAAATGGTAGAAATCAA
CACTCAATTCAACACATGGACCAACAATGATGGCCTTCTTACCTCATGGCTTCTTGGAATCATTGTTGAAGAAATGGTGGCTTTGATTAAAGGTACTGATACTGCAAAGC
AGGTTTTCTCTCCTCCAGTAGTAGATTATGAAGCCATGGGTAATAGATATGAAGGTGGCTGTTCGCTGGCCGTGGGTAGCAGATTTGAAACCTGTTTATGCAAGGATATG
CACAACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCGGACGATCAAGTCAGTATAGG
TCGGATTCCCAGTTTAGTTCGAGGGTATTCTCTTCCCCAAACATTGGCCCTCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTCATTCGACTTGCTTTGGACGC
GTGGAGACTTCCTATTCGTGGGAAAATATAACCGCTGCGGTGGATTTATCGTCGGAATATTCAAATATTCCGACGCTTCGGATCTTCGGGAGGATCCTAGCCGCTCGTTG
ATTACACGTCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCACTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTT
AGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTA
GGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGA
TGGGTCTCTCTCTACTTCAAAATGTTTGAGTACGACCTCAGACTTCCCCTTCACCCTTTCGTCCAAGAGCTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCC
CAATGGTGAAGAGGCCGAGCTGCTGGACGTAGACCAGCTCCTCGCAAGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCG
CAGGCGGGATAGTTAAGGGGTCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTCGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGAC
GTTCCCACTAGGTTTGGGAACCTGGTTTCAATCCGACCAGTCCCCGAGCTTATGCAAGCCTTCTTCGACACGTTGAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAA
GGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCTGGACTGCTAGATTACAACCCTGCAGTTCCCATGGTTTGCGGATTTGCAAGCAACGTGAAGCGCAAGTCCA
AGGGTCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGTCCCAGTGATCGAGCTGGAGTCTTCT
GGGGATCCTTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCCCTGAAGCGAAGGAGGAAGAA
GAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGACGTGA
CAGTACGGTTCAGAGTCGAGCTGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGT
GACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACACCGCTGAGACGTTTGTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGCTCT
GGAAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTATGAGGTGGAAGTTTTGAAGGCCAAGG
TGAAGGCCAAGGCCGAGTTGCTGAAGAAAGAAGAGGACAGGCGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCTTC
AAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGTGAAGGAGGAGGAGTTGAAGCACGCGACTGTTGAGCTGGAGATGGTGAAGGAGCGTCTCAACAATGGAGCCCTATT
GGAGGAATCGTTCAGGCAACATCCTGAATTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTC
AGATCGATCTCGGTGGTCTGAAGAAGAGAGATCTGGACTCTGACTACTCCGAACTCGAAGAGGATCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSQVLPPTKTLGVEHHLTEDPSEIKETKDEKEMVEINTQFNTWTNNDGLLTSWLLGIIVEEMVALIKGTDTAKQVFSPPVVDYEAMGNRYEGGCSLAVGSRFETCLCKDM
HNSVFQIAARTRPPDRSEYLGGPAQKGEHSDDQVSIGRIPSLVRGYSLPQTLALSLSGPISTWQRSSFDLLWTRGDFLFVGKYNRCGGFIVGIFKYSDASDLREDPSRSL
ITRLEPLVGRSLPSLSLSNVVAMSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEG
WVSLYFKMFEYDLRLPLHPFVQELLFRTGLAPAQVAPNGEEAELLDVDQLLASFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYASGEWLAKDESGRSFFD
VPTRFGNLVSIRPVPELMQAFFDTLKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVPMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPVPVIELESS
GDPSREKRPRDQTEAVDVSPLGEEVREEAPLKRRRKKKKTTSPLEVGARGALPASFADRVDDPEARMGGTSDVTVRFRVELSSSGVRDQVSRISAASLDRCLRRASKFVS
DPGSVLQRTIDYTAETFVASIQSALAVKAELDGREALEAREKEEFSAALEAASSTMKDELLKAHYEVEVLKAKVKAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLF
KEKDDMLQALEVKEEELKHATVELEMVKERLNNGALLEESFRQHPEFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRDLDSDYSELEEDQVGTTQEGAPQAGS