; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g34500 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g34500
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr8:25203933..25206230
RNA-Seq ExpressionMoc08g34500
SyntenyMoc08g34500
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]2.7e-11286.17Show/hide
Query:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA
        C  KGA GIVKGPTSIK WVRKWFYASGEWLA DES              V+IRPVP+LTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPA
Subjt:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA

Query:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRK
        VRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQ+SKP TPAVVGPASEDPAPV ELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRK
Subjt:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRK

Query:  TKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQ
         KKTTSPLEVGA GVLPASFADRVDDPEARMGGT DV  RFRVEPSSSGVR+Q
Subjt:  TKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]6.1e-10478.02Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYEL-------------GIVKRAS----CWTKGAGGIVKGPTSIKRWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGW V F+L   F+         EL              I K+      C  KGAGGIVKGPTSIK WVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYEL-------------GIVKRAS----CWTKGAGGIVKGPTSIKRWVR

Query:  KWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLA DESGRSFFDV TRFGNLVSIRPVP+LTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQ+SKP TPAVVGPASEDPAPV ELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.8e-11783.51Show/hide
Query:  GTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKD
        G   ++A+ R+EPSSSGVR+QVS ISAASLDRCLRRASKFV+    +L       ++AFVASIQSALAVKAE+DGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQAFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQA EAKDKEL+HATAELETA ERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQAFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRDLDSDDSDLEEDQVGTTQEGAPQADS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RYAE+WASGPGGTP PQALVD+YVRDLDSD SD EEDQVG+TQEGA    S
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRDLDSDDSDLEEDQVGTTQEGAPQADS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.6e-15583.94Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYELGIV---------KRAS--------CWTKGAGGIVKGPTSIKRWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGW V F+L   F+         EL  V         KR +        C  KGAGGIVKGPTSIK WVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYELGIV---------KRAS--------CWTKGAGGIVKGPTSIKRWVRKWFYA

Query:  SGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLA DESGRSFFDV TRFGNLVSIRPVP+LTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQ+SKPATPAVVGPASEDPA V ELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.2e-18166.92Show/hide
Query:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA
        C  KG GGIVKGPTSIK WV KWF+ASGEWLA DESGR+FFDV TRFGNLVSI+ +P+L QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP 
Subjt:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA

Query:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREE
        VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PV EL+ SGG S EKR R+++EA+D  PL  EVR E
Subjt:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREE

Query:  VPLKRRRKTKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLL----CTFSQAFV
         PL+RRRK KKT+S  E GA G LP S AD VDDPEARM GTS+V  RF +EPSSSGV++QVS ISA  LDR LRRASKFV+    +L       ++AF+
Subjt:  VPLKRRRKTKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLL----CTFSQAFV

Query:  ASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ
        ASI  A+ VKAE+DGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q
Subjt:  ASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ

Query:  AFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRD
          E KD  +   T EL+   ERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+E+WASGP GTP PQ+LVDKYVR+
Subjt:  AFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRD

Query:  LDSDDSDLEED--------QVGTTQEGAP
        LDSD SD+EE+        +VGTTQE  P
Subjt:  LDSDDSDLEED--------QVGTTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092981.3e-11286.17Show/hide
Query:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA
        C  KGA GIVKGPTSIK WVRKWFYASGEWLA DES              V+IRPVP+LTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPA
Subjt:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA

Query:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRK
        VRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQ+SKP TPAVVGPASEDPAPV ELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRK
Subjt:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRK

Query:  TKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQ
         KKTTSPLEVGA GVLPASFADRVDDPEARMGGT DV  RFRVEPSSSGVR+Q
Subjt:  TKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQ

A0A6J1CR42 uncharacterized protein LOC1110138262.9e-10478.02Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYEL-------------GIVKRAS----CWTKGAGGIVKGPTSIKRWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGW V F+L   F+         EL              I K+      C  KGAGGIVKGPTSIK WVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYEL-------------GIVKRAS----CWTKGAGGIVKGPTSIKRWVR

Query:  KWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLA DESGRSFFDV TRFGNLVSIRPVP+LTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDAL-------PLGE
         VKRKSKGRAHALEAAQ+SKP TPAVVGPASEDPAPV ELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  NVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1D971 uncharacterized protein LOC1110185382.3e-11783.51Show/hide
Query:  GTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKD
        G   ++A+ R+EPSSSGVR+QVS ISAASLDRCLRRASKFV+    +L       ++AFVASIQSALAVKAE+DGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLLCT----FSQAFVASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQAFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQA EAKDKEL+HATAELETA ERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQAFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRDLDSDDSDLEEDQVGTTQEGAPQADS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK+RYAE+WASGPGGTP PQALVD+YVRDLDSD SD EEDQVG+TQEGA    S
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRDLDSDDSDLEEDQVGTTQEGAPQADS

A0A6J1DXS5 uncharacterized protein LOC1110255023.7e-15583.94Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYELGIV---------KRAS--------CWTKGAGGIVKGPTSIKRWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGW V F+L   F+         EL  V         KR +        C  KGAGGIVKGPTSIK WVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGW-VSFSLWPSFF-------GYELGIV---------KRAS--------CWTKGAGGIVKGPTSIKRWVRKWFYA

Query:  SGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLA DESGRSFFDV TRFGNLVSIRPVP+LTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVD
        SKGRAHALEAAQ+SKPATPAVVGPASEDPA V ELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256653.0e-18166.92Show/hide
Query:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA
        C  KG GGIVKGPTSIK WV KWF+ASGEWLA DESGR+FFDV TRFGNLVSI+ +P+L QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP 
Subjt:  CWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPA

Query:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREE
        VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PV EL+ SGG S EKR R+++EA+D  PL  EVR E
Subjt:  VRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAV--------VGPASEDPAPVTELESSGGPSREKRPRDQTEAVDALPLGEEVREE

Query:  VPLKRRRKTKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLL----CTFSQAFV
         PL+RRRK KKT+S  E GA G LP S AD VDDPEARM GTS+V  RF +EPSSSGV++QVS ISA  LDR LRRASKFV+    +L       ++AF+
Subjt:  VPLKRRRKTKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASKFVARSNFLL----CTFSQAFV

Query:  ASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ
        ASI  A+ VKAE+DGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q
Subjt:  ASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQ

Query:  AFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRD
          E KD  +   T EL+   ERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLKK+Y+E+WASGP GTP PQ+LVDKYVR+
Subjt:  AFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQALVDKYVRD

Query:  LDSDDSDLEED--------QVGTTQEGAP
        LDSD SD+EE+        +VGTTQE  P
Subjt:  LDSDDSDLEED--------QVGTTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related2.6e-0422.84Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNG----------------WVSFSL
        SR    + G        PE +   IPE  +R  + PEG++ L+   F E GL  PL  F+  +  R  +A +Q++                    V   L
Subjt:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPAQVAPNG----------------WVSFSL

Query:  WPSFFGYELGIVKRASCWTKGAGG--IVKGPTS-IKRWVRKWFYASGEWLANDESGRSFFDV
        +     + +G+  R+        G  I  G TS ++ W + +F+A    ++ D++  S  ++
Subjt:  WPSFFGYELGIVKRASCWTKGAGG--IVKGPTS-IKRWVRKWFYASGEWLANDESGRSFFDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAAGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTCAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTAC
GAGCTCGGGATAGTGAAGAGGGCGAGCTGTTGGACGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGAGATGGGTGAGGAAGTGGTTCTACGCT
TCTGGGGAATGGCTTGCAAATGACGAGTCAGGTCGTTCCTTCTTTGACGTCCTCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCAAGCTTACGCAA
GCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGACTGCTAGAT
TACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGTCGAGCC
CATGCTCTTGAGGCTGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGACCGAGCTGGAGTCTTCTGGGGGT
CCCTCGAGGGAGAAGCGCCCCAGGGATCAGACTGAGGCGGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGACG
AAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGAT
GTGATGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGAACCAGGTGTCCCACATCTCAGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAA
TTTGTAGCTCGGTCTAACTTTCTTCTTTGTACCTTTTCTCAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGATGGATGGGAGGGAAGTT
CTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAG
GCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTAGAGAAGGAGAAG
TTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCATTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAATGAGCGTCTC
AGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATT
GCTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTAGCCCCCAAGCGTTG
GTGGATAAGTATGTCAGAGATCTGGATTCTGACGACTCCGATCTCGAAGAGGACCAGGTCGGCACCACACAGGAGGGCGCTCCTCAGGCAGACTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAAGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTCAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTTGTCCAAGAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTAC
GAGCTCGGGATAGTGAAGAGGGCGAGCTGTTGGACGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGAGATGGGTGAGGAAGTGGTTCTACGCT
TCTGGGGAATGGCTTGCAAATGACGAGTCAGGTCGTTCCTTCTTTGACGTCCTCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCCAAGCTTACGCAA
GCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGACTGCTAGAT
TACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGTCGAGCC
CATGCTCTTGAGGCTGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGACCGAGCTGGAGTCTTCTGGGGGT
CCCTCGAGGGAGAAGCGCCCCAGGGATCAGACTGAGGCGGTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGACG
AAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTTGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGAT
GTGATGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGAACCAGGTGTCCCACATCTCAGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAA
TTTGTAGCTCGGTCTAACTTTCTTCTTTGTACCTTTTCTCAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGATGGATGGGAGGGAAGTT
CTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAG
GCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTAGAGAAGGAGAAG
TTCCAACTCCTGAAGGAGAAGGACGACATGCTCCAGGCATTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAATGAGCGTCTC
AGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATT
GCTTCCGACATGCCTGACCTTCAGATCGATCTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTAGCCCCCAAGCGTTG
GTGGATAAGTATGTCAGAGATCTGGATTCTGACGACTCCGATCTCGAAGAGGACCAGGTCGGCACCACACAGGAGGGCGCTCCTCAGGCAGACTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPL
HPFVQEFLFRTGLAPAQVAPNGWVSFSLWPSFFGYELGIVKRASCWTKGAGGIVKGPTSIKRWVRKWFYASGEWLANDESGRSFFDVLTRFGNLVSIRPVPKLTQ
ASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQNSKPATPAVVGPASEDPAPVTELESSGG
PSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKTKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVMARFRVEPSSSGVRNQVSHISAASLDRCLRRASK
FVARSNFLLCTFSQAFVASIQSALAVKAEMDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEK
FQLLKEKDDMLQAFEAKDKELKHATAELETANERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKKRYAEQWASGPGGTPSPQAL
VDKYVRDLDSDDSDLEEDQVGTTQEGAPQADS