; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g01220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g01220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr3:862246..867289
RNA-Seq ExpressionMoc03g01220
SyntenyMoc03g01220
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]6.9e-11690.76Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPI
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES      V  R    VPELTQASFDTLKYYKEHF RGRKVGTLV DKLLLESG+LDY+PAVRPI
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPI

Query:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVREEVPLKRRRKKKKT
        ESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDV PLGEEVREEVPLKRRRKKKKT
Subjt:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVREEVPLKRRRKKKKT

Query:  TSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQ
        TSPLEVGARGVLPASFADRVDDPEARMGGT D T RFRVEPSSSGVRDQ
Subjt:  TSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]7.4e-11884.25Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNL     VPELTQASFDTLKYYKE F RGRKVGTLV D+LLLESG+LDY+PAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAV-------DVLPLGE
         VKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAV-------DVLPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.9e-10678.87Show/hide
Query:  GTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKD
        G     A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASST+KD
Subjt:  GTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKD

Query:  ELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDDELKHATAELETAKERLSNG--------------
        ELL AHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKD EL+HATAELETAKERLSNG              
Subjt:  ELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDDELKHATAELETAKERLSNG--------------

Query:  ----------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG
                        GI SDMPDL+IDLSGLK++YAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   G
Subjt:  ----------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.5e-16687.61Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRLSDDGEDSDASTSGQGLEYPSRIHEHYLGSLRRGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R+SDDGEDSDASTSGQGLEYPSRI EHYLGSLRRGFAIPENILLR+PEEG RADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRLSDDGEDSDASTSGQGLEYPSRIHEHYLGSLRRGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNG                  EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNL     VPELTQASFDTLKYYKE F RGRKVGTLV D+LLLESG+LDY+PAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESS GPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.8e-17265.28Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLV-----PELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLV     PEL QA+FDTLK+YK+HF R RK+ TLV DKLLLESG+LDY+P
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLV-----PELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S G S EKR R+++EA+DV PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+   RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKDELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELL A  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKDELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDDELKHATAELETAKERLSNG------------------------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +   T EL+  KERL+NG                              GI +DMP L+IDL+GLKKKY+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDDELKHATAELETAKERLSNG------------------------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAP
        +LDSDYSD+EE+        +VGTTQE  P
Subjt:  DLDSDYSDLEED--------QVGTTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.4e-11690.76Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPI
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES      V  R    VPELTQASFDTLKYYKEHF RGRKVGTLV DKLLLESG+LDY+PAVRPI
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPI

Query:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVREEVPLKRRRKKKKT
        ESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDV PLGEEVREEVPLKRRRKKKKT
Subjt:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVREEVPLKRRRKKKKT

Query:  TSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQ
        TSPLEVGARGVLPASFADRVDDPEARMGGT D T RFRVEPSSSGVRDQ
Subjt:  TSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138263.6e-11884.25Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG                  EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVR

Query:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFAS
        KWFYASGEWLAKDESGRSFFDVPTRFGNL     VPELTQASFDTLKYYKE F RGRKVGTLV D+LLLESG+LDY+PAVRPIE SRPNS LAMVC FAS
Subjt:  KWFYASGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAV-------DVLPLGE
         VKRKSKGRAHALEAAQSSKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAV-------DVLPLGE

A0A6J1D971 uncharacterized protein LOC1110185381.4e-10678.87Show/hide
Query:  GTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKD
        G     A+ R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASST+KD
Subjt:  GTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKD

Query:  ELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDDELKHATAELETAKERLSNG--------------
        ELL AHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKD EL+HATAELETAKERLSNG              
Subjt:  ELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDDELKHATAELETAKERLSNG--------------

Query:  ----------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG
                        GI SDMPDL+IDLSGLK++YAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   G
Subjt:  ----------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG

A0A6J1DXS5 uncharacterized protein LOC1110255027.1e-16787.61Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRLSDDGEDSDASTSGQGLEYPSRIHEHYLGSLRRGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R+SDDGEDSDASTSGQGLEYPSRI EHYLGSLRRGFAIPENILLR+PEEG RADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRLSDDGEDSDASTSGQGLEYPSRIHEHYLGSLRRGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNG                  EEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNG------------------EEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDVPTRFGNL     VPELTQASFDTLKYYKE F RGRKVGTLV D+LLLESG+LDY+PAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVPTRFGNL-----VPELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVD
        SKGRAHALEAAQSSKPATPAVVGPASEDPA VIELESS GPSREKRPRDQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256653.3e-17265.28Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLV-----PELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTRFGNLV     PEL QA+FDTLK+YK+HF R RK+ TLV DKLLLESG+LDY+P
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLV-----PELTQASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ S G S EKR R+++EA+DV PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSRGPSREKRPRDQTEAVDVLPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+   RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKDELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+K ELL A  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKDELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDDELKHATAELETAKERLSNG------------------------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +   T EL+  KERL+NG                              GI +DMP L+IDL+GLKKKY+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDDELKHATAELETAKERLSNG------------------------------GIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAP
        +LDSDYSD+EE+        +VGTTQE  P
Subjt:  DLDSDYSDLEED--------QVGTTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G38190.1 INVOLVED IN: biological_process unknown6.9e-0523.37Show/hide
Query:  RLESELEEIENFRLSDDGEDSDASTSGQGLEY------PSRIHEHYLGSLRRGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ
        R +++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP +  R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRLSDDGEDSDASTSGQGLEY------PSRIHEHYLGSLRRGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ

Query:  EFLFRTGLAPAQ--VAPNGEEAEL----------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
         F     +A +Q  VA     A L          L V+ +       ++  K G+ Y+ + +G   +   P+  + W+  +FYA
Subjt:  EFLFRTGLAPAQ--VAPNGEEAEL----------LDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCACTTGCAGGTCAGAGTTGGGAGCAAATTCCCGAAGAAGCAGCGCCACAGCACCAGTACACAGCCATGGCGCCTCCAGACAAAGCTTGGGCCTCTTTCCGTGATGA
CAGCGCCGCAGCGCTGTCTTGTAGCGCCATGGCGCAACGAACTTGCTCGTCCGTCACAATTTTGGGACAGCGCCATGGGATGAACAAAGAACTATATGCTCTCGAAAGGG
ATAAGAGCGGATTGTTGGCGATGACTTGGTCTATTGTTTCTGATCTACCTTGTTTATGCAAGGATATGCACAACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCG
GACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAATCATCATACC
TGATCGCGGAGTCGGACCTCGGCCAGGTTCACCCCGGCTCTCATACTTAGCATCTGTCAACGCTAGTGGTGGTGATCTCGGCGGTCCGAGCTGGGGCATGACTCATGGGT
CACCTTGGGGCACCAATGGCTGTCCTCCACGTGTCCTGGGTATTCTTTCCCCCAAACATTGGCCGCCTCTCTGTCTGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCA
CTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTTGAGGAGATAGAAAACTT
TAGGCTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACATGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCG
CTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAAGGGGGGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTTAAAATGTTTGAGTACGGCCTCAGA
CTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCT
CGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGGGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGG
TGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTCCCCGAGCTTACGCAA
GCCTCTTTCGACACGTTGAAATATTACAAGGAGCATTTTCTGAGGGGTAGGAAGGTCGGAACTTTGGTGAACGACAAGCTGCTGCTTGAGTCCGGGGTGCTAGACTACAG
CCCGGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTG
AGGCCGCCCAGAGTTCAAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCCTCGAGGGAGAAG
CGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTT
GGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGTGGGACGTCCGATGCGACGGCACGGTTCAGAGTTGAGC
CGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTGCAG
AGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGA
GTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATTAAGGATGAGCTGCTGAATGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTAT
TGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCTTGAAGGAGAAGGACGACATGCTC
CAGGCGCTTGAAGCGAAGGATGATGAGCTGAAGCACGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGGCATTGTTTCCGACATGCCTGACCTTCG
GATCGATCTCAGTGGTCTGAAAAAGAAGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACT
CTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCACTTGCAGGTCAGAGTTGGGAGCAAATTCCCGAAGAAGCAGCGCCACAGCACCAGTACACAGCCATGGCGCCTCCAGACAAAGCTTGGGCCTCTTTCCGTGATGA
CAGCGCCGCAGCGCTGTCTTGTAGCGCCATGGCGCAACGAACTTGCTCGTCCGTCACAATTTTGGGACAGCGCCATGGGATGAACAAAGAACTATATGCTCTCGAAAGGG
ATAAGAGCGGATTGTTGGCGATGACTTGGTCTATTGTTTCTGATCTACCTTGTTTATGCAAGGATATGCACAACAGTGTATTTCAGATTGCAGCTCGAACTCGGCCTCCG
GACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTCAGTATAGGTCGGATTCCCAGTTTAGTTCGAGGTCAGAAAATCATCATACC
TGATCGCGGAGTCGGACCTCGGCCAGGTTCACCCCGGCTCTCATACTTAGCATCTGTCAACGCTAGTGGTGGTGATCTCGGCGGTCCGAGCTGGGGCATGACTCATGGGT
CACCTTGGGGCACCAATGGCTGTCCTCCACGTGTCCTGGGTATTCTTTCCCCCAAACATTGGCCGCCTCTCTGTCTGCTCGAACCCTTGGTAGGTCGGTCTCTTCCCTCA
CTTTCTCTTTCGAACGTAGTTGCCATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTTGAGGAGATAGAAAACTT
TAGGCTCTCCGATGACGGGGAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACATGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCG
CTATCCCTGAGAACATCCTCCTTAGGATTCCGGAGGAAGGGGGGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTTAAAATGTTTGAGTACGGCCTCAGA
CTTCCCCTTCACCCTTTTGTCCAAGAATTTCTCTTCCGAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGTGAAGAGGCCGAGCTGTTGGACGTAGACCAGCTCCT
CGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGGGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGG
TGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTCCCCGAGCTTACGCAA
GCCTCTTTCGACACGTTGAAATATTACAAGGAGCATTTTCTGAGGGGTAGGAAGGTCGGAACTTTGGTGAACGACAAGCTGCTGCTTGAGTCCGGGGTGCTAGACTACAG
CCCGGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTG
AGGCCGCCCAGAGTTCAAAACCTGCCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTAGGGGTCCCTCGAGGGAGAAG
CGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTT
GGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGTGGGACGTCCGATGCGACGGCACGGTTCAGAGTTGAGC
CGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTGCAG
AGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGA
GTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATTAAGGATGAGCTGCTGAATGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTAT
TGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCTTGAAGGAGAAGGACGACATGCTC
CAGGCGCTTGAAGCGAAGGATGATGAGCTGAAGCACGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGGCATTGTTTCCGACATGCCTGACCTTCG
GATCGATCTCAGTGGTCTGAAAAAGAAGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACT
CTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTAA
Protein sequenceShow/hide protein sequence
MALAGQSWEQIPEEAAPQHQYTAMAPPDKAWASFRDDSAAALSCSAMAQRTCSSVTILGQRHGMNKELYALERDKSGLLAMTWSIVSDLPCLCKDMHNSVFQIAARTRPP
DRSEYLGGPAQKGEHSDDQVSIGRIPSLVRGQKIIIPDRGVGPRPGSPRLSYLASVNASGGDLGGPSWGMTHGSPWGTNGCPPRVLGILSPKHWPPLCLLEPLVGRSLPS
LSLSNVVAMSSSFSSNLGSDEDLARRLESELEEIENFRLSDDGEDSDASTSGQGLEYPSRIHEHYLGSLRRGFAIPENILLRIPEEGGRADNPPEGWVTLYFKMFEYGLR
LPLHPFVQEFLFRTGLAPAQVAPNGEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRFGNLVPELTQ
ASFDTLKYYKEHFLRGRKVGTLVNDKLLLESGVLDYSPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSRGPSREK
RPRDQTEAVDVLPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDATARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQ
RTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTIKDELLNAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
QALEAKDDELKHATAELETAKERLSNGGIVSDMPDLRIDLSGLKKKYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG