; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr7:16096800..16102243
RNA-Seq ExpressionMoc07g21940
SyntenyMoc07g21940
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]3.2e-9848.72Show/hide
Query:  RRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------
        +R+KKKK  S  EVGAC VLPA FADRVDDP ARMGGTSDVTARFRIE SSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAE       
Subjt:  RRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------

Query:  ------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEV------------------------------------------------
              AELD REVLAA EKEEFSAALEAASSTMKDELL+AHSE + LK EV                                                
Subjt:  ------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEV------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAK
                                            EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE KD  + R   ELKA K
Subjt:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAK

Query:  ERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------Q
        ERL N  LLE +FRQHPDFDGFAKDFSDAGFKF MKGI +D+P L +DL  LKKRYAE+WASGPNGT GP +LV+KYVRDLDSDYSDL+ED        +
Subjt:  ERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEG
        VGTTQEG
Subjt:  VGTTQEG

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.6e-11080Show/hide
Query:  GTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKD
        G   + A+ RIE SSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAE             AELD REVLAA EKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKD

Query:  ELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFD
        ELL+AHSE + LK EVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAKD+EL+ AT EL+ AKERL N VLLEE+FRQHPDFD
Subjt:  ELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS
        GFAKDFSDAGFKF MKGI SDMPDL IDLS LK+RYAE+WASGP GTPGPQALV++YVRDLDSDYSD EEDQVG+TQEGA  TGS
Subjt:  GFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS

XP_022156120.1 uncharacterized protein LOC111023084 [Momordica charantia]1.6e-94100Show/hide
Query:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
        MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
Subjt:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP

Query:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
        RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
Subjt:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.3e-11577.74Show/hide
Query:  DEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLRLHPFVQE
        + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLG LR+GFAIPENILLR+PEEGERADNPPEGWVTLY KMFEYGLRL LHPFVQE
Subjt:  DEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLRLHPFVQE

Query:  FLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESG
        FLFRT LAPAQVAPNGWGVIFALAILFWLRARD+EEAEL DVDQLLACFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDESG
Subjt:  FLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESG

Query:  RSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGD--RQAAARVRVVRDYNPAVRPIETSRPNSEL
        RSFFDVPTR  +   L SI P      +S     + L+        GR +G            + DYNPAVRPIE+SRPNSEL
Subjt:  RSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGD--RQAAARVRVVRDYNPAVRPIETSRPNSEL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.7e-13656.4Show/hide
Query:  MCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGDRQAAARV---
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTR   + +   + P  ++A    ++H            +     DR+    V   
Subjt:  MCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGDRQAAARV---

Query:  ----RVVRDYNPAVRPIETSRPNSELDL--------------------------RVTASLRRSNPSDRAG--------VFWRSFAG-----EALRDQTEA
              + DYNP VR IE SRPNSEL +                           VT ++ R+     +G        V     +G     +  R+++EA
Subjt:  ----RVVRDYNPAVRPIETSRPNSELDL--------------------------RVTASLRRSNPSDRAG--------VFWRSFAG-----EALRDQTEA

Query:  VDVSPLGEEVGEAAPLKRRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSV
        +DVSPL E  GE +PL+RR+KKKKT+S  E GA G LP S AD VDDPEARM GTS+V  RF +E SSSGV+DQVSRISA  LDR LRRASKFVSDPGSV
Subjt:  VDVSPLGEEVGEAAPLKRRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSV

Query:  LQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKE
        LQRTID  AE             AELD RE LAA E+E   AALEAA +T+K ELL+A  E DIL+ EV+AK +LLKKE ++ KA LRAAHAITKGLEKE
Subjt:  LQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKE

Query:  KFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGT
        KFQLLKEKDD+ Q LE KD  + R TTELK  KERL N  LLEESFRQHPDFDGFAKDFSDAGFKF MKGI +DMP L IDL+ LKK+Y+E+WASGPNGT
Subjt:  KFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGT

Query:  PGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE
        P PQ+LV+KYVR+LDSDYSD+EE+        +VGTTQE
Subjt:  PGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.5e-9848.72Show/hide
Query:  RRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------
        +R+KKKK  S  EVGAC VLPA FADRVDDP ARMGGTSDVTARFRIE SSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAE       
Subjt:  RRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------

Query:  ------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEV------------------------------------------------
              AELD REVLAA EKEEFSAALEAASSTMKDELL+AHSE + LK EV                                                
Subjt:  ------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEV------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAK
                                            EAKAELLK+E++R KA LRAAHAITKGLEKEKFQLLKEKDDMLQALE KD  + R   ELKA K
Subjt:  ------------------------------------EAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAK

Query:  ERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------Q
        ERL N  LLE +FRQHPDFDGFAKDFSDAGFKF MKGI +D+P L +DL  LKKRYAE+WASGPNGT GP +LV+KYVRDLDSDYSDL+ED        +
Subjt:  ERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEG
        VGTTQEG
Subjt:  VGTTQEG

A0A6J1D971 uncharacterized protein LOC1110185387.9e-11180Show/hide
Query:  GTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKD
        G   + A+ RIE SSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAE             AELD REVLAA EKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKD

Query:  ELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFD
        ELL+AHSE + LK EVE++AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAKD+EL+ AT EL+ AKERL N VLLEE+FRQHPDFD
Subjt:  ELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS
        GFAKDFSDAGFKF MKGI SDMPDL IDLS LK+RYAE+WASGP GTPGPQALV++YVRDLDSDYSD EEDQVG+TQEGA  TGS
Subjt:  GFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQ-TGS

A0A6J1DTY7 uncharacterized protein LOC1110230848.0e-95100Show/hide
Query:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
        MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP
Subjt:  MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLP

Query:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
        RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ
Subjt:  RVELISEGSCLTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQ

A0A6J1DXS5 uncharacterized protein LOC1110255026.3e-11677.74Show/hide
Query:  DEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLRLHPFVQE
        + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLG LR+GFAIPENILLR+PEEGERADNPPEGWVTLY KMFEYGLRL LHPFVQE
Subjt:  DEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLRLHPFVQE

Query:  FLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESG
        FLFRT LAPAQVAPNGWGVIFALAILFWLRARD+EEAEL DVDQLLACFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDESG
Subjt:  FLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESG

Query:  RSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGD--RQAAARVRVVRDYNPAVRPIETSRPNSEL
        RSFFDVPTR  +   L SI P      +S     + L+        GR +G            + DYNPAVRPIE+SRPNSEL
Subjt:  RSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGD--RQAAARVRVVRDYNPAVRPIETSRPNSEL

A0A6J1DZB3 uncharacterized protein LOC1110256658.4e-13756.4Show/hide
Query:  MCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGDRQAAARV---
        MCARKG  GIVKGPTSIKGWV KWF+ASGEWLAKDESGR+FFDVPTR   + +   + P  ++A    ++H            +     DR+    V   
Subjt:  MCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQEGRNLGDRQAAARV---

Query:  ----RVVRDYNPAVRPIETSRPNSELDL--------------------------RVTASLRRSNPSDRAG--------VFWRSFAG-----EALRDQTEA
              + DYNP VR IE SRPNSEL +                           VT ++ R+     +G        V     +G     +  R+++EA
Subjt:  ----RVVRDYNPAVRPIETSRPNSELDL--------------------------RVTASLRRSNPSDRAG--------VFWRSFAG-----EALRDQTEA

Query:  VDVSPLGEEVGEAAPLKRRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSV
        +DVSPL E  GE +PL+RR+KKKKT+S  E GA G LP S AD VDDPEARM GTS+V  RF +E SSSGV+DQVSRISA  LDR LRRASKFVSDPGSV
Subjt:  VDVSPLGEEVGEAAPLKRRKKKKKTTSPLEVGACGVLPASFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSV

Query:  LQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKE
        LQRTID  AE             AELD RE LAA E+E   AALEAA +T+K ELL+A  E DIL+ EV+AK +LLKKE ++ KA LRAAHAITKGLEKE
Subjt:  LQRTIDYAAE-------------AELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEADILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKE

Query:  KFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGT
        KFQLLKEKDD+ Q LE KD  + R TTELK  KERL N  LLEESFRQHPDFDGFAKDFSDAGFKF MKGI +DMP L IDL+ LKK+Y+E+WASGPNGT
Subjt:  KFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGITSDMPDLHIDLSSLKKRYAEQWASGPNGT

Query:  PGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE
        P PQ+LV+KYVR+LDSDYSD+EE+        +VGTTQE
Subjt:  PGPQALVNKYVRDLDSDYSDLEED--------QVGTTQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related4.7e-0723.68Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLRLHPFVQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR  +  F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLRLHPFVQ

Query:  EFLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYA
         F     +A +Q+       I   A L  L AR       L V+ +       ++  K G++Y+ + +G + +  GP+  + W+  +FYA
Subjt:  EFLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYA

AT2G15420.1 myosin heavy chain-related1.8e-0632.09Show/hide
Query:  PENILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLRLHPFVQEFLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIA
        P  I L  P+  +R   PPEG++ LY   F   GL   L  F+ E+  R  +A +Q+          LAIL        E    +D D         R+ 
Subjt:  PENILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLRLHPFVQEFLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIA

Query:  KKPGRYYMCARKGAEGIVKGPTS-IKGWVRKWFY
        + PG YY  A K    IV G  S I GW R++F+
Subjt:  KKPGRYYMCARKGAEGIVKGPTS-IKGWVRKWFY

AT5G38190.1 INVOLVED IN: biological_process unknown4.0e-0624.58Show/hide
Query:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLRLHPFVQEFLFRTWLAPA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR  +  F+  F     +A +
Subjt:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMF-EYGLRLRLHPFVQEFLFRTWLAPA

Query:  QVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYA
        Q+       I   A L  L AR       L V+ +       ++  K G++Y+ + +G + +   P+  + W+  +FYA
Subjt:  QVAPNGWGVIFALAILFWLRARDNEEAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATCTTCAAGCTCGAGCGGATAGACGGGTTCGTGGAGGCGACGGCAGTGCTGGCCGGAATATCGACTCGATGCCATTTCAAGTTGTCGGCGGCGATGTTCTCCCT
GTACGTGTACCACCATTCCCTCCGGTGCAACATAGCCCTCCAAATAATGCCTCCCTTCTTCACCACATACAATTTCTTCGACCAGAACCCCAACTCCTCATTCTACATCG
AAAACTTCTTCCGCGCTCTCCTCAACTTTCAGCGCAGCGGATGCTCTTCGCTGAGCTTCGCCCTCGACCACCACCTCCCCCGCGTGGAGCTGATCTCAGAAGGATCTTGC
CTCACGCAGCGCGTGATTGAATTGCCCCTCTCTCCTGCAGAGGAGAAGGCCTCCACAGAAATCGACTACTCAGTCTTTGTGTCCATTGATTTGCAAGACTTCAAGCCCGT
GGCAACCATGTTTGATCGCGCTCCTTATGTTCGCGTTACTTTGTCGCATTCGGGCGTGAGGTTTGCTTATGAAGACGAGGAGATTACTCTCACCGCACAGATGTGCCACA
CCAAGGAAAAACAGCATGGGTTGGATAGTGTTTATGCAAGAATATGCACAACAGTGTTTATTCTGATCGCAGCTCGAACTCGGTCTCCGGACCGATCTGAACACTTGGGC
GGACCTGCACAAAAAGGCGTACACTCCAACGATCAAGTCAGTATAGCTGTCCTCCACGTGTCCAGGGTATTTTCTCCCCCAAACATCGGCCCCCTCTCTGTCCGGTTCGA
TCTCGACCTGGCAGAGAAGTTCATTCGATTCGCTTCGGACACGTGGCGACTTCCTATTCGTGGAAAAATACAACCGTTGCGGTCATACCTTACGCTTCCTGAATTCTTGG
AGTTCGATCTGAAGGCAGCTCGAACCCTTGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGATGGGGAG
GATAGTGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGTCCCCTTCGTAAGGGGTTCGCTATCCCTGAAAACATCCTCCT
TAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACCTCAAAATGTTTGAGTACGGCCTCAGACTTCGCCTTCACCCTTTCGTAC
AAGAGTTTCTTTTCCGAACTTGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGATGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGGGACAATGAA
GAGGCCGAACTATTAGATGTTGATCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATGTGTGCAAGAAAAGGCGCAGAAGGTAT
AGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTA
GGTCTAACTCGGATTTTATCTTGTGCAGTATCAATCCGGCCAGTTCCCGAGCTTACTCAAGCCTCATTCGACACGCTGAAATATTACAAGGAGCACTTTCCGAGGGGCAG
GAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGTTGTTAGAGATTACAACCCCGCAGTTCGTCCCATTGAAACCTCAAGGCCGAACTCCGAACTAGA
TTTGCGAGTAACGGCCAGCCTCAGAAGATCCAACCCGAGTGATCGAGCTGGAGTCTTTTGGAGGTCCTTCGCGGGAGAAGCGCTAAGAGATCAGACCGAGGCGGTGGACG
TCTCGCCCTTGGGTGAGGAGGTGGGGGAGGCGGCCCCTCTGAAGCGGAGGAAGAAGAAGAAGAAAACCACCTCCCCCTTGGAGGTCGGAGCATGTGGGGTCCTGCCCGCG
AGCTTCGCAGACCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAATTGAACTATCAAGTTCTGGGGTGAGGGACCAGGTGTC
CCGCATATCAGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCCGAGC
TGGATAGGAGGGAAGTTCTGGCAGCGATGGAGAAGGAGGAATTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTAAGGGCTCACTCTGAGGCG
GACATTCTGAAGGTCGAGGTGGAGGCTAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCCGCCCATGCCATCACCAAGGGCCTGGAGAA
GGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGACGAGGAACTGAAGCGTGCGACTACCGAGCTAAAGGCGGCGAAGGAGCGTC
TCGGCAACGAAGTCCTGCTGGAGGAGTCTTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCTTCATGAAGGGCATTACT
TCCGACATGCCCGACCTTCATATCGATCTCAGTAGTCTGAAGAAGAGATATGCCGAGCAGTGGGCTTCTGGGCCTAACGGTACCCCTGGCCCCCAAGCGTTGGTGAATAA
GTATGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGACCAGGTCGGCACCACTCAAGAGGGCGCTCAAACAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGATCTTCAAGCTCGAGCGGATAGACGGGTTCGTGGAGGCGACGGCAGTGCTGGCCGGAATATCGACTCGATGCCATTTCAAGTTGTCGGCGGCGATGTTCTCCCT
GTACGTGTACCACCATTCCCTCCGGTGCAACATAGCCCTCCAAATAATGCCTCCCTTCTTCACCACATACAATTTCTTCGACCAGAACCCCAACTCCTCATTCTACATCG
AAAACTTCTTCCGCGCTCTCCTCAACTTTCAGCGCAGCGGATGCTCTTCGCTGAGCTTCGCCCTCGACCACCACCTCCCCCGCGTGGAGCTGATCTCAGAAGGATCTTGC
CTCACGCAGCGCGTGATTGAATTGCCCCTCTCTCCTGCAGAGGAGAAGGCCTCCACAGAAATCGACTACTCAGTCTTTGTGTCCATTGATTTGCAAGACTTCAAGCCCGT
GGCAACCATGTTTGATCGCGCTCCTTATGTTCGCGTTACTTTGTCGCATTCGGGCGTGAGGTTTGCTTATGAAGACGAGGAGATTACTCTCACCGCACAGATGTGCCACA
CCAAGGAAAAACAGCATGGGTTGGATAGTGTTTATGCAAGAATATGCACAACAGTGTTTATTCTGATCGCAGCTCGAACTCGGTCTCCGGACCGATCTGAACACTTGGGC
GGACCTGCACAAAAAGGCGTACACTCCAACGATCAAGTCAGTATAGCTGTCCTCCACGTGTCCAGGGTATTTTCTCCCCCAAACATCGGCCCCCTCTCTGTCCGGTTCGA
TCTCGACCTGGCAGAGAAGTTCATTCGATTCGCTTCGGACACGTGGCGACTTCCTATTCGTGGAAAAATACAACCGTTGCGGTCATACCTTACGCTTCCTGAATTCTTGG
AGTTCGATCTGAAGGCAGCTCGAACCCTTGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGACGATGGGGAG
GATAGTGATGCTTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCCGAGCACTACCTCGGTCCCCTTCGTAAGGGGTTCGCTATCCCTGAAAACATCCTCCT
TAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACCTCAAAATGTTTGAGTACGGCCTCAGACTTCGCCTTCACCCTTTCGTAC
AAGAGTTTCTTTTCCGAACTTGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGATGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGGGACAATGAA
GAGGCCGAACTATTAGATGTTGATCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTACTATATGTGTGCAAGAAAAGGCGCAGAAGGTAT
AGTTAAGGGGCCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGGTCGTTCCTTCTTTGACGTTCCCACTA
GGTCTAACTCGGATTTTATCTTGTGCAGTATCAATCCGGCCAGTTCCCGAGCTTACTCAAGCCTCATTCGACACGCTGAAATATTACAAGGAGCACTTTCCGAGGGGCAG
GAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTCGAGTCCGGGTTGTTAGAGATTACAACCCCGCAGTTCGTCCCATTGAAACCTCAAGGCCGAACTCCGAACTAGA
TTTGCGAGTAACGGCCAGCCTCAGAAGATCCAACCCGAGTGATCGAGCTGGAGTCTTTTGGAGGTCCTTCGCGGGAGAAGCGCTAAGAGATCAGACCGAGGCGGTGGACG
TCTCGCCCTTGGGTGAGGAGGTGGGGGAGGCGGCCCCTCTGAAGCGGAGGAAGAAGAAGAAGAAAACCACCTCCCCCTTGGAGGTCGGAGCATGTGGGGTCCTGCCCGCG
AGCTTCGCAGACCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCGACGTGACAGCACGGTTCAGAATTGAACTATCAAGTTCTGGGGTGAGGGACCAGGTGTC
CCGCATATCAGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGACCCGGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCCGAGC
TGGATAGGAGGGAAGTTCTGGCAGCGATGGAGAAGGAGGAATTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTAAGGGCTCACTCTGAGGCG
GACATTCTGAAGGTCGAGGTGGAGGCTAAGGCCGAGCTGTTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCCGCCCATGCCATCACCAAGGGCCTGGAGAA
GGAGAAGTTCCAACTCCTCAAGGAGAAGGACGACATGCTTCAGGCGCTTGAAGCGAAGGACGAGGAACTGAAGCGTGCGACTACCGAGCTAAAGGCGGCGAAGGAGCGTC
TCGGCAACGAAGTCCTGCTGGAGGAGTCTTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCTTCATGAAGGGCATTACT
TCCGACATGCCCGACCTTCATATCGATCTCAGTAGTCTGAAGAAGAGATATGCCGAGCAGTGGGCTTCTGGGCCTAACGGTACCCCTGGCCCCCAAGCGTTGGTGAATAA
GTATGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAAGAGGACCAGGTCGGCACCACTCAAGAGGGCGCTCAAACAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MLIFKLERIDGFVEATAVLAGISTRCHFKLSAAMFSLYVYHHSLRCNIALQIMPPFFTTYNFFDQNPNSSFYIENFFRALLNFQRSGCSSLSFALDHHLPRVELISEGSC
LTQRVIELPLSPAEEKASTEIDYSVFVSIDLQDFKPVATMFDRAPYVRVTLSHSGVRFAYEDEEITLTAQMCHTKEKQHGLDSVYARICTTVFILIAARTRSPDRSEHLG
GPAQKGVHSNDQVSIAVLHVSRVFSPPNIGPLSVRFDLDLAEKFIRFASDTWRLPIRGKIQPLRSYLTLPEFLEFDLKAARTLGSDEDLARRLESELEEIENFRFSDDGE
DSDASTSGQGLEYPSRIPEHYLGPLRKGFAIPENILLRIPEEGERADNPPEGWVTLYLKMFEYGLRLRLHPFVQEFLFRTWLAPAQVAPNGWGVIFALAILFWLRARDNE
EAELLDVDQLLACFEAKRIAKKPGRYYMCARKGAEGIVKGPTSIKGWVRKWFYASGEWLAKDESGRSFFDVPTRSNSDFILCSINPASSRAYSSLIRHAEILQGALSEGQ
EGRNLGDRQAAARVRVVRDYNPAVRPIETSRPNSELDLRVTASLRRSNPSDRAGVFWRSFAGEALRDQTEAVDVSPLGEEVGEAAPLKRRKKKKKTTSPLEVGACGVLPA
SFADRVDDPEARMGGTSDVTARFRIELSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAELDRREVLAAMEKEEFSAALEAASSTMKDELLRAHSEA
DILKVEVEAKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKDEELKRATTELKAAKERLGNEVLLEESFRQHPDFDGFAKDFSDAGFKFFMKGIT
SDMPDLHIDLSSLKKRYAEQWASGPNGTPGPQALVNKYVRDLDSDYSDLEEDQVGTTQEGAQTGS