; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g27680 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g27680
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr4:20417153..20419409
RNA-Seq ExpressionMoc04g27680
SyntenyMoc04g27680
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]7.8e-10785.14Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQRTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPI
        MCARKGA GIVKG TSIKGWVRKWFYASGE L +            V+IRPV ELTQASFDTLKYYKEHFPRG KVGTLVTDKLLLESGLLDYNPAVRPI
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQRTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPI

Query:  ESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVREEAPLKRRRKKKKT
        ESSRPNSEL MVCGFASNVKRKSKG+AHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKR R QTEA DVS LGEEVREE PLKRRRKKKKT
Subjt:  ESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVREEAPLKRRRKKKKT

Query:  TSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQ
        TSPL+VGARG LPASFADRVD PEARMGGT DVT RFRVEP SSGVRDQ
Subjt:  TSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.1e-11683.88Show/hide
Query:  MFEYGLRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAI FWLRARDSEEAELLDVDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR

Query:  KWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAS
        KWFYASGE L      R+   VP+    LVSIRPV ELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS L MVC FAS
Subjt:  KWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHR-------VQTEAADVSSLGE
         VKRKSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKR R        QTEAADV  LGE
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHR-------VQTEAADVSSLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]7.0e-11684.87Show/hide
Query:  GTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKD
        G   + A+ R+EP SSGVRDQ+SRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELD RE LAAREKEEFS ALE ASSTMKD
Subjt:  GTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKD

Query:  ELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFD
        ELLKAHS+VE LKAEVE++ ELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELE  KERL++G LLEE+FRQHPDFD
Subjt:  ELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVRDLDSNYSDLEED
        GFAKDFSDAGFKFLMKGIASDMPDLQI+L GLK+RYAE+WASGP GTPG QALVD+YVRDLDS+YSD EED
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVRDLDSNYSDLEED

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-16587.32Show/hide
Query:  MSSSFSSDLGSDEDLARWLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLAR LES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARWLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA
        LRLPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAI FWLRARDSEEAEL DVDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFASNVKRK
        SGE L      R+   VP+    LVSIRPV ELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL MVCGFAS VKRK
Subjt:  SGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAAD
        SKGRAHALEAAQSS+PATPAV GPASEDPA VIELESS GPSREKR R QTEA D
Subjt:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAAD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.7e-17568.36Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKG TSIKGWV KWF+ASGE L      R    VP+    LVSI+ + EL QA+FDTLK+YK+HFPR  K+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVRE
         VR IE+SRPNSEL MVCGF  +VKRKSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR 
Subjt:  AVRPIESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVRE

Query:  EAPLKRRRKKKKTTSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKKT+S  + GARG LP S AD VD PEARM GTS+V  RF +EP SSGV+DQ+SRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKTTSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKDELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELD REALAA+E+E    ALEAA +T+K ELLKA  +V+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKDELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVR
        Q LE K+  +   T EL+ +KERL +G LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQI+L GLKK+Y+E+WASGP+GTP  Q+LVDKYVR
Subjt:  QALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVR

Query:  DLDSNYSDLEED
        +LDS+YSD+EE+
Subjt:  DLDSNYSDLEED

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092983.8e-10785.14Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQRTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPI
        MCARKGA GIVKG TSIKGWVRKWFYASGE L +            V+IRPV ELTQASFDTLKYYKEHFPRG KVGTLVTDKLLLESGLLDYNPAVRPI
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQRTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPI

Query:  ESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVREEAPLKRRRKKKKT
        ESSRPNSEL MVCGFASNVKRKSKG+AHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKR R QTEA DVS LGEEVREE PLKRRRKKKKT
Subjt:  ESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVREEAPLKRRRKKKKT

Query:  TSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQ
        TSPL+VGARG LPASFADRVD PEARMGGT DVT RFRVEP SSGVRDQ
Subjt:  TSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138265.3e-11783.88Show/hide
Query:  MFEYGLRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAI FWLRARDSEEAELLDVDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVR

Query:  KWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAS
        KWFYASGE L      R+   VP+    LVSIRPV ELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS L MVC FAS
Subjt:  KWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFAS

Query:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHR-------VQTEAADVSSLGE
         VKRKSKGRAHALEAAQSS+P TPAV GPASEDPAPVIELESS GPSREKR R        QTEAADV  LGE
Subjt:  NVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHR-------VQTEAADVSSLGE

A0A6J1D971 uncharacterized protein LOC1110185383.4e-11684.87Show/hide
Query:  GTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKD
        G   + A+ R+EP SSGVRDQ+SRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELD RE LAAREKEEFS ALE ASSTMKD
Subjt:  GTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKD

Query:  ELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFD
        ELLKAHS+VE LKAEVE++ ELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELE  KERL++G LLEE+FRQHPDFD
Subjt:  ELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVRDLDSNYSDLEED
        GFAKDFSDAGFKFLMKGIASDMPDLQI+L GLK+RYAE+WASGP GTPG QALVD+YVRDLDS+YSD EED
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVRDLDSNYSDLEED

A0A6J1DXS5 uncharacterized protein LOC1110255028.0e-16687.32Show/hide
Query:  MSSSFSSDLGSDEDLARWLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLAR LES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARWLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA
        LRLPLHPFVQEFLF TGLAPAQVAPNGWGVIFALAI FWLRARDSEEAEL DVDQLL CFEAKRIAKKPGRFYMCARKGAGGIVKG TSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA

Query:  SGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFASNVKRK
        SGE L      R+   VP+    LVSIRPV ELTQASFDTLKYYKE FPRG KVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL MVCGFAS VKRK
Subjt:  SGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFASNVKRK

Query:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAAD
        SKGRAHALEAAQSS+PATPAV GPASEDPA VIELESS GPSREKR R QTEA D
Subjt:  SKGRAHALEAAQSSEPATPAVAGPASEDPAPVIELESSEGPSREKRHRVQTEAAD

A0A6J1DZB3 uncharacterized protein LOC1110256654.2e-17568.36Show/hide
Query:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKG TSIKGWV KWF+ASGE L      R    VP+    LVSI+ + EL QA+FDTLK+YK+HFPR  K+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGSTSIKGWVRKWFYASGECLQ-----RTSRVVPSLTFPLVSIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVRE
         VR IE+SRPNSEL MVCGF  +VKRKSKGRAHAL+    +EP TP V        +GP+S  P PVIEL+ S G S EKR R ++EA DVS L  EVR 
Subjt:  AVRPIESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAV--------AGPASEDPAPVIELESSEGPSREKRHRVQTEAADVSSLGEEVRE

Query:  EAPLKRRRKKKKTTSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E+PL+RRRKKKKT+S  + GARG LP S AD VD PEARM GTS+V  RF +EP SSGV+DQ+SRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EAPLKRRRKKKKTTSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKDELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELD REALAA+E+E    ALEAA +T+K ELLKA  +V+IL+AEV+AK +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKDELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVR
        Q LE K+  +   T EL+ +KERL +G LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQI+L GLKK+Y+E+WASGP+GTP  Q+LVDKYVR
Subjt:  QALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVR

Query:  DLDSNYSDLEED
        +LDS+YSD+EE+
Subjt:  DLDSNYSDLEED

SwissProt top hitse value%identityAlignment
Q9LEX8 Uncharacterized protein At3g60930, chloroplastic1.2e-0429.55Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEA
        S   E  L  L+  F +   + LR+P   ERAD+PP G+ TLY + F YG  L LP+   V E++    +A +Q+       +  + I      R  E  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG--LRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEA

Query:  ELLDVDQLLTCFEAKRIAK-KPGRFYMCARKG
          + +  L    E +R+ K +  R+Y+   KG
Subjt:  ELLDVDQLLTCFEAKRIAK-KPGRFYMCARKG

Arabidopsis top hitse value%identityAlignment
AT3G42060.1 myosin heavy chain-related7.8e-0427.74Show/hide
Query:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAE
        SR    + G        PE +   IPE  +R  + PEG++ L+   F E GL  PL  F+  +     +A +Q++         L I        +EE  
Subjt:  SRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAE

Query:  LLDVD--QLLTCFEAKRIAKKPGRFYMCARKGAG-GIVKGSTS-IKGWVRKWFYA
        ++D+D  + +T F    I  K  R  +CA    G  I  G TS ++ W + +F+A
Subjt:  LLDVD--QLLTCFEAKRIAKKPGRFYMCARKGAG-GIVKGSTS-IKGWVRKWFYA

AT5G38190.1 INVOLVED IN: biological_process unknown2.4e-0524.58Show/hide
Query:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFETGLAPA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +
Subjt:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFETGLAPA

Query:  QVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA
        Q+       I   A    L AR       +++ + LT F   ++  K G+ Y+ + +G   +    +  + W+  +FYA
Subjt:  QVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTTGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCGAAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCTTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTGGACGTAGACCAGCTCCTCACATGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAG
TTAAGGGGTCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGTT
TCAATCCGACCAGTCCTCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTTCCGAGGGGTACGAAGGTCGGAACCTTGGTGACCGACAAGCT
GCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGTCATGGTTTGCGGATTTGCAAGCAACGTGAAAC
GCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTG
GAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCACAGGGTTCAGACCGAGGCGGCGGACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGCGAAG
GAGGAAGAAGAAGAAGACCACCTCCCCCTTGAAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACCATCCTGAGGCCAGGATGGGCGGGACGT
CCGACGTGACAGCACGGTTCAGAGTCGAGCCGTTAAGTTCTGGGGTGAGGGACCAGATGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAA
TTTGTAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTATGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTTTGGCCGTGAAGGCCGAGCTGGATAGGAG
GGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTACTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTAAGGTGGAAATTTTGA
AGGCTGAGGTGGAGGCCAAGACCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTC
CAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAACAGTGG
AGCCTTATTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGC
CTGACCTTCAGATCAATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCTTCCAAGCGTTGGTGGATAAGTACGTCAGA
GATCTGGACTCTAACTACTCCGACCTCGAAGAGGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTTGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCGAAACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCTTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTGGACGTAGACCAGCTCCTCACATGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAG
TTAAGGGGTCGACCTCCATCAAAGGATGGGTGAGGAAGTGGTTCTACGCTTCCGGGGAATGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGACGTTCCCACTAGTT
TCAATCCGACCAGTCCTCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCATTTTCCGAGGGGTACGAAGGTCGGAACCTTGGTGACCGACAAGCT
GCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGTCATGGTTTGCGGATTTGCAAGCAACGTGAAAC
GCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGGAACCTGCCACTCCTGCCGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTG
GAGTCTTCTGAGGGTCCTTCGAGGGAGAAGCGCCACAGGGTTCAGACCGAGGCGGCGGACGTCTCGTCCTTGGGCGAGGAGGTGAGGGAGGAGGCCCCTCTGAAGCGAAG
GAGGAAGAAGAAGAAGACCACCTCCCCCTTGAAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGGACCATCCTGAGGCCAGGATGGGCGGGACGT
CCGACGTGACAGCACGGTTCAGAGTCGAGCCGTTAAGTTCTGGGGTGAGGGACCAGATGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAA
TTTGTAAGTGACCCGGGGTCCGTCCTGCAGAGGACCATCGACTATGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTTTGGCCGTGAAGGCCGAGCTGGATAGGAG
GGAAGCTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTACTGCCTTGGAGGCTGCCTCTTCCACCATGAAGGATGAGCTGCTGAAAGCTCACTCTAAGGTGGAAATTTTGA
AGGCTGAGGTGGAGGCCAAGACCGAGCTGCTGAAGAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTC
CAACTCCTCAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCACGCGACTGCTGAGCTGGAGATGGTGAAGGAGCGTCTCAACAGTGG
AGCCTTATTGGAGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGC
CTGACCTTCAGATCAATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCTTCCAAGCGTTGGTGGATAAGTACGTCAGA
GATCTGGACTCTAACTACTCCGACCTCGAAGAGGATTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARWLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPLHPFVQ
EFLFETGLAPAQVAPNGWGVIFALAIFFWLRARDSEEAELLDVDQLLTCFEAKRIAKKPGRFYMCARKGAGGIVKGSTSIKGWVRKWFYASGECLQRTSRVVPSLTFPLV
SIRPVLELTQASFDTLKYYKEHFPRGTKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELVMVCGFASNVKRKSKGRAHALEAAQSSEPATPAVAGPASEDPAPVIEL
ESSEGPSREKRHRVQTEAADVSSLGEEVREEAPLKRRRKKKKTTSPLKVGARGALPASFADRVDHPEARMGGTSDVTARFRVEPLSSGVRDQMSRISAASLDRCLRRASK
FVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDRREALAAREKEEFSTALEAASSTMKDELLKAHSKVEILKAEVEAKTELLKKEEDRRKAQLRAAHAITKGLEKEKF
QLLKEKDDMLQALEAKEEELKHATAELEMVKERLNSGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQINLGGLKKRYAEQWASGPSGTPGFQALVDKYVR
DLDSNYSDLEED