; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003782 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003782
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSOUL heme-binding family protein
Genome locationscaffold127:406792..411010
RNA-Seq ExpressionMS003782
SyntenyMS003782
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65404.1 hypothetical protein Csa_019846 [Cucumis sativus]7.3e-13165.15Show/hide
Query:  QVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGR-------TKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRD
        Q+SLQNF S PT+    RP KSGR T   PRLL  R       TK  K VVR  L D+ PPKST+DV RLVDFL+EDL H+FD QGID TAYDE VRFRD
Subjt:  QVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGR-------TKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRD

Query:  PITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIF
        PITK++ I GY+ NI+LLR+LFRP+F LHWVK+TGPYEITTRWT VMKF LLPWKPELV TG SIM I+PETGKFC+HVDLWDS+QNN+YFS+EGLWD+F
Subjt:  PITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIF

Query:  KQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHIL
        KQ RFY+TPELESP+Y ILKRTA YEVRKYAPFI VET  DKL GSA FN VA        G       +P+ T   T  F   +PK            +
Subjt:  KQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHIL

Query:  SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
           PDP+QD + LR ++GGIAAVLKFSGKP E +VQEKAKELR SLIKDGLKP  GCLLARYNDP RTW+F+M
Subjt:  SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]2.1e-13365.25Show/hide
Query:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHV
        M   Q+SLQNFLS PT+    RP KSGR T   PRLL+ RT          K VVR  L D+SPPKSTVDV RLVDFLYEDL H+FD QGID TAYDE V
Subjt:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHV

Query:  RFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGL
        RFRDPITK++ I GY+ NI+LLR++FRP+F LHWVK+TGPYEITTRWT +MKF LLPWKPEL+ TGTSIM I+PETGKFC+HVDLWDS+QNN+YFS+EGL
Subjt:  RFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGL

Query:  WDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHL
        WD+FKQ RFY+TPELESP+Y ILKRT  YEVRKYAPFI VET  DKL GSA FN VA      ++    E + +  P  TQT  F   +PK         
Subjt:  WDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHL

Query:  THILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
           +   PDP+QD I LR ++GGIAAVLKFSGKP+E +VQEKAKELR SLIKDGLKP  GCLLARYNDP RTW+F+M
Subjt:  THILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]1.5e-13666.75Show/hide
Query:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKT--GPEPRLLRGRT-------KVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDE
        M   Q+SLQNFLS PT G GFRP KSG  T  G  PRLL+ RT       +  K  VR  L D+SPPKS VDVDRLVDFLYEDLRH+FD QGID TAYDE
Subjt:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKT--GPEPRLLRGRT-------KVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDE

Query:  HVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLE
        HVRFRDPITK++ I GY  NI+LLR+LFRP+F LHWVK+TGPYEITTRWT VMKFVLLPWKPE + TG SIM I+PETGKFC+HVDLWDS+QNN+YFSLE
Subjt:  HVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLE

Query:  GLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPT
        GL D+FKQ RFY+TPELESP+Y+ILKRTANYEVRKY PF+ VET  DKL GSA FN VA      ++    E + +  P  TQT  F   +PK       
Subjt:  GLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPT

Query:  HLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
             ++  PDP+QD I LR ++GGIAAVLKFSGKP+E+MVQEKAKELR  LIKDGLKP KGCLLARYNDP RTWSF+M
Subjt:  HLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

XP_022144957.1 uncharacterized protein LOC111014503 isoform X2 [Momordica charantia]1.1e-17986.49Show/hide
Query:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPIT
        MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLR RTKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPIT
Subjt:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPIT

Query:  KYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQF
        KYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQF
Subjt:  KYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQF

Query:  RFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHILSGF
        RFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVA                                                 F
Subjt:  RFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHILSGF

Query:  PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
        PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
Subjt:  PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]5.1e-13265.68Show/hide
Query:  QVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRD
        Q+SLQNF S PT+G G RP +SGR T   PRL + RT          K VVR  L D+SPPKSTVDV RLVDFLYEDLRH+FD QGID TAYDE VRFRD
Subjt:  QVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRD

Query:  PITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIF
        PIT ++ I GY+ NI+LLR+LFRP+F LHWVK+TGPYEITTRWT VMKFVLLPWKPELV TG SIM I+PETGKFC+HVDLWDS+QNN+YFS+EGLWD+F
Subjt:  PITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIF

Query:  KQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHIL
        KQ R+Y+TP LESP+Y ILKRTANYEVRKYA FI VET  DKL GSA FN VA      ++    E + +  P  TQT  F    PK            +
Subjt:  KQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHIL

Query:  SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
           PDP+QD I LR ++G IAAVLKFSGKP+E +VQEKAKELR SLIKDGLKP  GCLLARYNDP RTW+F+M
Subjt:  SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein3.5e-13165.15Show/hide
Query:  QVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGR-------TKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRD
        Q+SLQNF S PT+    RP KSGR T   PRLL  R       TK  K VVR  L D+ PPKST+DV RLVDFL+EDL H+FD QGID TAYDE VRFRD
Subjt:  QVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGR-------TKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRD

Query:  PITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIF
        PITK++ I GY+ NI+LLR+LFRP+F LHWVK+TGPYEITTRWT VMKF LLPWKPELV TG SIM I+PETGKFC+HVDLWDS+QNN+YFS+EGLWD+F
Subjt:  PITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIF

Query:  KQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHIL
        KQ RFY+TPELESP+Y ILKRTA YEVRKYAPFI VET  DKL GSA FN VA        G       +P+ T   T  F   +PK            +
Subjt:  KQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHIL

Query:  SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
           PDP+QD + LR ++GGIAAVLKFSGKP E +VQEKAKELR SLIKDGLKP  GCLLARYNDP RTW+F+M
Subjt:  SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X19.9e-13465.25Show/hide
Query:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHV
        M   Q+SLQNFLS PT+    RP KSGR T   PRLL+ RT          K VVR  L D+SPPKSTVDV RLVDFLYEDL H+FD QGID TAYDE V
Subjt:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHV

Query:  RFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGL
        RFRDPITK++ I GY+ NI+LLR++FRP+F LHWVK+TGPYEITTRWT +MKF LLPWKPEL+ TGTSIM I+PETGKFC+HVDLWDS+QNN+YFS+EGL
Subjt:  RFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGL

Query:  WDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHL
        WD+FKQ RFY+TPELESP+Y ILKRT  YEVRKYAPFI VET  DKL GSA FN VA      ++    E + +  P  TQT  F   +PK         
Subjt:  WDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHL

Query:  THILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
           +   PDP+QD I LR ++GGIAAVLKFSGKP+E +VQEKAKELR SLIKDGLKP  GCLLARYNDP RTW+F+M
Subjt:  THILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

A0A5A7TMX2 SOUL heme-binding family protein isoform 18.7e-13062.5Show/hide
Query:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHV
        M   Q+SLQNFLS PT+    RP KSGR T   PRLL+ RT          K VVR  L D+SPPKSTVDV RLVDFLYEDL H+FD QGID TAYDE V
Subjt:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKV-------RKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHV

Query:  RFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKT--------------GPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLW
        RFRDPITK++ I GY+ NI+LLR++FRP+F LHWVK+                PYEITTRWT +MKF LLPWKPEL+ TGTSIM I+PETGKFC+HVDLW
Subjt:  RFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKT--------------GPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLW

Query:  DSVQNNNYFSLEGLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFG
        DS+QNN+YFS+EGLWD+FKQ RFY+TPELESP+Y ILKRT  YEVRKYAPFI VET  DKL GSA FN VA      ++    E + +  P  TQT  F 
Subjt:  DSVQNNNYFSLEGLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFG

Query:  PHTPKPISLAPTHLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVMV
          +PK            +   PDP+QD I LR ++GGIAAVLKFSGKP+E +VQEKAKELR SLIKDGLKP  GCLLARYNDP RTW+F+MV
Subjt:  PHTPKPISLAPTHLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVMV

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X17.3e-13766.75Show/hide
Query:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKT--GPEPRLLRGRT-------KVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDE
        M   Q+SLQNFLS PT G GFRP KSG  T  G  PRLL+ RT       +  K  VR  L D+SPPKS VDVDRLVDFLYEDLRH+FD QGID TAYDE
Subjt:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKT--GPEPRLLRGRT-------KVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDE

Query:  HVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLE
        HVRFRDPITK++ I GY  NI+LLR+LFRP+F LHWVK+TGPYEITTRWT VMKFVLLPWKPE + TG SIM I+PETGKFC+HVDLWDS+QNN+YFSLE
Subjt:  HVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLE

Query:  GLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPT
        GL D+FKQ RFY+TPELESP+Y+ILKRTANYEVRKY PF+ VET  DKL GSA FN VA      ++    E + +  P  TQT  F   +PK       
Subjt:  GLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPT

Query:  HLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
             ++  PDP+QD I LR ++GGIAAVLKFSGKP+E+MVQEKAKELR  LIKDGLKP KGCLLARYNDP RTWSF+M
Subjt:  HLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X25.4e-18086.49Show/hide
Query:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPIT
        MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLR RTKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPIT
Subjt:  MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPIT

Query:  KYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQF
        KYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQF
Subjt:  KYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQF

Query:  RFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHILSGF
        RFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVA                                                 F
Subjt:  RFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHILSGF

Query:  PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
        PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM
Subjt:  PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVM

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic2.9e-0525Show/hide
Query:  FYETPELESPQYQILKRTANYEVRKYAPFISV------ETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQT-----HIFGPHTPKPISLAP
        F   P+LE+  +++L RT  YE+R+  P+         ETG D    S  FN +A      +T  + E + +  P  T+            TP   S A 
Subjt:  FYETPELESPQYQILKRTANYEVRKYAPFISV------ETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQT-----HIFGPHTPKPISLAP

Query:  T----HLTHIL-----SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCL---LARYNDP
              ++ ++     S  P PK  ++ ++ +   I AV+ FSG  ++  ++ + +ELR +L  D    ++  +   +A+YN P
Subjt:  T----HLTHIL-----SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCL---LARYNDP

Arabidopsis top hitse value%identityAlignment
AT2G37970.1 SOUL heme-binding family protein6.0e-0641.94Show/hide
Query:  PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDP
        P P  + + ++   G    V+KFSG  SE++V EK K+L   L KDG K     +LARYN P
Subjt:  PDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLARYNDP

AT3G10130.1 SOUL heme-binding family protein2.1e-0625Show/hide
Query:  FYETPELESPQYQILKRTANYEVRKYAPFISV------ETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQT-----HIFGPHTPKPISLAP
        F   P+LE+  +++L RT  YE+R+  P+         ETG D    S  FN +A      +T  + E + +  P  T+            TP   S A 
Subjt:  FYETPELESPQYQILKRTANYEVRKYAPFISV------ETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQT-----HIFGPHTPKPISLAP

Query:  T----HLTHIL-----SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCL---LARYNDP
              ++ ++     S  P PK  ++ ++ +   I AV+ FSG  ++  ++ + +ELR +L  D    ++  +   +A+YN P
Subjt:  T----HLTHIL-----SGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCL---LARYNDP

AT5G20140.1 SOUL heme-binding family protein1.1e-10558.97Show/hide
Query:  STVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTG
        STV+++ LV FLYEDL H+FD QGID TAYDE V+FRDPITK++ I GY+ NIA L+ +F PQF LHW K+TGPYEITTRWT VMKF+ LPWKPELV TG
Subjt:  STVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTG

Query:  TSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTG
         SIM+++PET KFC+H+DLWDS++NN+YFSLEGL D+FKQ R Y+TP+LE+P+YQILKRTANYEVR Y PFI VET  DKL GS+ FN VA      ++ 
Subjt:  TSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTG

Query:  LLCESLSLPLPTHTQTHIFGPHTPKPISLA-PTHLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLAR
        +  E + +  P  TQT      +   + +  P+     LS  P P ++ ++L+ ++GG AA +KFSGKP+E++VQ K  ELR SL KDGL+  KGC+LAR
Subjt:  LLCESLSLPLPTHTQTHIFGPHTPKPISLA-PTHLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLAR

Query:  YNDPSRTWSFVM
        YNDP RTW+F+M
Subjt:  YNDPSRTWSFVM

AT5G20140.2 SOUL heme-binding family protein1.1e-10558.97Show/hide
Query:  STVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTG
        STV+++ LV FLYEDL H+FD QGID TAYDE V+FRDPITK++ I GY+ NIA L+ +F PQF LHW K+TGPYEITTRWT VMKF+ LPWKPELV TG
Subjt:  STVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPITKYNGIRGYMLNIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTG

Query:  TSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTG
         SIM+++PET KFC+H+DLWDS++NN+YFSLEGL D+FKQ R Y+TP+LE+P+YQILKRTANYEVR Y PFI VET  DKL GS+ FN VA      ++ 
Subjt:  TSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQFRFYETPELESPQYQILKRTANYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTG

Query:  LLCESLSLPLPTHTQTHIFGPHTPKPISLA-PTHLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLAR
        +  E + +  P  TQT      +   + +  P+     LS  P P ++ ++L+ ++GG AA +KFSGKP+E++VQ K  ELR SL KDGL+  KGC+LAR
Subjt:  LLCESLSLPLPTHTQTHIFGPHTPKPISLA-PTHLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSENMVQEKAKELRYSLIKDGLKPIKGCLLAR

Query:  YNDPSRTWSFVM
        YNDP RTW+F+M
Subjt:  YNDPSRTWSFVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGAGGCCAAGTTTCCCTCCAAAACTTCCTCTCAATCCCAACCGTTGGTTGCGGTTTCCGGCCGAAGAAATCCGGCCGGAAAACCGGGCCGGAACCCCGTCTACT
CAGAGGCCGGACTAAAGTTCGCAAGTGCGTTGTTCGATCACGATTGGCAGATCGAAGCCCACCGAAATCGACGGTCGACGTTGACCGATTGGTTGATTTCTTATACGAAG
ATCTCCGCCATGTCTTCGACGCACAGGGGATTGATCCGACGGCGTACGACGAACACGTCCGGTTCCGGGACCCCATCACCAAGTACAATGGCATTCGCGGCTATATGCTT
AATATTGCCCTCTTGCGACAACTCTTCAGGCCCCAGTTCCTCTTGCACTGGGTTAAAAAGACTGGACCGTATGAAATAACTACAAGATGGACTGCGGTAATGAAGTTTGT
TCTTCTACCATGGAAACCAGAATTGGTTTTGACGGGAACTTCCATAATGGATATCGATCCAGAGACGGGCAAGTTTTGTAACCATGTGGATCTCTGGGATTCGGTACAAA
ACAATAACTATTTTTCTCTAGAAGGTCTGTGGGATATATTTAAGCAGTTCCGGTTTTATGAGACTCCAGAATTGGAATCACCCCAATATCAGATATTGAAAAGGACTGCA
AATTATGAGGTGAGGAAATATGCACCATTTATATCAGTAGAAACAGGTGAGGACAAGCTCTATGGGTCCGCTGTATTCAACAGGGTTGCTAGTCTCTCTCTCTCCCTCTC
AACAGGGTTGCTGTGTGAGTCTCTCTCTCTCCCTCTCCCCACACACACCCAAACCCATATCTTTGGCCCACACACACCCAAACCCATATCTTTGGCCCCCACACACCTAA
CACACATCCTTTCAGGTTTCCCAGATCCTAAACAAGACGCAATCAGCTTGAGGACGATTGATGGAGGTATTGCCGCAGTGTTGAAATTCAGTGGAAAACCTTCTGAAAAT
ATGGTGCAAGAGAAGGCCAAAGAATTGCGGTATAGTCTCATAAAAGACGGTCTTAAACCCATTAAGGGCTGTTTGCTTGCTCGGTACAACGACCCCAGCCGAACATGGAG
CTTTGTAATGGTA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGAGGCCAAGTTTCCCTCCAAAACTTCCTCTCAATCCCAACCGTTGGTTGCGGTTTCCGGCCGAAGAAATCCGGCCGGAAAACCGGGCCGGAACCCCGTCTACT
CAGAGGCCGGACTAAAGTTCGCAAGTGCGTTGTTCGATCACGATTGGCAGATCGAAGCCCACCGAAATCGACGGTCGACGTTGACCGATTGGTTGATTTCTTATACGAAG
ATCTCCGCCATGTCTTCGACGCACAGGGGATTGATCCGACGGCGTACGACGAACACGTCCGGTTCCGGGACCCCATCACCAAGTACAATGGCATTCGCGGCTATATGCTT
AATATTGCCCTCTTGCGACAACTCTTCAGGCCCCAGTTCCTCTTGCACTGGGTTAAAAAGACTGGACCGTATGAAATAACTACAAGATGGACTGCGGTAATGAAGTTTGT
TCTTCTACCATGGAAACCAGAATTGGTTTTGACGGGAACTTCCATAATGGATATCGATCCAGAGACGGGCAAGTTTTGTAACCATGTGGATCTCTGGGATTCGGTACAAA
ACAATAACTATTTTTCTCTAGAAGGTCTGTGGGATATATTTAAGCAGTTCCGGTTTTATGAGACTCCAGAATTGGAATCACCCCAATATCAGATATTGAAAAGGACTGCA
AATTATGAGGTGAGGAAATATGCACCATTTATATCAGTAGAAACAGGTGAGGACAAGCTCTATGGGTCCGCTGTATTCAACAGGGTTGCTAGTCTCTCTCTCTCCCTCTC
AACAGGGTTGCTGTGTGAGTCTCTCTCTCTCCCTCTCCCCACACACACCCAAACCCATATCTTTGGCCCACACACACCCAAACCCATATCTTTGGCCCCCACACACCTAA
CACACATCCTTTCAGGTTTCCCAGATCCTAAACAAGACGCAATCAGCTTGAGGACGATTGATGGAGGTATTGCCGCAGTGTTGAAATTCAGTGGAAAACCTTCTGAAAAT
ATGGTGCAAGAGAAGGCCAAAGAATTGCGGTATAGTCTCATAAAAGACGGTCTTAAACCCATTAAGGGCTGTTTGCTTGCTCGGTACAACGACCCCAGCCGAACATGGAG
CTTTGTAATGGTA
Protein sequenceShow/hide protein sequence
MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRGRTKVRKCVVRSRLADRSPPKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPITKYNGIRGYML
NIALLRQLFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVDLWDSVQNNNYFSLEGLWDIFKQFRFYETPELESPQYQILKRTA
NYEVRKYAPFISVETGEDKLYGSAVFNRVASLSLSLSTGLLCESLSLPLPTHTQTHIFGPHTPKPISLAPTHLTHILSGFPDPKQDAISLRTIDGGIAAVLKFSGKPSEN
MVQEKAKELRYSLIKDGLKPIKGCLLARYNDPSRTWSFVMV