; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014829 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014829
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr12:5046012..5054291
RNA-Seq ExpressionLag0014829
SyntenyLag0014829
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589061.1 hypothetical protein SDJN03_17626, partial [Cucurbita argyrosperma subsp. sororia]7.1e-18377.28Show/hide
Query:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLGTLPSCYNTYRRLCQIGKNSLEAEKVIREFIPRVLIRKSQD
        WLLGLPTSV   K+ DHSDFLNK+NLPESLLREDDVFYETVKTR+EEAFG LNVET  L               + K   + ++VI+EFIP+VL R+SQD
Subjt:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLGTLPSCYNTYRRLCQIGKNSLEAEKVIREFIPRVLIRKSQD

Query:  CRQLEIVKRLSQFLNDPKNF----RRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRHRHGWGRDRLINLLTKISKKMLSS
        C QLEIVKRL+Q LNDPKNF    RR CS T TSS PS  DAASQVLYRLGDLPTQGL+AMHRKL GV+VMPQ+KRH+HGWGRDRLIN+LTKISKKMLSS
Subjt:  CRQLEIVKRLSQFLNDPKNF----RRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRHRHGWGRDRLINLLTKISKKMLSS

Query:  LGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECS
        L EGD+LQESLAKAMA+A+LSLKLV G HNSS IEFY FSPQIKTLHNEIVKAIWFVR K++ +KLKQLKSLLDPD+KV NR LRTAIK+MLIDYLFEC 
Subjt:  LGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECS

Query:  DMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDDDFDDNDEDSCD-GLPREDNGSHS
        DMDTVPKSLLKALAMINADSRSA HS  S+DE EEEVECVF+LSAQMKQVVWDLLPN +FE +FADAYMEELEESDDDFDD+   SCD GLPREDNGSHS
Subjt:  DMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDDDFDDNDEDSCD-GLPREDNGSHS

Query:  VYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE
        VYVEGMGESMP NL +SSVGNV+SPSQASMK+ DV P Q SE  HFT E
Subjt:  VYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE

XP_022136208.1 uncharacterized protein LOC111007960 [Momordica charantia]9.3e-18376.87Show/hide
Query:  WLLGLPTSVQ-QKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL
        WLLGLPTS   QK SDHSDFLNKRNLPE LLREDDVFYETVKTR+EEAFG LNVETRH G               + SC +       Y     + ++S+
Subjt:  WLLGLPTSVQ-QKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL

Query:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH
        + EK       V+REFIP VL RKSQDC QLE+VK+LSQ LNDP NFRRRCS T+TSSSPS+HDAASQVLYRLGDLPTQGLLAM RKLEGV+VMPQIKRH
Subjt:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH

Query:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA
        RHGWGRDRLIN+LT+ S KMLSS GEGDELQESLAKAMAVADLSLKLVPG HNSS IEFY F PQIKTLHNEIVKAIW VR K N QKLKQLKSLLDPDA
Subjt:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA

Query:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD
        KVSNR LRTAIKKMLIDYLFECSDMDTVPKSLLKALA+INADSR+A +S  S +EIE+EVECVF+LSAQMKQVVWDLLPNCDFE DFADAYMEELEESDD
Subjt:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD

Query:  DFDDNDEDSCDGLPREDNGSHSV----YVEGMGESMPANLDHSSVGNVMSPSQA
        DFDDND D+CDGLP +DNGSHS     +VEGMGESMPANL+HSSVGN +SPS A
Subjt:  DFDDNDEDSCDGLPREDNGSHSV----YVEGMGESMPANLDHSSVGNVMSPSQA

XP_038890245.1 uncharacterized protein LOC120079870 isoform X1 [Benincasa hispida]9.6e-18875.8Show/hide
Query:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN--TYRRL-----------C
        WLLGLPTSV + KYSDHSDFLNKRNLPESLLREDDVFYETVKTR+EEAFGVLNVETRHLG               + SC N  + R L            
Subjt:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN--TYRRL-----------C

Query:  QIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH
        Q+ K   + ++ I+EFIP+VL RKS+DCRQLE+VK LSQ  ND KNFRRRCS T+TSSS S HDA SQVLY LGDLPTQ LLAM RKLEGV+ MPQ+KRH
Subjt:  QIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH

Query:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA
        RHGWGRDRLINLLTKIS+KMLSS+GEGDELQESLAKAMAVADLS KLVPGRHNSS IEFY FSPQIKTLHNEIVKAIWFVR K N QKLKQLKSLLDPDA
Subjt:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA

Query:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD
        KVS+R+LR +IK MLIDYLFECSDMDTVPKSLLKALA++NADSRSA  SV S+DEIEE+ ECVF+LSAQMKQVVWDLLPNCDFE DFADAYMEELEESDD
Subjt:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD

Query:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIH
        DF++ ++DSCDG P+ED    SVYVEGMGESMPANLDHSSVGN+++PSQAS+ NADVE  Q S P+H
Subjt:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIH

XP_038890247.1 uncharacterized protein LOC120079870 isoform X2 [Benincasa hispida]9.6e-18875.8Show/hide
Query:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN--TYRRL-----------C
        WLLGLPTSV + KYSDHSDFLNKRNLPESLLREDDVFYETVKTR+EEAFGVLNVETRHLG               + SC N  + R L            
Subjt:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN--TYRRL-----------C

Query:  QIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH
        Q+ K   + ++ I+EFIP+VL RKS+DCRQLE+VK LSQ  ND KNFRRRCS T+TSSS S HDA SQVLY LGDLPTQ LLAM RKLEGV+ MPQ+KRH
Subjt:  QIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH

Query:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA
        RHGWGRDRLINLLTKIS+KMLSS+GEGDELQESLAKAMAVADLS KLVPGRHNSS IEFY FSPQIKTLHNEIVKAIWFVR K N QKLKQLKSLLDPDA
Subjt:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA

Query:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD
        KVS+R+LR +IK MLIDYLFECSDMDTVPKSLLKALA++NADSRSA  SV S+DEIEE+ ECVF+LSAQMKQVVWDLLPNCDFE DFADAYMEELEESDD
Subjt:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD

Query:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIH
        DF++ ++DSCDG P+ED    SVYVEGMGESMPANLDHSSVGN+++PSQAS+ NADVE  Q S P+H
Subjt:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIH

XP_038890248.1 uncharacterized protein LOC120079870 isoform X3 [Benincasa hispida]9.6e-18875.8Show/hide
Query:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN--TYRRL-----------C
        WLLGLPTSV + KYSDHSDFLNKRNLPESLLREDDVFYETVKTR+EEAFGVLNVETRHLG               + SC N  + R L            
Subjt:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN--TYRRL-----------C

Query:  QIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH
        Q+ K   + ++ I+EFIP+VL RKS+DCRQLE+VK LSQ  ND KNFRRRCS T+TSSS S HDA SQVLY LGDLPTQ LLAM RKLEGV+ MPQ+KRH
Subjt:  QIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH

Query:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA
        RHGWGRDRLINLLTKIS+KMLSS+GEGDELQESLAKAMAVADLS KLVPGRHNSS IEFY FSPQIKTLHNEIVKAIWFVR K N QKLKQLKSLLDPDA
Subjt:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA

Query:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD
        KVS+R+LR +IK MLIDYLFECSDMDTVPKSLLKALA++NADSRSA  SV S+DEIEE+ ECVF+LSAQMKQVVWDLLPNCDFE DFADAYMEELEESDD
Subjt:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD

Query:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIH
        DF++ ++DSCDG P+ED    SVYVEGMGESMPANLDHSSVGN+++PSQAS+ NADVE  Q S P+H
Subjt:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIH

TrEMBL top hitse value%identityAlignment
A0A0A0K6A1 Uncharacterized protein1.3e-18274.52Show/hide
Query:  WLLGLPTS-VQQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL
        WLLGLPTS  ++KY DHSDFLNK+NLPESLLREDD+FYETVKTR+EEAFG L VETRHLG               + SC N       Y     + +NS 
Subjt:  WLLGLPTS-VQQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL

Query:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH
        + EK        IREFIP+VL  KSQDCRQLEIVK+L+Q LND KNFRRR STT+TSS  S HDA S VLY LGDLPTQ LLAMHRKL GV+ MPQ+KR+
Subjt:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH

Query:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA
        RHG GRD LINLLTKISKKMLSS+GEGDELQESLAKAMAVADLSLKLVPGRHNSS IEFYHFSPQ+K+LHNEIVKAIW +  + N QKLKQ+KSLLDP+A
Subjt:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA

Query:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD
        KVS+RSLR+ IK+MLIDYLFECSDMDTVPKSLLKALA+INADS+ A HSV S+DEIEEEVECVF+LSAQMKQVVWDLLPNCDFE DFADAYMEELEESD+
Subjt:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD

Query:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE
        D DD+DEDSCDGLPREDN S SVYVEGMGESMPANLDHSSVGN++SPS     NADVE FQ S P+HF  E
Subjt:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE

A0A6J1C3N7 uncharacterized protein LOC1110079604.5e-18376.87Show/hide
Query:  WLLGLPTSVQ-QKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL
        WLLGLPTS   QK SDHSDFLNKRNLPE LLREDDVFYETVKTR+EEAFG LNVETRH G               + SC +       Y     + ++S+
Subjt:  WLLGLPTSVQ-QKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL

Query:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH
        + EK       V+REFIP VL RKSQDC QLE+VK+LSQ LNDP NFRRRCS T+TSSSPS+HDAASQVLYRLGDLPTQGLLAM RKLEGV+VMPQIKRH
Subjt:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH

Query:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA
        RHGWGRDRLIN+LT+ S KMLSS GEGDELQESLAKAMAVADLSLKLVPG HNSS IEFY F PQIKTLHNEIVKAIW VR K N QKLKQLKSLLDPDA
Subjt:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA

Query:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD
        KVSNR LRTAIKKMLIDYLFECSDMDTVPKSLLKALA+INADSR+A +S  S +EIE+EVECVF+LSAQMKQVVWDLLPNCDFE DFADAYMEELEESDD
Subjt:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD

Query:  DFDDNDEDSCDGLPREDNGSHSV----YVEGMGESMPANLDHSSVGNVMSPSQA
        DFDDND D+CDGLP +DNGSHS     +VEGMGESMPANL+HSSVGN +SPS A
Subjt:  DFDDNDEDSCDGLPREDNGSHSV----YVEGMGESMPANLDHSSVGNVMSPSQA

A0A6J1EKS9 uncharacterized protein LOC111435237 isoform X11.7e-18274.37Show/hide
Query:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL
        WLLGLPTSV   K+ DHSDFLNK+NLPESLLREDDVFYETVKTR+EEAFG LNVET  LG               + SC N       Y       ++S+
Subjt:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL

Query:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNF----RRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQ
          EK       VI+EFIP+VL R+SQDC QLEIVKRL+Q LNDPKNF    RR CS T TSS PS  DAASQVLYRLGDLPTQGL+AMHRKL GV+VMPQ
Subjt:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNF----RRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQ

Query:  IKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLL
        +KRH+HGWGRDRLIN+LTKISKKMLSSL EGD+LQESLAKAMA+A+LSLKLV G HNSS IEFY FSPQIKTLHNEIVKAIWFVR K++ +KLKQLKSLL
Subjt:  IKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLL

Query:  DPDAKVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELE
        DPD+KV NR LRTAIK+MLIDYLFEC DMDTVPKSLLKALAMINADSRSA HS  S+DE EEEVECVF+LSAQMKQVVWDLLPN +FE +FADAYMEELE
Subjt:  DPDAKVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELE

Query:  ESDDDFDDNDEDSCD-GLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE
        ESDDDFDD+   SCD GLPREDNGSHSVYVEGMGESMP NL +SSVGNV+SPSQASMK+ DV P Q SE  HFT E
Subjt:  ESDDDFDDNDEDSCD-GLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE

A0A6J1ERM5 uncharacterized protein LOC111435237 isoform X21.7e-18274.37Show/hide
Query:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL
        WLLGLPTSV   K+ DHSDFLNK+NLPESLLREDDVFYETVKTR+EEAFG LNVET  LG               + SC N       Y       ++S+
Subjt:  WLLGLPTSVQQ-KYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLG--------------TLPSCYN------TYRRLCQIGKNSL

Query:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNF----RRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQ
          EK       VI+EFIP+VL R+SQDC QLEIVKRL+Q LNDPKNF    RR CS T TSS PS  DAASQVLYRLGDLPTQGL+AMHRKL GV+VMPQ
Subjt:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNF----RRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQ

Query:  IKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLL
        +KRH+HGWGRDRLIN+LTKISKKMLSSL EGD+LQESLAKAMA+A+LSLKLV G HNSS IEFY FSPQIKTLHNEIVKAIWFVR K++ +KLKQLKSLL
Subjt:  IKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLL

Query:  DPDAKVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELE
        DPD+KV NR LRTAIK+MLIDYLFEC DMDTVPKSLLKALAMINADSRSA HS  S+DE EEEVECVF+LSAQMKQVVWDLLPN +FE +FADAYMEELE
Subjt:  DPDAKVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELE

Query:  ESDDDFDDNDEDSCD-GLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE
        ESDDDFDD+   SCD GLPREDNGSHSVYVEGMGESMP NL +SSVGNV+SPSQASMK+ DV P Q SE  HFT E
Subjt:  ESDDDFDDNDEDSCD-GLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE

A0A6J1HWI3 uncharacterized protein LOC111468137 isoform X11.9e-18173.89Show/hide
Query:  WLLGLPTSVQ-QKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLGT--------------LPSCYN------TYRRLCQIGKNSL
        WLLGLPTSV   KYSDHSD LNKRNLPESLLREDDVF+ TVKTR+EEAFGVLN+ETRHLG               + SC +       Y     + ++S+
Subjt:  WLLGLPTSVQ-QKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLGT--------------LPSCYN------TYRRLCQIGKNSL

Query:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH
        + EK       VIREFIP+VL RKSQDC QLE  KRLSQ LND  NFRR  S T TSS+ SFHDAASQVLY LGD+PTQ LLAM RKLEGV+ +PQIK  
Subjt:  EAEK-------VIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRH

Query:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA
        + GWGRDRLINLLTKISKKMLSSLGEG ELQESLAKAMAVADLSLKLVPGRHN S IEFY FSPQIKTLHNEIVKAIWFVR   NI+KLK+LK LLDPDA
Subjt:  RHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDA

Query:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD
        +VS+R LR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAPHSV S+DEI EEVE VF+LSAQMKQVVWDLLPNCDFE DFADAYMEELEESDD
Subjt:  KVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDD

Query:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE
        D+ DND++  DGLP+ED+G HSV+VEGMGESMPANLD++SVGN++SPSQAS+KNADVEPF+CS+P  FT E
Subjt:  DFDDNDEDSCDGLPREDNGSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G40520.1 unknown protein2.0e-7447.74Show/hide
Query:  KNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTM--TSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRHR
        K  L+ +++IR+ + R   +  +   + EI+ +L Q L+DP NFR  C   +  T +  S  DAA +VL  L  L TQ L AM RKL+G +++PQ+K  R
Subjt:  KNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTM--TSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIKRHR

Query:  HGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDAK
         G  R  LIN + + S+KMLS L  GD+LQE LAKA++V DLSLKL PG   ++  +F+ FSP+ K L NEIVKA+W +R K   ++LK+L   LDP+A+
Subjt:  HGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDAK

Query:  VSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDDD
        VSN SLR+A++K LI+YLFECSD+DT+PKSL++AL+++N+ + +  H V  R+ IEEE EC+  +SAQ+KQ+    +PN + + DF DAYME+LE+SDD+
Subjt:  VSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDDD

Query:  FDDNDEDSCD
         DD+D+D  D
Subjt:  FDDNDEDSCD

AT5G40520.2 unknown protein2.1e-8443.39Show/hide
Query:  DYGA----ISDLLETDSINLEWLLGLPTSVQQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVL--------NVETRHL--------GTLPS
        D+G+    I D  E   +   WLLG   S   K  DH+       +PESLLREDD+FYET+K+R+EEAFG          +V+ + L          L S
Subjt:  DYGA----ISDLLETDSINLEWLLGLPTSVQQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVL--------NVETRHL--------GTLPS

Query:  CYNTYRRL---------CQIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTM--TSSSPSFHDAASQVLYRLGDLPTQG
          N    L             K  L+ +++IR+ + R   +  +   + EI+ +L Q L+DP NFR  C   +  T +  S  DAA +VL  L  L TQ 
Subjt:  CYNTYRRL---------CQIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTM--TSSSPSFHDAASQVLYRLGDLPTQG

Query:  LLAMHRKLEGVQVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFV
        L AM RKL+G +++PQ+K  R G  R  LIN + + S+KMLS L  GD+LQE LAKA++V DLSLKL PG   ++  +F+ FSP+ K L NEIVKA+W +
Subjt:  LLAMHRKLEGVQVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFV

Query:  RNKANIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPN
        R K   ++LK+L   LDP+A+VSN SLR+A++K LI+YLFECSD+DT+PKSL++AL+++N+ + +  H V  R+ IEEE EC+  +SAQ+KQ+    +PN
Subjt:  RNKANIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPN

Query:  CDFETDFADAYMEELEESDDDFDDNDEDSCD
         + + DF DAYME+LE+SDD+ DD+D+D  D
Subjt:  CDFETDFADAYMEELEESDDDFDDNDEDSCD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGATGGTTGTCGACGCGGCGGATGAAGGTCGTTGGCACGGCGAAGATGGTTCGTCGGCGTGGTATGAGAGGGGGAGAAACGACCGAGCATGCAAGAGATGGTT
GTCGACGCGGCGGATGAAGGTCGTTGACACGGCGGAGATGGTTCGTCGGCGTGGTGTGAGAGGGTTTCTACTTTTGGAGGAAGAAGATGGACGGGTGGGCTGCTGCATGA
AGGCTATTCTGGTGGTGAGGGTGGGCGGTGACATGACGTTTTTACCCCTTGTATTTACGGTTATTTACAGACGCCGTTTACGTCATTTCCGAGGACAAAGATGTGTTTTG
CAGTATTTAATTGCAATTATTCATTCTGGTGGTGAGGGTGGGCGCCAAGGCATCTCAAGCCTCTTCCTCTCCATCTTCACGATATTTGCAGCCGCCCAACCCTCATCTCC
GGCAACTTCTTCGTCAGTCTCAACACCACCACCAGCGGCAGGTGGTATCTTCTCCCGTCGTCAGTTCGGCAGCGGCGGCGTGCGTGGTCTTAGTCCGACAGCGGCGGCGG
TTCGCGGTTTTGGGTGCAGCGGCGTGTTTTTTTCGGTAGATTTCAGCAGCAAAGCACGTCCAACCCTCGTCAGGCGTTGTTTCCGCAGCGTGCTGGTCCGTTTCAGCCTC
ATAAAACGCGCGGGCAGCAACACTTCAGCAGGGGGCGCGACGGACAGCAGTGGTGGGGGCTTTTTCCGGCGGTTTCTAACACTCCTAGGCGTGGATGCTGATTATGGTGC
AATTTCCGATTTACTGGAAACGGATTCGATAAATTTAGAATGGCTGCTGGGCCTTCCTACATCTGTTCAACAGAAGTATTCAGATCATTCAGACTTTTTAAACAAACGAA
ACTTGCCTGAATCGTTGCTGAGGGAAGATGATGTTTTCTATGAGACTGTCAAAACAAGAATTGAAGAAGCTTTTGGAGTGTTAAATGTTGAAACAAGGCATCTTGGGACT
TTACCTTCTTGCTATAATACTTACAGAAGACTCTGTCAAATTGGAAAAAACTCGCTGGAAGCTGAAAAGGTTATCAGAGAATTTATTCCAAGAGTTCTGATTAGGAAAAG
TCAAGATTGCCGTCAATTAGAAATTGTTAAACGATTGTCTCAATTTCTCAACGACCCAAAAAATTTCCGAAGAAGATGCTCAACAACTATGACATCAAGTTCGCCATCTT
TCCATGATGCAGCATCGCAGGTACTGTATAGATTAGGAGACCTGCCCACCCAAGGTCTCTTAGCTATGCATCGAAAGCTTGAAGGAGTTCAAGTTATGCCTCAGATAAAA
CGCCACAGGCATGGGTGGGGCCGTGATCGTCTTATTAATCTTCTTACCAAAATTAGTAAGAAGATGCTTTCATCGCTTGGTGAAGGAGATGAATTGCAAGAATCACTAGC
AAAAGCCATGGCGGTGGCTGATTTATCACTTAAACTAGTACCAGGTCGCCATAATTCATCCGAAATTGAGTTTTATCACTTCTCACCCCAAATAAAAACCTTGCACAATG
AAATAGTAAAAGCCATATGGTTTGTTAGAAACAAGGCTAATATTCAGAAGCTCAAACAGTTGAAGTCTTTGTTGGATCCTGATGCTAAAGTGTCGAATAGGAGTCTAAGA
ACAGCTATTAAGAAGATGTTAATAGACTATCTTTTTGAGTGTAGTGATATGGACACTGTGCCAAAGTCTCTTTTGAAAGCTTTAGCTATGATAAATGCAGATTCTCGAAG
TGCACCACATTCAGTTTCCTCACGAGATGAAATTGAAGAGGAGGTTGAATGTGTATTTACTTTGAGTGCTCAGATGAAACAAGTAGTTTGGGATTTACTACCTAACTGTG
ACTTTGAAACCGACTTTGCTGATGCATATATGGAAGAGTTAGAAGAAAGTGATGATGATTTTGATGATAATGACGAAGATAGTTGTGATGGTTTGCCTCGAGAAGACAAT
GGCTCCCATTCTGTTTATGTTGAAGGTATGGGAGAATCAATGCCGGCCAATTTGGATCATTCATCAGTGGGGAATGTCATGTCCCCTAGTCAGGCGTCCATGAAAAATGC
AGATGTGGAGCCTTTTCAATGTTCTGAACCTATCCATTTTACTTGGGAGGTTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGATGGTTGTCGACGCGGCGGATGAAGGTCGTTGGCACGGCGAAGATGGTTCGTCGGCGTGGTATGAGAGGGGGAGAAACGACCGAGCATGCAAGAGATGGTT
GTCGACGCGGCGGATGAAGGTCGTTGACACGGCGGAGATGGTTCGTCGGCGTGGTGTGAGAGGGTTTCTACTTTTGGAGGAAGAAGATGGACGGGTGGGCTGCTGCATGA
AGGCTATTCTGGTGGTGAGGGTGGGCGGTGACATGACGTTTTTACCCCTTGTATTTACGGTTATTTACAGACGCCGTTTACGTCATTTCCGAGGACAAAGATGTGTTTTG
CAGTATTTAATTGCAATTATTCATTCTGGTGGTGAGGGTGGGCGCCAAGGCATCTCAAGCCTCTTCCTCTCCATCTTCACGATATTTGCAGCCGCCCAACCCTCATCTCC
GGCAACTTCTTCGTCAGTCTCAACACCACCACCAGCGGCAGGTGGTATCTTCTCCCGTCGTCAGTTCGGCAGCGGCGGCGTGCGTGGTCTTAGTCCGACAGCGGCGGCGG
TTCGCGGTTTTGGGTGCAGCGGCGTGTTTTTTTCGGTAGATTTCAGCAGCAAAGCACGTCCAACCCTCGTCAGGCGTTGTTTCCGCAGCGTGCTGGTCCGTTTCAGCCTC
ATAAAACGCGCGGGCAGCAACACTTCAGCAGGGGGCGCGACGGACAGCAGTGGTGGGGGCTTTTTCCGGCGGTTTCTAACACTCCTAGGCGTGGATGCTGATTATGGTGC
AATTTCCGATTTACTGGAAACGGATTCGATAAATTTAGAATGGCTGCTGGGCCTTCCTACATCTGTTCAACAGAAGTATTCAGATCATTCAGACTTTTTAAACAAACGAA
ACTTGCCTGAATCGTTGCTGAGGGAAGATGATGTTTTCTATGAGACTGTCAAAACAAGAATTGAAGAAGCTTTTGGAGTGTTAAATGTTGAAACAAGGCATCTTGGGACT
TTACCTTCTTGCTATAATACTTACAGAAGACTCTGTCAAATTGGAAAAAACTCGCTGGAAGCTGAAAAGGTTATCAGAGAATTTATTCCAAGAGTTCTGATTAGGAAAAG
TCAAGATTGCCGTCAATTAGAAATTGTTAAACGATTGTCTCAATTTCTCAACGACCCAAAAAATTTCCGAAGAAGATGCTCAACAACTATGACATCAAGTTCGCCATCTT
TCCATGATGCAGCATCGCAGGTACTGTATAGATTAGGAGACCTGCCCACCCAAGGTCTCTTAGCTATGCATCGAAAGCTTGAAGGAGTTCAAGTTATGCCTCAGATAAAA
CGCCACAGGCATGGGTGGGGCCGTGATCGTCTTATTAATCTTCTTACCAAAATTAGTAAGAAGATGCTTTCATCGCTTGGTGAAGGAGATGAATTGCAAGAATCACTAGC
AAAAGCCATGGCGGTGGCTGATTTATCACTTAAACTAGTACCAGGTCGCCATAATTCATCCGAAATTGAGTTTTATCACTTCTCACCCCAAATAAAAACCTTGCACAATG
AAATAGTAAAAGCCATATGGTTTGTTAGAAACAAGGCTAATATTCAGAAGCTCAAACAGTTGAAGTCTTTGTTGGATCCTGATGCTAAAGTGTCGAATAGGAGTCTAAGA
ACAGCTATTAAGAAGATGTTAATAGACTATCTTTTTGAGTGTAGTGATATGGACACTGTGCCAAAGTCTCTTTTGAAAGCTTTAGCTATGATAAATGCAGATTCTCGAAG
TGCACCACATTCAGTTTCCTCACGAGATGAAATTGAAGAGGAGGTTGAATGTGTATTTACTTTGAGTGCTCAGATGAAACAAGTAGTTTGGGATTTACTACCTAACTGTG
ACTTTGAAACCGACTTTGCTGATGCATATATGGAAGAGTTAGAAGAAAGTGATGATGATTTTGATGATAATGACGAAGATAGTTGTGATGGTTTGCCTCGAGAAGACAAT
GGCTCCCATTCTGTTTATGTTGAAGGTATGGGAGAATCAATGCCGGCCAATTTGGATCATTCATCAGTGGGGAATGTCATGTCCCCTAGTCAGGCGTCCATGAAAAATGC
AGATGTGGAGCCTTTTCAATGTTCTGAACCTATCCATTTTACTTGGGAGGTTCCTTAG
Protein sequenceShow/hide protein sequence
MQEMVVDAADEGRWHGEDGSSAWYERGRNDRACKRWLSTRRMKVVDTAEMVRRRGVRGFLLLEEEDGRVGCCMKAILVVRVGGDMTFLPLVFTVIYRRRLRHFRGQRCVL
QYLIAIIHSGGEGGRQGISSLFLSIFTIFAAAQPSSPATSSSVSTPPPAAGGIFSRRQFGSGGVRGLSPTAAAVRGFGCSGVFFSVDFSSKARPTLVRRCFRSVLVRFSL
IKRAGSNTSAGGATDSSGGGFFRRFLTLLGVDADYGAISDLLETDSINLEWLLGLPTSVQQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGVLNVETRHLGT
LPSCYNTYRRLCQIGKNSLEAEKVIREFIPRVLIRKSQDCRQLEIVKRLSQFLNDPKNFRRRCSTTMTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEGVQVMPQIK
RHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYHFSPQIKTLHNEIVKAIWFVRNKANIQKLKQLKSLLDPDAKVSNRSLR
TAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPHSVSSRDEIEEEVECVFTLSAQMKQVVWDLLPNCDFETDFADAYMEELEESDDDFDDNDEDSCDGLPREDN
GSHSVYVEGMGESMPANLDHSSVGNVMSPSQASMKNADVEPFQCSEPIHFTWEVP