; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G004410 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G004410
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF3754)
Genome locationchr05:5639240..5655976
RNA-Seq ExpressionLsi05G004410
SyntenyLsi05G004410
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581139.1 hypothetical protein SDJN03_21141, partial [Cucurbita argyrosperma subsp. sororia]3.6e-25365.76Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPV+KPKLIMTLANLIEH+SDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ LSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRG GIDRT+DFFFMEK+D+LIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FWAYLLRLTRLEKIF RRPS R MEDRKKNDEI P+A+ DDL+VERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRAS KS SERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        AD+EIVLPEKKNPGLTPMDWVKFLVSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK------------------------------------------------
        IQQ                    EDLDLRCEELIKEEFGE+CNFEVDDAV K                                                
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK------------------------------------------------

Query:  -------------------RICRRIDREDLLNENEAM------AMDSSATNWLPNNDQEE---------------------------------EGSHTLG
                           R   +  +E +  E E++       + S+ ++ L  +D++E                                  G+  L 
Subjt:  -------------------RICRRIDREDLLNENEAM------AMDSSATNWLPNNDQEE---------------------------------EGSHTLG

Query:  EGVGYPHPQAHAYQ----RLVQPSRELNI--------------QFELGTFCNL---MIFWKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYR
        +    P       Q    +L Q   + N               Q+ L    ++    +  KLLT YFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYR
Subjt:  EGVGYPHPQAHAYQ----RLVQPSRELNI--------------QFELGTFCNL---MIFWKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDHFYR

Query:  TKVNTIIMRIWMFFLKISGLKSLLFGASRSRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELL
        TKVN II RIWMFFL I GLK LLF ASRS QSQVFSKQIDIST+S+DDGLYVERIRVENM+LGF +L
Subjt:  TKVNTIIMRIWMFFLKISGLKSLLFGASRSRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELL

TYK27184.1 uncharacterized protein E5676_scaffold236G00370 [Cucumis melo var. makuwa]1.5e-21487.83Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRGTGIDRTSDFFFMEK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQK
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

XP_008449987.1 PREDICTED: uncharacterized protein LOC103491708 isoform X2 [Cucumis melo]4.3e-21487.61Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRGTGIDRTSDFFF+EK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQK
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

XP_031738641.1 uncharacterized protein LOC101204725 isoform X2 [Cucumis sativus]1.3e-21590.8Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRGTGID+TSDFFFMEK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FWAYLLRLTRLEKI  RRP +R  EDRKKNDEIPP+A+Q DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVA+VGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ   EDLDLRCEELIKEEFGEHCNFEVDDAVQK
Subjt:  IQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQK

XP_038874654.1 uncharacterized protein LOC120067213 [Benincasa hispida]4.5e-21988.94Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKR+EYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRGTGIDRTSDFF+MEK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FWAYLL LTRLEKIFSRRPS RL EDRKKNDEI P+AEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWVKFLVSA+VGLVAVVGS+EMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQK
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

TrEMBL top hitse value%identityAlignment
A0A1S3BNA9 uncharacterized protein LOC103491708 isoform X22.1e-21487.61Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRGTGIDRTSDFFF+EK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQK
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

A0A1S4DXZ1 uncharacterized protein LOC103491708 isoform X12.1e-21487.61Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRGTGIDRTSDFFF+EK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQK
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

A0A5D3DUK4 Uncharacterized protein7.2e-21587.83Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRGTGIDRTSDFFFMEK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQK
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

A0A6J1D477 uncharacterized protein LOC111016875 isoform X28.0e-21489.2Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKR+EYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKITSDEEIE+ALSGQYLLNLPITVD+SKLDKVLLKKYFATHPQ +LPDFVDK                RRG GIDRT+DFFFMEK+DMLIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FWAYLLRLTRLEKIFSRRPS R M DRKKND+IPP+A+  DL VER+RLENMELSARN+LGK+TIQEPTFDRIIVVYRRASTKSK ERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        ADMEIVLPEKKNPGLTPMDWV FLVSA+VGLVAVVGSIEMPKADFWVI AVLSTVIGYCAKTYFTFQQN+ TYQNLITQSMY+KQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ   EDLDLRCEELIKEEFGE CNFEVDDAVQK
Subjt:  IQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQK

A0A6J1IYZ4 uncharacterized protein LOC1114819384.7e-21487.17Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPV+KPKLIMTLANLIEH+SDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ LSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR
        QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDK                RRG GIDRT+DFFFMEK+D+LIGR
Subjt:  QVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGR

Query:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM
        FWAYLLRLTRLEKIFSRRPS R MEDRKKNDEI P+A+ DDL+VERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRAS KS SERGIYVKHFKNIPM
Subjt:  FWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPM

Query:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
        AD+EIVLPEKKNPGLTPMDWVKFLVSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV
Subjt:  ADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDV

Query:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
        IQQ                    EDLDLRCEELIKEEFGE CNFEVDDAV K
Subjt:  IQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)4.3e-1026.95Show/hide
Query:  ENMELSARNLLGKVTIQEPTFDRIIVVYRRAST------KSKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEM---
        + ++ S   LL   T+QEP F+ +I++Y + ++      K ++   + ++ F+ IP+ D+ ++ P KK      +D V+  +++++GL A   + +    
Subjt:  ENMELSARNLLGKVTIQEPTFDRIIVVYRRAST------KSKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEM---

Query:  ---PKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQE
           P A F  + AV + VI Y  +    ++Q    YQ L+ +++Y+K L SG G++  L D   QQ+
Subjt:  ---PKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQE

AT3G19340.1 Protein of unknown function (DUF3754)1.0e-18472.83Show/hide
Query:  NKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQVM
        NKEVIRLE ESVIP+LKPKLIMTLANLIEHS+DR EFLKLCKRIEYT+RAWYLLQFEDLMQLYSLFDPVHGAQK++QQ+L+S+EI+VLEQNFL+YLFQVM
Subjt:  NKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQVM

Query:  EKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGRFWA
        EKSNFKITS+EE+E+A SGQYLLNLPI VDESKLDK LLK+YF  HP  N+PDF DK                RRG G+D+T+D+FFMEK+D++I RFW+
Subjt:  EKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGRFWA

Query:  YLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADM
        +L+R+TRLEK+ ++R S+   +D KK+DE  P+ + D+L VER+RLEN +LS ++ L K+TIQEPTFDR+IVVYRRAS+K+  ERGIYVKHFKNIPMADM
Subjt:  YLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADM

Query:  EIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQ
        EIVLPEK+NPGLTPMDWVKFL+SAVVGLVAV+ S+EMPK+D WVI A+LSTV+GYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQ
Subjt:  EIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQ

Query:  --------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK
                            EDLDLRCEELIKEEFG  CNF+V+DAVQK
Subjt:  --------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQK

AT5G13940.1 aminopeptidases6.6e-12856.37Show/hide
Query:  IEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQVMEKSNFKITSDEEIEIALSGQYLLNLPI
        I+   +R EFL+ C+R+E TIRAWY L FEDLMQLYSLF+PV GA +L QQ+LS+ EI+ LE  FL +LFQVMEKSNFK+ ++EEI++ALS QY LNLPI
Subjt:  IEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQVMEKSNFKITSDEEIEIALSGQYLLNLPI

Query:  TVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGRFWAYLLRLTRLEK-IFSRRPSARLMEDRKK
         V+E+KLD  LL +YF+  P+ +LP F DK                RRG GID    +FF+ KID ++ R W +LL +T L++ ++ ++    L E    
Subjt:  TVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGRFWAYLLRLTRLEK-IFSRRPSARLMEDRKK

Query:  NDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVV
          +I    E+D L +ER+R+E ++LS  NL+ K+TIQEPTF+RIIVVYRR S K +SER IYVKHFK IPMADMEIVLPEKKNPGLTP+DWVKFLVSA +
Subjt:  NDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVV

Query:  GLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQ---------------------EDLDL
        GLV VV S+ + KAD  VI A+LSTV+ YC KTYFTFQ+N+  YQ+LIT+S+YDKQLDSGRGTLLHLCD+VIQQ                     E+LD+
Subjt:  GLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQ---------------------EDLDL

Query:  RCEELIKEEFGEHCNFEVDDAVQK
        + E  IKEEF E CNF+VDDA+ K
Subjt:  RCEELIKEEFGEHCNFEVDDAVQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAACAAGGAAGTAATTCGGCTGGAGCGTGAATCTGTCATTCCGGTGTTGAAGCCGAAGTTGATAATGACCTTGGCGAATCTAATTGAACATAGTTCTGATCG
GGCTGAGTTCTTAAAACTTTGCAAGAGAATTGAGTACACGATTCGAGCTTGGTACCTTCTACAATTTGAGGATTTGATGCAACTCTACTCTCTCTTTGATCCTGTTCATG
GAGCCCAGAAGCTGGAGCAGCAGCATCTATCCTCTGAGGAAATTGAAGTGCTTGAACAAAATTTTCTTTCTTACTTGTTTCAGGTTATGGAAAAGAGCAATTTCAAAATA
ACAAGTGACGAGGAGATTGAAATTGCACTTTCAGGGCAGTATCTCCTAAATCTTCCTATTACAGTTGATGAGTCCAAGCTTGACAAGGTTCTTTTAAAGAAATATTTCGC
TACACATCCTCAAGTCAACCTACCAGACTTTGTTGACAAGGTGCCAAAGTCTGGCTCTATTTCCTCCTTTCAACCCCCCTATTTTCTTAGGCGAGGAACTGGAATTGACC
GAACGAGTGATTTCTTTTTCATGGAGAAAATAGACATGCTTATTGGAAGATTTTGGGCATATCTCTTAAGGTTAACGAGGTTAGAAAAGATTTTCTCCAGAAGGCCAAGT
GCACGCCTTATGGAAGATAGGAAGAAGAATGATGAGATTCCCCCTAATGCTGAACAAGATGACCTAAATGTTGAAAGAGTTCGTCTGGAGAATATGGAACTGAGTGCTCG
CAATTTGCTGGGCAAGGTTACTATTCAAGAACCTACCTTTGATAGGATTATTGTAGTTTACAGGCGAGCAAGTACGAAGTCTAAATCTGAACGTGGAATATATGTCAAGC
ATTTTAAAAACATTCCAATGGCTGATATGGAAATAGTACTTCCTGAAAAGAAAAATCCAGGACTGACTCCAATGGACTGGGTTAAGTTCCTTGTATCTGCTGTAGTTGGG
CTGGTTGCTGTTGTGGGTTCAATTGAAATGCCCAAGGCGGATTTTTGGGTCATTTTCGCTGTTCTCTCTACAGTTATTGGTTACTGTGCAAAGACATATTTCACGTTTCA
GCAAAACATGGCTACATATCAGAACTTGATTACACAATCCATGTATGACAAACAGCTGGATAGTGGAAGGGGCACACTTCTTCATTTGTGTGACGACGTGATCCAACAGG
AAGATCTTGATCTACGGTGTGAGGAGTTGATCAAAGAAGAGTTTGGTGAGCACTGCAACTTTGAAGTGGACGATGCCGTTCAAAAGAGAATTTGCCGCCGAATAGACCGG
GAGGATTTATTGAACGAAAATGAGGCCATGGCCATGGATTCATCTGCCACGAACTGGCTTCCGAACAATGACCAAGAAGAAGAGGGAAGTCATACGCTTGGAGAGGGAGT
CGGTTATCCCCATCCTCAAGCCCACGCTTATCAGCGCCTTGTCCAGCCATCTCGAGAGTTGAATATTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGGA
AGCTTTTGACGAAATACTTCATGGAGAATCCTCACGACAATCTTCCCTATTTTGCTGATAAGTACATAATTTTCCGTCGTGGTATTGGGATTGATCAAATGAACGATCAC
TTTTACCGAACGAAAGTAAATACCATCATTATGCGAATATGGATGTTCTTTCTCAAAATCTCAGGGTTAAAGAGCCTTCTATTTGGAGCGTCAAGAAGCCGCCAAAGTCA
GGTATTTTCAAAACAAATTGACATCAGTACAGAGTCAGAGGATGATGGCTTGTATGTTGAACGGATTCGCGTTGAGAACATGACACTTGGGTTTGAACTCTTACCACTCT
ATAATTTAGCTTCTTGCATTAATGGTTGGATTAGTTTAACTAGGGAAGGCCCTTGCTGTCTTCCTAGCAAGCAGGAGGTCAAGCATGGGCATGCACTTTGTCTTTCAGAT
TCCTTGATTGCTTGCTTTGATCATCTATTGTATGTCATTCCAGGATCTCTACGCTATTGA
mRNA sequenceShow/hide mRNA sequence
TACAAAACAGCATTTCCCTCCGGAGCGAGCGGGGCTCGGTCAGTTCCATTCCATGAGAAGTTTTTCTAGTTGAGGGGAGAGAAATTTTTGTAGATTCAAAACCTAAATTG
CAGTGTATTATTGTCATTATCGAGAAGATAATTGGGTCTGTTCATCCTAATTATGATTTGATTTGTTTTGGAATCGACTCGGGACTGTTCTTGACACTGTTTCAGACGTT
ACAGTTTTGCTGATTTGGAGGTATTATTGTCTCTTCTCCCTCTTTCTTAGGCCACTGGGAAACTCGTTGAGCTTCTAAACAATGGGGAAGAACAAGGAAGTAATTCGGCT
GGAGCGTGAATCTGTCATTCCGGTGTTGAAGCCGAAGTTGATAATGACCTTGGCGAATCTAATTGAACATAGTTCTGATCGGGCTGAGTTCTTAAAACTTTGCAAGAGAA
TTGAGTACACGATTCGAGCTTGGTACCTTCTACAATTTGAGGATTTGATGCAACTCTACTCTCTCTTTGATCCTGTTCATGGAGCCCAGAAGCTGGAGCAGCAGCATCTA
TCCTCTGAGGAAATTGAAGTGCTTGAACAAAATTTTCTTTCTTACTTGTTTCAGGTTATGGAAAAGAGCAATTTCAAAATAACAAGTGACGAGGAGATTGAAATTGCACT
TTCAGGGCAGTATCTCCTAAATCTTCCTATTACAGTTGATGAGTCCAAGCTTGACAAGGTTCTTTTAAAGAAATATTTCGCTACACATCCTCAAGTCAACCTACCAGACT
TTGTTGACAAGGTGCCAAAGTCTGGCTCTATTTCCTCCTTTCAACCCCCCTATTTTCTTAGGCGAGGAACTGGAATTGACCGAACGAGTGATTTCTTTTTCATGGAGAAA
ATAGACATGCTTATTGGAAGATTTTGGGCATATCTCTTAAGGTTAACGAGGTTAGAAAAGATTTTCTCCAGAAGGCCAAGTGCACGCCTTATGGAAGATAGGAAGAAGAA
TGATGAGATTCCCCCTAATGCTGAACAAGATGACCTAAATGTTGAAAGAGTTCGTCTGGAGAATATGGAACTGAGTGCTCGCAATTTGCTGGGCAAGGTTACTATTCAAG
AACCTACCTTTGATAGGATTATTGTAGTTTACAGGCGAGCAAGTACGAAGTCTAAATCTGAACGTGGAATATATGTCAAGCATTTTAAAAACATTCCAATGGCTGATATG
GAAATAGTACTTCCTGAAAAGAAAAATCCAGGACTGACTCCAATGGACTGGGTTAAGTTCCTTGTATCTGCTGTAGTTGGGCTGGTTGCTGTTGTGGGTTCAATTGAAAT
GCCCAAGGCGGATTTTTGGGTCATTTTCGCTGTTCTCTCTACAGTTATTGGTTACTGTGCAAAGACATATTTCACGTTTCAGCAAAACATGGCTACATATCAGAACTTGA
TTACACAATCCATGTATGACAAACAGCTGGATAGTGGAAGGGGCACACTTCTTCATTTGTGTGACGACGTGATCCAACAGGAAGATCTTGATCTACGGTGTGAGGAGTTG
ATCAAAGAAGAGTTTGGTGAGCACTGCAACTTTGAAGTGGACGATGCCGTTCAAAAGAGAATTTGCCGCCGAATAGACCGGGAGGATTTATTGAACGAAAATGAGGCCAT
GGCCATGGATTCATCTGCCACGAACTGGCTTCCGAACAATGACCAAGAAGAAGAGGGAAGTCATACGCTTGGAGAGGGAGTCGGTTATCCCCATCCTCAAGCCCACGCTT
ATCAGCGCCTTGTCCAGCCATCTCGAGAGTTGAATATTCAATTCGAGCTTGGTACCTTCTGCAATTTGATGATCTTTTGGAAGCTTTTGACGAAATACTTCATGGAGAAT
CCTCACGACAATCTTCCCTATTTTGCTGATAAGTACATAATTTTCCGTCGTGGTATTGGGATTGATCAAATGAACGATCACTTTTACCGAACGAAAGTAAATACCATCAT
TATGCGAATATGGATGTTCTTTCTCAAAATCTCAGGGTTAAAGAGCCTTCTATTTGGAGCGTCAAGAAGCCGCCAAAGTCAGGTATTTTCAAAACAAATTGACATCAGTA
CAGAGTCAGAGGATGATGGCTTGTATGTTGAACGGATTCGCGTTGAGAACATGACACTTGGGTTTGAACTCTTACCACTCTATAATTTAGCTTCTTGCATTAATGGTTGG
ATTAGTTTAACTAGGGAAGGCCCTTGCTGTCTTCCTAGCAAGCAGGAGGTCAAGCATGGGCATGCACTTTGTCTTTCAGATTCCTTGATTGCTTGCTTTGATCATCTATT
GTATGTCATTCCAGGATCTCTACGCTATTGA
Protein sequenceShow/hide protein sequence
MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQVMEKSNFKI
TSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKVPKSGSISSFQPPYFLRRGTGIDRTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPS
ARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVG
LVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQEDLDLRCEELIKEEFGEHCNFEVDDAVQKRICRRIDR
EDLLNENEAMAMDSSATNWLPNNDQEEEGSHTLGEGVGYPHPQAHAYQRLVQPSRELNIQFELGTFCNLMIFWKLLTKYFMENPHDNLPYFADKYIIFRRGIGIDQMNDH
FYRTKVNTIIMRIWMFFLKISGLKSLLFGASRSRQSQVFSKQIDISTESEDDGLYVERIRVENMTLGFELLPLYNLASCINGWISLTREGPCCLPSKQEVKHGHALCLSD
SLIACFDHLLYVIPGSLRY