; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011368 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011368
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF3754)
Genome locationChr01:5315024..5326907
RNA-Seq ExpressionHG10011368
SyntenyHG10011368
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK27184.1 uncharacterized protein E5676_scaffold236G00370 [Cucumis melo var. makuwa]7.9e-24286.35Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRGTGID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RTSDFFFMEK+DMLIGRFW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTK
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        SK ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGI+SRDTIGRYYCVGLKRANEIIGPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVNP
        EELVLKARQG+NP
Subjt:  EELVLKARQGVNP

XP_008449987.1 PREDICTED: uncharacterized protein LOC103491708 isoform X2 [Cucumis melo]2.3e-24186.16Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRGTGID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RTSDFFF+EK+DMLIGRFW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTK
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        SK ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGI+SRDTIGRYYCVGLKRANEIIGPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVNP
        EELVLKARQG+NP
Subjt:  EELVLKARQGVNP

XP_022983322.1 uncharacterized protein LOC111481938 [Cucurbita maxima]3.3e-24085.19Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPV+KPKLIMTLANLIEH+SDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ LSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRG GID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RT+DFFFMEK+D+LIGRFWAYLLRLTRLEKIFSRRPS R MEDRKKNDEI P+A+ DDL+VERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRAS K
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        S SERGIYVKHFKNIPMAD+EIVLPEKKNPGLTPMDWVKFLVSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGE CNFEVDDAV KLEKLGI+SRD++GRYYCVGLKRANEI+GPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVNP
        EELV KARQGVNP
Subjt:  EELVLKARQGVNP

XP_031738641.1 uncharacterized protein LOC101204725 isoform X2 [Cucumis sativus]3.6e-24289.11Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRGTGID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        +TSDFFFMEK+DMLIGRFWAYLLRLTRLEKI  RRP +R  EDRKKNDEIPP+A+Q DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTK
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        SK ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVA+VGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEELVLKARQGVNP
        KQLDSGRGTLLHLCDDVIQQ   EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIG TTEELVLKARQGVNP
Subjt:  KQLDSGRGTLLHLCDDVIQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEELVLKARQGVNP

XP_038874654.1 uncharacterized protein LOC120067213 [Benincasa hispida]4.1e-24687.7Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKR+EYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRGTGID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RTSDFF+MEK+DMLIGRFWAYLL LTRLEKIFSRRPS RL EDRKKNDEI P+AEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        SK ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSA+VGLVAVVGS+EMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVN
        EELVLKARQGVN
Subjt:  EELVLKARQGVN

TrEMBL top hitse value%identityAlignment
A0A1S3BNA9 uncharacterized protein LOC103491708 isoform X21.1e-24186.16Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRGTGID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RTSDFFF+EK+DMLIGRFW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTK
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        SK ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGI+SRDTIGRYYCVGLKRANEIIGPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVNP
        EELVLKARQG+NP
Subjt:  EELVLKARQGVNP

A0A5D3DUK4 Uncharacterized protein3.8e-24286.35Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ+LSS+EI+VLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKI SDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRGTGID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RTSDFFFMEK+DMLIGRFW+YLLRLTRLEKI  RRPS+R MEDRKKNDEIP +AEQ DL+VERVRLENMELSA NLLGKVTIQEPTFDRIIVVYRRASTK
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        SK ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKF+VSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGI+SRDTIGRYYCVGLKRANEIIGPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVNP
        EELVLKARQG+NP
Subjt:  EELVLKARQGVNP

A0A6J1D477 uncharacterized protein LOC111016875 isoform X22.8e-24087.27Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKR+EYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKITSDEEIE+ALSGQYLLNLPITVD+SKLDKVLLKKYFATHPQ +LPDFVDKYV+FRRG GID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RT+DFFFMEK+DMLIGRFWAYLLRLTRLEKIFSRRPS R M DRKKND+IPP+A+  DL VER+RLENMELSARN+LGK+TIQEPTFDRIIVVYRRASTK
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        SK ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWV FLVSA+VGLVAVVGSIEMPKADFWVI AVLSTVIGYCAKTYFTFQQN+ TYQNLITQSMY+
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEELVLKARQGVN
        KQLDSGRGTLLHLCDDVIQQ   EDLDLRCEELIKEEFGE CNFEVDDAVQKLEKLGI+SRDTIGRYYCVGLKRANEIIGPTTEELVLKARQG N
Subjt:  KQLDSGRGTLLHLCDDVIQQ---EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEELVLKARQGVN

A0A6J1F2A2 uncharacterized protein LOC1114415303.6e-24084.99Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPV+KPKLIMTLANLIEH+SDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ LSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRG GID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RT+DFFFMEK+D+LIGRFWAYLLRLTRLEKIF RRPS R MEDRKKNDEI P+A+ DDL+VERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRAS K
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        S SERGIYVKHFKNIPMAD+EIVLPEKKNPGLTPMDWVKFLVSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGE+CNFEVDDAV KLEKLGI+SRD++GRYYCVGLKRANEI+GPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVNP
        EELV KARQGVNP
Subjt:  EELVLKARQGVNP

A0A6J1IYZ4 uncharacterized protein LOC1114819381.6e-24085.19Show/hide
Query:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF
        MGKNKEVIRLERESVIPV+KPKLIMTLANLIEH+SDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQ LSS+EIEVLEQNFLSYLF
Subjt:  MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLF

Query:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID
        Q                             VMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQ NLPDFVDKYVIFRRG GID
Subjt:  QGDLCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGID

Query:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK
        RT+DFFFMEK+D+LIGRFWAYLLRLTRLEKIFSRRPS R MEDRKKNDEI P+A+ DDL+VERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRAS K
Subjt:  RTSDFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTK

Query:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
        S SERGIYVKHFKNIPMAD+EIVLPEKKNPGLTPMDWVKFLVSA+VGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD
Subjt:  SKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYD

Query:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT
        KQLDSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFGE CNFEVDDAV KLEKLGI+SRD++GRYYCVGLKRANEI+GPTT
Subjt:  KQLDSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTT

Query:  EELVLKARQGVNP
        EELV KARQGVNP
Subjt:  EELVLKARQGVNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)3.1e-1026.95Show/hide
Query:  ENMELSARNLLGKVTIQEPTFDRIIVVYRRAST------KSKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEM---
        + ++ S   LL   T+QEP F+ +I++Y + ++      K ++   + ++ F+ IP+ D+ ++ P KK      +D V+  +++++GL A   + +    
Subjt:  ENMELSARNLLGKVTIQEPTFDRIIVVYRRAST------KSKSERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEM---

Query:  ---PKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQE
           P A F  + AV + VI Y  +    ++Q    YQ L+ +++Y+K L SG G++  L D   QQ+
Subjt:  ---PKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQE

AT3G19340.1 Protein of unknown function (DUF3754)1.1e-20972.35Show/hide
Query:  NKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQGD
        NKEVIRLE ESVIP+LKPKLIMTLANLIEHS+DR EFLKLCKRIEYT+RAWYLLQFEDLMQLYSLFDPVHGAQK++QQ+L+S+EI+VLEQNFL+YLFQ  
Subjt:  NKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQGD

Query:  LCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGIDRTS
                                   VMEKSNFKITS+EE+E+A SGQYLLNLPI VDESKLDK LLK+YF  HP  N+PDF DKYVIFRRG G+D+T+
Subjt:  LCGSYSEHLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGIDRTS

Query:  DFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKS
        D+FFMEK+D++I RFW++L+R+TRLEK+ ++R S+   +D KK+DE  P+ + D+L VER+RLEN +LS ++ L K+TIQEPTFDR+IVVYRRAS+K+  
Subjt:  DFFFMEKIDMLIGRFWAYLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKS

Query:  ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQL
        ERGIYVKHFKNIPMADMEIVLPEK+NPGLTPMDWVKFL+SAVVGLVAV+ S+EMPK+D WVI A+LSTV+GYCAKTYFTFQQNMATYQNLITQSMYDKQL
Subjt:  ERGIYVKHFKNIPMADMEIVLPEKKNPGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQL

Query:  DSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEEL
        DSGRGTLLHLCDDVIQQ                    EDLDLRCEELIKEEFG  CNF+V+DAVQKLEKLGI++RDTIGRYYC+GLKRANEIIG TTEEL
Subjt:  DSGRGTLLHLCDDVIQQ--------------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEEL

Query:  VLKARQGVNP
        VLKA+QGV P
Subjt:  VLKARQGVNP

AT5G13940.1 aminopeptidases1.3e-14656.02Show/hide
Query:  IEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQGDLCGSYSEHLPFAGFKLVHVTFHITLYK
        I+   +R EFL+ C+R+E TIRAWY L FEDLMQLYSLF+PV GA +L QQ+LS+ EI+ LE  FL +LFQ                             
Subjt:  IEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQGDLCGSYSEHLPFAGFKLVHVTFHITLYK

Query:  VMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGIDRTSDFFFMEKIDMLIGRFWAYLLRLTRLEK
        VMEKSNFK+ ++EEI++ALS QY LNLPI V+E+KLD  LL +YF+  P+ +LP F DKY+IFRRG GID    +FF+ KID ++ R W +LL +T L++
Subjt:  VMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGIDRTSDFFFMEKIDMLIGRFWAYLLRLTRLEK

Query:  -IFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADMEIVLPEKKN
         ++ ++    L E      +I    E+D L +ER+R+E ++LS  NL+ K+TIQEPTF+RIIVVYRR S K +SER IYVKHFK IPMADMEIVLPEKKN
Subjt:  -IFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADMEIVLPEKKN

Query:  PGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQ---------
        PGLTP+DWVKFLVSA +GLV VV S+ + KAD  VI A+LSTV+ YC KTYFTFQ+N+  YQ+LIT+S+YDKQLDSGRGTLLHLCD+VIQQ         
Subjt:  PGLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQ---------

Query:  ------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEELVLKARQG
                    E+LD++ E  IKEEF E CNF+VDDA+ KLEKLG++SRD+  +Y CV +K ANEI+G TTEE+VLKARQG
Subjt:  ------------EDLDLRCEELIKEEFGEHCNFEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEELVLKARQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGAACAAGGAAGTAATTCGGCTGGAGCGTGAATCTGTCATTCCGGTGTTGAAGCCGAAGTTGATAATGACCTTGGCGAATCTAATTGAACATAGTTCTGATCG
GGCTGAGTTCTTAAAACTTTGCAAGAGAATTGAGTACACGATTCGAGCTTGGTACCTTCTACAATTTGAGGATTTGATGCAACTCTACTCTCTCTTTGATCCTGTTCATG
GAGCCCAGAAGCTGGAGCAGCAGCATCTATCCTCTGAGGAAATTGAAGTGCTTGAACAAAATTTTCTTTCTTACTTGTTTCAGGGAGATCTATGTGGTTCTTATTCTGAG
CATTTACCCTTTGCTGGTTTCAAACTTGTACATGTGACGTTCCACATTACTTTGTACAAAGTTATGGAAAAGAGCAATTTCAAAATAACAAGTGACGAGGAGATTGAAAT
TGCACTTTCAGGGCAGTATCTCCTAAATCTTCCTATTACAGTTGATGAGTCCAAGCTTGACAAGGTTCTTTTAAAGAAATATTTCGCTACACATCCTCAAGTCAACCTAC
CAGACTTTGTTGACAAGTATGTTATTTTCAGGCGAGGAACTGGAATTGACCGAACGAGTGATTTCTTTTTCATGGAGAAAATAGACATGCTTATTGGAAGATTTTGGGCA
TATCTCTTAAGGTTAACGAGGTTAGAAAAGATTTTCTCCAGAAGGCCAAGTGCACGCCTTATGGAAGATAGGAAGAAGAATGATGAGATTCCCCCTAATGCTGAACAAGA
TGACCTAAATGTTGAAAGAGTTCGTCTGGAGAATATGGAACTGAGTGCTCGCAATTTGCTGGGCAAGGTTACTATTCAAGAACCTACCTTTGATAGGATTATTGTAGTTT
ACAGGCGAGCAAGTACGAAGTCTAAATCTGAACGTGGAATATATGTCAAGCATTTTAAAAACATTCCAATGGCTGATATGGAAATAGTACTTCCTGAAAAGAAAAATCCA
GGACTGACTCCAATGGACTGGGTTAAGTTCCTTGTATCTGCTGTAGTTGGGCTGGTTGCTGTTGTGGGTTCAATTGAAATGCCCAAGGCGGATTTTTGGGTCATTTTCGC
TGTTCTCTCTACAGTTATTGGTTACTGTGCAAAGACATATTTCACGTTTCAGCAAAACATGGCTACATATCAGAACTTGATTACACAATCCATGTATGACAAACAGCTGG
ATAGTGGAAGGGGCACACTTCTTCATTTGTGTGACGACGTGATCCAACAGGAAGATCTTGATCTACGGTGTGAGGAGTTGATCAAAGAAGAGTTTGGTGAGCACTGCAAC
TTTGAAGTGGACGATGCCGTTCAAAAGTTGGAGAAATTGGGAATCATTTCTCGGGATACAATTGGACGTTATTACTGTGTCGGATTAAAACGCGCCAATGAGATCATAGG
ACCGACTACAGAGGAGCTTGTCCTGAAAGCAAGGCAGGGGGTCAATCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGAACAAGGAAGTAATTCGGCTGGAGCGTGAATCTGTCATTCCGGTGTTGAAGCCGAAGTTGATAATGACCTTGGCGAATCTAATTGAACATAGTTCTGATCG
GGCTGAGTTCTTAAAACTTTGCAAGAGAATTGAGTACACGATTCGAGCTTGGTACCTTCTACAATTTGAGGATTTGATGCAACTCTACTCTCTCTTTGATCCTGTTCATG
GAGCCCAGAAGCTGGAGCAGCAGCATCTATCCTCTGAGGAAATTGAAGTGCTTGAACAAAATTTTCTTTCTTACTTGTTTCAGGGAGATCTATGTGGTTCTTATTCTGAG
CATTTACCCTTTGCTGGTTTCAAACTTGTACATGTGACGTTCCACATTACTTTGTACAAAGTTATGGAAAAGAGCAATTTCAAAATAACAAGTGACGAGGAGATTGAAAT
TGCACTTTCAGGGCAGTATCTCCTAAATCTTCCTATTACAGTTGATGAGTCCAAGCTTGACAAGGTTCTTTTAAAGAAATATTTCGCTACACATCCTCAAGTCAACCTAC
CAGACTTTGTTGACAAGTATGTTATTTTCAGGCGAGGAACTGGAATTGACCGAACGAGTGATTTCTTTTTCATGGAGAAAATAGACATGCTTATTGGAAGATTTTGGGCA
TATCTCTTAAGGTTAACGAGGTTAGAAAAGATTTTCTCCAGAAGGCCAAGTGCACGCCTTATGGAAGATAGGAAGAAGAATGATGAGATTCCCCCTAATGCTGAACAAGA
TGACCTAAATGTTGAAAGAGTTCGTCTGGAGAATATGGAACTGAGTGCTCGCAATTTGCTGGGCAAGGTTACTATTCAAGAACCTACCTTTGATAGGATTATTGTAGTTT
ACAGGCGAGCAAGTACGAAGTCTAAATCTGAACGTGGAATATATGTCAAGCATTTTAAAAACATTCCAATGGCTGATATGGAAATAGTACTTCCTGAAAAGAAAAATCCA
GGACTGACTCCAATGGACTGGGTTAAGTTCCTTGTATCTGCTGTAGTTGGGCTGGTTGCTGTTGTGGGTTCAATTGAAATGCCCAAGGCGGATTTTTGGGTCATTTTCGC
TGTTCTCTCTACAGTTATTGGTTACTGTGCAAAGACATATTTCACGTTTCAGCAAAACATGGCTACATATCAGAACTTGATTACACAATCCATGTATGACAAACAGCTGG
ATAGTGGAAGGGGCACACTTCTTCATTTGTGTGACGACGTGATCCAACAGGAAGATCTTGATCTACGGTGTGAGGAGTTGATCAAAGAAGAGTTTGGTGAGCACTGCAAC
TTTGAAGTGGACGATGCCGTTCAAAAGTTGGAGAAATTGGGAATCATTTCTCGGGATACAATTGGACGTTATTACTGTGTCGGATTAAAACGCGCCAATGAGATCATAGG
ACCGACTACAGAGGAGCTTGTCCTGAAAGCAAGGCAGGGGGTCAATCCTTGA
Protein sequenceShow/hide protein sequence
MGKNKEVIRLERESVIPVLKPKLIMTLANLIEHSSDRAEFLKLCKRIEYTIRAWYLLQFEDLMQLYSLFDPVHGAQKLEQQHLSSEEIEVLEQNFLSYLFQGDLCGSYSE
HLPFAGFKLVHVTFHITLYKVMEKSNFKITSDEEIEIALSGQYLLNLPITVDESKLDKVLLKKYFATHPQVNLPDFVDKYVIFRRGTGIDRTSDFFFMEKIDMLIGRFWA
YLLRLTRLEKIFSRRPSARLMEDRKKNDEIPPNAEQDDLNVERVRLENMELSARNLLGKVTIQEPTFDRIIVVYRRASTKSKSERGIYVKHFKNIPMADMEIVLPEKKNP
GLTPMDWVKFLVSAVVGLVAVVGSIEMPKADFWVIFAVLSTVIGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQEDLDLRCEELIKEEFGEHCN
FEVDDAVQKLEKLGIISRDTIGRYYCVGLKRANEIIGPTTEELVLKARQGVNP