; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013111 (gene) of Snake gourd v1 genome

Gene IDTan0013111
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF1308 domain-containing protein
Genome locationLG01:20928340..20931678
RNA-Seq ExpressionTan0013111
SyntenyTan0013111
Gene Ontology termsNA
InterPro domainsIPR010733 - Domain of unknown function DUF1308


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022923545.1 uncharacterized protein LOC111431203 isoform X1 [Cucurbita moschata]5.4e-22186.24Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRC  ++D I+ALPSSTNIT+SSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSR+EEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFNFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +NC SK  STGV+EPE LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL GD DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKH-IMVVPDTTSKRM
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSR+N+LLKH IMVVPD  SKRM
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKH-IMVVPDTTSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_022923546.1 uncharacterized protein LOC111431203 isoform X2 [Cucurbita moschata]2.2e-22286.43Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRC  ++D I+ALPSSTNIT+SSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSR+EEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFNFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +NC SK  STGV+EPE LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL GD DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSR+N+LLKHIMVVPD  SKRMT
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_022965050.1 uncharacterized protein LOC111465028 isoform X2 [Cucurbita maxima]5.6e-21886.21Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRC  ++D I+ALP+STNITVSSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPSSC KAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFNFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +N  SK  STGVDE E LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL  D DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSRAN+LLKHIMVVPD  SKRM 
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_023553240.1 uncharacterized protein LOC111810716 [Cucurbita pepo subsp. pepo]2.4e-22186.43Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VEL KQRC  ++D I+ALPSSTNI+VSSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NF+FSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +NC SK  STGVDEPE LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL GD DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSRAN+LLKHIMVVPD  SKRMT
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        CLPTTRKLALKNK+VFGTGDYWNA TLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_038906087.1 UPF0415 protein C7orf25 homolog [Benincasa hispida]2.8e-22587.31Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EP+ +ELAKQRC  +ID I+ LPSSTNITVSSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSVTGISRVCKPIPSSCSK VYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTL++NPVWVIVSDRKPRYISW+K HRSKGLKSRLEEV+DAARSLQALEPCSIILFFSHGLDQFILE+LRDEF+A E+NFNFSDFDFGFSEIDGDWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPRSY+EA VLEIKVNDR C     N  S   STGVD+PE LD Y+ER++ DPFCSVVMAMKPNPM     MESASLEH LGGDNDLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT
        SGISNG VAKLLATPESELRQKYKSNYDFVIGQ MSEI+KPILVELSSLL+GKRGIICQSVHSEFKEL+ MCGGPNEKSRANHLLKHI+VVPD  SKRMT
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

TrEMBL top hitse value%identityAlignment
A0A0A0L776 DUF1308 domain-containing protein6.7e-21785.37Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIP-SSCSKAVYV
        M EP+ VELAKQRC  I+D I+ LPSSTNI+VS +QTLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSVTGISRVCKPIP SS S+AVYV
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIP-SSCSKAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWV
        DIICTLNRNPVWVIVSDRKPRYISW+K HRSKGLKSRLEEV+DAARSL ALEPCSIILFFSHGLDQFILERLRDEF+ATE++FNFSDFDF FSEIDGDW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWV

Query:  NVLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIAL
        NVLPRSY+EACVLEIKVNDRNC    +N  SK  S+GVDEPE L+   E + GD FCSVVMAMKPNPM     MESA+ E LLGGD+DLINFDTTALIAL
Subjt:  NVLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIAL

Query:  VSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRM
        VSGISNG  AKLL+ PE+ELRQKYKSNYDFVIGQ MSEI+KPILVELSSLLSGKRGIICQS HSEFKELI MCGGPNEKSRANHLLKHIMVV D  SKRM
Subjt:  VSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A6J1E731 uncharacterized protein LOC111431203 isoform X21.1e-22286.43Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRC  ++D I+ALPSSTNIT+SSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSR+EEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFNFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +NC SK  STGV+EPE LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL GD DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSR+N+LLKHIMVVPD  SKRMT
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A6J1EC49 uncharacterized protein LOC111431203 isoform X12.6e-22186.24Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRC  ++D I+ALPSSTNIT+SSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSR+EEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFNFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +NC SK  STGV+EPE LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL GD DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKH-IMVVPDTTSKRM
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSR+N+LLKH IMVVPD  SKRM
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKH-IMVVPDTTSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A6J1HML8 uncharacterized protein LOC111465028 isoform X22.7e-21886.21Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRC  ++D I+ALP+STNITVSSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPSSC KAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFNFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +N  SK  STGVDE E LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL  D DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSRAN+LLKHIMVVPD  SKRM 
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A6J1HPY3 uncharacterized protein LOC111465028 isoform X16.7e-21786.03Show/hide
Query:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRC  ++D I+ALP+STNITVSSS+TLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQ PSV GISRVCKPIPSSC KAVYVD
Subjt:  MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW + HRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFNFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNC    +N  SK  STGVDE E LDKY+ER+LG PFCSVV AMKPNPM     +ES SLEHLL  D DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNC---VANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPM-----MESASLEHLLGGDNDLINFDTTALIALV

Query:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKH-IMVVPDTTSKRM
        SGISNG VAKLLATPE EL+QKYKSNYDFVI QVMSEIQKPILVELSS LSGKRGIICQSVHSEFKEL+ MCGGP EKSRAN+LLKH IMVVPD  SKRM
Subjt:  SGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKH-IMVVPDTTSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
         CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

SwissProt top hitse value%identityAlignment
Q08AW5 UPF0415 protein C7orf25 homolog4.3e-1935.93Show/hide
Query:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S+G    L        ++K  +       Q   E Q+ +L  L S +  K    C+S   +F+ ++   GGP EK RA  L+K I
Subjt:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI

Query:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
         VVPD  S+R + L  + K+  ++  +FGTG+   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Q1LZE8 UPF0415 protein C7orf25 homolog1.2e-1834.13Show/hide
Query:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S G    +        ++K  +       Q   E ++ +L +L + +  K    C+S   +F+ ++   GGP E+ RA  L+K I
Subjt:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI

Query:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
         VVPD  S+R   L  + K+  ++  +FGTGD   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Q5BKL1 UPF0415 protein C7orf25 homolog1.9e-1925.75Show/hide
Query:  DGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSS-----SSSIPLSLNIGHLEAIVHILQ--QPSVTGISRVCKPIPSSCSKAV
        D +++AKQ    +I R +AL  S    V     L      EL FL +  +       S   S N+ HL+A++   +  +  V+ +   C        +++
Subjt:  DGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSS-----SSSIPLSLNIGHLEAIVHILQ--QPSVTGISRVCKPIPSSCSKAV

Query:  YVDIICTLNRNPVWVIVSDRKPRYISWFKSHRSK-GLKSRLEEVVD--AARSLQALEPCS--IILFFSHGLDQFILERLRD----------EFRATEYNF
         VD++   N    WV    RK   +      R + G KS +E+  D   A S Q ++  S  II  F + + + + E+L++              ++ N 
Subjt:  YVDIICTLNRNPVWVIVSDRKPRYISWFKSHRSK-GLKSRLEEVVD--AARSLQALEPCS--IILFFSHGLDQFILERLRD----------EFRATEYNF

Query:  NFSDFDFGF-SEIDGDWVNVLPRSYKEACVLEIKVNDRNCVANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPMMESASLEHLLGGDNDLI
           ++  G  S+ DGD  +VL  S         KV+  N VA+                      E+    C                           +
Subjt:  NFSDFDFGF-SEIDGDWVNVLPRSYKEACVLEIKVNDRNCVANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPMMESASLEHLLGGDNDLI

Query:  NFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIM
        N D T LI  VS +S+G         E   ++K  +       Q   E Q+ +L  L+S +  K    C+    +F+ ++   GGP EK RA  L+K I 
Subjt:  NFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIM

Query:  VVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        VVPD  S+R   L ++ K+  ++  +FGTG+   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  VVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Q803H0 UPF0415 protein C7orf25 homolog1.3e-2035.93Show/hide
Query:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S+G+               +      +  Q   E Q+ +L  L   + GK    CQS   +F+ ++   GGP EKSRA  LL  +
Subjt:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI

Query:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
         VVPD  S+R   L  + K+  ++ ++FGTGD   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Q9BPX7 UPF0415 protein C7orf259.5e-1934.13Show/hide
Query:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S G    +        ++K  +       Q   E ++ +L +L + +  K    C+S   +F+ ++   GGP E+ RA  L+K I
Subjt:  INFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHI

Query:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
         VVPD  S+R   L  + K+  ++  +FGTGD   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  MVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Arabidopsis top hitse value%identityAlignment
AT1G73380.1 unknown protein2.0e-12054.65Show/hide
Query:  VELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSS-SSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVDIICTL
        +E+AKQRC  +I  I+ LP ST IT S  +TL KLA  EL+FLS  SS  S  PLS+NIGH+E++V ILQ PS+TG+SRVCKPIP      V+VD++CTL
Subjt:  VELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSS-SSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVDIICTL

Query:  NRNPVWVIVSDRKPRYISW-FKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNF-SDFDFGFS---EIDGDWVN
         + PVW+IVSDR PRYISW    H SKGL+SR+E+++ AA S   L+P S+ILFF++GL   + E+L+DEF A  ++F F SD D   S   + D +WVN
Subjt:  NRNPVWVIVSDRKPRYISW-FKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNF-SDFDFGFS---EIDGDWVN

Query:  VL-PRSYKEACVLEIKVNDRNCVANCCSKESSTGVDEPENLDKYLERELG--DPFCSVVMAMKPNPMMESASLEHLLGGDNDLINFDTTALIALVSGISN
        V+  RSYKEA  +EIK+ D+      C   +S    E E L +    EL   D F +V+ +M+             L G++ LINFDTTAL+ALVSGISN
Subjt:  VL-PRSYKEACVLEIKVNDRNCVANCCSKESSTGVDEPENLDKYLERELG--DPFCSVVMAMKPNPMMESASLEHLLGGDNDLINFDTTALIALVSGISN

Query:  GSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMTCLPTT
        G   +L+  PE EL +K+K N  FVI Q  SEI+KP LV++ ++LSGKRGI+C+SV SEFKEL++M  GPNEK RA  LLK +MVV D  S+R+  LPTT
Subjt:  GSVAKLLATPESELRQKYKSNYDFVIGQVMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMTCLPTT

Query:  RKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        RKLA+KNK VFGTGD W APTLTANM+FVRAV+Q+GMSL T +H PRALTGD
Subjt:  RKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAACCAGATGGAGTAGAATTGGCAAAGCAAAGATGCATAGTGATTATCGACAGAATCAAAGCACTGCCTTCTTCCACCAACATCACCGTTTCAAGCAGCCAAAC
TCTCCACAAATTGGCTCTTCGCGAACTCAATTTCCTCTCTCGCTGCTCCTCCTCGTCCTCCATCCCGCTCAGCTTGAACATTGGCCACCTCGAGGCCATTGTTCACATTC
TTCAACAGCCTTCCGTCACTGGAATTTCACGCGTCTGTAAGCCGATTCCATCTTCCTGTTCGAAAGCTGTTTATGTTGATATAATCTGCACTTTGAATAGGAATCCTGTG
TGGGTTATTGTATCCGATAGAAAACCTAGGTATATTTCCTGGTTTAAGAGCCATAGAAGTAAGGGCTTGAAATCTCGACTTGAGGAAGTGGTCGATGCGGCTCGCTCTTT
GCAGGCCTTAGAACCTTGCTCGATCATCCTGTTTTTTTCGCATGGACTCGATCAGTTTATTCTGGAAAGACTTCGAGATGAGTTTAGGGCTACTGAGTATAATTTCAATT
TCTCTGATTTCGATTTTGGTTTCTCTGAGATTGATGGCGATTGGGTTAATGTGCTTCCGAGAAGCTATAAAGAAGCCTGTGTTCTTGAAATTAAAGTTAATGATAGGAAT
TGTGTTGCAAATTGCTGTAGTAAGGAAAGTTCTACTGGTGTGGATGAGCCAGAGAATTTGGACAAGTATCTGGAGAGAGAACTGGGAGATCCTTTCTGCTCTGTTGTTAT
GGCAATGAAACCTAATCCTATGATGGAATCCGCGAGTTTGGAACATTTGTTGGGTGGTGACAATGATTTAATAAACTTTGATACCACGGCGTTGATTGCATTAGTGTCAG
GCATTAGTAATGGTTCTGTTGCTAAATTATTGGCTACCCCTGAGAGTGAATTGAGACAGAAGTACAAGAGTAACTATGATTTTGTTATTGGTCAGGTGATGTCTGAAATT
CAGAAGCCCATTCTTGTGGAGTTGAGTTCTCTCTTATCTGGAAAAAGAGGTATAATATGCCAAAGTGTTCACTCTGAGTTCAAGGAACTAATTGCAATGTGTGGAGGGCC
TAATGAGAAGTCCAGAGCAAACCACTTACTAAAGCACATCATGGTTGTACCAGACACAACATCGAAACGTATGACGTGTCTTCCAACCACGAGAAAGTTGGCTTTGAAGA
ACAAGGTTGTGTTTGGTACTGGTGACTACTGGAATGCCCCAACCTTGACTGCTAACATGTCATTTGTTCGAGCAGTATCACAGACTGGGATGTCCCTTTTTACCTTTGAG
CATAGGCCGCGAGCTTTAACTGGTGATTAG
mRNA sequenceShow/hide mRNA sequence
TGTGAATGCTGATTTTGGGGTTTCCTTGTCAATCCTTCTTCTTGGGCATTTTGATGAAACTTGTGACATTATGATCTTATAAGCCACAAAAATGGAAGAACCAGATGGAG
TAGAATTGGCAAAGCAAAGATGCATAGTGATTATCGACAGAATCAAAGCACTGCCTTCTTCCACCAACATCACCGTTTCAAGCAGCCAAACTCTCCACAAATTGGCTCTT
CGCGAACTCAATTTCCTCTCTCGCTGCTCCTCCTCGTCCTCCATCCCGCTCAGCTTGAACATTGGCCACCTCGAGGCCATTGTTCACATTCTTCAACAGCCTTCCGTCAC
TGGAATTTCACGCGTCTGTAAGCCGATTCCATCTTCCTGTTCGAAAGCTGTTTATGTTGATATAATCTGCACTTTGAATAGGAATCCTGTGTGGGTTATTGTATCCGATA
GAAAACCTAGGTATATTTCCTGGTTTAAGAGCCATAGAAGTAAGGGCTTGAAATCTCGACTTGAGGAAGTGGTCGATGCGGCTCGCTCTTTGCAGGCCTTAGAACCTTGC
TCGATCATCCTGTTTTTTTCGCATGGACTCGATCAGTTTATTCTGGAAAGACTTCGAGATGAGTTTAGGGCTACTGAGTATAATTTCAATTTCTCTGATTTCGATTTTGG
TTTCTCTGAGATTGATGGCGATTGGGTTAATGTGCTTCCGAGAAGCTATAAAGAAGCCTGTGTTCTTGAAATTAAAGTTAATGATAGGAATTGTGTTGCAAATTGCTGTA
GTAAGGAAAGTTCTACTGGTGTGGATGAGCCAGAGAATTTGGACAAGTATCTGGAGAGAGAACTGGGAGATCCTTTCTGCTCTGTTGTTATGGCAATGAAACCTAATCCT
ATGATGGAATCCGCGAGTTTGGAACATTTGTTGGGTGGTGACAATGATTTAATAAACTTTGATACCACGGCGTTGATTGCATTAGTGTCAGGCATTAGTAATGGTTCTGT
TGCTAAATTATTGGCTACCCCTGAGAGTGAATTGAGACAGAAGTACAAGAGTAACTATGATTTTGTTATTGGTCAGGTGATGTCTGAAATTCAGAAGCCCATTCTTGTGG
AGTTGAGTTCTCTCTTATCTGGAAAAAGAGGTATAATATGCCAAAGTGTTCACTCTGAGTTCAAGGAACTAATTGCAATGTGTGGAGGGCCTAATGAGAAGTCCAGAGCA
AACCACTTACTAAAGCACATCATGGTTGTACCAGACACAACATCGAAACGTATGACGTGTCTTCCAACCACGAGAAAGTTGGCTTTGAAGAACAAGGTTGTGTTTGGTAC
TGGTGACTACTGGAATGCCCCAACCTTGACTGCTAACATGTCATTTGTTCGAGCAGTATCACAGACTGGGATGTCCCTTTTTACCTTTGAGCATAGGCCGCGAGCTTTAA
CTGGTGATTAGTTATGTACTTCTATTTAAGATGTGTGTTCTTTTTGTCCAGGAAAAAAAAGATATAGCGGTAGTGATTGTAATTGTTGAGGAATATTGTGCGTACTGTAA
GTAATTGTTAAACAATATTGTAAGCATGAAATTACTTGTTGGGGCCATTATAACGTGGTTGTTTCTCTTATAATCTTATGGTCTATAGTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MEEPDGVELAKQRCIVIIDRIKALPSSTNITVSSSQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEAIVHILQQPSVTGISRVCKPIPSSCSKAVYVDIICTLNRNPV
WVIVSDRKPRYISWFKSHRSKGLKSRLEEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFNFSDFDFGFSEIDGDWVNVLPRSYKEACVLEIKVNDRN
CVANCCSKESSTGVDEPENLDKYLERELGDPFCSVVMAMKPNPMMESASLEHLLGGDNDLINFDTTALIALVSGISNGSVAKLLATPESELRQKYKSNYDFVIGQVMSEI
QKPILVELSSLLSGKRGIICQSVHSEFKELIAMCGGPNEKSRANHLLKHIMVVPDTTSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFE
HRPRALTGD