; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G03550 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G03550
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF1308 domain-containing protein
Genome locationChr3:2893333..2896809
RNA-Seq ExpressionCSPI03G03550
SyntenyCSPI03G03550
Gene Ontology termsNA
InterPro domainsIPR010733 - Domain of unknown function DUF1308
IPR016129 - Peptidase family C14A, His active site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK19430.1 UPF0415 protein C7orf25-like protein [Cucumis melo var. makuwa]9.3e-25396.29Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTL KLALRELNFLSRCS SSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSS+AVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAA SL ALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVL RSY+EACVLEIKV+DRNCG TSSNYNSKVCSSGVDEP+ILN+NTEID GDSFCSVVMAMKPNPMNGIEDMESAN EQLLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGCAAKLL+ PENEL+QKYKSNYDFVIGQAMSEIKKPILVEL SLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_004147991.1 uncharacterized protein LOC101214095 isoform X1 [Cucumis sativus]1.6e-26098.91Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEPNTVELAKQRCKAIMDII+TLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDF FSEIDGDWI
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFE+LLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQS HSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_008448969.1 PREDICTED: UPF0415 protein C7orf25 homolog [Cucumis melo]3.2e-25396.51Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTL KLALRELNFLSRCS SSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSS+AVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAA SL ALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVL RSYEEACVLEIKV+DRNCG TSSNYNSKVCSSGVDEP+ILN+NTEID GDSFCSVVMAMKPNPMNGIEDMESAN EQLLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGCAAKLL+ PENEL+QKYKSNYDFVIGQAMSEIKKPILVEL SLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_022923546.1 uncharacterized protein LOC111431203 isoform X2 [Cucurbita moschata]1.7e-22284.5Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEP+ VELAKQRC+A+MD+IE LPSSTNI++S ++TLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSV GISRVCKPIP S  S+AVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVW+IVSDRKPRYISW++GHRSKGLKSR+EEV+DAARSL ALEPCSIILFFSHGLDQFILERLRDEF+ATEF+FNFSD DF FSEID DW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVLPR Y+EACVLEIKVNDRNCG+TSSN  SK+CS+GV+EPEIL+   E D G  FCSVV AMKPNPM GIED+ES + E LL GD+DLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGC AKLL+ PE+EL+QKYKSNYDFVI Q MSEI+KPILVELSS LSGKRGIICQSVHSEFKEL+TMCGGP EKSR+N+LLKHIMVV DM SKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

XP_038906087.1 UPF0415 protein C7orf25 homolog [Benincasa hispida]7.4e-23489.08Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEP T+ELAKQRC+A++DIIETLPSSTNI+VS ++TLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIP SS S+ VYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTL++NPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSL ALEPCSIILFFSHGLDQFILE+LRDEFKA EF+FNFSDFDFGFSEIDGDW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVLPRSYEEA VLEIKVNDR CGVTS NYNS  CS+GVD+PEIL+N  E D  D FCSVVMAMKPNPM GIEDMESA+ E  LGGD+DLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGC AKLL+ PE+ELRQKYKSNYDFVIGQAMSEI+KPILVELSSLL+GKRGIICQSVHSEFKEL+TMCGGPNEKSRANHLLKHI+VV DM SKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

TrEMBL top hitse value%identityAlignment
A0A0A0L776 DUF1308 domain-containing protein7.7e-26198.91Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEPNTVELAKQRCKAIMDII+TLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSS PLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDF FSEIDGDWI
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFE+LLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQS HSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A1S3BKD2 UPF0415 protein C7orf25 homolog1.6e-25396.51Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTL KLALRELNFLSRCS SSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSS+AVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAA SL ALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVL RSYEEACVLEIKV+DRNCG TSSNYNSKVCSSGVDEP+ILN+NTEID GDSFCSVVMAMKPNPMNGIEDMESAN EQLLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGCAAKLL+ PENEL+QKYKSNYDFVIGQAMSEIKKPILVEL SLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A5D3D7K2 UPF0415 protein C7orf25-like protein4.5e-25396.29Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTL KLALRELNFLSRCS SSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSS+AVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAA SL ALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVL RSY+EACVLEIKV+DRNCG TSSNYNSKVCSSGVDEP+ILN+NTEID GDSFCSVVMAMKPNPMNGIEDMESAN EQLLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGCAAKLL+ PENEL+QKYKSNYDFVIGQAMSEIKKPILVEL SLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A6J1E731 uncharacterized protein LOC111431203 isoform X28.3e-22384.5Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEP+ VELAKQRC+A+MD+IE LPSSTNI++S ++TLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSV GISRVCKPIP S  S+AVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVW+IVSDRKPRYISW++GHRSKGLKSR+EEV+DAARSL ALEPCSIILFFSHGLDQFILERLRDEF+ATEF+FNFSD DF FSEID DW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVLPR Y+EACVLEIKVNDRNCG+TSSN  SK+CS+GV+EPEIL+   E D G  FCSVV AMKPNPM GIED+ES + E LL GD+DLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGC AKLL+ PE+EL+QKYKSNYDFVI Q MSEI+KPILVELSS LSGKRGIICQSVHSEFKEL+TMCGGP EKSR+N+LLKHIMVV DM SKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
        TCLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

A0A6J1HML8 uncharacterized protein LOC111465028 isoform X21.6e-22184.72Show/hide
Query:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV
        MAEP+ VELAKQRC+A+MD+IE LP+STNI+VS ++TLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSV GISRVCKPIP SS  +AVYV
Subjt:  MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI
        DIICTLNRNPVW+IVSDRKPRYISW++GHRSKGLKSRLEEV+DAARSL ALEPCSIILFFSHGLDQFILERLRDEF+ATEF+FNFSD DF FSEID DW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWI

Query:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL
        NVLPR Y+EACVLEIKVNDRNCG+TSSN+NSK+CS+GVDE EIL+   E D G  FCSVV AMKPNPM GIED+ES + E LL  D+DLINFDTTALIAL
Subjt:  NVLPRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIAL

Query:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM
        VSGISNGC AKLL+ PE+EL+QKYKSNYDFVI Q MSEI+KPILVELSS LSGKRGIICQSVHSEFKEL+TMCGGP EKSRAN+LLKHIMVV DM SKRM
Subjt:  VSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
         CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALTGD
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD

SwissProt top hitse value%identityAlignment
Q1LZE8 UPF0415 protein C7orf25 homolog9.7e-1935.33Show/hide
Query:  INFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S G    +    E  L +           QA  E K+ +L +L + +  K    C+S   +F+ ++   GGP E+ RA  L+K I
Subjt:  INFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHI

Query:  MVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
         VV D  S+R   L  + K+  ++  +FGTGD   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  MVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Q5M888 UPF0415 protein C7orf25 homolog3.3e-1927.23Show/hide
Query:  ELNFLSRCSS-----SSSTPLSLNIGHLEAIVHILQH-PSVTGISRVCKPIPSSSSSQAVYVDIICTLNRNPVWVIVSDRKPRYI-SWYKGHRSKGLKSR
        EL FL +  S       S   S N+ HL+AIV   ++   V  + RV     S    Q + VD++   N    WV    RK   + + + G    G KS 
Subjt:  ELNFLSRCSS-----SSSTPLSLNIGHLEAIVHILQH-PSVTGISRVCKPIPSSSSSQAVYVDIICTLNRNPVWVIVSDRKPRYI-SWYKGHRSKGLKSR

Query:  LEEVID---AARS--LHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGD--WINVLPRSYEEACVLEIKVNDRNCGVTSSNYN
        +E+  D   A+R   +    P  +  F++      +   + D+ K           D G S + GD   +N L    EE  + E                
Subjt:  LEEVID---AARS--LHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGD--WINVLPRSYEEACVLEIKVNDRNCGVTSSNYN

Query:  SKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDF
            S   DE + L   T +D  +     V+A    P     D+               +N D T LI  VS +S G    +    E  L +        
Subjt:  SKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDF

Query:  VIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTA
           QA  E K+ +L +L + +  K    C+S   +F+ ++   GGP E+ RA  L+K I VV D  S+R   L  + K+  ++  +FGTGD   A T+TA
Subjt:  VIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTA

Query:  NMSFVRAVSQTGMSLFTFEHRPRALT
        N  FVRA +  G+    F H+PRALT
Subjt:  NMSFVRAVSQTGMSLFTFEHRPRALT

Q803H0 UPF0415 protein C7orf25 homolog3.9e-2036.31Show/hide
Query:  INFDTTALIALVSGISNG-CAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKH
        +N D T LI  VS +S+G C      +   E              QA  E ++ +L  L   + GK    CQS   +F+ ++   GGP EKSRA  LL  
Subjt:  INFDTTALIALVSGISNG-CAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKH

Query:  IMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        + VV D  S+R   L  + K+  ++ ++FGTGD   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  IMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Q91WD4 UPF0415 protein C7orf25 homolog9.7e-1934.52Show/hide
Query:  INFDTTALIALVSGIS-NGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKH
        +N D T LI  VS +S  GC               +      +  QA  E K+ +L +L + +  K    C+S   +F+ ++   GGP E+ RA+ L+K 
Subjt:  INFDTTALIALVSGIS-NGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKH

Query:  IMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        I VV D  S+R   L  + K+  ++  +FGTGD   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  IMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Q9BPX7 UPF0415 protein C7orf257.4e-1935.33Show/hide
Query:  INFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S G    +    E  L +           QA  E K+ +L +L + +  K    C+S   +F+ ++   GGP E+ RA  L+K I
Subjt:  INFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHI

Query:  MVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
         VV D  S+R   L  + K+  ++  +FGTGD   A T+TAN  FVRA +  G+    F H+PRALT
Subjt:  MVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

Arabidopsis top hitse value%identityAlignment
AT1G73380.1 unknown protein1.8e-12454Show/hide
Query:  EPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSS-SSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYVD
        E   +E+AKQRC++++  IE LP ST I+ SC +TL KLA  EL+FLS  SS  S  PLS+NIGH+E++V ILQ PS+TG+SRVCKPIP       V+VD
Subjt:  EPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSS-SSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWY-KGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNF-SDFDFGFS---EID
        ++CTL + PVW+IVSDR PRYISW    H SKGL+SR+E+++ AA S   L+P S+ILFF++GL   + E+L+DEF A  F F F SD D   S   + D
Subjt:  IICTLNRNPVWVIVSDRKPRYISWY-KGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNF-SDFDFGFS---EID

Query:  GDWINVL-PRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTT
         +W+NV+  RSY+EA  +EIK+ D+ C   +S     +  + V         TE+   D+F +V+ +M+                 LLG D  LINFDTT
Subjt:  GDWINVL-PRSYEEACVLEIKVNDRNCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTT

Query:  ALIALVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDM
        AL+ALVSGISNGCA +L+ +PE EL +K+K N  FVI QA SEI+KP LV++ ++LSGKRGI+C+SV SEFKEL++M  GPNEK RA  LLK +MVV D 
Subjt:  ALIALVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDM

Query:  VSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD
         S+R+  LPTTRKLA+KNK VFGTGD W APTLTANM+FVRAV+Q+GMSL T +H PRALTGD
Subjt:  VSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAACCAAATACAGTAGAATTGGCCAAGCAAAGATGCAAAGCGATTATGGACATAATCGAAACCCTACCTTCTTCCACCAACATCTCCGTTTCATGTACCCAAAC
TCTCCACAAATTGGCTCTTCGCGAGCTCAATTTCCTCTCTCGTTGCTCCTCCTCGTCTTCCACCCCTCTCAGCTTGAATATTGGGCACCTTGAAGCTATTGTTCACATTC
TTCAACACCCTTCCGTCACTGGAATTTCACGTGTTTGTAAGCCGATTCCATCTTCCTCCTCTTCGCAAGCTGTTTATGTTGATATAATTTGCACTTTGAATAGGAATCCT
GTGTGGGTTATTGTTTCCGATAGAAAACCTAGGTATATTTCTTGGTATAAGGGCCATAGAAGTAAGGGCTTGAAATCTCGACTTGAGGAAGTGATTGATGCGGCTCGCTC
TTTGCACGCCTTAGAACCTTGCTCGATCATTTTGTTTTTTTCGCATGGGCTTGATCAGTTTATTCTGGAAAGGCTCCGGGATGAATTCAAAGCCACTGAGTTTCATTTCA
ATTTCTCGGATTTTGATTTTGGTTTCTCTGAGATTGATGGTGATTGGATTAATGTGCTTCCAAGAAGCTATGAAGAAGCTTGTGTTCTTGAAATAAAAGTTAATGATAGG
AATTGTGGGGTTACGAGTTCAAATTATAACAGTAAAGTATGTTCTAGTGGTGTGGATGAGCCGGAGATTTTGAACAACAATACCGAGATAGATTTTGGGGATTCTTTCTG
CTCTGTTGTTATGGCAATGAAGCCTAATCCTATGAACGGTATCGAAGATATGGAATCCGCAAATTTTGAACAATTATTGGGTGGTGATAGTGATTTAATAAATTTTGATA
CCACGGCGTTGATTGCATTAGTGTCCGGCATTAGTAATGGCTGTGCTGCTAAATTATTGTCTATCCCAGAGAATGAATTGAGACAGAAGTACAAGAGTAACTATGATTTT
GTTATTGGTCAGGCAATGTCAGAAATTAAGAAGCCTATTCTTGTGGAGCTGAGTTCTCTATTGTCTGGAAAAAGAGGAATAATATGCCAGAGTGTTCACTCTGAGTTCAA
GGAACTTATAACAATGTGTGGAGGGCCGAATGAGAAGTCCAGAGCAAACCACTTACTAAAGCACATCATGGTTGTGCTGGACATGGTGTCAAAACGCATGACATGCCTCC
CGACCACGAGAAAGTTAGCTTTGAAGAACAAGGTTGTGTTCGGTACTGGTGACTACTGGAATGCCCCGACCTTAACTGCTAACATGTCATTTGTTCGAGCAGTATCGCAG
ACTGGGATGTCCCTTTTTACCTTTGAGCATAGGCCACGAGCTTTAACCGGTGATTAG
mRNA sequenceShow/hide mRNA sequence
ATTGTTTGGAAGTTATATATATATAGTTTTCTTGTCAATTCTTCTTCCCAAGGCATTAGGCATTTTGATGAAATTTGCCATTTGGGATATTTAAATGGAGAGAGTTTATC
ACCACTAAACCCCAGCAATGGCAGAACCAAATACAGTAGAATTGGCCAAGCAAAGATGCAAAGCGATTATGGACATAATCGAAACCCTACCTTCTTCCACCAACATCTCC
GTTTCATGTACCCAAACTCTCCACAAATTGGCTCTTCGCGAGCTCAATTTCCTCTCTCGTTGCTCCTCCTCGTCTTCCACCCCTCTCAGCTTGAATATTGGGCACCTTGA
AGCTATTGTTCACATTCTTCAACACCCTTCCGTCACTGGAATTTCACGTGTTTGTAAGCCGATTCCATCTTCCTCCTCTTCGCAAGCTGTTTATGTTGATATAATTTGCA
CTTTGAATAGGAATCCTGTGTGGGTTATTGTTTCCGATAGAAAACCTAGGTATATTTCTTGGTATAAGGGCCATAGAAGTAAGGGCTTGAAATCTCGACTTGAGGAAGTG
ATTGATGCGGCTCGCTCTTTGCACGCCTTAGAACCTTGCTCGATCATTTTGTTTTTTTCGCATGGGCTTGATCAGTTTATTCTGGAAAGGCTCCGGGATGAATTCAAAGC
CACTGAGTTTCATTTCAATTTCTCGGATTTTGATTTTGGTTTCTCTGAGATTGATGGTGATTGGATTAATGTGCTTCCAAGAAGCTATGAAGAAGCTTGTGTTCTTGAAA
TAAAAGTTAATGATAGGAATTGTGGGGTTACGAGTTCAAATTATAACAGTAAAGTATGTTCTAGTGGTGTGGATGAGCCGGAGATTTTGAACAACAATACCGAGATAGAT
TTTGGGGATTCTTTCTGCTCTGTTGTTATGGCAATGAAGCCTAATCCTATGAACGGTATCGAAGATATGGAATCCGCAAATTTTGAACAATTATTGGGTGGTGATAGTGA
TTTAATAAATTTTGATACCACGGCGTTGATTGCATTAGTGTCCGGCATTAGTAATGGCTGTGCTGCTAAATTATTGTCTATCCCAGAGAATGAATTGAGACAGAAGTACA
AGAGTAACTATGATTTTGTTATTGGTCAGGCAATGTCAGAAATTAAGAAGCCTATTCTTGTGGAGCTGAGTTCTCTATTGTCTGGAAAAAGAGGAATAATATGCCAGAGT
GTTCACTCTGAGTTCAAGGAACTTATAACAATGTGTGGAGGGCCGAATGAGAAGTCCAGAGCAAACCACTTACTAAAGCACATCATGGTTGTGCTGGACATGGTGTCAAA
ACGCATGACATGCCTCCCGACCACGAGAAAGTTAGCTTTGAAGAACAAGGTTGTGTTCGGTACTGGTGACTACTGGAATGCCCCGACCTTAACTGCTAACATGTCATTTG
TTCGAGCAGTATCGCAGACTGGGATGTCCCTTTTTACCTTTGAGCATAGGCCACGAGCTTTAACCGGTGATTAGTTGATGCGCACTTTTGATTTAAATGCGATATCTTTG
TTCTGTTAGTTCTGTTAGAAAAAGGAATGGTGGTAGGGAGATTTGAATTTTCAATTTTGTTTTTTCTTTCAGAATAAAATGTAGTTAGTTAAAGACAATTTCTCCCTTGT
TGTAGAATGATAAATTATTGGCCATATTTTGGTTGGTATATAAATTAAATCCGTGCATTGCATGCAGGAAAAGGAAAG
Protein sequenceShow/hide protein sequence
MAEPNTVELAKQRCKAIMDIIETLPSSTNISVSCTQTLHKLALRELNFLSRCSSSSSTPLSLNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQAVYVDIICTLNRNP
VWVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILERLRDEFKATEFHFNFSDFDFGFSEIDGDWINVLPRSYEEACVLEIKVNDR
NCGVTSSNYNSKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEQLLGGDSDLINFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDF
VIGQAMSEIKKPILVELSSLLSGKRGIICQSVHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQ
TGMSLFTFEHRPRALTGD