; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G02780 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G02780
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of Unknown Function (DUF239)
Genome locationClcChr01:2481568..2490649
RNA-Seq ExpressionClc01G02780
SyntenyClc01G02780
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR022212 - Domain of unknown function DUF3741
IPR025486 - Domain of unknown function DUF4378
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042752.1 DUF3741 domain-containing protein/DUF4378 domain-containing protein [Cucumis melo var. makuwa]0.0e+0084.26Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSP+SLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLS NKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN T SL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMR G+KE ECLS+NG+ K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
        IQTDDK I SSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKKHEGDKEVSRK+KAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLHASTTELNVSPVIF +  VDQDPIIEGSVK + ++ T+ QER++FCE  SR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR

Query:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN+ NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES FYHEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_004143902.1 uncharacterized protein LOC101217666 [Cucumis sativus]0.0e+0084.39Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CK VD PSK+LQQPSP+ LV ASSK LDSTASTRVACCR++RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIE STDSRNRKQQMMTFFDSRLS NKIREVGEYEEP Y + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN T SL AAVGTN+C SLKSHSS IKNGQSDK TLFSFRQIKRKMKQAMR G+KE ECLSTNGI K+TP + R PKDDGKQ  +EATGRSSY+N
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
        IQTDDK I SSFQDSL RDQ D+AFYSRNGDKTASTSEST KK+ QSAV SNLKRQKSKKHEGDKEVSRK KAKPWGW MCFSDDDILPSNKPGC T   
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK+K QND E+ CKTPEMVK+GA FAEA R+DDQLHASTTELNVSPVIF +  VDQDP+IEGSVK V D+AT+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR

Query:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FD++ NT  CQ TNK KGF EKGNPELSKLNLPLEVQPS  SVD F SSSLQFQTVEDPNG CDR VQPLPE IH DQL++DATSSNLA TPGT EPSS 
Subjt:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQC+GLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP KESAFYHEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_008437289.1 PREDICTED: uncharacterized protein LOC103482755 [Cucumis melo]0.0e+0084.39Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSP+SLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLS NKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN T SL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMR G+KE ECLS+NG+ K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
        IQTDDK I SSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKKHEGDKEVSRK+KAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLHASTTELNVSPVIF +  VDQDPIIEGSVK + ++ T+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR

Query:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN+ NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES FYHEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_022159509.1 uncharacterized protein LOC111025905 [Momordica charantia]0.0e+0074.46Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK
        MG KHM SNS+MVG+VS+SHK+L KAVDRP+KEL+ PSP+ L  T+S+K LD TASTRVACCRSRRFCTCKSC EYG+HNEISL+LVQKNEA EPFSSKK
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK

Query:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE
        FV VAD+QCKQLLDALGIFNSNKELF+NLLQDPNSLLI+ IEDSTDS + KQQM TF D RLS NK REVGEYEEP Y + LKPCDRLP+E+SDDS SLE
Subjt:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE

Query:  RIVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYN
        RIVVLKPNPT S  +  GTN+C SL+SHSSFI N QSDKR+ FSFRQIKRKM+QAM  GKKE ECLS + ++KKTP V            MEATG+SS N
Subjt:  RIVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYN

Query:  NIQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPS
        NIQT  K I SS QDSL+RDQ+D+AFYSRNGDKTASTSEST K VGQSAV S+LKRQKSKKHEGDKEVSRK+K KPWGW MCFSDDDI+PSNKPG H   
Subjt:  NIQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPS

Query:  HTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCE-ES
        H RYS LSNKKF+YEKKSK QN+ ++SC+ P+M K+ A  +E RRDDDQLH S TELN+S VIF DVKVD+D  IEGSVK + DIATIHQE N+FCE  S
Subjt:  HTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCE-ES

Query:  SRFDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS
        S FD++C T +CQRTNKT GF E+GN ELSKLN PLE QPSA SVD FPS S +FQ VEDPNGL DREVQP+PETIH DQLL+DATSSN  F P  AE S
Subjt:  SRFDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS

Query:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC
         EALPI+FE+  C GLARLQE L+P IASFNCC SISQ +LELLQ SKQNW+ELS++CHSS WL+I FVDK K+F SQLCGDCVLLFDYFNEVLEDVFHC
Subjt:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC

Query:  YIRCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        YIRCSPWLSSYKAH QAPD ESA YHE++QHVDW LLQQQ PQTL+ L LRDL RSR WI+Y TETE++VTI+AES+LREL IESVVYLG+
Subjt:  YIRCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_038874729.1 uncharacterized protein LOC120067270 [Benincasa hispida]0.0e+0088.71Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNS+MVGRVSKS KMLCKAV RPSKELQQPSP+SLVTASSK LDSTASTRVACCRSRRFCTCKSCMEYGRHNEISL+LVQKNEAAEPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFF SRLS NKIREVGEYEEPD S+ L PCDRLP +DSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPNPT SL AAVGTN+C SLKSHSSFIKNGQSDK TLFSFRQIK KMKQAM   KKE ECLSTNG+ KKT PV RAPKD+GKQMVM ATGRSSY+N
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
        IQTDDK I SSFQDSLERDQLD+AFYSRNGDKTASTSE TDKKVGQSAVTSNLKRQKSKKHEGD+EVSRK+KAKPWGW MCFSDDDILPSNKPGCHT S 
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR
         RYS+LSNKKFI+EKKSK QND EQSC T +MVK GA  A ARRDD+Q+HASTTELNVSPVIFSDVKVDQDPIIEGSVK + D ATI QERN+FCEESSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR

Query:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDNNCN  YCQRTNK KG  EKGNPELSKLN PLEVQP A SV+ FPSSSLQFQTVEDP+GLCDR+VQPLPETIH  QLL+DATSSNLAFT GTAEPSSE
Subjt:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVLNPA+ASFNCCSSISQ VLELLQVSKQNWNELSLDCHSSTWLQISFVDKVK+FSSQLCGDCV+LFDYFNEVLEDVFHCYI
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCSPWLSSYKAHIQAPDKES FYHEV+QHVDWSLLQQQPPQTLDQLCLRDLRSR WINYPTETEE+VTI+ ESVLRELI+ESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

TrEMBL top hitse value%identityAlignment
A0A0A0KNW5 Uncharacterized protein0.0e+0084.39Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CK VD PSK+LQQPSP+ LV ASSK LDSTASTRVACCR++RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIE STDSRNRKQQMMTFFDSRLS NKIREVGEYEEP Y + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN T SL AAVGTN+C SLKSHSS IKNGQSDK TLFSFRQIKRKMKQAMR G+KE ECLSTNGI K+TP + R PKDDGKQ  +EATGRSSY+N
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
        IQTDDK I SSFQDSL RDQ D+AFYSRNGDKTASTSEST KK+ QSAV SNLKRQKSKKHEGDKEVSRK KAKPWGW MCFSDDDILPSNKPGC T   
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK+K QND E+ CKTPEMVK+GA FAEA R+DDQLHASTTELNVSPVIF +  VDQDP+IEGSVK V D+AT+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR

Query:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FD++ NT  CQ TNK KGF EKGNPELSKLNLPLEVQPS  SVD F SSSLQFQTVEDPNG CDR VQPLPE IH DQL++DATSSNLA TPGT EPSS 
Subjt:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQC+GLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP KESAFYHEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A1S3AU75 uncharacterized protein LOC1034827550.0e+0084.39Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSP+SLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLS NKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN T SL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMR G+KE ECLS+NG+ K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
        IQTDDK I SSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKKHEGDKEVSRK+KAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLHASTTELNVSPVIF +  VDQDPIIEGSVK + ++ T+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR

Query:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN+ NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES FYHEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A5A7TH74 DUF3741 domain-containing protein/DUF4378 domain-containing protein0.0e+0084.26Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSP+SLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLS NKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN T SL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMR G+KE ECLS+NG+ K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
        IQTDDK I SSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKKHEGDKEVSRK+KAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLHASTTELNVSPVIF +  VDQDPIIEGSVK + ++ T+ QER++FCE  SR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSR

Query:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN+ NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES FYHEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A6J1E2K4 uncharacterized protein LOC1110259050.0e+0074.46Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK
        MG KHM SNS+MVG+VS+SHK+L KAVDRP+KEL+ PSP+ L  T+S+K LD TASTRVACCRSRRFCTCKSC EYG+HNEISL+LVQKNEA EPFSSKK
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK

Query:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE
        FV VAD+QCKQLLDALGIFNSNKELF+NLLQDPNSLLI+ IEDSTDS + KQQM TF D RLS NK REVGEYEEP Y + LKPCDRLP+E+SDDS SLE
Subjt:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE

Query:  RIVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYN
        RIVVLKPNPT S  +  GTN+C SL+SHSSFI N QSDKR+ FSFRQIKRKM+QAM  GKKE ECLS + ++KKTP V            MEATG+SS N
Subjt:  RIVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYN

Query:  NIQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPS
        NIQT  K I SS QDSL+RDQ+D+AFYSRNGDKTASTSEST K VGQSAV S+LKRQKSKKHEGDKEVSRK+K KPWGW MCFSDDDI+PSNKPG H   
Subjt:  NIQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPS

Query:  HTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCE-ES
        H RYS LSNKKF+YEKKSK QN+ ++SC+ P+M K+ A  +E RRDDDQLH S TELN+S VIF DVKVD+D  IEGSVK + DIATIHQE N+FCE  S
Subjt:  HTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCE-ES

Query:  SRFDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS
        S FD++C T +CQRTNKT GF E+GN ELSKLN PLE QPSA SVD FPS S +FQ VEDPNGL DREVQP+PETIH DQLL+DATSSN  F P  AE S
Subjt:  SRFDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS

Query:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC
         EALPI+FE+  C GLARLQE L+P IASFNCC SISQ +LELLQ SKQNW+ELS++CHSS WL+I FVDK K+F SQLCGDCVLLFDYFNEVLEDVFHC
Subjt:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC

Query:  YIRCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        YIRCSPWLSSYKAH QAPD ESA YHE++QHVDW LLQQQ PQTL+ L LRDL RSR WI+Y TETE++VTI+AES+LREL IESVVYLG+
Subjt:  YIRCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A6J1K828 uncharacterized protein LOC1114910108.5e-29870.13Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MG KHMQSNSSMVG+ SKSHKM+CKA++RPS+E QQPSP+SL  ASS         RVACCR++R CTCK CME+GRHNEISL+LVQKNEAAEPF SKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSN+ELFVNLL DPNSLLIKRIEDSTDSR+RKQQ  T+F+ RLS NKI+EVGEYEEP +S  LKPC      DSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPNP+ SL AAVGTN C SLKSHSSFIKN +S+K TLFSFRQIKRKMKQAMR G+KE ECLSTNG++ KTP V+R PKD                 
Subjt:  IVVLKPNPTVSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH
                          DQLD+ FYSRNGDKTASTSES   KVGQSAVTS+LKRQKSKK EGDKEV RK+KAKPWGW MCFSDDDILPSNKPGC T +H
Subjt:  IQTDDKIIPSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSH

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCE-ESS
        TR+S LSNKKFIYE KSKSQND +Q C+TP+M         ARR+DDQ   S  ELNVS V+  DVKV++ P IEGS+K V DIATI QE N FCE  SS
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCE-ESS

Query:  RFDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSS
         FDN  N           GF EK + ELSK NL LE Q S        SS  +FQTVEDPNGLCDREVQPLPET H   L+  A SS+LAFT  TA+ S+
Subjt:  RFDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSS

Query:  -EALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC
         EALPI+FE+ QCTG A LQEV++PAI++FNCC S+S+ VLELLQVS QNWNELS+DCHSS WL+I FVDKVK+F +QLCGDCVL+FDYFNEVLEDVFHC
Subjt:  -EALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC

Query:  YIRCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        YIRCSPWLSSYKAHIQAP+KE+  YHEV+QH+DW LLQQQPPQTLD LCLRDLR R WINYPTETEE+VTI+AESVLREL+IESVVYLG+
Subjt:  YIRCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20170.1 Protein of Unknown Function (DUF239)4.6e-5435.22Show/hide
Query:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA
        ++E+ EL   L  +NKPAIKSF+T+ G I+DC+DI KQ + DHPLLKNH+IQ+KPT IPK     +T K   L        SCP G+V I+RTT EDLI 
Subjt:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA

Query:  ARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------
         +  K L     T   +    ++  G   A   Y    YGA   IN+W+P  + DQ+S AS+++  GFRD   +I  GW                     
Subjt:  ARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------

Query:  ---------------------------------------------GDASTGNWLFMLGDKYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM
                                                      D  TGNW F++ ++ IGYWPK L  + GL  GA+   WGGE++S   ++  P M
Subjt:  ---------------------------------------------GDASTGNWLFMLGDKYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM

Query:  GSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
        GS HFP+EGF K+AFVN ++V +  I +    PV   L +  + P C+ +  K      W   IF+GGP GC
Subjt:  GSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT2G20170.2 Protein of Unknown Function (DUF239)4.6e-5435.22Show/hide
Query:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA
        ++E+ EL   L  +NKPAIKSF+T+ G I+DC+DI KQ + DHPLLKNH+IQ+KPT IPK     +T K   L        SCP G+V I+RTT EDLI 
Subjt:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA

Query:  ARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------
         +  K L     T   +    ++  G   A   Y    YGA   IN+W+P  + DQ+S AS+++  GFRD   +I  GW                     
Subjt:  ARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------

Query:  ---------------------------------------------GDASTGNWLFMLGDKYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM
                                                      D  TGNW F++ ++ IGYWPK L  + GL  GA+   WGGE++S   ++  P M
Subjt:  ---------------------------------------------GDASTGNWLFMLGDKYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM

Query:  GSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
        GS HFP+EGF K+AFVN ++V +  I +    PV   L +  + P C+ +  K      W   IF+GGP GC
Subjt:  GSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT2G20170.3 Protein of Unknown Function (DUF239)4.9e-5636.9Show/hide
Query:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA
        ++E+ EL   L  +NKPAIKSF+T+ G I+DC+DI KQ + DHPLLKNH+IQ+KPT IPK     +T K   L        SCP G+V I+RTT EDLI 
Subjt:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA

Query:  ARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------
         +  K L     T   +    ++  G   A   Y    YGA   IN+W+P  + DQ+S AS+++  GFRD   +I  GW                     
Subjt:  ARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------

Query:  ----------------------------GDASTGNWLFMLGDKYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAMGSSHFPEEGFSKSAFVN
                                     D  TGNW F++ ++ IGYWPK L  + GL  GA+   WGGE++S   ++  P MGS HFP+EGF K+AFVN
Subjt:  ----------------------------GDASTGNWLFMLGDKYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAMGSSHFPEEGFSKSAFVN

Query:  QIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
         ++V +  I +    PV   L +  + P C+ +  K      W   IF+GGP GC
Subjt:  QIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT4G23380.1 Protein of Unknown Function (DUF239)9.9e-4933.92Show/hide
Query:  VLRELIIESVVYLGVDALTQLQMLSEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLI---GDTSKVERL
        VLR L+  S+V +   A     + SE+E+ E+  QLK +NKPAIKSFKTE  +I DC+DI+KQ + DH LL+NH++++KPT++PK  I       KV  +
Subjt:  VLRELIIESVVYLGVDALTQLQMLSEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLI---GDTSKVERL

Query:  LQDLPNINSCPPGSVPIRRTTREDLIAARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNY-RAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFR-DQ
           L  I SCP G+V ++RTT +DLI ++  K +  +     L   + ID +G+  AT +Y    V G    IN+W+P  S DQ S A+M ++ G + +Q
Subjt:  LQDLPNINSCPPGSVPIRRTTREDLIAARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNY-RAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFR-DQ

Query:  QNTIQVGW--------------------GDASTGNW-----------------------------------------LFMLGDKYIGYWPKGLL--PGLE
          +I VGW                    G   TG +                                            +    +GYWP+ L    GL 
Subjt:  QNTIQVGW--------------------GDASTGNW-----------------------------------------LFMLGDKYIGYWPKGLL--PGLE

Query:  NGATTAAWGGEIYSPTTEAGPAMGSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
         GA  A+WGG++YSP TE  P MGS HFP+EGF K+AFVN I +         + P    +      P C+       ED  W   ++FGGP GC
Subjt:  NGATTAAWGGEIYSPTTEAGPAMGSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT4G23390.1 Protein of Unknown Function (DUF239)1.2e-5434.93Show/hide
Query:  SEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPK-----GLIGDTSKVERLLQDLPNINSCPPGSVPIRRTTR
        S++E+ E+   L  LNKPA+KSF+TE G I DC+DI KQ + DHPLLKNH+I++KPTTIPK          TS +     D+    SCP G+V ++R   
Subjt:  SEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPK-----GLIGDTSKVERLLQDLPNINSCPPGSVPIRRTTR

Query:  EDLIAARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASM--------------------------------W
        EDLI A+  + L  +           ID  G+  AT++Y+   YGAK  INVWNP  S DQ+S A+M                                W
Subjt:  EDLIAARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASM--------------------------------W

Query:  LSQG--------------------------------FRDQQNTIQVG-WGDASTGNWLFMLGDKYIGYWPKGLL--PGLENGATTAAWGGEIYSPTTEAG
         + G                                +  QQ  ++V  + D  T +W F+L ++ IGYWPK L    GL +GA+   WGGE+YS   E  
Subjt:  LSQG--------------------------------FRDQQNTIQVG-WGDASTGNWLFMLGDKYIGYWPKGLL--PGLENGATTAAWGGEIYSPTTEAG

Query:  PAMGSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
        P+MGS HFP+EGF K+A+VN +++  + I++    P+ S L      P C+ +         W   I FGGP GC
Subjt:  PAMGSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACAAAGCATATGCAATCTAATTCTAGCATGGTTGGAAGAGTCTCAAAAAGCCACAAAATGCTGTGCAAAGCTGTTGATAGACCTAGCAAAGAACTTCAACAGCC
ATCTCCTAGAAGTTTGGTGACTGCATCATCAAAGAATCTAGATTCAACAGCATCAACAAGAGTAGCTTGCTGCAGAAGTCGAAGATTTTGTACTTGTAAAAGTTGTATGG
AGTATGGCCGACATAATGAGATTAGCCTAAGATTGGTTCAGAAGAATGAAGCGGCTGAGCCATTTTCAAGTAAAAAGTTTGTTGGTGTGGCTGATAAACAGTGTAAACAA
TTATTAGATGCATTGGGAATTTTCAATTCAAATAAGGAATTGTTTGTAAATCTACTACAAGACCCAAATTCTCTGTTAATTAAACGTATTGAAGACTCTACTGATTCAAG
GAACAGAAAACAGCAGATGATGACTTTCTTTGATAGTAGGTTGTCTGGAAACAAGATAAGAGAAGTGGGGGAATATGAGGAGCCTGACTACAGTAAAAAATTGAAGCCCT
GTGATAGATTACCCACTGAGGATAGCGATGATTCTCTATCCTTGGAAAGAATAGTGGTATTAAAGCCAAATCCAACTGTCTCACTACCTGCGGCTGTGGGAACCAATCAT
TGCCCCTCTCTGAAATCTCATTCTAGTTTCATAAAGAATGGGCAAAGTGACAAGAGAACTCTTTTTTCTTTTAGACAAATAAAGAGGAAGATGAAGCAAGCAATGAGGAC
AGGGAAAAAAGAACCCGAATGCCTATCAACTAATGGTATCGCCAAGAAAACTCCACCAGTTTTTAGGGCCCCAAAAGATGATGGTAAACAGATGGTTATGGAGGCAACTG
GAAGAAGTTCCTACAATAATATTCAAACAGATGATAAAATAATTCCTAGTTCATTTCAAGATTCCCTGGAAAGGGATCAACTAGACAGGGCATTTTACTCTAGAAATGGG
GACAAGACGGCTTCAACCAGTGAAAGTACTGACAAAAAGGTAGGCCAGTCAGCTGTGACAAGTAATCTCAAACGGCAGAAATCTAAGAAGCATGAAGGGGACAAAGAGGT
CTCAAGAAAAATTAAAGCAAAACCATGGGGGTGGGCGATGTGCTTTTCTGATGATGACATATTGCCATCAAATAAACCTGGATGTCATACTCCAAGCCATACGAGATATT
CCCACCTTAGCAATAAGAAGTTCATTTACGAGAAGAAGTCAAAATCTCAGAATGACGTGGAACAAAGTTGCAAAACACCAGAAATGGTTAAAATAGGAGCTCCTTTTGCA
GAGGCAAGGAGAGATGATGACCAATTGCACGCCTCAACTACAGAGTTGAATGTGTCACCCGTCATTTTTTCTGATGTCAAAGTGGATCAAGATCCAATTATTGAAGGGTC
TGTGAAGTTCGTAAACGACATTGCCACAATACACCAGGAAAGAAATAGTTTTTGTGAAGAGTCATCTAGATTTGATAACAATTGCAACACATGTTATTGTCAAAGAACCA
ACAAGACCAAGGGCTTTGAAGAGAAAGGAAATCCAGAGCTCTCTAAACTGAATTTGCCTTTGGAGGTTCAACCATCAGCTCTTTCAGTAGATCCATTTCCATCCAGCTCA
TTACAATTTCAGACAGTGGAAGATCCTAATGGTTTGTGTGATAGAGAAGTGCAGCCTCTACCAGAAACTATTCATGACGACCAACTTTTGATAGATGCTACCTCTAGTAA
TCTTGCTTTCACCCCAGGAACAGCTGAGCCATCTAGTGAAGCACTCCCCATTAATTTTGAAAAGGACCAGTGTACTGGTTTGGCAAGGTTGCAAGAGGTTCTCAATCCCG
CCATCGCTTCCTTTAACTGTTGTAGCTCCATCTCTCAGTGTGTACTTGAGCTGCTGCAAGTCTCAAAACAGAATTGGAATGAATTGTCATTGGATTGTCATTCTTCAACT
TGGCTGCAGATATCATTTGTTGACAAAGTGAAGATATTTAGTAGCCAGTTATGTGGTGATTGTGTGCTTCTTTTCGACTATTTTAATGAAGTCCTTGAGGATGTTTTCCA
CTGTTATATTAGATGCTCCCCATGGTTATCATCTTATAAGGCACACATTCAAGCACCTGATAAGGAAAGCGCTTTCTATCACGAGGTTATCCAACATGTGGATTGGTCAC
TTCTGCAGCAGCAGCCACCACAAACACTGGACCAACTTTGTTTAAGAGACTTGAGATCTAGAAAGTGGATCAATTATCCAACTGAAACTGAAGAGCTCGTTACCATTGTA
GCGGAATCGGTTTTAAGAGAATTAATCATTGAAAGTGTTGTTTACCTTGGTGTAGATGCTCTCACACAACTCCAAATGTTGTCTGAAGATGAACAGTTGGAACTCAACAC
ACAGCTGAAACAACTCAACAAACCTGCAATCAAGAGTTTCAAGACAGAATTTGGTGATATTATAGATTGTGTTGACATCTACAAACAACCTTCTCTTGATCATCCTTTGC
TCAAAAACCATGCAATCCAGATGAAGCCAACAACAATCCCGAAAGGGCTGATAGGCGATACATCAAAAGTCGAGAGGCTGCTGCAGGATCTTCCAAACATTAACAGCTGC
CCACCAGGATCAGTGCCAATCAGAAGGACTACAAGGGAAGATCTGATAGCAGCAAGAAGTTTCAAGCCTTTGTGGTCAGATCAAGCAACAGATAATCTCCGGCCAAGCAC
TACGATCGACGCCGCCGGTTATCAGCTTGCAACACTCAACTATAGGGCCAAAGTCTATGGAGCAAAATCACAAATCAATGTATGGAACCCAATTCCATCAGTGGATCAAT
ATAGTTCTGCTAGTATGTGGCTATCTCAAGGCTTTAGAGATCAACAGAACACCATACAAGTTGGTTGGGGAGATGCGAGCACGGGGAACTGGTTGTTCATGCTTGGAGAC
AAATACATTGGGTACTGGCCAAAGGGACTGCTGCCAGGCTTGGAAAATGGAGCAACTACTGCAGCATGGGGAGGGGAAATTTACAGCCCTACAACAGAAGCAGGGCCAGC
CATGGGGAGTAGCCATTTTCCTGAAGAGGGTTTCAGTAAAAGTGCTTTTGTGAATCAGATTCAAGTGGCAGAGTCTAGTATTTCAAGGGGATTTGTTGATCCAGTGGGTT
CACAGCTCAGTATTGTTTTGGACAAACCTATCTGTTTTGGGCTCATTAATAAGTTTACTGAAGATGGGAACTGGGGACATCATATCTTCTTTGGAGGGCCAAGTGGCTGC
AGGTGA
mRNA sequenceShow/hide mRNA sequence
CCCAATTTCAAATTTCAAAAACAGAGTAATAATAAAAAAACTGAACCAATTGCCTACAAGCGTCCCCACTGGTCTTTGTCCTCACTCAAAATTGCTCATCAATGGCAGGT
CCTGTAAGCAATAACACGACGTCGTTTCCTCAAATACCGAGACCCTCTACACCCCAGTACGACGTCGTACGCAGCACAATCCAAGCCCAAAACTTCGAAAGAGTAACATG
GAGTTGGGGCAGAGCGAGGGAAAGCAACCTTACAATTGCTTGAGTGTAAAGCTTTCTTCCTGCAAATTGGCAGTCCATTTCGCCATTAATCAATCCAAAATCCAATACCC
ATAATTCAACTTTCTTCTCTTTTCGTTTTCTATATTTGTCCCTTTTCAATTCGCTTTTGTTTCTCCTTCTCTCTCTTCCCCCACCAGTTTCCTCTTCGTCTTCTTCGTCT
TTCCCTGTTATTCCTCTCTTTTACATAGCTGTTTCCCCTGTTCATGTACGAGTAATGGGTCCATCTCCATTTTCTCCTTTTGCTTCTTGTGGATTCAAGCTGACCCAGGG
TCATCAACAAGACTAGTATCACGGGAGAAAAGAACCGATCGGCATGGAGCAGAGGAAGCTGTTCTTGGACCACATGTCCTTGAATTATTTTCATCTGGGCAATAGATTAT
GAAACAACCGCTGGTCAAACAAACAAGGGGTTTCTATAGAACAACGCATAAATAGATTCTGCAATGGGAACAAAGCATATGCAATCTAATTCTAGCATGGTTGGAAGAGT
CTCAAAAAGCCACAAAATGCTGTGCAAAGCTGTTGATAGACCTAGCAAAGAACTTCAACAGCCATCTCCTAGAAGTTTGGTGACTGCATCATCAAAGAATCTAGATTCAA
CAGCATCAACAAGAGTAGCTTGCTGCAGAAGTCGAAGATTTTGTACTTGTAAAAGTTGTATGGAGTATGGCCGACATAATGAGATTAGCCTAAGATTGGTTCAGAAGAAT
GAAGCGGCTGAGCCATTTTCAAGTAAAAAGTTTGTTGGTGTGGCTGATAAACAGTGTAAACAATTATTAGATGCATTGGGAATTTTCAATTCAAATAAGGAATTGTTTGT
AAATCTACTACAAGACCCAAATTCTCTGTTAATTAAACGTATTGAAGACTCTACTGATTCAAGGAACAGAAAACAGCAGATGATGACTTTCTTTGATAGTAGGTTGTCTG
GAAACAAGATAAGAGAAGTGGGGGAATATGAGGAGCCTGACTACAGTAAAAAATTGAAGCCCTGTGATAGATTACCCACTGAGGATAGCGATGATTCTCTATCCTTGGAA
AGAATAGTGGTATTAAAGCCAAATCCAACTGTCTCACTACCTGCGGCTGTGGGAACCAATCATTGCCCCTCTCTGAAATCTCATTCTAGTTTCATAAAGAATGGGCAAAG
TGACAAGAGAACTCTTTTTTCTTTTAGACAAATAAAGAGGAAGATGAAGCAAGCAATGAGGACAGGGAAAAAAGAACCCGAATGCCTATCAACTAATGGTATCGCCAAGA
AAACTCCACCAGTTTTTAGGGCCCCAAAAGATGATGGTAAACAGATGGTTATGGAGGCAACTGGAAGAAGTTCCTACAATAATATTCAAACAGATGATAAAATAATTCCT
AGTTCATTTCAAGATTCCCTGGAAAGGGATCAACTAGACAGGGCATTTTACTCTAGAAATGGGGACAAGACGGCTTCAACCAGTGAAAGTACTGACAAAAAGGTAGGCCA
GTCAGCTGTGACAAGTAATCTCAAACGGCAGAAATCTAAGAAGCATGAAGGGGACAAAGAGGTCTCAAGAAAAATTAAAGCAAAACCATGGGGGTGGGCGATGTGCTTTT
CTGATGATGACATATTGCCATCAAATAAACCTGGATGTCATACTCCAAGCCATACGAGATATTCCCACCTTAGCAATAAGAAGTTCATTTACGAGAAGAAGTCAAAATCT
CAGAATGACGTGGAACAAAGTTGCAAAACACCAGAAATGGTTAAAATAGGAGCTCCTTTTGCAGAGGCAAGGAGAGATGATGACCAATTGCACGCCTCAACTACAGAGTT
GAATGTGTCACCCGTCATTTTTTCTGATGTCAAAGTGGATCAAGATCCAATTATTGAAGGGTCTGTGAAGTTCGTAAACGACATTGCCACAATACACCAGGAAAGAAATA
GTTTTTGTGAAGAGTCATCTAGATTTGATAACAATTGCAACACATGTTATTGTCAAAGAACCAACAAGACCAAGGGCTTTGAAGAGAAAGGAAATCCAGAGCTCTCTAAA
CTGAATTTGCCTTTGGAGGTTCAACCATCAGCTCTTTCAGTAGATCCATTTCCATCCAGCTCATTACAATTTCAGACAGTGGAAGATCCTAATGGTTTGTGTGATAGAGA
AGTGCAGCCTCTACCAGAAACTATTCATGACGACCAACTTTTGATAGATGCTACCTCTAGTAATCTTGCTTTCACCCCAGGAACAGCTGAGCCATCTAGTGAAGCACTCC
CCATTAATTTTGAAAAGGACCAGTGTACTGGTTTGGCAAGGTTGCAAGAGGTTCTCAATCCCGCCATCGCTTCCTTTAACTGTTGTAGCTCCATCTCTCAGTGTGTACTT
GAGCTGCTGCAAGTCTCAAAACAGAATTGGAATGAATTGTCATTGGATTGTCATTCTTCAACTTGGCTGCAGATATCATTTGTTGACAAAGTGAAGATATTTAGTAGCCA
GTTATGTGGTGATTGTGTGCTTCTTTTCGACTATTTTAATGAAGTCCTTGAGGATGTTTTCCACTGTTATATTAGATGCTCCCCATGGTTATCATCTTATAAGGCACACA
TTCAAGCACCTGATAAGGAAAGCGCTTTCTATCACGAGGTTATCCAACATGTGGATTGGTCACTTCTGCAGCAGCAGCCACCACAAACACTGGACCAACTTTGTTTAAGA
GACTTGAGATCTAGAAAGTGGATCAATTATCCAACTGAAACTGAAGAGCTCGTTACCATTGTAGCGGAATCGGTTTTAAGAGAATTAATCATTGAAAGTGTTGTTTACCT
TGGTGTAGATGCTCTCACACAACTCCAAATGTTGTCTGAAGATGAACAGTTGGAACTCAACACACAGCTGAAACAACTCAACAAACCTGCAATCAAGAGTTTCAAGACAG
AATTTGGTGATATTATAGATTGTGTTGACATCTACAAACAACCTTCTCTTGATCATCCTTTGCTCAAAAACCATGCAATCCAGATGAAGCCAACAACAATCCCGAAAGGG
CTGATAGGCGATACATCAAAAGTCGAGAGGCTGCTGCAGGATCTTCCAAACATTAACAGCTGCCCACCAGGATCAGTGCCAATCAGAAGGACTACAAGGGAAGATCTGAT
AGCAGCAAGAAGTTTCAAGCCTTTGTGGTCAGATCAAGCAACAGATAATCTCCGGCCAAGCACTACGATCGACGCCGCCGGTTATCAGCTTGCAACACTCAACTATAGGG
CCAAAGTCTATGGAGCAAAATCACAAATCAATGTATGGAACCCAATTCCATCAGTGGATCAATATAGTTCTGCTAGTATGTGGCTATCTCAAGGCTTTAGAGATCAACAG
AACACCATACAAGTTGGTTGGGGAGATGCGAGCACGGGGAACTGGTTGTTCATGCTTGGAGACAAATACATTGGGTACTGGCCAAAGGGACTGCTGCCAGGCTTGGAAAA
TGGAGCAACTACTGCAGCATGGGGAGGGGAAATTTACAGCCCTACAACAGAAGCAGGGCCAGCCATGGGGAGTAGCCATTTTCCTGAAGAGGGTTTCAGTAAAAGTGCTT
TTGTGAATCAGATTCAAGTGGCAGAGTCTAGTATTTCAAGGGGATTTGTTGATCCAGTGGGTTCACAGCTCAGTATTGTTTTGGACAAACCTATCTGTTTTGGGCTCATT
AATAAGTTTACTGAAGATGGGAACTGGGGACATCATATCTTCTTTGGAGGGCCAAGTGGCTGCAGGTGAAGAAAAACTGATTCCAT
Protein sequenceShow/hide protein sequence
MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPRSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKFVGVADKQCKQ
LLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSGNKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLERIVVLKPNPTVSLPAAVGTNH
CPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRTGKKEPECLSTNGIAKKTPPVFRAPKDDGKQMVMEATGRSSYNNIQTDDKIIPSSFQDSLERDQLDRAFYSRNG
DKTASTSESTDKKVGQSAVTSNLKRQKSKKHEGDKEVSRKIKAKPWGWAMCFSDDDILPSNKPGCHTPSHTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFA
EARRDDDQLHASTTELNVSPVIFSDVKVDQDPIIEGSVKFVNDIATIHQERNSFCEESSRFDNNCNTCYCQRTNKTKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSS
LQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSST
WLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYIRCSPWLSSYKAHIQAPDKESAFYHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIV
AESVLRELIIESVVYLGVDALTQLQMLSEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIGDTSKVERLLQDLPNINSC
PPGSVPIRRTTREDLIAARSFKPLWSDQATDNLRPSTTIDAAGYQLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGWGDASTGNWLFMLGD
KYIGYWPKGLLPGLENGATTAAWGGEIYSPTTEAGPAMGSSHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
R