; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G002700 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G002700
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCiama_Chr01:2758747..2768674
RNA-Seq ExpressionCaUC01G002700
SyntenyCaUC01G002700
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR022212 - Domain of unknown function DUF3741
IPR025486 - Domain of unknown function DUF4378
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042752.1 DUF3741 domain-containing protein/DUF4378 domain-containing protein [Cucumis melo var. makuwa]0.0e+0084.9Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSPKSLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLSENKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN TSSL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMRVG+KE ECLS+NGM K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
        IQTDDK ISSSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKK EGDKEVSRKMKAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLH STTELNVSPVIF +  VDQDPIIEGSVK +K++ T+ QER++FCE  SR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR

Query:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN  NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES F+HEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_004143902.1 uncharacterized protein LOC101217666 [Cucumis sativus]0.0e+0084.64Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CK VD PSK+LQQPSPK LV ASSK LDSTASTRVACCR++RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIE STDSRNRKQQMMTFFDSRLSENKIREVGEYEEP Y + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN TSSL AAVGTN+C SLKSHSS IKNGQSDK TLFSFRQIKRKMKQAMRVG+KE ECLSTNG+ K+TP + R PKDDGKQ  +EATGRSSY+N
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
        IQTDDK ISSSFQDSL RDQ D+AFYSRNGDKTASTSEST KK+ QSAV SNLKRQKSKK EGDKEVSRK KAKPWGW MCFSDDDILPSNKPGC T   
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK+K QND E+ CKTPEMVK+GA FAEA R+DDQLH STTELNVSPVIF +  VDQDP+IEGSVK VKD+AT+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR

Query:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FD+  NT  CQ TNK KGF EKGNPELSKLNLPLEVQPS  SVD F SSSLQFQTVEDPNG CDR VQPLPE IH DQL++DATSSNLA TPGT EPSS 
Subjt:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQC+GLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP KESAF+HEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_008437289.1 PREDICTED: uncharacterized protein LOC103482755 [Cucumis melo]0.0e+0085.03Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSPKSLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLSENKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN TSSL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMRVG+KE ECLS+NGM K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
        IQTDDK ISSSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKK EGDKEVSRKMKAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLH STTELNVSPVIF +  VDQDPIIEGSVK +K++ T+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR

Query:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN  NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES F+HEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_022159509.1 uncharacterized protein LOC111025905 [Momordica charantia]0.0e+0074.97Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK
        MG KHM SNS+MVG+VS+SHK+L KAVDRP+KEL+ PSPK L  T+S+K LD TASTRVACCRSRRFCTCKSC EYG+HNEISL+LVQKNEA EPFSSKK
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK

Query:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE
        FV VAD+QCKQLLDALGIFNSNKELF+NLLQDPNSLLI+ IEDSTDS + KQQM TF D RLSENK REVGEYEEP Y + LKPCDRLP+E+SDDS SLE
Subjt:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE

Query:  RIVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYN
        RIVVLKPNPTSS  +  GTN+C SL+SHSSFI N QSDKR+ FSFRQIKRKM+QAM VGKKE ECLS + M+KKTP V            MEATG+SS N
Subjt:  RIVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYN

Query:  NIQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPS
        NIQT  K ISSS QDSL+RDQ+D+AFYSRNGDKTASTSEST K VGQSAV S+LKRQKSKK EGDKEVSRKMK KPWGW MCFSDDDI+PSNKPG H   
Subjt:  NIQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPS

Query:  NTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCE-ES
        + RYS LSNKKF+YEKKSK QN+ ++SC+ P+M K+ A  +E RRDDDQLH S TELN+S VIF DVKVD+D  IEGSVK +KDIATIHQE N+FCE  S
Subjt:  NTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCE-ES

Query:  SRFDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS
        S FD+ C T +CQRTNK  GF E+GN ELSKLN PLE QPSA SVD FPS S +FQ VEDPNGL DREVQP+PETIH DQLL+DATSSN  F P  AE S
Subjt:  SRFDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS

Query:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC
         EALPI+FE+  C GLARLQE L+P IASFNCC SISQ +LELLQ SKQNW+ELS++CHSS WL+I FVDK K+F SQLCGDCVLLFDYFNEVLEDVFHC
Subjt:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC

Query:  YIRCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        YIRCSPWLSSYKAH QAPD ESA +HE++QHVDW LLQQQ PQTL+ L LRDL RSR WI+Y TETE++VTI+AES+LREL IESVVYLG+
Subjt:  YIRCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

XP_038874729.1 uncharacterized protein LOC120067270 [Benincasa hispida]0.0e+0089.21Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNS+MVGRVSKS KMLCKAV RPSKELQQPSPKSLVTASSK LDSTASTRVACCRSRRFCTCKSCMEYGRHNEISL+LVQKNEAAEPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFF SRLSENKIREVGEYEEPD S+ L PCDRLP +DSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPNPTSSL AAVGTN+C SLKSHSSFIKNGQSDK TLFSFRQIK KMKQAM V KKE ECLSTNG+ KKT PV RAPKD+GKQMVM ATGRSSY+N
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
        IQTDDK ISSSFQDSLERDQLD+AFYSRNGDKTASTSE TDKKVGQSAVTSNLKRQKSKK EGD+EVSRKMKAKPWGW MCFSDDDILPSNKPGCHT S 
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR
         RYS+LSNKKFI+EKKSK QND EQSC T +MVK GA  A ARRDD+Q+H STTELNVSPVIFSDVKVDQDPIIEGSVK +KD ATI QERN+FCEESSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR

Query:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN+CN  YCQRTNKIKG  EKGNPELSKLN PLEVQP A SV+ FPSSSLQFQTVEDP+GLCDR+VQPLPETIH  QLL+DATSSNLAFT GTAEPSSE
Subjt:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVLNPA+ASFNCCSSISQ VLELLQVSKQNWNELSLDCHSSTWLQISFVDKVK+FSSQLCGDCV+LFDYFNEVLEDVFHCYI
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCSPWLSSYKAHIQAPDKES F+HEV+QHVDWSLLQQQPPQTLDQLCLRDLRSR WINYPTETEE+VTI+ ESVLRELI+ESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

TrEMBL top hitse value%identityAlignment
A0A0A0KNW5 Uncharacterized protein0.0e+0084.64Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CK VD PSK+LQQPSPK LV ASSK LDSTASTRVACCR++RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIE STDSRNRKQQMMTFFDSRLSENKIREVGEYEEP Y + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN TSSL AAVGTN+C SLKSHSS IKNGQSDK TLFSFRQIKRKMKQAMRVG+KE ECLSTNG+ K+TP + R PKDDGKQ  +EATGRSSY+N
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
        IQTDDK ISSSFQDSL RDQ D+AFYSRNGDKTASTSEST KK+ QSAV SNLKRQKSKK EGDKEVSRK KAKPWGW MCFSDDDILPSNKPGC T   
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK+K QND E+ CKTPEMVK+GA FAEA R+DDQLH STTELNVSPVIF +  VDQDP+IEGSVK VKD+AT+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR

Query:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FD+  NT  CQ TNK KGF EKGNPELSKLNLPLEVQPS  SVD F SSSLQFQTVEDPNG CDR VQPLPE IH DQL++DATSSNLA TPGT EPSS 
Subjt:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQC+GLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP KESAF+HEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A1S3AU75 uncharacterized protein LOC1034827550.0e+0085.03Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSPKSLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLSENKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN TSSL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMRVG+KE ECLS+NGM K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
        IQTDDK ISSSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKK EGDKEVSRKMKAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLH STTELNVSPVIF +  VDQDPIIEGSVK +K++ T+ QER++FCE SSR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR

Query:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN  NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES F+HEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A5A7TH74 DUF3741 domain-containing protein/DUF4378 domain-containing protein0.0e+0084.9Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MGTKHMQSNSSMVGRVSKSHKM CKAVDRPSK+LQQPSPKSLV  SSK LDSTASTRVACCRS+RFCTCKSCMEY RHNEISL+LVQKNEA+EPFSSKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIED+T+SRNRKQQMMTFFDSRLSENKIREVGE +EP   + LKPCDRLP EDSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPN TSSL AAVGTN+C SLKSHSS  KNGQSDK TLFSFRQIKRKMKQAMRVG+KE ECLS+NGM K+TP + RAPKDDGKQ V+ AT RSSY+ 
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
        IQTDDK ISSSFQDSLERDQ D+AFYSRNGDKTASTSEST KKV Q AV SNLKRQKSKK EGDKEVSRKMKAKPWGW MCFSDDDILPSNKPGCHT   
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR
         RYSHL NKKFI+EKK++ QND EQ CKTPEMVK+GA FAEA RDDDQLH STTELNVSPVIF +  VDQDPIIEGSVK +K++ T+ QER++FCE  SR
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSR

Query:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE
        FDN  NT YCQRTNK +GF EKGNPELSK NLPLEVQPSA SVD FPSSSLQFQTVEDPNG CDR VQPLPE I +DQLL+DATSSNLA T GTAEPSSE
Subjt:  FDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSE

Query:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI
        ALPINFE+DQCTGLARLQEVL+PAIASF+CC S SQC+LELLQVSKQNWNELS+DCHSSTWLQISFVDKVK+FSSQLCGDCVLLFDYFNEVLEDVFHCY+
Subjt:  ALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYI

Query:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        RCS WLSSYK HIQAP+KES F+HEV+QH+DWSLLQQQPPQTLD LCLRDL+SR WI+YPTETEE+VTI+AESVLRELIIESVVYLG+
Subjt:  RCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A6J1E2K4 uncharacterized protein LOC1110259050.0e+0074.97Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK
        MG KHM SNS+MVG+VS+SHK+L KAVDRP+KEL+ PSPK L  T+S+K LD TASTRVACCRSRRFCTCKSC EYG+HNEISL+LVQKNEA EPFSSKK
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSL-VTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKK

Query:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE
        FV VAD+QCKQLLDALGIFNSNKELF+NLLQDPNSLLI+ IEDSTDS + KQQM TF D RLSENK REVGEYEEP Y + LKPCDRLP+E+SDDS SLE
Subjt:  FVGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLE

Query:  RIVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYN
        RIVVLKPNPTSS  +  GTN+C SL+SHSSFI N QSDKR+ FSFRQIKRKM+QAM VGKKE ECLS + M+KKTP V            MEATG+SS N
Subjt:  RIVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYN

Query:  NIQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPS
        NIQT  K ISSS QDSL+RDQ+D+AFYSRNGDKTASTSEST K VGQSAV S+LKRQKSKK EGDKEVSRKMK KPWGW MCFSDDDI+PSNKPG H   
Subjt:  NIQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPS

Query:  NTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCE-ES
        + RYS LSNKKF+YEKKSK QN+ ++SC+ P+M K+ A  +E RRDDDQLH S TELN+S VIF DVKVD+D  IEGSVK +KDIATIHQE N+FCE  S
Subjt:  NTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCE-ES

Query:  SRFDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS
        S FD+ C T +CQRTNK  GF E+GN ELSKLN PLE QPSA SVD FPS S +FQ VEDPNGL DREVQP+PETIH DQLL+DATSSN  F P  AE S
Subjt:  SRFDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPS

Query:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC
         EALPI+FE+  C GLARLQE L+P IASFNCC SISQ +LELLQ SKQNW+ELS++CHSS WL+I FVDK K+F SQLCGDCVLLFDYFNEVLEDVFHC
Subjt:  SEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC

Query:  YIRCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        YIRCSPWLSSYKAH QAPD ESA +HE++QHVDW LLQQQ PQTL+ L LRDL RSR WI+Y TETE++VTI+AES+LREL IESVVYLG+
Subjt:  YIRCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDL-RSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

A0A6J1K828 uncharacterized protein LOC1114910109.0e-30070.89Show/hide
Query:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF
        MG KHMQSNSSMVG+ SKSHKM+CKA++RPS+E QQPSPKSL  ASS         RVACCR++R CTCK CME+GRHNEISL+LVQKNEAAEPF SKKF
Subjt:  MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKF

Query:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER
        VGVADKQCKQLLDALGIFNSN+ELFVNLL DPNSLLIKRIEDSTDSR+RKQQ  T+F+ RLSENKI+EVGEYEEP +S  LKPC      DSDDSLSLER
Subjt:  VGVADKQCKQLLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLER

Query:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN
        IVVLKPNP+SSL AAVGTN C SLKSHSSFIKN +S+K TLFSFRQIKRKMKQAMRVG+KE ECLSTNGM+ KTP V+R PKD                 
Subjt:  IVVLKPNPTSSLPAAVGTNHCPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNN

Query:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN
                          DQLD+ FYSRNGDKTASTSES   KVGQSAVTS+LKRQKSKKREGDKEV RKMKAKPWGW MCFSDDDILPSNKPGC T ++
Subjt:  IQTDDKIISSSFQDSLERDQLDRAFYSRNGDKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSN

Query:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCE-ESS
        TR+S LSNKKFIYE KSKSQND +Q C+TP+M         ARR+DDQ   S  ELNVS V+  DVKV++ P IEGS+K VKDIATI QE N FCE  SS
Subjt:  TRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFAEARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCE-ESS

Query:  RFDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSS
         FD           NK  GF EK + ELSK NL LE Q S        SS  +FQTVEDPNGLCDREVQPLPET H   L+  A SS+LAFT  TA+ S+
Subjt:  RFDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSSLQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSS

Query:  -EALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC
         EALPI+FE+ QCTG A LQEV++PAI++FNCC S+S+ VLELLQVS QNWNELS+DCHSS WL+I FVDKVK+F +QLCGDCVL+FDYFNEVLEDVFHC
Subjt:  -EALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSSTWLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHC

Query:  YIRCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV
        YIRCSPWLSSYKAHIQAP+KE+  +HEV+QH+DW LLQQQPPQTLD LCLRDLR R WINYPTETEE+VTI+AESVLREL+IESVVYLG+
Subjt:  YIRCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIVAESVLRELIIESVVYLGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20170.1 Protein of Unknown Function (DUF239)3.2e-5535.75Show/hide
Query:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA
        ++E+ EL   L  +NKPAIKSF+T+ G I+DC+DI KQ + DHPLLKNH+IQ+KPT IPK     +T K   L        SCP G+V I+RTT EDLI 
Subjt:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA

Query:  ARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------
         +  K L     T   +    ++  G H A   Y    YGA   IN+W+P  + DQ+S AS+++  GFRD   +I  GW                     
Subjt:  ARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------

Query:  ---------------------------------------------GDASTGNWLFMLGDIYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM
                                                      D  TGNW F++ +  IGYWPK L  + GL  GA+   WGGE++S   ++  P M
Subjt:  ---------------------------------------------GDASTGNWLFMLGDIYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM

Query:  GSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
        GSGHFP+EGF K+AFVN ++V +  I +    PV   L +  + P C+ +  K      W   IF+GGP GC
Subjt:  GSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT2G20170.2 Protein of Unknown Function (DUF239)3.2e-5535.75Show/hide
Query:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA
        ++E+ EL   L  +NKPAIKSF+T+ G I+DC+DI KQ + DHPLLKNH+IQ+KPT IPK     +T K   L        SCP G+V I+RTT EDLI 
Subjt:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA

Query:  ARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------
         +  K L     T   +    ++  G H A   Y    YGA   IN+W+P  + DQ+S AS+++  GFRD   +I  GW                     
Subjt:  ARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------

Query:  ---------------------------------------------GDASTGNWLFMLGDIYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM
                                                      D  TGNW F++ +  IGYWPK L  + GL  GA+   WGGE++S   ++  P M
Subjt:  ---------------------------------------------GDASTGNWLFMLGDIYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAM

Query:  GSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
        GSGHFP+EGF K+AFVN ++V +  I +    PV   L +  + P C+ +  K      W   IF+GGP GC
Subjt:  GSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT2G20170.3 Protein of Unknown Function (DUF239)3.4e-5737.46Show/hide
Query:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA
        ++E+ EL   L  +NKPAIKSF+T+ G I+DC+DI KQ + DHPLLKNH+IQ+KPT IPK     +T K   L        SCP G+V I+RTT EDLI 
Subjt:  EDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIG-DTSKVERLLQDLPNINSCPPGSVPIRRTTREDLIA

Query:  ARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------
         +  K L     T   +    ++  G H A   Y    YGA   IN+W+P  + DQ+S AS+++  GFRD   +I  GW                     
Subjt:  ARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGW---------------------

Query:  ----------------------------GDASTGNWLFMLGDIYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAMGSGHFPEEGFSKSAFVN
                                     D  TGNW F++ +  IGYWPK L  + GL  GA+   WGGE++S   ++  P MGSGHFP+EGF K+AFVN
Subjt:  ----------------------------GDASTGNWLFMLGDIYIGYWPKGL--LPGLENGATTAAWGGEIYSPTTEA-GPAMGSGHFPEEGFSKSAFVN

Query:  QIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
         ++V +  I +    PV   L +  + P C+ +  K      W   IF+GGP GC
Subjt:  QIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT4G23380.1 Protein of Unknown Function (DUF239)8.1e-5134.43Show/hide
Query:  VLRELIIESVVYLGVDALTQLQTLSEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLI---GDTSKVERL
        VLR L+  S+V +   A       SE+E+ E+  QLK +NKPAIKSFKTE  +I DC+DI+KQ + DH LL+NH++++KPT++PK  I       KV  +
Subjt:  VLRELIIESVVYLGVDALTQLQTLSEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLI---GDTSKVERL

Query:  LQDLPNINSCPPGSVPIRRTTREDLIAARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNY-RAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFR-DQ
           L  I SCP G+V ++RTT +DLI ++  K +  +     L   + ID +G+H AT +Y    V G    IN+W+P  S DQ S A+M ++ G + +Q
Subjt:  LQDLPNINSCPPGSVPIRRTTREDLIAARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNY-RAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFR-DQ

Query:  QNTIQVGW--------------------GDASTGNW-----------------------------------------LFMLGDIYIGYWPKGLL--PGLE
          +I VGW                    G   TG +                                            +  + +GYWP+ L    GL 
Subjt:  QNTIQVGW--------------------GDASTGNW-----------------------------------------LFMLGDIYIGYWPKGLL--PGLE

Query:  NGATTAAWGGEIYSPTTEAGPAMGSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
         GA  A+WGG++YSP TE  P MGSGHFP+EGF K+AFVN I +         + P    +      P C+       ED  W   ++FGGP GC
Subjt:  NGATTAAWGGEIYSPTTEAGPAMGSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC

AT4G23390.1 Protein of Unknown Function (DUF239)8.4e-5635.47Show/hide
Query:  SEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPK-----GLIGDTSKVERLLQDLPNINSCPPGSVPIRRTTR
        S++E+ E+   L  LNKPA+KSF+TE G I DC+DI KQ + DHPLLKNH+I++KPTTIPK          TS +     D+    SCP G+V ++R   
Subjt:  SEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPK-----GLIGDTSKVERLLQDLPNINSCPPGSVPIRRTTR

Query:  EDLIAARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASM--------------------------------W
        EDLI A+  + L  +           ID  G+H AT++Y+   YGAK  INVWNP  S DQ+S A+M                                W
Subjt:  EDLIAARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASM--------------------------------W

Query:  LSQG--------------------------------FRDQQNTIQVG-WGDASTGNWLFMLGDIYIGYWPKGLL--PGLENGATTAAWGGEIYSPTTEAG
         + G                                +  QQ  ++V  + D  T +W F+L +  IGYWPK L    GL +GA+   WGGE+YS   E  
Subjt:  LSQG--------------------------------FRDQQNTIQVG-WGDASTGNWLFMLGDIYIGYWPKGLL--PGLENGATTAAWGGEIYSPTTEAG

Query:  PAMGSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
        P+MGSGHFP+EGF K+A+VN +++  + I++    P+ S L      P C+ +         W   I FGGP GC
Subjt:  PAMGSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACAAAGCATATGCAATCTAATTCTAGCATGGTTGGAAGAGTCTCAAAAAGCCACAAAATGCTGTGCAAAGCTGTTGATAGACCTAGCAAAGAACTTCAACAGCC
ATCTCCTAAAAGTTTGGTGACTGCATCATCAAAGAATCTAGATTCAACAGCATCAACAAGAGTAGCTTGCTGCAGAAGTCGAAGATTTTGTACTTGTAAAAGTTGTATGG
AGTATGGCCGACATAATGAGATTAGCCTAAGATTGGTTCAGAAGAATGAAGCGGCTGAGCCATTTTCAAGTAAAAAGTTTGTTGGTGTGGCTGATAAACAGTGTAAACAA
TTATTAGATGCATTGGGAATTTTCAATTCAAATAAGGAATTGTTTGTAAATCTACTACAAGACCCAAATTCTCTGTTAATTAAACGTATTGAAGACTCTACTGATTCAAG
GAACAGAAAACAGCAGATGATGACTTTCTTTGATAGCAGGTTGTCTGAAAACAAGATAAGAGAAGTGGGGGAATATGAGGAGCCTGACTACAGTAAAAAATTGAAGCCCT
GTGATAGATTACCCACTGAGGATAGCGATGATTCTCTATCCTTGGAAAGAATAGTGGTATTAAAGCCAAATCCAACTAGCTCACTACCTGCGGCTGTGGGAACCAATCAT
TGCCCCTCTCTGAAATCTCATTCTAGTTTCATAAAGAATGGGCAAAGTGACAAGAGAACTCTTTTTTCTTTTAGACAAATAAAGAGGAAGATGAAGCAAGCAATGAGGGT
AGGGAAAAAAGAACCTGAATGCCTATCAACTAATGGTATGGCCAAGAAAACTCCACCAGTTTTTAGGGCCCCAAAAGACGATGGTAAACAGATGGTTATGGAGGCAACTG
GAAGAAGTTCCTACAATAATATTCAAACAGATGATAAAATAATTTCTAGTTCATTTCAAGATTCCCTGGAAAGGGATCAACTAGACAGGGCATTTTACTCTAGAAATGGG
GACAAGACGGCTTCAACCAGTGAAAGTACTGACAAAAAAGTAGGCCAGTCAGCTGTGACAAGTAATCTCAAACGGCAGAAATCTAAGAAGCGTGAAGGGGACAAAGAGGT
CTCAAGAAAAATGAAAGCAAAACCATGGGGGTGGGCGATGTGCTTTTCTGATGATGACATATTGCCATCAAATAAACCTGGATGTCATACTCCAAGCAATACGAGATATT
CCCACCTTAGCAATAAGAAGTTCATTTACGAGAAGAAGTCAAAATCTCAGAATGACGTGGAACAAAGTTGCAAAACACCAGAAATGGTTAAAATAGGAGCTCCTTTTGCA
GAGGCAAGGAGAGATGATGACCAATTGCACACCTCAACTACAGAGTTGAATGTGTCACCTGTCATTTTTTCTGATGTCAAAGTGGATCAAGATCCGATTATTGAAGGGTC
TGTGAAGTTCGTAAAAGACATTGCCACAATACACCAGGAAAGAAATAGTTTTTGTGAAGAGTCATCTAGATTTGATAACGATTGCAACACATGTTATTGTCAAAGAACCA
ACAAGATCAAGGGCTTTGAAGAGAAAGGAAATCCGGAGCTCTCTAAACTGAATTTGCCTTTGGAGGTTCAACCATCAGCTCTTTCAGTAGATCCATTTCCATCCAGCTCA
TTACAATTTCAGACAGTGGAAGATCCTAATGGTTTGTGTGATAGAGAAGTGCAGCCTCTACCAGAAACTATTCATGACGACCAACTTTTGATAGATGCTACCTCTAGTAA
TCTTGCTTTCACCCCAGGAACAGCTGAGCCATCTAGTGAAGCACTCCCCATTAATTTTGAAAAGGACCAGTGTACTGGTTTGGCAAGGTTGCAAGAGGTTCTCAATCCCG
CCATCGCTTCCTTTAACTGTTGTAGCTCCATCTCTCAGTGTGTACTTGAGCTGCTGCAAGTCTCAAAACAGAATTGGAATGAATTGTCATTGGATTGTCATTCTTCAACT
TGGCTGCAGATATCATTTGTTGACAAAGTGAAGATATTTAGTAGCCAGTTATGTGGTGATTGTGTGCTTCTTTTCGACTATTTTAATGAAGTCCTTGAGGATGTTTTCCA
CTGTTATATTAGATGCTCCCCATGGTTATCATCTTATAAGGCACACATTCAAGCACCTGATAAGGAAAGCGCTTTCCATCACGAGGTTATCCAACATGTGGATTGGTCAC
TTCTGCAGCAGCAGCCACCACAAACACTGGACCAACTTTGTTTAAGAGACTTGAGATCTAGAAAGTGGATCAATTATCCAACTGAAACTGAAGAGCTCGTTACCATTGTA
GCGGAATCGGTTTTAAGAGAATTAATCATTGAAAGTGTTGTTTACCTTGGTGTAGATGCTCTCACACAACTCCAAACGTTGTCTGAAGATGAACAGTTGGAACTCAACAC
ACAGCTGAAACAACTCAACAAACCTGCAATCAAGAGTTTCAAGACAGAATTTGGTGATATTATAGATTGTGTTGACATCTACAAACAACCTTCTCTTGATCATCCTTTGC
TCAAAAACCATGCAATCCAGATGAAGCCAACAACAATCCCGAAAGGGCTGATAGGCGATACATCAAAAGTCGAGAGGCTGCTGCAGGATCTTCCAAACATTAACAGCTGC
CCACCAGGATCAGTGCCAATCAGAAGGACTACAAGGGAAGATCTGATAGCAGCAAGAAGTTTCAAGCCTTTGTGGTCAGATCAAGCAACAGATAATCTCCGGCCAAGCAC
TACGATCGACACCGCCGGTTATCATCTTGCAACACTCAACTATAGGGCCAAAGTCTATGGAGCAAAATCACAAATCAATGTATGGAACCCAATTCCATCAGTGGATCAAT
ATAGTTCTGCTAGTATGTGGCTATCTCAAGGCTTTAGAGATCAACAGAACACCATACAAGTTGGTTGGGGAGATGCGAGCACGGGGAACTGGTTGTTCATGCTTGGAGAC
ATATACATTGGGTACTGGCCAAAGGGACTGCTGCCAGGCTTGGAAAATGGAGCAACTACTGCAGCATGGGGAGGGGAAATTTACAGCCCTACAACAGAAGCAGGGCCAGC
CATGGGGAGTGGCCATTTTCCTGAAGAGGGTTTCAGTAAAAGTGCTTTTGTGAATCAGATTCAAGTGGCAGAGTCTAGTATTTCAAGGGGATTTGTTGATCCAGTGGGTT
CACAGCTCAGTATTGTTTTGGACAAACCTATCTGTTTTGGGCTCATTAATAAGTTTACTGAAGATGGGAACTGGGGACATCATATCTTCTTTGGAGGGCCAAGTGGCTGC
AGGTGA
mRNA sequenceShow/hide mRNA sequence
CCCAATTTCAAATTTCAAAAACAGAGTAATAATAAAAAAACTGAACCAATTGCCTACAAGCTTCCCCACTGGTCTTTGTCCTCACTCAAAATTGCTCATCAATGGCAGGT
CCTGTAAGCAATAACACGACGCCGTTTCCTCAAATACCGAGACCCTCTACACCCCAGTACGACGTCGTACGCAGCACAATCCAAGCCCAAAACTGCGAAAGAGTAACATG
GAGTTGGGGCAGAGCGAGGGAAAGCAACCTTACAATTGCTTGAGTGTAAAGCTTTCTTCCTGCAAATTGGCAGTCCATTTCGCCATTAATCAATCCAAAATCCAATACCC
ATAATTCAACTTTCTTCTCTTTTCGTTTTCTATATTTGTCCCTTTTCAATTCGCTTTTGTTTCTCCTTCTCTCTCTTCCCCCACCAGTTTCCTCTTCGTCTTCTTCGTCT
TTCCCTGTTATTCCTCTCTTTTACATAGCTGTTTTCCCTGTTCATGTACGAGTAATGGGTCCATCTCCATTTTCTCCTTTTGCTTCTTGCGGATTCAAGCTGACCCAGGG
TCATCAACAAGACTAGTATCACGGGAGAAAAGAACCGATCGGCATGGAGCAGAGGAAGCTGTTCTTGGACCACATGTCCTTGAATTATTTTCATCTGGGCAATAGATTAT
GAAACAACCGCAGTTGGTCAAACAAACAAGGGGTTTCTATAGAACAACGCATAAATAGATTCTGCAATGGGAACAAAGCATATGCAATCTAATTCTAGCATGGTTGGAAG
AGTCTCAAAAAGCCACAAAATGCTGTGCAAAGCTGTTGATAGACCTAGCAAAGAACTTCAACAGCCATCTCCTAAAAGTTTGGTGACTGCATCATCAAAGAATCTAGATT
CAACAGCATCAACAAGAGTAGCTTGCTGCAGAAGTCGAAGATTTTGTACTTGTAAAAGTTGTATGGAGTATGGCCGACATAATGAGATTAGCCTAAGATTGGTTCAGAAG
AATGAAGCGGCTGAGCCATTTTCAAGTAAAAAGTTTGTTGGTGTGGCTGATAAACAGTGTAAACAATTATTAGATGCATTGGGAATTTTCAATTCAAATAAGGAATTGTT
TGTAAATCTACTACAAGACCCAAATTCTCTGTTAATTAAACGTATTGAAGACTCTACTGATTCAAGGAACAGAAAACAGCAGATGATGACTTTCTTTGATAGCAGGTTGT
CTGAAAACAAGATAAGAGAAGTGGGGGAATATGAGGAGCCTGACTACAGTAAAAAATTGAAGCCCTGTGATAGATTACCCACTGAGGATAGCGATGATTCTCTATCCTTG
GAAAGAATAGTGGTATTAAAGCCAAATCCAACTAGCTCACTACCTGCGGCTGTGGGAACCAATCATTGCCCCTCTCTGAAATCTCATTCTAGTTTCATAAAGAATGGGCA
AAGTGACAAGAGAACTCTTTTTTCTTTTAGACAAATAAAGAGGAAGATGAAGCAAGCAATGAGGGTAGGGAAAAAAGAACCTGAATGCCTATCAACTAATGGTATGGCCA
AGAAAACTCCACCAGTTTTTAGGGCCCCAAAAGACGATGGTAAACAGATGGTTATGGAGGCAACTGGAAGAAGTTCCTACAATAATATTCAAACAGATGATAAAATAATT
TCTAGTTCATTTCAAGATTCCCTGGAAAGGGATCAACTAGACAGGGCATTTTACTCTAGAAATGGGGACAAGACGGCTTCAACCAGTGAAAGTACTGACAAAAAAGTAGG
CCAGTCAGCTGTGACAAGTAATCTCAAACGGCAGAAATCTAAGAAGCGTGAAGGGGACAAAGAGGTCTCAAGAAAAATGAAAGCAAAACCATGGGGGTGGGCGATGTGCT
TTTCTGATGATGACATATTGCCATCAAATAAACCTGGATGTCATACTCCAAGCAATACGAGATATTCCCACCTTAGCAATAAGAAGTTCATTTACGAGAAGAAGTCAAAA
TCTCAGAATGACGTGGAACAAAGTTGCAAAACACCAGAAATGGTTAAAATAGGAGCTCCTTTTGCAGAGGCAAGGAGAGATGATGACCAATTGCACACCTCAACTACAGA
GTTGAATGTGTCACCTGTCATTTTTTCTGATGTCAAAGTGGATCAAGATCCGATTATTGAAGGGTCTGTGAAGTTCGTAAAAGACATTGCCACAATACACCAGGAAAGAA
ATAGTTTTTGTGAAGAGTCATCTAGATTTGATAACGATTGCAACACATGTTATTGTCAAAGAACCAACAAGATCAAGGGCTTTGAAGAGAAAGGAAATCCGGAGCTCTCT
AAACTGAATTTGCCTTTGGAGGTTCAACCATCAGCTCTTTCAGTAGATCCATTTCCATCCAGCTCATTACAATTTCAGACAGTGGAAGATCCTAATGGTTTGTGTGATAG
AGAAGTGCAGCCTCTACCAGAAACTATTCATGACGACCAACTTTTGATAGATGCTACCTCTAGTAATCTTGCTTTCACCCCAGGAACAGCTGAGCCATCTAGTGAAGCAC
TCCCCATTAATTTTGAAAAGGACCAGTGTACTGGTTTGGCAAGGTTGCAAGAGGTTCTCAATCCCGCCATCGCTTCCTTTAACTGTTGTAGCTCCATCTCTCAGTGTGTA
CTTGAGCTGCTGCAAGTCTCAAAACAGAATTGGAATGAATTGTCATTGGATTGTCATTCTTCAACTTGGCTGCAGATATCATTTGTTGACAAAGTGAAGATATTTAGTAG
CCAGTTATGTGGTGATTGTGTGCTTCTTTTCGACTATTTTAATGAAGTCCTTGAGGATGTTTTCCACTGTTATATTAGATGCTCCCCATGGTTATCATCTTATAAGGCAC
ACATTCAAGCACCTGATAAGGAAAGCGCTTTCCATCACGAGGTTATCCAACATGTGGATTGGTCACTTCTGCAGCAGCAGCCACCACAAACACTGGACCAACTTTGTTTA
AGAGACTTGAGATCTAGAAAGTGGATCAATTATCCAACTGAAACTGAAGAGCTCGTTACCATTGTAGCGGAATCGGTTTTAAGAGAATTAATCATTGAAAGTGTTGTTTA
CCTTGGTGTAGATGCTCTCACACAACTCCAAACGTTGTCTGAAGATGAACAGTTGGAACTCAACACACAGCTGAAACAACTCAACAAACCTGCAATCAAGAGTTTCAAGA
CAGAATTTGGTGATATTATAGATTGTGTTGACATCTACAAACAACCTTCTCTTGATCATCCTTTGCTCAAAAACCATGCAATCCAGATGAAGCCAACAACAATCCCGAAA
GGGCTGATAGGCGATACATCAAAAGTCGAGAGGCTGCTGCAGGATCTTCCAAACATTAACAGCTGCCCACCAGGATCAGTGCCAATCAGAAGGACTACAAGGGAAGATCT
GATAGCAGCAAGAAGTTTCAAGCCTTTGTGGTCAGATCAAGCAACAGATAATCTCCGGCCAAGCACTACGATCGACACCGCCGGTTATCATCTTGCAACACTCAACTATA
GGGCCAAAGTCTATGGAGCAAAATCACAAATCAATGTATGGAACCCAATTCCATCAGTGGATCAATATAGTTCTGCTAGTATGTGGCTATCTCAAGGCTTTAGAGATCAA
CAGAACACCATACAAGTTGGTTGGGGAGATGCGAGCACGGGGAACTGGTTGTTCATGCTTGGAGACATATACATTGGGTACTGGCCAAAGGGACTGCTGCCAGGCTTGGA
AAATGGAGCAACTACTGCAGCATGGGGAGGGGAAATTTACAGCCCTACAACAGAAGCAGGGCCAGCCATGGGGAGTGGCCATTTTCCTGAAGAGGGTTTCAGTAAAAGTG
CTTTTGTGAATCAGATTCAAGTGGCAGAGTCTAGTATTTCAAGGGGATTTGTTGATCCAGTGGGTTCACAGCTCAGTATTGTTTTGGACAAACCTATCTGTTTTGGGCTC
ATTAATAAGTTTACTGAAGATGGGAACTGGGGACATCATATCTTCTTTGGAGGGCCAAGTGGCTGCAGGTGAAGAAAAACTGATTCCAT
Protein sequenceShow/hide protein sequence
MGTKHMQSNSSMVGRVSKSHKMLCKAVDRPSKELQQPSPKSLVTASSKNLDSTASTRVACCRSRRFCTCKSCMEYGRHNEISLRLVQKNEAAEPFSSKKFVGVADKQCKQ
LLDALGIFNSNKELFVNLLQDPNSLLIKRIEDSTDSRNRKQQMMTFFDSRLSENKIREVGEYEEPDYSKKLKPCDRLPTEDSDDSLSLERIVVLKPNPTSSLPAAVGTNH
CPSLKSHSSFIKNGQSDKRTLFSFRQIKRKMKQAMRVGKKEPECLSTNGMAKKTPPVFRAPKDDGKQMVMEATGRSSYNNIQTDDKIISSSFQDSLERDQLDRAFYSRNG
DKTASTSESTDKKVGQSAVTSNLKRQKSKKREGDKEVSRKMKAKPWGWAMCFSDDDILPSNKPGCHTPSNTRYSHLSNKKFIYEKKSKSQNDVEQSCKTPEMVKIGAPFA
EARRDDDQLHTSTTELNVSPVIFSDVKVDQDPIIEGSVKFVKDIATIHQERNSFCEESSRFDNDCNTCYCQRTNKIKGFEEKGNPELSKLNLPLEVQPSALSVDPFPSSS
LQFQTVEDPNGLCDREVQPLPETIHDDQLLIDATSSNLAFTPGTAEPSSEALPINFEKDQCTGLARLQEVLNPAIASFNCCSSISQCVLELLQVSKQNWNELSLDCHSST
WLQISFVDKVKIFSSQLCGDCVLLFDYFNEVLEDVFHCYIRCSPWLSSYKAHIQAPDKESAFHHEVIQHVDWSLLQQQPPQTLDQLCLRDLRSRKWINYPTETEELVTIV
AESVLRELIIESVVYLGVDALTQLQTLSEDEQLELNTQLKQLNKPAIKSFKTEFGDIIDCVDIYKQPSLDHPLLKNHAIQMKPTTIPKGLIGDTSKVERLLQDLPNINSC
PPGSVPIRRTTREDLIAARSFKPLWSDQATDNLRPSTTIDTAGYHLATLNYRAKVYGAKSQINVWNPIPSVDQYSSASMWLSQGFRDQQNTIQVGWGDASTGNWLFMLGD
IYIGYWPKGLLPGLENGATTAAWGGEIYSPTTEAGPAMGSGHFPEEGFSKSAFVNQIQVAESSISRGFVDPVGSQLSIVLDKPICFGLINKFTEDGNWGHHIFFGGPSGC
R