; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000416 (gene) of Snake gourd v1 genome

Gene IDTan0000416
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUV-B-induced protein At3g17800, chloroplastic-like
Genome locationLG06:13460470..13463941
RNA-Seq ExpressionTan0000416
SyntenyTan0000416
Gene Ontology termsNA
InterPro domainsIPR008479 - Protein of unknown function DUF760
IPR038925 - UV-B-induced protein At3g17800-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593011.1 UV-B-induced protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.3e-19890.84Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK
        MDHCLS HKSFPLKS P  SSIPK KPHPFSSSDF+ RSR RSR++  SL+VLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSV+DELK
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK

Query:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
        RLA DRDAALNRM+L + SDEALLHRRIAQLKEHECQ+AVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
Subjt:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT

Query:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSA+LRH+LEQ+LAIPNTHRNG QKT LQFPE+CL GFRNLLSGRLS+MLSVP  QVLNSNQ
Subjt:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHSCALLGN+EAGFFESDEVIVTSF+SLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

XP_022960014.1 UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita moschata]1.9e-19891.09Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK
        MDHCLS HKSFPLKS P  SSIPK KPHPFSSSDF+ RSR RSR++  SL+VLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSV+DELK
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK

Query:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
        RLA DRDAALNRM+L A SDEALLHRRIAQLKEHECQ+AVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
Subjt:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT

Query:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSA+LRH+LEQ+LAIPNTHRNG QKT LQFPE+CL GFRNLLSGRLS+MLSVP  QVLNSNQ
Subjt:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHSCALLGN+EAGFFESDEVIVTSF+SLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

XP_023004723.1 UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita maxima]2.1e-19790.33Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK
        MDHCL  HKSFPLKS P  SSIPK KPHPFSSSDF+ RSR RSR++  SL+VLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSV+DELK
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK

Query:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
        RLA DRDAALNRM+L + SDEALLHRRIAQLKEHECQ+AVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
Subjt:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT

Query:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        VIGLRADSSVTDNWAMTNIRQ HLGRVYVASILYGYFLKSA+LRHHLEQ+LAIPNTHRNG +KTFLQFPE+CL GFRNLLSGRLS+MLSVP  QVLNSNQ
Subjt:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAK+KSKEALNLIENHSCALLGN EAGFFESDEVIVTSF+SLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

XP_023513957.1 UV-B-induced protein At3g17800, chloroplastic-like [Cucurbita pepo subsp. pepo]1.5e-19890.84Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK
        MDHCLS HKSFPLKS P  SSIPK KPHPFSSSDF+ RSR RSR++  SL+VLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSV+DELK
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK

Query:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
        RLA DRDAALNRM+L + SDEALLHRRIAQLKEHECQ+AVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
Subjt:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT

Query:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSA+LRHHLEQ+LAIPNTHRNG QKT LQFPE+CL GFRNLLSGRLS+MLSVP  QVLNSNQ
Subjt:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHSCALLGN+EAGFFESDEV VTSF+SLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

XP_038898285.1 UV-B-induced protein At3g17800, chloroplastic [Benincasa hispida]7.8e-19289.2Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSR------SRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQ
        MDHCLSLHKSFPLKSLPSS+     K +PF SSDFA     +      +RS S VVLASAGASHCEFGSLNTPL+P+SS GKHLSRVLQNYRQLFHVSV+
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSR------SRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQ

Query:  DELKRLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKE
        DELKRLAD+RDAAL+RMLL+A SDEALLHRRIAQLKE ECQIAVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEI PCKDWELESIYELEVLEMIKE
Subjt:  DELKRLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKE

Query:  HITTVIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNG-NQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQV
        HITTVIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNG NQKTFLQFPE+CL GFRN LSGRLSNMLSVPHNQV
Subjt:  HITTVIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNG-NQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQV

Query:  LNSNQETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        L+SNQETE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHSCALLGNEE GFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
Subjt:  LNSNQETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

TrEMBL top hitse value%identityAlignment
A0A0A0K582 Uncharacterized protein8.2e-18787.76Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKRL
        MDHCLSLHKSFPL SLPSSSS PKFKP+  S           SRS S +VLAS GASHCEFGSLNTPLDP+SS GKHLSRVLQNYR LFHVSV+DELK L
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKRL

Query:  ADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVI
        AD RDAAL+RML++A SDEALLHRRIAQLKEHECQIAVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEI PCKDWELESIYELEVL MIKEHITTVI
Subjt:  ADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVI

Query:  GLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRN-GNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQE
        GLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQ+LAIPNTHRN G+ KTFLQFPE+CL GFRNLLSGRLSNMLSVPHNQVL+S+QE
Subjt:  GLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRN-GNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQE

Query:  TEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        TE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHS ALLGNEE GFFE++EVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  TEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

A0A1S3C721 UV-B-induced protein At3g17800, chloroplastic4.8e-18788.8Show/hide
Query:  MDHCLSLHKSFPLKSLP-SSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKR
        MDHCLSLHKSFPL SLP SSSS PK KP+ FSS     +    SRS SLVVLAS GASHCEFGSLNTPLDP+SS GKHLSRVLQNYR LFHVSV+DELKR
Subjt:  MDHCLSLHKSFPLKSLP-SSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKR

Query:  LADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTV
        LADDR AAL+RMLL A SDEALLHRRIAQLKEHECQIAVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEI PCKDWELESIYELEVLEMIKEHITTV
Subjt:  LADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTV

Query:  IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRN-GNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQ+LAIPN HRN G+ KTFLQFPE+CL GFRNLLSGRLSNMLSVPHNQVLNS+Q
Subjt:  IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRN-GNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHS ALLGNEE GFFE++EV+VTSFSSLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

A0A5A7SN39 UV-B-induced protein4.8e-18788.8Show/hide
Query:  MDHCLSLHKSFPLKSLP-SSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKR
        MDHCLSLHKSFPL SLP SSSS PK KP+ FSS     +    SRS SLVVLAS GASHCEFGSLNTPLDP+SS GKHLSRVLQNYR LFHVSV+DELKR
Subjt:  MDHCLSLHKSFPLKSLP-SSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKR

Query:  LADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTV
        LADDR AAL+RMLL A SDEALLHRRIAQLKEHECQIAVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEI PCKDWELESIYELEVLEMIKEHITTV
Subjt:  LADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTV

Query:  IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRN-GNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQ+LAIPN HRN G+ KTFLQFPE+CL GFRNLLSGRLSNMLSVPHNQVLNS+Q
Subjt:  IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRN-GNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHS ALLGNEE GFFE++EV+VTSFSSLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

A0A6J1H6H2 UV-B-induced protein At3g17800, chloroplastic-like9.3e-19991.09Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK
        MDHCLS HKSFPLKS P  SSIPK KPHPFSSSDF+ RSR RSR++  SL+VLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSV+DELK
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK

Query:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
        RLA DRDAALNRM+L A SDEALLHRRIAQLKEHECQ+AVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
Subjt:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT

Query:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSA+LRH+LEQ+LAIPNTHRNG QKT LQFPE+CL GFRNLLSGRLS+MLSVP  QVLNSNQ
Subjt:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAKLKSKEALNLIENHSCALLGN+EAGFFESDEVIVTSF+SLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

A0A6J1L0E3 UV-B-induced protein At3g17800, chloroplastic-like1.0e-19790.33Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK
        MDHCL  HKSFPLKS P  SSIPK KPHPFSSSDF+ RSR RSR++  SL+VLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSV+DELK
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSR--SLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELK

Query:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
        RLA DRDAALNRM+L + SDEALLHRRIAQLKEHECQ+AVQDVMYMLIFY+FSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT
Subjt:  RLADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITT

Query:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ
        VIGLRADSSVTDNWAMTNIRQ HLGRVYVASILYGYFLKSA+LRHHLEQ+LAIPNTHRNG +KTFLQFPE+CL GFRNLLSGRLS+MLSVP  QVLNSNQ
Subjt:  VIGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQ

Query:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        ETE EKLKRFLTGFDS+ALQRCAK+KSKEALNLIENHSCALLGN EAGFFESDEVIVTSF+SLKRLVLEAVAFGSFLWDAEEYVD IYKLKEN
Subjt:  ETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

SwissProt top hitse value%identityAlignment
Q9LVJ0 UV-B-induced protein At3g17800, chloroplastic2.0e-4433.97Show/hide
Query:  SSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDP---RSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRDA-ALNRMLLTATSDEALLHRRI
        SS    + + +    RS VV AS+ ++    GS   P+ P   +S AG+ LS++L ++  L   +V+ +L++L  DRD+   N+   +    + +L+RRI
Subjt:  SSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDP---RSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRDA-ALNRMLLTATSDEALLHRRI

Query:  AQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLS-RCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGLRADSSVTDNWAMTNIRQAHLGRV
        A+LKE+E +  +++++Y L+  KF E  V+LVP +S     +GR++ WP K  +LE ++  E+ EMI  H+  ++G    S + D  ++  I +  +G+V
Subjt:  AQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLS-RCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGLRADSSVTDNWAMTNIRQAHLGRV

Query:  YVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQETEAEKLKRFLTGFDSDALQRCAKLKS
        Y AS++YGYFLK    R  LE+ + I     +   KT ++  E     ++  +S                   E +  +L+ ++  FD++ LQR A ++S
Subjt:  YVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQETEAEKLKRFLTGFDSDALQRCAKLKS

Query:  KEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIY
        +EA+ +IE H+ AL G  E      G  +S  DE I  SF  +KRLVLEAV FGSFLWD E +VDA Y
Subjt:  KEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIY

Arabidopsis top hitse value%identityAlignment
AT1G48450.1 Protein of unknown function (DUF760)2.4e-4534.74Show/hide
Query:  SFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNT----PLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRD
        S P   L  +SS+ KF   PF S     +SR     RS VV ASA       G  +T    PL  +S  G+ LS++L ++  L   +V+ +L++L  DRD
Subjt:  SFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNT----PLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRD

Query:  A-ALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKL--SRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGL
        A   ++   +    + +L+RRIA++KE E + A+++++Y L+  KF +  V LVP +  S    +GR++ WP  D ELE ++  EV EMI+ H++ ++  
Subjt:  A-ALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKL--SRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGL

Query:  RADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAI-PNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNS-----
        R D    D  A+  I +  +G+VY AS++YGYFLK    R  LE+ + I P     G         ++     RN          +V  NQ + S     
Subjt:  RADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAI-PNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNS-----

Query:  ------NQETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVD
              + + +  +LK ++  FD + LQR A ++S+E++ +IE H+ AL G  E      G  +S  DE I  SF  LKRLVLEAV FGSFLWD E +VD
Subjt:  ------NQETEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVD

Query:  AIY
        + Y
Subjt:  AIY

AT3G07310.1 Protein of unknown function (DUF760)1.1e-10355.36Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFG-SLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKR
        MD CLS      L+ LPS S     +        F   ++ + +  S+VV+A+AG S CE G SLN PL+PRS+ G+ L  VL N RQLFH +  DELK+
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFG-SLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKR

Query:  LADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTV
        LADDR+AAL RM L++ SDEA LHRRIA+LKE  C+ AVQD+MYMLIFYK+SEIRV LVPKLSRC+YNGRLEIWP KDWELESIY  + LE+IKEH++ V
Subjt:  LADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTV

Query:  IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQE
        IGLR +S VTDNWA T I++ HL +VY ASILYGYFLKSA LRH LE   ++ + H +G    +L+ P I  C F                     + Q 
Subjt:  IGLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQE

Query:  TEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
        +  ++L+ +++ FD + LQRCAK +++EA NLIE  S AL G E     ESDE IVTSFSSLKRLVLEAVAFG+FLWD E YVD  YKLKEN
Subjt:  TEAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN

AT3G17800.1 Protein of unknown function (DUF760)1.4e-4533.97Show/hide
Query:  SSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDP---RSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRDA-ALNRMLLTATSDEALLHRRI
        SS    + + +    RS VV AS+ ++    GS   P+ P   +S AG+ LS++L ++  L   +V+ +L++L  DRD+   N+   +    + +L+RRI
Subjt:  SSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDP---RSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRDA-ALNRMLLTATSDEALLHRRI

Query:  AQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLS-RCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGLRADSSVTDNWAMTNIRQAHLGRV
        A+LKE+E +  +++++Y L+  KF E  V+LVP +S     +GR++ WP K  +LE ++  E+ EMI  H+  ++G    S + D  ++  I +  +G+V
Subjt:  AQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLS-RCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGLRADSSVTDNWAMTNIRQAHLGRV

Query:  YVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQETEAEKLKRFLTGFDSDALQRCAKLKS
        Y AS++YGYFLK    R  LE+ + I     +   KT ++  E     ++  +S                   E +  +L+ ++  FD++ LQR A ++S
Subjt:  YVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQETEAEKLKRFLTGFDSDALQRCAKLKS

Query:  KEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIY
        +EA+ +IE H+ AL G  E      G  +S  DE I  SF  +KRLVLEAV FGSFLWD E +VDA Y
Subjt:  KEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIY

AT3G17800.2 Protein of unknown function (DUF760)1.4e-4533.97Show/hide
Query:  SSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDP---RSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRDA-ALNRMLLTATSDEALLHRRI
        SS    + + +    RS VV AS+ ++    GS   P+ P   +S AG+ LS++L ++  L   +V+ +L++L  DRD+   N+   +    + +L+RRI
Subjt:  SSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDP---RSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRDA-ALNRMLLTATSDEALLHRRI

Query:  AQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLS-RCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGLRADSSVTDNWAMTNIRQAHLGRV
        A+LKE+E +  +++++Y L+  KF E  V+LVP +S     +GR++ WP K  +LE ++  E+ EMI  H+  ++G    S + D  ++  I +  +G+V
Subjt:  AQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLS-RCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGLRADSSVTDNWAMTNIRQAHLGRV

Query:  YVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQETEAEKLKRFLTGFDSDALQRCAKLKS
        Y AS++YGYFLK    R  LE+ + I     +   KT ++  E     ++  +S                   E +  +L+ ++  FD++ LQR A ++S
Subjt:  YVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQETEAEKLKRFLTGFDSDALQRCAKLKS

Query:  KEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIY
        +EA+ +IE H+ AL G  E      G  +S  DE I  SF  +KRLVLEAV FGSFLWD E +VDA Y
Subjt:  KEALNLIENHSCALLGNEE-----AGFFES--DEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIY

AT5G48590.1 Protein of unknown function (DUF760)4.0e-8549.62Show/hide
Query:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKRL
        MD  LS H      SLP   S   F             SR + R  SLVV+++A +      S++ PL PRS  G+ LS VL   RQLFH +V D LK+L
Subjt:  MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKRL

Query:  ADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVI
        ADD++A+L+RM L+  SDEA LHRRIAQLKE +CQIA++D+MYMLI YKFSEIRV LVPKL  C+YNGRLEI P KDWELESI+  +VLE+IKEH   VI
Subjt:  ADDRDAALNRMLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVI

Query:  GLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQET
         LR +SS+TD+ A T I +  L +VY AS+LYGYFLKSA LRH LE                         C                     L+ +  +
Subjt:  GLRADSSVTDNWAMTNIRQAHLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQET

Query:  EAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN
          ++L+ +++ FD   L+RCAK +S EA +LIE  S AL G EE+    S E IVTSFSSLKRL+LEAVAFG+FLWD EEYVD  +KLKEN
Subjt:  EAEKLKRFLTGFDSDALQRCAKLKSKEALNLIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCATTGTCTCTCTCTCCACAAATCCTTCCCTCTCAAATCTCTTCCTTCTTCTTCTTCCATTCCCAAATTCAAGCCCCATCCCTTTTCCTCTTCCGATTTCGCCGC
CAGGTCCAGGTCCAGGTCCAGGTCCAGGTCCTTGGTTGTTCTTGCCAGTGCCGGAGCCAGTCACTGCGAGTTCGGCAGTTTGAATACTCCTTTGGACCCGAGGTCATCTG
CCGGGAAGCATCTCAGTAGAGTTTTGCAGAACTATAGGCAGTTGTTCCATGTTTCCGTCCAGGATGAGTTGAAGCGCTTGGCTGATGATCGTGATGCTGCTCTTAATCGT
ATGCTGCTCACCGCTACCTCTGATGAAGCCCTGCTGCATAGGAGGATTGCACAATTGAAGGAGCACGAGTGCCAAATAGCAGTTCAAGATGTCATGTACATGCTCATTTT
CTACAAATTTTCTGAAATCAGAGTTAATTTGGTACCCAAGCTCTCCAGATGTGTTTATAATGGAAGACTGGAAATTTGGCCATGCAAGGACTGGGAACTGGAATCCATTT
ATGAGTTGGAGGTCCTGGAGATGATTAAAGAACACATAACCACAGTGATTGGTTTGAGAGCAGATTCGAGTGTCACAGACAACTGGGCTATGACTAACATTAGACAAGCA
CATCTCGGCCGTGTTTATGTGGCCTCCATCTTATATGGCTACTTCTTGAAGTCTGCTATATTGAGGCATCACCTGGAGCAAAGACTAGCCATACCAAACACACACCGTAA
TGGCAATCAGAAAACTTTCCTTCAGTTCCCTGAGATATGTCTTTGTGGATTTAGAAATCTTCTCTCCGGTCGTCTTAGCAACATGCTTTCGGTGCCGCATAACCAAGTGT
TGAACAGCAACCAGGAGACAGAGGCAGAGAAGCTAAAGCGTTTCTTGACAGGGTTCGATTCCGATGCATTGCAGAGATGCGCAAAGCTGAAATCTAAGGAAGCTTTGAAT
CTGATCGAGAACCATAGCTGTGCATTGCTGGGAAATGAGGAAGCTGGCTTTTTCGAGAGCGATGAGGTGATAGTGACTTCATTTTCAAGCCTGAAGAGATTGGTTTTGGA
GGCTGTTGCTTTTGGTTCTTTCCTTTGGGATGCAGAGGAGTATGTGGACGCCATATATAAGCTCAAGGAGAACTAA
mRNA sequenceShow/hide mRNA sequence
CCGTACTATATCTTTCCCATCTTCACTCTCACCCTCAGCCCTATCCATTTCTCTCTCTCTCTCTTTTCCTCTCTGAACTTATCTTCTCATTTTTTTGCAGAGCTTCAACA
TTCCTGTTGAATGGACCATTGTCTCTCTCTCCACAAATCCTTCCCTCTCAAATCTCTTCCTTCTTCTTCTTCCATTCCCAAATTCAAGCCCCATCCCTTTTCCTCTTCCG
ATTTCGCCGCCAGGTCCAGGTCCAGGTCCAGGTCCAGGTCCTTGGTTGTTCTTGCCAGTGCCGGAGCCAGTCACTGCGAGTTCGGCAGTTTGAATACTCCTTTGGACCCG
AGGTCATCTGCCGGGAAGCATCTCAGTAGAGTTTTGCAGAACTATAGGCAGTTGTTCCATGTTTCCGTCCAGGATGAGTTGAAGCGCTTGGCTGATGATCGTGATGCTGC
TCTTAATCGTATGCTGCTCACCGCTACCTCTGATGAAGCCCTGCTGCATAGGAGGATTGCACAATTGAAGGAGCACGAGTGCCAAATAGCAGTTCAAGATGTCATGTACA
TGCTCATTTTCTACAAATTTTCTGAAATCAGAGTTAATTTGGTACCCAAGCTCTCCAGATGTGTTTATAATGGAAGACTGGAAATTTGGCCATGCAAGGACTGGGAACTG
GAATCCATTTATGAGTTGGAGGTCCTGGAGATGATTAAAGAACACATAACCACAGTGATTGGTTTGAGAGCAGATTCGAGTGTCACAGACAACTGGGCTATGACTAACAT
TAGACAAGCACATCTCGGCCGTGTTTATGTGGCCTCCATCTTATATGGCTACTTCTTGAAGTCTGCTATATTGAGGCATCACCTGGAGCAAAGACTAGCCATACCAAACA
CACACCGTAATGGCAATCAGAAAACTTTCCTTCAGTTCCCTGAGATATGTCTTTGTGGATTTAGAAATCTTCTCTCCGGTCGTCTTAGCAACATGCTTTCGGTGCCGCAT
AACCAAGTGTTGAACAGCAACCAGGAGACAGAGGCAGAGAAGCTAAAGCGTTTCTTGACAGGGTTCGATTCCGATGCATTGCAGAGATGCGCAAAGCTGAAATCTAAGGA
AGCTTTGAATCTGATCGAGAACCATAGCTGTGCATTGCTGGGAAATGAGGAAGCTGGCTTTTTCGAGAGCGATGAGGTGATAGTGACTTCATTTTCAAGCCTGAAGAGAT
TGGTTTTGGAGGCTGTTGCTTTTGGTTCTTTCCTTTGGGATGCAGAGGAGTATGTGGACGCCATATATAAGCTCAAGGAGAACTAATCATCTGGCAAATTTGGAGGTATT
TCGTAGATATAATAACACTTTTCTCTGTATATAGGATGAGTTGCAGACACATTTCAGGAAAGTTCACCATTTGTATAGAGTTCTATTTCTATATATATAATTTCTTATGA
ATCTGAGATGATGAATTCTCTTAAGCAATTGTTACATATGAGTTTATCAGTGAGTTCCTTGAGCCTTGACTTATTTGCCTGCTTCAAATTTCTTTTTGGTATTGGCTGAA
ATTGAAGAAATTTAAAGCATCTTGTAGATTGCTCAACTGGATTGAAATAACTTGAGAGAATGTGAAACCTGAATGGTGAAAAAATAGGGTGGAT
Protein sequenceShow/hide protein sequence
MDHCLSLHKSFPLKSLPSSSSIPKFKPHPFSSSDFAARSRSRSRSRSLVVLASAGASHCEFGSLNTPLDPRSSAGKHLSRVLQNYRQLFHVSVQDELKRLADDRDAALNR
MLLTATSDEALLHRRIAQLKEHECQIAVQDVMYMLIFYKFSEIRVNLVPKLSRCVYNGRLEIWPCKDWELESIYELEVLEMIKEHITTVIGLRADSSVTDNWAMTNIRQA
HLGRVYVASILYGYFLKSAILRHHLEQRLAIPNTHRNGNQKTFLQFPEICLCGFRNLLSGRLSNMLSVPHNQVLNSNQETEAEKLKRFLTGFDSDALQRCAKLKSKEALN
LIENHSCALLGNEEAGFFESDEVIVTSFSSLKRLVLEAVAFGSFLWDAEEYVDAIYKLKEN