; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g07380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g07380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptiontudor domain-containing protein 3
Genome locationchr6:5360152..5370495
RNA-Seq ExpressionMoc06g07380
SyntenyMoc06g07380
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002794 - Protein of unknown function DUF92, TMEM19
IPR013894 - RecQ mediated genome instability protein, N-terminal
IPR033472 - RecQ mediated genome instability protein, DUF1767
IPR042470 - RecQ mediated genome instability protein, N-terminal, subdomain 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589889.1 Protein PGR, partial [Cucurbita argyrosperma subsp. sororia]4.0e-29775.03Show/hide
Query:  DNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASS
        D SV+VLETLR RGW+FGDLDEVR VIMI SALADDP+SVVDSVE EL+NMDLRS GGKSLP+ SLLRKSSR+LGP+VLQISSVRDISRSSLDGM KAS+
Subjt:  DNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASS

Query:  GHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFE
          RLLRFGL+DGH+EITAIEYSHIPS+ +DIPPGTKVRLE K+PVY GI+CLSSKGLTVLGG+VPTLYEEWKMNQKYSGLSRAS+RLSQGGDVDGPPPF 
Subjt:  GHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFE

Query:  KLQVGAP-RKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHKMSQ
        K QVGAP +KFSQK  SSYQ ESSSKSN P+ADSGNI  KST  QQS D KA NSVNSAS VEK+EEKPSSSETRP+EVVEAVPVQNQAASQKLLHKMSQ
Subjt:  KLQVGAP-RKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHKMSQ

Query:  QDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI------------IR-----------------
        QDGN RHF NR HRGKGRMEDP VYTLEEYERRKSGT+Q+PK  SS T+ DE+LA +LQ QFDLE+SH+            IR                 
Subjt:  QDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI------------IR-----------------

Query:  -----------------------------------------------------------LKIKMEY---NLIQPSVAVLISSIIALRAYRRKSLDLSGAL
                                                                   LKI+ME+   NLIQ SVAV+ISSII++ AYRRKSL+LSGAL
Subjt:  -----------------------------------------------------------LKIKMEY---NLIQPSVAVLISSIIALRAYRRKSLDLSGAL

Query:  AGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHY
        AGFIVMSTHFAISYRYGAVLLVFF TSSKLTKVG EKKRV++ADFKEGGQRNWIQVL NSGIATVLAV+IW ++GWQDKCLDSKDSA+VT LIGGILGHY
Subjt:  AGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHY

Query:  SCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQ
        SCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVT AGLLAA AAG VIGL FVL+GFFT +CAYGTALKQLLVIPLAA+AGLCGSVIDSLLGAT+Q
Subjt:  SCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQ

Query:  FSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        FSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLT++LT+I+CIYIF
Subjt:  FSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

OMO57993.1 hypothetical protein COLO4_34948 [Corchorus olitorius]6.8e-22863.32Show/hide
Query:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH
        AV+ETLRARGW FGDLD+VR VI++ +AL+D  D  SV DS E EL+NMDLRS GGKSLPE S LRK S I+GP VLQISSVRDISRSS++     SS  
Subjt:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH

Query:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL
        RLLR  L+DGHSEITA+EYSH+P++ +++ PGTK+RLENK+ V+GGI+CL+ K + +LGG+V +LYEEW+MNQKYSG SR+S+R +Q     GPP FEKL
Subjt:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL

Query:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK
        Q+ AP   +F Q  KSSY  ES  K + PT A S     + +   Q+++ K+ N  N     S  EK EE PSSSETRPKEV E+VP+QNQAASQKLL K
Subjt:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK

Query:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSH---IIRLK---------------------
        MS  + + RH   RK+RGKG+ E+P V TL+E+E+ K+G     ++    TS DE LAWQLQ Q DLED H   + ++K                     
Subjt:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSH---IIRLK---------------------

Query:  ---------IKMEYNLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQ
                  +ME  LIQP +AVLISS+IA+R+YRRKSLDLSGAL+G IVM+ HFA+ YR+GA+LL FF +SSKLTKVGEEKKR VDADFKEGGQRNWIQ
Subjt:  ---------IKMEYNLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQ

Query:  VLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLA
        VL NSGIATVL+V+IWN++G +DKCLDSK+S ++T LIGGI+GHYSCCNGDTWSSE+G+LS+  PRLITTFKPVR+GTNG VT  GLLAA AAG VIGL 
Subjt:  VLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLA

Query:  FVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        FVL+GFFTT C+ G A+KQLLVIPL+A+AGL GS+IDSLLGAT+QFSGFCTVRNKVVGKPGPTVKKISGL+ILDNNAVNLVSVLLTTLLT+++C+YIF
Subjt:  FVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

OMO80812.1 hypothetical protein CCACVL1_12732 [Corchorus capsularis]2.2e-23164.33Show/hide
Query:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH
        AV+ETLRARGW FGDLD+VR VI++ +AL+D  D  SV DS E EL+NMDLR+ GGKSLPE S LRK S I+GP VLQI SVRDISRSS++     SS  
Subjt:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH

Query:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL
        RLLR  L+DGHSEITA+EYSH+P++ +++ PGTK+RLENK+ ++GGI+CL+ K + +LGG+V +LYEEW+MNQKYSG SR+S+R +Q     GPPPFEKL
Subjt:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL

Query:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK
        Q+ AP   +  Q  KSSY  ES  K + PT A S     + +   Q+++ K+ N+ N     S  EK EE PSSSETRPKEV E+VP+QNQAASQKLL K
Subjt:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK

Query:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHIIRLK-------------------IKMEY
        MS  + + RH   RK RGKG+ E+P V TL+E+E+ K+G     ++    TS DE LAWQLQ Q DLEDSH+ R+                     +ME 
Subjt:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHIIRLK-------------------IKMEY

Query:  NLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVV
         LIQP +AVLISS+IA+R+YRRKSLDLSGAL+G IVM+ HFA+ YR+GA+LL FFF+SSKLTKVGEEKKR VDADFKEGGQRNWIQVL NSGIATVL+V+
Subjt:  NLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVV

Query:  IWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYG
        IWN++GW+DKCLDSK+S ++T LIGGI+GHYSCCNGDTWSSE+G+LS+  PRLITTFKPVR+GTNG VT  GLLAA AAG VIGL FVL+GFFTT C+ G
Subjt:  IWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYG

Query:  TALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
         A+KQLLVIPL+A+AGL GS+IDSLLGAT+QFSGFCTVRNKVVGKPG TVKKISGL+ILDNNAVNLVS+LLTTLLT+++C+YIF
Subjt:  TALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

RLM92926.1 tudor domain-containing protein 3 isoform X1 [Panicum miliaceum]3.5e-16848.91Show/hide
Query:  VLETLRARGWSFGD-LDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPE----PSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASS
        +L++L  RGW F D  DE    ++++S     P+   ++VE EL++MDLR+FGGKSLP+     +  ++ S + GPIVLQ+ S+RDI  SS+D   K   
Subjt:  VLETLRARGWSFGD-LDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPE----PSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASS

Query:  GHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFE
           LLRFGL+DG  E  AIE+S IP + E+I PGTK+RLENK P+  GI+CLS+K ++V+GG V +LYEEW+MNQKYSGLSR S+RLSQ  D  GPPPFE
Subjt:  GHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFE

Query:  KLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHKMSQQ
        KL + A        +++  Q   ++    T D   + S    + +      +N V+  +   K+E K  + ++RPKEV E VPVQNQAA+QKLL KMSQ 
Subjt:  KLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHKMSQQ

Query:  DGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKS-GTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI---------------IRLKIKMEYN------
            R   + + +GKGR ED  V+TL+E+E++K+ G+    ++    TS DE+LA QLQ Q DLED H+               I L   +++N      
Subjt:  DGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKS-GTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI---------------IRLKIKMEYN------

Query:  ---------LIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWI-------
                  I+ +VAV     IA RA RRKS++ S    G   M  H    YR+  +LLVFFFTSS++T+VGE +KR +D +FKEGGQRNW        
Subjt:  ---------LIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWI-------

Query:  -------------------------------------QVLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSD
                                             QVLSNSGIA++  V+I +V+G  D+CLDSK+S LVT LIGG++GHY+CCNGDTWSSELGILS 
Subjt:  -------------------------------------QVLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSD

Query:  ATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGP
        A PR+ITTFK VRKGTNG VT  GLLAAAAAG  IGLAFVLIGF TT+CA     +QLLVIPLA  AGLCGS+IDS+LGATVQ+SGFC+VR KVVG  GP
Subjt:  ATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGP

Query:  TVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        TV +ISG+NILDNN VN+VSV LTT+LTA++C YIF
Subjt:  TVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

XP_022145032.1 tudor domain-containing protein 3 [Momordica charantia]1.9e-20199.73Show/hide
Query:  METTTDNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGM
        METTTDNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGM
Subjt:  METTTDNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGM

Query:  LKASSGHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDG
        LKASSGHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDG
Subjt:  LKASSGHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDG

Query:  PPPFEKLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLH
        PPPFEKLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLH
Subjt:  PPPFEKLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLH

Query:  KMSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI
        KMSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSH+
Subjt:  KMSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI

TrEMBL top hitse value%identityAlignment
A0A1R3GIS4 DUF1767 domain-containing protein3.3e-22863.32Show/hide
Query:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH
        AV+ETLRARGW FGDLD+VR VI++ +AL+D  D  SV DS E EL+NMDLRS GGKSLPE S LRK S I+GP VLQISSVRDISRSS++     SS  
Subjt:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH

Query:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL
        RLLR  L+DGHSEITA+EYSH+P++ +++ PGTK+RLENK+ V+GGI+CL+ K + +LGG+V +LYEEW+MNQKYSG SR+S+R +Q     GPP FEKL
Subjt:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL

Query:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK
        Q+ AP   +F Q  KSSY  ES  K + PT A S     + +   Q+++ K+ N  N     S  EK EE PSSSETRPKEV E+VP+QNQAASQKLL K
Subjt:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK

Query:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSH---IIRLK---------------------
        MS  + + RH   RK+RGKG+ E+P V TL+E+E+ K+G     ++    TS DE LAWQLQ Q DLED H   + ++K                     
Subjt:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSH---IIRLK---------------------

Query:  ---------IKMEYNLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQ
                  +ME  LIQP +AVLISS+IA+R+YRRKSLDLSGAL+G IVM+ HFA+ YR+GA+LL FF +SSKLTKVGEEKKR VDADFKEGGQRNWIQ
Subjt:  ---------IKMEYNLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQ

Query:  VLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLA
        VL NSGIATVL+V+IWN++G +DKCLDSK+S ++T LIGGI+GHYSCCNGDTWSSE+G+LS+  PRLITTFKPVR+GTNG VT  GLLAA AAG VIGL 
Subjt:  VLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLA

Query:  FVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        FVL+GFFTT C+ G A+KQLLVIPL+A+AGL GS+IDSLLGAT+QFSGFCTVRNKVVGKPGPTVKKISGL+ILDNNAVNLVSVLLTTLLT+++C+YIF
Subjt:  FVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

A0A1R3IDZ3 DUF1767 domain-containing protein1.1e-23164.33Show/hide
Query:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH
        AV+ETLRARGW FGDLD+VR VI++ +AL+D  D  SV DS E EL+NMDLR+ GGKSLPE S LRK S I+GP VLQI SVRDISRSS++     SS  
Subjt:  AVLETLRARGWSFGDLDEVRGVIMISSALAD--DPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH

Query:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL
        RLLR  L+DGHSEITA+EYSH+P++ +++ PGTK+RLENK+ ++GGI+CL+ K + +LGG+V +LYEEW+MNQKYSG SR+S+R +Q     GPPPFEKL
Subjt:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL

Query:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK
        Q+ AP   +  Q  KSSY  ES  K + PT A S     + +   Q+++ K+ N+ N     S  EK EE PSSSETRPKEV E+VP+QNQAASQKLL K
Subjt:  QVGAP--RKFSQKEKSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNSVNSA---SGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHK

Query:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHIIRLK-------------------IKMEY
        MS  + + RH   RK RGKG+ E+P V TL+E+E+ K+G     ++    TS DE LAWQLQ Q DLEDSH+ R+                     +ME 
Subjt:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHIIRLK-------------------IKMEY

Query:  NLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVV
         LIQP +AVLISS+IA+R+YRRKSLDLSGAL+G IVM+ HFA+ YR+GA+LL FFF+SSKLTKVGEEKKR VDADFKEGGQRNWIQVL NSGIATVL+V+
Subjt:  NLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVV

Query:  IWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYG
        IWN++GW+DKCLDSK+S ++T LIGGI+GHYSCCNGDTWSSE+G+LS+  PRLITTFKPVR+GTNG VT  GLLAA AAG VIGL FVL+GFFTT C+ G
Subjt:  IWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYG

Query:  TALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
         A+KQLLVIPL+A+AGL GS+IDSLLGAT+QFSGFCTVRNKVVGKPG TVKKISGL+ILDNNAVNLVS+LLTTLLT+++C+YIF
Subjt:  TALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

A0A3L6R1P4 Tudor domain-containing protein 3 isoform X11.7e-16848.91Show/hide
Query:  VLETLRARGWSFGD-LDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPE----PSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASS
        +L++L  RGW F D  DE    ++++S     P+   ++VE EL++MDLR+FGGKSLP+     +  ++ S + GPIVLQ+ S+RDI  SS+D   K   
Subjt:  VLETLRARGWSFGD-LDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPE----PSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASS

Query:  GHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFE
           LLRFGL+DG  E  AIE+S IP + E+I PGTK+RLENK P+  GI+CLS+K ++V+GG V +LYEEW+MNQKYSGLSR S+RLSQ  D  GPPPFE
Subjt:  GHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFE

Query:  KLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHKMSQQ
        KL + A        +++  Q   ++    T D   + S    + +      +N V+  +   K+E K  + ++RPKEV E VPVQNQAA+QKLL KMSQ 
Subjt:  KLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHKMSQQ

Query:  DGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKS-GTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI---------------IRLKIKMEYN------
            R   + + +GKGR ED  V+TL+E+E++K+ G+    ++    TS DE+LA QLQ Q DLED H+               I L   +++N      
Subjt:  DGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKS-GTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI---------------IRLKIKMEYN------

Query:  ---------LIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWI-------
                  I+ +VAV     IA RA RRKS++ S    G   M  H    YR+  +LLVFFFTSS++T+VGE +KR +D +FKEGGQRNW        
Subjt:  ---------LIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWI-------

Query:  -------------------------------------QVLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSD
                                             QVLSNSGIA++  V+I +V+G  D+CLDSK+S LVT LIGG++GHY+CCNGDTWSSELGILS 
Subjt:  -------------------------------------QVLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSD

Query:  ATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGP
        A PR+ITTFK VRKGTNG VT  GLLAAAAAG  IGLAFVLIGF TT+CA     +QLLVIPLA  AGLCGS+IDS+LGATVQ+SGFC+VR KVVG  GP
Subjt:  ATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGP

Query:  TVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        TV +ISG+NILDNN VN+VSV LTT+LTA++C YIF
Subjt:  TVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

A0A6J1CVD2 tudor domain-containing protein 39.0e-20299.73Show/hide
Query:  METTTDNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGM
        METTTDNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGM
Subjt:  METTTDNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGM

Query:  LKASSGHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDG
        LKASSGHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDG
Subjt:  LKASSGHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDG

Query:  PPPFEKLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLH
        PPPFEKLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLH
Subjt:  PPPFEKLQVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLH

Query:  KMSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI
        KMSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSH+
Subjt:  KMSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHI

A5BMK2 RMI1_N domain-containing protein8.8e-16557.09Show/hide
Query:  ITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKLQVGAP-RKFSQKE
        +TAIEYS IP++ +++ PGTKVRLE K+ ++ GI+CL+ K +TVLGG+V +LYEEW+MNQKYSG SR+S+RLSQ     GPPPFEKLQ+GAP R+ S++ 
Subjt:  ITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKLQVGAP-RKFSQKE

Query:  KSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNS---VNSASGVEKLEEKPSSSETRPKE--------------VVEAVPVQNQAASQKLLHK
        + S    S+SK+  PT A++  +G        ++ +KA N+   + +AS +E+ EEKPSSSE RPKE              V E+VPVQNQAA+QKLL K
Subjt:  KSSYQQESSSKSNTPT-ADSGNIGSKSTTLQQSIDVKATNS---VNSASGVEKLEEKPSSSETRPKE--------------VVEAVPVQNQAASQKLLHK

Query:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHIIRLKIKMEYNLIQPSVAVLISSIIALRA
        M+  + + RH   RKHRGKG+ E+  VYTL+E+E+RK+G     K+  +  S DE LAWQLQNQ D+ED +++R   K E   I+ S+          + 
Subjt:  MSQQDGNHRHFNNRKHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHIIRLKIKMEYNLIQPSVAVLISSIIALRA

Query:  YRRKSLDLSGA--------LAGFIV--MSTHFAI-----SYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNV
         R +                  ++V    T F +     + RYGA+LL FF TSSKLTK GEEKKR+VDADFKEGGQRNW QVL NSGI+ VLA+++W +
Subjt:  YRRKSLDLSGA--------LAGFIV--MSTHFAI-----SYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNV

Query:  SGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFK------------PVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGF
        +GWQDKCLDSK+S+L+T LIGGI+GHYSCCNGDTWSSELGILSD+ PRLITTFK            PVRKGTNG VT  GLLAA AAGGVIGL FVLIGF
Subjt:  SGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFK------------PVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGF

Query:  FTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        FTT+CA   ALKQLLVIPL+A+AGLCGS+IDSLLGAT+Q+SGFC+VRNKVVGKPGPTV+KISG++ILDNN VNLVS+LLT++LT+I+C+YIF
Subjt:  FTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

SwissProt top hitse value%identityAlignment
Q0P4L9 Transmembrane protein 193.1e-4240.88Show/hide
Query:  VAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSG
        V+VL   II     +++SLD SGAL G +V       +Y + + LL FFF SSKLTK   E K+  D+++KEGGQRNW+QV  N G+   LA++    +G
Subjt:  VAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSG

Query:  WQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAY----GT
          +  +D       + +   +LG  +   GDTW+SE+G +LS ++PRLITT++ V  GTNG VT  GL+++   G  +G+A     +F T+  +      
Subjt:  WQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAY----GT

Query:  ALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL
        A  Q  ++    MAGL GS+IDS LGA +Q+SG+     K+V  P    K I G  ILDNNAVNL S +L  LL
Subjt:  ALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL

Q0WP96 Protein PGR1.0e-11473.65Show/hide
Query:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW
        AV+ISS+IA R+Y+RKSLDLSG +AGF+VM+ HF   +RYGA+LLVFF TSSKLTKVGE+KKR VD +FKEGGQRNW+QVL NSGIA+VL V+   ++GW
Subjt:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW

Query:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLL
        +DKCLDSK S +VT LIGGI+GHY+CCNGDTWSSELG+LSDA PRLITTFKPV+KGTNG VT AGLLAA AAG  +GL F++ G FT  CA   ALKQLL
Subjt:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLL

Query:  VIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        VIPL+A+AGLCGS+IDS+LGAT+QFSGFC+VRNKVVGKPGPTVKKISG++ILDNN VN VS+LLT+ LT+I+ +YIF
Subjt:  VIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

Q6IR76 Transmembrane protein 193.7e-4341.24Show/hide
Query:  VAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSG
        V++L   II     +++SLD SGAL G +V       +Y + + LL FFF SSKLTK   E K+  D+++KEGGQRNW+QV  N G+   LA++    +G
Subjt:  VAVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSG

Query:  WQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAY----GT
          +  +D       + +   +LG  SC  GDTW+SE+G +LS + PRLITT++ V  GTNG VT  GL+++   G  +G+A     +F T+  +      
Subjt:  WQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAY----GT

Query:  ALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL
        A  Q  ++    MAGL GS+IDS LGA +Q+SG+     K+V  P    K I G  ILDNNAVNL S +L  LL
Subjt:  ALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL

Q6P726 Transmembrane protein 191.6e-4341.03Show/hide
Query:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW
        +V++S +I+   +++KSLD SGAL G +V       ++ +   LL+FF TSSKLTK   E K+ +D+++KEGGQRNW+QV  N G+ T LA++    +G 
Subjt:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW

Query:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTAL---
         +  +D       + +   +L   +   GDTW+SE+  +LS ++PRLITT++ V  GTNG VT  GL+++   G  +GLA     +F T+  +   L   
Subjt:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTAL---

Query:  -KQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL
          Q  +I    +AGL GS++DS LGAT+QFSG       VV  P    K ISG  ILDNNAVNL S +L  LL
Subjt:  -KQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL

Q96HH6 Transmembrane protein 192.9e-4038.66Show/hide
Query:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW
        +V++  +I     ++KSLD SGAL G +V       ++ +   LL+FF +SSKLTK   E K+ +D+++KEGGQRNW+QV  N  + T LA++    +G 
Subjt:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW

Query:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQL
         +  +D       + +   +L   +C  GDTW+SE+G +LS ++PRLITT++ V  GTNG VT  GL+++   G  +G+A+ L            +  Q 
Subjt:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELG-ILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQL

Query:  LVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL
         +I    +AGL GS++DS LGAT+Q++G       VV  P    + I+G  ILDNNAVNL S +L  LL
Subjt:  LVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLL

Arabidopsis top hitse value%identityAlignment
AT5G19930.1 Protein of unknown function DUF92, transmembrane7.4e-11673.65Show/hide
Query:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW
        AV+ISS+IA R+Y+RKSLDLSG +AGF+VM+ HF   +RYGA+LLVFF TSSKLTKVGE+KKR VD +FKEGGQRNW+QVL NSGIA+VL V+   ++GW
Subjt:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW

Query:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLL
        +DKCLDSK S +VT LIGGI+GHY+CCNGDTWSSELG+LSDA PRLITTFKPV+KGTNG VT AGLLAA AAG  +GL F++ G FT  CA   ALKQLL
Subjt:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLL

Query:  VIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        VIPL+A+AGLCGS+IDS+LGAT+QFSGFC+VRNKVVGKPGPTVKKISG++ILDNN VN VS+LLT+ LT+I+ +YIF
Subjt:  VIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

AT5G19930.2 Protein of unknown function DUF92, transmembrane7.4e-11673.65Show/hide
Query:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW
        AV+ISS+IA R+Y+RKSLDLSG +AGF+VM+ HF   +RYGA+LLVFF TSSKLTKVGE+KKR VD +FKEGGQRNW+QVL NSGIA+VL V+   ++GW
Subjt:  AVLISSIIALRAYRRKSLDLSGALAGFIVMSTHFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGW

Query:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLL
        +DKCLDSK S +VT LIGGI+GHY+CCNGDTWSSELG+LSDA PRLITTFKPV+KGTNG VT AGLLAA AAG  +GL F++ G FT  CA   ALKQLL
Subjt:  QDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSELGILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLL

Query:  VIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF
        VIPL+A+AGLCGS+IDS+LGAT+QFSGFC+VRNKVVGKPGPTVKKISG++ILDNN VN VS+LLT+ LT+I+ +YIF
Subjt:  VIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGPTVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF

AT5G19950.1 Domain of unknown function (DUF1767)3.1e-7744.82Show/hide
Query:  VLETLRARGWSFGDLDEVRGVIMISSAL---ADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH
        V+  L +RGW F D++ ++ ++   S+L    +   ++V+SVE ELLNMD++  GGKSLP+P+ LR+ S + GP VLQIS VRD++RSS +  + +S+G 
Subjt:  VLETLRARGWSFGDLDEVRGVIMISSAL---ADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH

Query:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL
        R+L+F L+DG +EI+A+EYSHIP++  D+ PGTKVRLENK+ +  G+VCL+ K +TVLGG V +L EEW+M +KY+ L+R+  + S+ G  DGPPPFE+L
Subjt:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL

Query:  QVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNI----GSKSTTLQQSIDVKATN---SVNSASGVEKL------EEKPSSSETRPKEVVEAVPVQNQAA
        ++          K      ++S++N P A   ++    G K  + +   + + TN     N A    K+      +EK SSS+TRPK+VVEAVP+QNQAA
Subjt:  QVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNI----GSKSTTLQQSIDVKATN---SVNSASGVEKL------EEKPSSSETRPKEVVEAVPVQNQAA

Query:  SQKLLHKMSQQDGNHRHFNNRKHRGKGR-------MEDPVVYTLEEYERRKSGTNQIP-KNASSYTSHDEQLAWQLQNQFDLEDSH
        +Q LL KM     N R +  R+ RG+GR        ED  V+TL+E+E+R +G   +P  N  S T+ DE LAWQLQNQFDLEDS+
Subjt:  SQKLLHKMSQQDGNHRHFNNRKHRGKGR-------MEDPVVYTLEEYERRKSGTNQIP-KNASSYTSHDEQLAWQLQNQFDLEDSH

AT5G19950.2 Domain of unknown function (DUF1767)3.1e-7744.82Show/hide
Query:  VLETLRARGWSFGDLDEVRGVIMISSAL---ADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH
        V+  L +RGW F D++ ++ ++   S+L    +   ++V+SVE ELLNMD++  GGKSLP+P+ LR+ S + GP VLQIS VRD++RSS +  + +S+G 
Subjt:  VLETLRARGWSFGDLDEVRGVIMISSAL---ADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH

Query:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL
        R+L+F L+DG +EI+A+EYSHIP++  D+ PGTKVRLENK+ +  G+VCL+ K +TVLGG V +L EEW+M +KY+ L+R+  + S+ G  DGPPPFE+L
Subjt:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL

Query:  QVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNI----GSKSTTLQQSIDVKATN---SVNSASGVEKL------EEKPSSSETRPKEVVEAVPVQNQAA
        ++          K      ++S++N P A   ++    G K  + +   + + TN     N A    K+      +EK SSS+TRPK+VVEAVP+QNQAA
Subjt:  QVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNI----GSKSTTLQQSIDVKATN---SVNSASGVEKL------EEKPSSSETRPKEVVEAVPVQNQAA

Query:  SQKLLHKMSQQDGNHRHFNNRKHRGKGR-------MEDPVVYTLEEYERRKSGTNQIP-KNASSYTSHDEQLAWQLQNQFDLEDSH
        +Q LL KM     N R +  R+ RG+GR        ED  V+TL+E+E+R +G   +P  N  S T+ DE LAWQLQNQFDLEDS+
Subjt:  SQKLLHKMSQQDGNHRHFNNRKHRGKGR-------MEDPVVYTLEEYERRKSGTNQIP-KNASSYTSHDEQLAWQLQNQFDLEDSH

AT5G19950.3 Domain of unknown function (DUF1767)3.1e-7744.82Show/hide
Query:  VLETLRARGWSFGDLDEVRGVIMISSAL---ADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH
        V+  L +RGW F D++ ++ ++   S+L    +   ++V+SVE ELLNMD++  GGKSLP+P+ LR+ S + GP VLQIS VRD++RSS +  + +S+G 
Subjt:  VLETLRARGWSFGDLDEVRGVIMISSAL---ADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASSGH

Query:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL
        R+L+F L+DG +EI+A+EYSHIP++  D+ PGTKVRLENK+ +  G+VCL+ K +TVLGG V +L EEW+M +KY+ L+R+  + S+ G  DGPPPFE+L
Subjt:  RLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKL

Query:  QVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNI----GSKSTTLQQSIDVKATN---SVNSASGVEKL------EEKPSSSETRPKEVVEAVPVQNQAA
        ++          K      ++S++N P A   ++    G K  + +   + + TN     N A    K+      +EK SSS+TRPK+VVEAVP+QNQAA
Subjt:  QVGAPRKFSQKEKSSYQQESSSKSNTPTADSGNI----GSKSTTLQQSIDVKATN---SVNSASGVEKL------EEKPSSSETRPKEVVEAVPVQNQAA

Query:  SQKLLHKMSQQDGNHRHFNNRKHRGKGR-------MEDPVVYTLEEYERRKSGTNQIP-KNASSYTSHDEQLAWQLQNQFDLEDSH
        +Q LL KM     N R +  R+ RG+GR        ED  V+TL+E+E+R +G   +P  N  S T+ DE LAWQLQNQFDLEDS+
Subjt:  SQKLLHKMSQQDGNHRHFNNRKHRGKGR-------MEDPVVYTLEEYERRKSGTNQIP-KNASSYTSHDEQLAWQLQNQFDLEDSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACGACGACGGATAATTCCGTCGCTGTGCTTGAAACATTACGAGCGAGAGGATGGAGTTTCGGCGATTTGGACGAAGTTAGGGGCGTAATCATGATCAGT
AGCGCCTTGGCCGATGATCCGAGCTCGGTGGTTGATTCGGTGGAGTTGGAGCTCTTAAACATGGACCTCAGGTCATTTGGGGGCAAGTCATTGCCTGAACCTTCT
CTTCTTCGCAAGTCTTCTCGTATTCTCGGCCCTATTGTTCTCCAGATATCATCTGTAAGAGACATATCGCGAAGCAGCCTAGATGGTATGTTGAAGGCTTCTAGT
GGTCACCGTCTCCTACGGTTCGGTCTCAGTGATGGACACTCTGAGATTACTGCTATAGAGTATTCTCATATACCATCTGTTCTTGAAGATATTCCTCCTGGCACC
AAGGTCCGTCTGGAAAATAAATCTCCTGTGTATGGTGGAATTGTATGTTTAAGTTCAAAGGGGTTAACTGTGCTTGGAGGTATGGTTCCAACACTTTATGAAGAA
TGGAAAATGAACCAAAAATACTCTGGTTTATCTCGTGCATCTGTAAGGTTATCTCAAGGAGGAGATGTCGATGGTCCTCCTCCATTTGAGAAGTTGCAAGTTGGG
GCTCCACGTAAATTTAGTCAGAAGGAAAAATCTTCATATCAACAGGAGTCGTCCTCAAAGAGCAACACGCCGACTGCTGATTCAGGGAATATTGGAAGTAAATCA
ACTACATTGCAGCAAAGTATAGATGTGAAAGCTACCAATTCTGTCAATTCTGCTTCCGGTGTAGAAAAGCTTGAAGAAAAACCTAGTAGCTCTGAAACAAGACCA
AAGGAAGTTGTGGAAGCTGTTCCTGTTCAAAATCAGGCCGCCTCTCAGAAACTACTCCACAAAATGAGTCAACAAGATGGAAACCATCGGCATTTCAATAATAGA
AAGCACAGGGGAAAGGGCAGAATGGAAGATCCGGTGGTCTATACTCTAGAAGAATATGAAAGGAGAAAATCTGGGACAAATCAAATACCAAAAAATGCATCTTCC
TATACGAGTCATGATGAGCAACTTGCATGGCAGCTTCAAAATCAATTTGATTTGGAAGATTCTCATATAATCCGCTTGAAGATCAAAATGGAGTATAATCTGATT
CAGCCCTCTGTTGCGGTTCTAATCTCATCGATAATCGCTCTTAGGGCATATCGAAGGAAATCCTTGGACCTCTCTGGAGCTCTAGCGGGATTTATTGTTATGTCA
ACACACTTCGCCATCAGTTACAGATACGGAGCCGTGCTTTTGGTATTCTTTTTCACTTCCTCTAAGCTTACCAAGGTTGGGGAAGAGAAGAAACGAGTCGTTGAT
GCCGATTTTAAGGAAGGTGGTCAAAGAAATTGGATTCAAGTTCTTTCTAATAGTGGTATTGCTACAGTTTTGGCTGTGGTTATTTGGAACGTTTCAGGATGGCAA
GATAAATGCCTGGACTCTAAAGACTCGGCTCTTGTCACTGGCCTCATTGGCGGGATTCTTGGGCACTACTCCTGCTGCAATGGGGACACGTGGTCTTCTGAGCTT
GGAATTCTTAGTGATGCAACACCTCGATTGATCACAACCTTCAAGCCTGTTCGTAAGGGTACAAATGGTGCTGTTACAAATGCAGGGCTCCTTGCAGCTGCAGCT
GCAGGTGGTGTCATAGGATTGGCGTTTGTTCTCATCGGTTTTTTCACTACAGAATGTGCTTATGGCACAGCACTGAAACAGCTATTGGTAATTCCCCTGGCAGCC
ATGGCTGGACTTTGTGGAAGTGTCATAGACTCTCTATTGGGAGCAACAGTGCAATTCAGTGGATTCTGCACTGTTCGTAATAAGGTCGTTGGAAAACCAGGACCA
ACAGTAAAAAAGATATCAGGTCTCAACATTCTTGACAACAATGCTGTCAACCTTGTCTCAGTATTATTAACCACACTGCTCACTGCAATTTCATGCATCTACATT
TTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACGACGACGGATAATTCCGTCGCTGTGCTTGAAACATTACGAGCGAGAGGATGGAGTTTCGGCGATTTGGACGAAGTTAGGGGCGTAATCATGATCAGT
AGCGCCTTGGCCGATGATCCGAGCTCGGTGGTTGATTCGGTGGAGTTGGAGCTCTTAAACATGGACCTCAGGTCATTTGGGGGCAAGTCATTGCCTGAACCTTCT
CTTCTTCGCAAGTCTTCTCGTATTCTCGGCCCTATTGTTCTCCAGATATCATCTGTAAGAGACATATCGCGAAGCAGCCTAGATGGTATGTTGAAGGCTTCTAGT
GGTCACCGTCTCCTACGGTTCGGTCTCAGTGATGGACACTCTGAGATTACTGCTATAGAGTATTCTCATATACCATCTGTTCTTGAAGATATTCCTCCTGGCACC
AAGGTCCGTCTGGAAAATAAATCTCCTGTGTATGGTGGAATTGTATGTTTAAGTTCAAAGGGGTTAACTGTGCTTGGAGGTATGGTTCCAACACTTTATGAAGAA
TGGAAAATGAACCAAAAATACTCTGGTTTATCTCGTGCATCTGTAAGGTTATCTCAAGGAGGAGATGTCGATGGTCCTCCTCCATTTGAGAAGTTGCAAGTTGGG
GCTCCACGTAAATTTAGTCAGAAGGAAAAATCTTCATATCAACAGGAGTCGTCCTCAAAGAGCAACACGCCGACTGCTGATTCAGGGAATATTGGAAGTAAATCA
ACTACATTGCAGCAAAGTATAGATGTGAAAGCTACCAATTCTGTCAATTCTGCTTCCGGTGTAGAAAAGCTTGAAGAAAAACCTAGTAGCTCTGAAACAAGACCA
AAGGAAGTTGTGGAAGCTGTTCCTGTTCAAAATCAGGCCGCCTCTCAGAAACTACTCCACAAAATGAGTCAACAAGATGGAAACCATCGGCATTTCAATAATAGA
AAGCACAGGGGAAAGGGCAGAATGGAAGATCCGGTGGTCTATACTCTAGAAGAATATGAAAGGAGAAAATCTGGGACAAATCAAATACCAAAAAATGCATCTTCC
TATACGAGTCATGATGAGCAACTTGCATGGCAGCTTCAAAATCAATTTGATTTGGAAGATTCTCATATAATCCGCTTGAAGATCAAAATGGAGTATAATCTGATT
CAGCCCTCTGTTGCGGTTCTAATCTCATCGATAATCGCTCTTAGGGCATATCGAAGGAAATCCTTGGACCTCTCTGGAGCTCTAGCGGGATTTATTGTTATGTCA
ACACACTTCGCCATCAGTTACAGATACGGAGCCGTGCTTTTGGTATTCTTTTTCACTTCCTCTAAGCTTACCAAGGTTGGGGAAGAGAAGAAACGAGTCGTTGAT
GCCGATTTTAAGGAAGGTGGTCAAAGAAATTGGATTCAAGTTCTTTCTAATAGTGGTATTGCTACAGTTTTGGCTGTGGTTATTTGGAACGTTTCAGGATGGCAA
GATAAATGCCTGGACTCTAAAGACTCGGCTCTTGTCACTGGCCTCATTGGCGGGATTCTTGGGCACTACTCCTGCTGCAATGGGGACACGTGGTCTTCTGAGCTT
GGAATTCTTAGTGATGCAACACCTCGATTGATCACAACCTTCAAGCCTGTTCGTAAGGGTACAAATGGTGCTGTTACAAATGCAGGGCTCCTTGCAGCTGCAGCT
GCAGGTGGTGTCATAGGATTGGCGTTTGTTCTCATCGGTTTTTTCACTACAGAATGTGCTTATGGCACAGCACTGAAACAGCTATTGGTAATTCCCCTGGCAGCC
ATGGCTGGACTTTGTGGAAGTGTCATAGACTCTCTATTGGGAGCAACAGTGCAATTCAGTGGATTCTGCACTGTTCGTAATAAGGTCGTTGGAAAACCAGGACCA
ACAGTAAAAAAGATATCAGGTCTCAACATTCTTGACAACAATGCTGTCAACCTTGTCTCAGTATTATTAACCACACTGCTCACTGCAATTTCATGCATCTACATT
TTTTGA
Protein sequenceShow/hide protein sequence
METTTDNSVAVLETLRARGWSFGDLDEVRGVIMISSALADDPSSVVDSVELELLNMDLRSFGGKSLPEPSLLRKSSRILGPIVLQISSVRDISRSSLDGMLKASS
GHRLLRFGLSDGHSEITAIEYSHIPSVLEDIPPGTKVRLENKSPVYGGIVCLSSKGLTVLGGMVPTLYEEWKMNQKYSGLSRASVRLSQGGDVDGPPPFEKLQVG
APRKFSQKEKSSYQQESSSKSNTPTADSGNIGSKSTTLQQSIDVKATNSVNSASGVEKLEEKPSSSETRPKEVVEAVPVQNQAASQKLLHKMSQQDGNHRHFNNR
KHRGKGRMEDPVVYTLEEYERRKSGTNQIPKNASSYTSHDEQLAWQLQNQFDLEDSHIIRLKIKMEYNLIQPSVAVLISSIIALRAYRRKSLDLSGALAGFIVMS
THFAISYRYGAVLLVFFFTSSKLTKVGEEKKRVVDADFKEGGQRNWIQVLSNSGIATVLAVVIWNVSGWQDKCLDSKDSALVTGLIGGILGHYSCCNGDTWSSEL
GILSDATPRLITTFKPVRKGTNGAVTNAGLLAAAAAGGVIGLAFVLIGFFTTECAYGTALKQLLVIPLAAMAGLCGSVIDSLLGATVQFSGFCTVRNKVVGKPGP
TVKKISGLNILDNNAVNLVSVLLTTLLTAISCIYIF