; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033946 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033946
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNuclear localized protein
Genome locationchr3:3175374..3178544
RNA-Seq ExpressionLag0033946
SyntenyLag0033946
Gene Ontology termsGO:0000725 - recombinational repair (biological process)
InterPro domainsIPR028045 - Homologous recombination OB-fold protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137555.1 uncharacterized protein C17orf53 homolog isoform X1 [Cucumis sativus]2.0e-17574.08Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS    P++A+TSTLSLPLLE+CS  PSQS    +N QS  + S Q+ ICRSQR+STELEA  PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQ AMQRRTRGD+   VGDEEPVPTQEYIRRV+ENGDEEDDDFN + W+CALDFVRG+GAM+GNGAVS TPL SIKNGF  EKV  V AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSNGLGGMMV LKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTR VHVLNVT +NV+KVISKD GP  + N+PTAIRQ DSI G+ HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
         VHMPQ N DVSRE+TQNIMNNL+QNSKL+G GL DLQT  G AASS N KWN    ++Q+ +EK+G V+D+G+ KGT ++GCNT+ VD+DQ  GS E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NHCTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   T     AKEN AA  T Q+PNN E ETIN MKKTV RTQ PLLPQWTDEQLDELFVFD
Subjt:  NHCTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

XP_022921729.1 uncharacterized protein C17orf53 homolog, partial [Cucurbita moschata]3.9e-17473.12Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLP----SQSNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS      AA TSTLSLPLLE+CSR P    SQS N    S+FSSQ+PICR+QR+STELE+P PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLP----SQSNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQAAMQRRTRG+  FYVGDEEP+PTQEYIRRV+ENGD+EDDDFNGNPW+CALDFVRGL AMDG G ++ TPL+SIKN FNAEKVALV AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSN LGGMMV LKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTR VHVLNVT++NV+KVISKD GPP + N+PTAI +PDS   E+HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
        EVH+PQ NFDVSRE TQNIM NLRQNSKL+  G+GDLQT  GNAASS N K NG N ++Q         +DL   KGTS++GCNT++VD+DQ+TG  E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   TD+   SQAKEN AA  T Q PNN EAE I        RTQ PLLPQWT+EQLDELF FD
Subjt:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

XP_022988449.1 uncharacterized protein LOC111485693, partial [Cucurbita maxima]1.2e-17573.98Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSN----NRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS      AA TSTLSLPLLE+CS  PSQ      N  S S+FSSQ+PICRSQR+S+ELE+P PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSN----NRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQAAMQRRTRGD  FY GDEEPVPTQEYIRRV+ENGDEED DFNGNPW+CALDFVRGLGAMDG G ++ TPL+SIKN FNAEKVALV AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSN LGGMMV LKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFS TR VHVLNVT++NV+KVISKD GPP + N+PTAI +PDS   E HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
        EVH+ Q NFDVSRE TQNIMNNLRQNSKL+  GLGDLQT  GNAASS N K NG N ++Q         +DL   KGTS++GCNT++VD+DQ+TG  E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   TD+   SQAKEN AA  T + PNN EAE IN M KT  RTQ PLLPQWT+EQLDELF FD
Subjt:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

XP_023516131.1 uncharacterized protein C17orf53 [Cucurbita pepo subsp. pepo]3.3e-17372.9Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSNNRQSH----SNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSP+ LS      AA TSTLSLPLLE+CS  PSQ  ++  +    S+FSSQ+PICRSQR+STELE+P PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSNNRQSH----SNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQAAMQRRTRG+  FYVGDEEPVPTQEYIRRV+ENGDEED DFNGNPW+CALDFVRGLGAMDG G ++ TPL+SIKN FNAEKVALV AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSN LGGMMV LKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTR VHVLNVT++NV+KVISKD GPP + N+PTAI QPDS   E HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
        EVH+PQ N DVSRE TQNIMNNLRQNSKL+  GLGDLQT  GNAASS N K NG N ++Q         +DL   KGTS++GCNT++VD+DQ+TG  E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   TD+   SQAKE  AA  T Q PNN EAE I        RTQ PLLPQWT+EQLDELF FD
Subjt:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

XP_038879213.1 uncharacterized protein LOC120071176 isoform X1 [Benincasa hispida]3.1e-17975.54Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQ----SNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS       AATS LSLPLLE+C R P Q    S+N QS SNF SQ+ ICRSQR+STE E P PS AS+R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQ----SNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGL-GAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIK
        IPGPAGAVQAAMQRRTRGDN  YVGDEEPVPTQEYIRRV+ENGDEEDDDFN +PW+CALDFVRGL GAMDGNGA+S TPL SIKNGFN EKV LV AIIK
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGL-GAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIK

Query:  SCTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENH
        SCTSNGLGGMMV LKDPTGTIDASIHHRVI+EGNFGKD+SVGAVLILQKVAVFSPTRFVHVLNVT +N++KVISKD G   + N PTAIRQ D I G+  
Subjt:  SCTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENH

Query:  GEVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEH
        GE+HMPQ N DVSRETTQNIMNNLRQ +KL+G  LGDLQT  GNAASSSN K NG  R++ ++VEK+ AV+D+G  KGTS++G  T+ VD+DQETG  E 
Subjt:  GEVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEH

Query:  TNH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
         NH   TD    SQAKEN AA  T QVPNN EAETIN MKKTV RTQ P+LPQWTDEQLDELFVFD
Subjt:  TNH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

TrEMBL top hitse value%identityAlignment
A0A0A0LSW2 Uncharacterized protein9.9e-17674.08Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS    P++A+TSTLSLPLLE+CS  PSQS    +N QS  + S Q+ ICRSQR+STELEA  PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQ AMQRRTRGD+   VGDEEPVPTQEYIRRV+ENGDEEDDDFN + W+CALDFVRG+GAM+GNGAVS TPL SIKNGF  EKV  V AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSNGLGGMMV LKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTR VHVLNVT +NV+KVISKD GP  + N+PTAIRQ DSI G+ HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
         VHMPQ N DVSRE+TQNIMNNL+QNSKL+G GL DLQT  G AASS N KWN    ++Q+ +EK+G V+D+G+ KGT ++GCNT+ VD+DQ  GS E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NHCTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   T     AKEN AA  T Q+PNN E ETIN MKKTV RTQ PLLPQWTDEQLDELFVFD
Subjt:  NHCTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

A0A1S3BVT0 uncharacterized protein C17orf53 homolog isoform X22.1e-17072.14Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS    P++ ATSTLSLPLLE+CS  PS+S    +N QS  + S Q+ +CRSQR+ST LEA  PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQ AMQRRTRGD+   VGDEEPVPTQEYIRRVMENGDEEDDDFN +PW+CALDFVR +GAM+GNGAVS TPL SIKNGF  EKV LV AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSNGLGGMMV LKDPTGTIDASIHHRVISEG FGKDLSVGAVLILQKVAVFSPTR VHVLNVT +NV+KVISKD GP  + N+PT IR  D I G+ HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
        EVHM Q N DVSRE+TQNIMNNLRQ+SKL+   LGDL+T  G AASSSN   N    S+Q++VEK+G V+D+G+ K T ++GCN + VD+DQ  GS E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NHCTDTVLPSQAKENDAAFTT-QVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   T      KEN AA +T Q+PNN EAETIN MKKTV RTQ PLLPQWTDEQLDELF FD
Subjt:  NHCTDTVLPSQAKENDAAFTT-QVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

A0A5A7USK6 Uncharacterized protein4.0e-16972.14Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS    P++ ATSTLSLPLLE+CS  PS+S    +N QS  + S Q+ +CRSQR+ST LEA  PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQS----NNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQ AMQRRTRGD+   VGDEEPVPTQEYIRRVMENGDEEDDDFN +PW+CALDFVR +GAM+GNGAVS TPL SIKNGF  EKV LV AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSNGLGGMMV LKDPTGTIDASIHHRVISEG FGKDLSVGAVLILQKVAVFSPTR VHVLNVT +NV+KVISKD GP  + N+PT IR  D I G+ HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
        EVHM Q N DVSRE+TQNIMNNLRQ+SK +   LGDL+T  G AASSSN   N    S+Q++VEK+G V+D+G+ K T ++GCN + VD+DQ  GS E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NHCTDTVLPSQAKENDAAFTT-QVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   T     AKEN AA +T Q+PNN EAETIN MKKTV RTQ PLLPQWTDEQLDELF FD
Subjt:  NHCTDTVLPSQAKENDAAFTT-QVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

A0A6J1E1C3 uncharacterized protein C17orf53 homolog1.9e-17473.12Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLP----SQSNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS      AA TSTLSLPLLE+CSR P    SQS N    S+FSSQ+PICR+QR+STELE+P PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLP----SQSNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQAAMQRRTRG+  FYVGDEEP+PTQEYIRRV+ENGD+EDDDFNGNPW+CALDFVRGL AMDG G ++ TPL+SIKN FNAEKVALV AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSN LGGMMV LKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTR VHVLNVT++NV+KVISKD GPP + N+PTAI +PDS   E+HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
        EVH+PQ NFDVSRE TQNIM NLRQNSKL+  G+GDLQT  GNAASS N K NG N ++Q         +DL   KGTS++GCNT++VD+DQ+TG  E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   TD+   SQAKEN AA  T Q PNN EAE I        RTQ PLLPQWT+EQLDELF FD
Subjt:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

A0A6J1JLK0 uncharacterized protein LOC1114856935.8e-17673.98Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSN----NRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        MEPWEALDLDYSDVHSLLRPLKRHRSPQPLS      AA TSTLSLPLLE+CS  PSQ      N  S S+FSSQ+PICRSQR+S+ELE+P PS AS R+
Subjt:  MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSN----NRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS
        IPGPAGAVQAAMQRRTRGD  FY GDEEPVPTQEYIRRV+ENGDEED DFNGNPW+CALDFVRGLGAMDG G ++ TPL+SIKN FNAEKVALV AIIKS
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
        CTSN LGGMMV LKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFS TR VHVLNVT++NV+KVISKD GPP + N+PTAI +PDS   E HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT
        EVH+ Q NFDVSRE TQNIMNNLRQNSKL+  GLGDLQT  GNAASS N K NG N ++Q         +DL   KGTS++GCNT++VD+DQ+TG  E  
Subjt:  EVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHT

Query:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
        NH   TD+   SQAKEN AA  T + PNN EAE IN M KT  RTQ PLLPQWT+EQLDELF FD
Subjt:  NH--CTDTVLPSQAKENDAAF-TTQVPNNPEAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD

SwissProt top hitse value%identityAlignment
Q32P12 Homologous recombination OB-fold protein3.2e-0626.95Show/hide
Query:  SPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSNNRQSHSNFSSQSPICRSQRVSTELEAPS-PS-AASARLIPGPAGAVQAAMQRRTRGDNCFYVGDE
        S  P+S+  +P +   +T S P+ +   + P  +N+       ++++P           + PS PS  A  R  PGPAG     +  +  G+N   +   
Subjt:  SPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSNNRQSHSNFSSQSPICRSQRVSTELEAPS-PS-AASARLIPGPAGAVQAAMQRRTRGDNCFYVGDE

Query:  EP-VPTQEYIRR-----VMENGDEEDDDFNGNPWLCALDFVRGLGAMDGN--------GAVSGTPLTSIKNGFNAEKVALVAAIIKSCTSNGLGGMMVTL
         P  PT   + +        +    ++DF   PW   L     LG  +G+          V      ++K      KV  +A +IKS T + +   +V  
Subjt:  EP-VPTQEYIRR-----VMENGDEEDDDFNGNPWLCALDFVRGLGAMDGN--------GAVSGTPLTSIKNGFNAEKVALVAAIIKSCTSNGLGGMMVTL

Query:  KDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPP--TEPNNPTAIRQPDSIAGENHGEVHMPQKNFDV
        KDPTG +  ++ HRV+ E     +L  G+VL+L+++ VFSP+   H LNVT NN++ + S D G     EP  P     P  + G +HG +       DV
Subjt:  KDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPP--TEPNNPTAIRQPDSIAGENHGEVHMPQKNFDV

Query:  SRETTQNI
        + E T+ +
Subjt:  SRETTQNI

Q6GX86 Homologous recombination OB-fold protein1.9e-0626.96Show/hide
Query:  HSLLRPLKRHRSPQ--PLSATSAPAAAATSTLSLPLLESCSRLPSQSNNRQSHSNFSSQSPICRSQRVSTELEAPS-PS-AASARLIPGPAGAVQAAMQR
        HS      R  SP   P+S+  +P +   ST S  + +   + P  +N+       ++++P           + PS PS  A  R  PGPAG     +  
Subjt:  HSLLRPLKRHRSPQ--PLSATSAPAAAATSTLSLPLLESCSRLPSQSNNRQSHSNFSSQSPICRSQRVSTELEAPS-PS-AASARLIPGPAGAVQAAMQR

Query:  RTRGDNCFYVGDEEP-VPTQEYIRR-----VMENGDEEDDDFNGNPWLCALDFVRGLGAMDGN--------GAVSGTPLTSIKNGFNAEKVALVAAIIKS
        +  G+N   +    P  PT   + +     V  +    ++DF   PW   L     LG  +G+          V      ++K      KV  +A +IKS
Subjt:  RTRGDNCFYVGDEEP-VPTQEYIRR-----VMENGDEEDDDFNGNPWLCALDFVRGLGAMDGN--------GAVSGTPLTSIKNGFNAEKVALVAAIIKS

Query:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG
         T + +   +V  KDPTG +  ++ HRV+ E     +L  G+VL+L+++ VFSP+   H LNVT NN++ + S D G       P  + +     G +HG
Subjt:  CTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHG

Query:  EVHMPQKNFDVSRETTQNI
         +       DV+ E TQ +
Subjt:  EVHMPQKNFDVSRETTQNI

Q8N3J3 Homologous recombination OB-fold protein6.9e-0926.58Show/hide
Query:  ASARLIPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRR-----VMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNA--
        A  R  PGPAG +      R+  D    +      PT   + +     V  +    ++DF   PWL  +    GL   D +  +    +  +     A  
Subjt:  ASARLIPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRR-----VMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNA--

Query:  ----EKVALVAAIIKSCTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPT--EP
             KV  +A +IKS T + +   +V  KDPTG +  ++H  ++       +L  G+VL+L+++ VFSP+   H LNVT NN++ + S D G  +  +P
Subjt:  ----EKVALVAAIIKSCTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPT--EP

Query:  NNPTAIRQPDSIAGENHGEVHMPQKNFDVSRETTQNI
        + P     P       H     P++ F     T QN+
Subjt:  NNPTAIRQPDSIAGENHGEVHMPQKNFDVSRETTQNI

Arabidopsis top hitse value%identityAlignment
AT1G48580.1 unknown protein2.9e-5835.61Show/hide
Query:  MEPWEALDLDYSDVHSLLRPLKRH---RSPQPLSATSAPAAAATS-TLSLPLLESCSRLPSQSNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL
        ++ WEALDL  S++ S LRP KR    RS QP +    P A   S T   P L  CS          S   F  +S                    S  L
Subjt:  MEPWEALDLDYSDVHSLLRPLKRH---RSPQPLSATSAPAAAATS-TLSLPLLESCSRLPSQSNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARL

Query:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNA-EKVALVAAIIK
        IPGPAG VQ A++R+   D   +    EP+PTQE++R+  E  D ED DF+ +PW+  +D++R  G +   G   GTP++ IK   ++  KV  V AI+K
Subjt:  IPGPAGAVQAAMQRRTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNA-EKVALVAAIIK

Query:  SCTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENH
        +CT NGLG +MVTLKDPTGTIDAS+H +VISE  FG+D+ VGAV+IL++VAV +P+R    LN+T  N+ KVI+KD   P  PN     +    ++ +NH
Subjt:  SCTSNGLGGMMVTLKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENH

Query:  GEVH-MPQKN-------------------------FDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSN-------CKWNGNNRSQQTMVEKD
          V+ +P++N                         F V + TTQ IMNNLRQN+K   + L D++    N A  S         K +   R +QT++ K 
Subjt:  GEVH-MPQKN-------------------------FDVSRETTQNIMNNLRQNSKLKGKGLGDLQTDTGNAASSSN-------CKWNGNNRSQQTMVEKD

Query:  GAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHTNHCTDTVLPSQAKENDAAFTTQVPNNP-EAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD
         ++                +  D   ET  A+           SQ++ ++      V  NP E  T   + K+        LPQWTDEQL+ELF FD
Subjt:  GAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHTNHCTDTVLPSQAKENDAAFTTQVPNNP-EAETINRMKKTVIRTQGPLLPQWTDEQLDELFVFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCATGGGAAGCTCTTGATCTCGATTACTCCGACGTTCACTCTCTTCTCCGCCCTTTGAAGCGCCACCGCAGCCCCCAACCCCTCTCCGCCACTTCCGCACCCGC
CGCCGCCGCTACTTCCACTCTCTCTCTGCCGCTTCTTGAATCCTGCTCTCGCCTCCCTTCCCAATCTAATAATCGCCAATCGCACTCGAACTTCTCTTCCCAGAGTCCAA
TTTGTCGATCCCAACGAGTTTCCACTGAACTTGAGGCCCCGTCTCCGTCCGCTGCGTCTGCGCGTCTCATTCCTGGCCCTGCCGGAGCGGTTCAAGCGGCGATGCAGCGT
AGAACCCGCGGTGATAATTGCTTCTATGTCGGCGATGAAGAACCCGTTCCTACGCAGGAGTATATAAGGAGGGTTATGGAAAATGGTGATGAAGAGGACGATGATTTTAA
TGGCAATCCGTGGCTTTGCGCTTTGGATTTTGTCCGCGGCCTAGGTGCGATGGATGGTAATGGAGCTGTGTCTGGAACTCCTTTGACTTCTATCAAGAACGGATTTAATG
CTGAGAAAGTTGCTCTGGTCGCTGCCATTATCAAATCTTGTACCTCAAATGGTCTGGGTGGTATGATGGTAACTTTGAAGGATCCAACAGGTACAATAGACGCTAGCATC
CACCATAGAGTCATTTCTGAAGGGAATTTTGGGAAGGACTTATCTGTCGGTGCAGTTTTAATATTGCAGAAGGTTGCTGTGTTTTCTCCGACACGTTTTGTACATGTGCT
CAATGTAACATCAAACAATGTCATCAAGGTTATCTCCAAGGACGGTGGACCTCCTACAGAGCCCAATAATCCTACAGCAATCAGACAGCCTGATTCTATAGCTGGAGAAA
ACCATGGAGAAGTACACATGCCGCAGAAGAATTTTGATGTGTCCCGTGAAACAACTCAAAATATCATGAACAATCTAAGGCAAAATTCTAAATTGAAAGGTAAAGGACTA
GGTGATCTACAAACAGACACAGGAAATGCTGCATCATCTAGCAATTGCAAATGGAACGGAAACAACAGAAGCCAACAGACTATGGTCGAGAAAGATGGGGCTGTGATGGA
TCTGGGTATGTTCAAAGGAACCTCAAATTTGGGTTGTAACACAATCGTTGTCGATCGAGATCAAGAAACAGGGTCGGCTGAGCATACCAACCATTGTACAGATACTGTCT
TACCGTCGCAGGCCAAAGAAAATGATGCTGCATTCACTACACAAGTTCCAAATAATCCAGAAGCTGAAACAATCAACAGAATGAAGAAGACAGTAATACGAACACAAGGA
CCATTACTCCCACAATGGACAGATGAGCAGTTGGATGAGCTCTTTGTATTTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCATGGGAAGCTCTTGATCTCGATTACTCCGACGTTCACTCTCTTCTCCGCCCTTTGAAGCGCCACCGCAGCCCCCAACCCCTCTCCGCCACTTCCGCACCCGC
CGCCGCCGCTACTTCCACTCTCTCTCTGCCGCTTCTTGAATCCTGCTCTCGCCTCCCTTCCCAATCTAATAATCGCCAATCGCACTCGAACTTCTCTTCCCAGAGTCCAA
TTTGTCGATCCCAACGAGTTTCCACTGAACTTGAGGCCCCGTCTCCGTCCGCTGCGTCTGCGCGTCTCATTCCTGGCCCTGCCGGAGCGGTTCAAGCGGCGATGCAGCGT
AGAACCCGCGGTGATAATTGCTTCTATGTCGGCGATGAAGAACCCGTTCCTACGCAGGAGTATATAAGGAGGGTTATGGAAAATGGTGATGAAGAGGACGATGATTTTAA
TGGCAATCCGTGGCTTTGCGCTTTGGATTTTGTCCGCGGCCTAGGTGCGATGGATGGTAATGGAGCTGTGTCTGGAACTCCTTTGACTTCTATCAAGAACGGATTTAATG
CTGAGAAAGTTGCTCTGGTCGCTGCCATTATCAAATCTTGTACCTCAAATGGTCTGGGTGGTATGATGGTAACTTTGAAGGATCCAACAGGTACAATAGACGCTAGCATC
CACCATAGAGTCATTTCTGAAGGGAATTTTGGGAAGGACTTATCTGTCGGTGCAGTTTTAATATTGCAGAAGGTTGCTGTGTTTTCTCCGACACGTTTTGTACATGTGCT
CAATGTAACATCAAACAATGTCATCAAGGTTATCTCCAAGGACGGTGGACCTCCTACAGAGCCCAATAATCCTACAGCAATCAGACAGCCTGATTCTATAGCTGGAGAAA
ACCATGGAGAAGTACACATGCCGCAGAAGAATTTTGATGTGTCCCGTGAAACAACTCAAAATATCATGAACAATCTAAGGCAAAATTCTAAATTGAAAGGTAAAGGACTA
GGTGATCTACAAACAGACACAGGAAATGCTGCATCATCTAGCAATTGCAAATGGAACGGAAACAACAGAAGCCAACAGACTATGGTCGAGAAAGATGGGGCTGTGATGGA
TCTGGGTATGTTCAAAGGAACCTCAAATTTGGGTTGTAACACAATCGTTGTCGATCGAGATCAAGAAACAGGGTCGGCTGAGCATACCAACCATTGTACAGATACTGTCT
TACCGTCGCAGGCCAAAGAAAATGATGCTGCATTCACTACACAAGTTCCAAATAATCCAGAAGCTGAAACAATCAACAGAATGAAGAAGACAGTAATACGAACACAAGGA
CCATTACTCCCACAATGGACAGATGAGCAGTTGGATGAGCTCTTTGTATTTGACTGA
Protein sequenceShow/hide protein sequence
MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSATSAPAAAATSTLSLPLLESCSRLPSQSNNRQSHSNFSSQSPICRSQRVSTELEAPSPSAASARLIPGPAGAVQAAMQR
RTRGDNCFYVGDEEPVPTQEYIRRVMENGDEEDDDFNGNPWLCALDFVRGLGAMDGNGAVSGTPLTSIKNGFNAEKVALVAAIIKSCTSNGLGGMMVTLKDPTGTIDASI
HHRVISEGNFGKDLSVGAVLILQKVAVFSPTRFVHVLNVTSNNVIKVISKDGGPPTEPNNPTAIRQPDSIAGENHGEVHMPQKNFDVSRETTQNIMNNLRQNSKLKGKGL
GDLQTDTGNAASSSNCKWNGNNRSQQTMVEKDGAVMDLGMFKGTSNLGCNTIVVDRDQETGSAEHTNHCTDTVLPSQAKENDAAFTTQVPNNPEAETINRMKKTVIRTQG
PLLPQWTDEQLDELFVFD