; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G038060 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G038060
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionDNA repair protein RAD51 homolog 4
Genome locationCicolChr02:33494608..33498241
RNA-Seq ExpressionCcUC02G038060
SyntenyCcUC02G038060
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0016444 - somatic cell DNA recombination (biological process)
GO:1900426 - positive regulation of defense response to bacterium (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008094 - DNA-dependent ATPase activity (molecular function)
InterPro domainsIPR003593 - AAA+ ATPase domain
IPR013632 - DNA recombination and repair protein Rad51-like, C-terminal
IPR020588 - DNA recombination and repair protein RecA-like, ATP-binding domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448361.1 PREDICTED: DNA repair protein RAD51 homolog 4 isoform X2 [Cucumis melo]1.4e-16090.37Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLK LE++ P IDSNF TFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIID  ERQPW+NGLELLEDA ENKH+LS GFE VDVL
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA NYNAEVFY+DTGNSFSPQR+SGFVNWKPGTALD +EQSMLQQVMSSISCHSVFDIFA+FDVLH+LE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC
        FNLRSQ CK DRRVQ LI+DSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSR AGSNV 
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC

Query:  QASILKHSSMPSGMTVRFVIYE
        QASILKHSSM SG   RFV+YE
Subjt:  QASILKHSSMPSGMTVRFVIYE

XP_031743457.1 DNA repair protein RAD51 homolog 4 isoform X2 [Cucumis sativus]2.9e-16187.02Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLKSLE++CP IDSNF TFCASHGIFTVEDFLI DLYVLAAFAEQQPASEKLKQGITQILSIIDA ERQPW+NGLELLEDA ENK++LS GFEGVDVL
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALD-----------------RTEQSMLQQVMSSISC
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA NY AEVFY+DTGNSFSPQR+SGFVNWKPGTALD                 ++EQSMLQ+VM+SISC
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALD-----------------RTEQSMLQQVMSSISC

Query:  HSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKS
        HSVF+IFA+FDVLHQLEFNLRSQ CK DRRVQLLI+DSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKS
Subjt:  HSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKS

Query:  VPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVRFVIYE
        VPHVRLQLSR AGSNVCQASILKHSSM SGMT RFVIYE
Subjt:  VPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVRFVIYE

XP_031743458.1 DNA repair protein RAD51 homolog 4 isoform X3 [Cucumis sativus]1.1e-16087.02Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLKSLE++CP IDSNF TFCASHGIFTVEDFLI DLYVLAAFAEQQPASEKLKQGITQILSIIDA ERQPW+NGLELLEDA ENK++LS GFEGVDVL
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA NY AEVFY+DTGNSFSPQR+SGFVNWKPGTALD +EQSMLQ+VM+SISCHSVF+IFA+FDVLHQLE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVL-----------------VTNHTVGGDRGTSKPALGESWKS
        FNLRSQ CK DRRVQLLI+DSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVL                 VTNHTVGGDRGTSKPALGESWKS
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVL-----------------VTNHTVGGDRGTSKPALGESWKS

Query:  VPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVRFVIYE
        VPHVRLQLSR AGSNVCQASILKHSSM SGMT RFVIYE
Subjt:  VPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVRFVIYE

XP_038901757.1 DNA repair protein RAD51 homolog 4 isoform X1 [Benincasa hispida]2.5e-16591.69Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLY LAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDA ENKH+LSTGFEG+DVL
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNV+TNY A+VFYLD+GNSFSPQR+SGFVNWKPGTALDRTEQSMLQQV+SSISCHSVF+IF MFD LHQLE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSS---QGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGS
        FNLRSQMCK  +RVQLLIVDSISSL+TPILGGS S   QGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSR AGS
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSS---QGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGS

Query:  NVCQASILKHSSMPSGMTVRFVIYE
        NVCQASILKHSSM SG   +FVIYE
Subjt:  NVCQASILKHSSMPSGMTVRFVIYE

XP_038901758.1 DNA repair protein RAD51 homolog 4 isoform X2 [Benincasa hispida]6.0e-16792.55Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLY LAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDA ENKH+LSTGFEG+DVL
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNV+TNY A+VFYLD+GNSFSPQR+SGFVNWKPGTALDRTEQSMLQQV+SSISCHSVF+IF MFD LHQLE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC
        FNLRSQMCK  +RVQLLIVDSISSL+TPILGGS SQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSR AGSNVC
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC

Query:  QASILKHSSMPSGMTVRFVIYE
        QASILKHSSM SG   +FVIYE
Subjt:  QASILKHSSMPSGMTVRFVIYE

TrEMBL top hitse value%identityAlignment
A0A0A0KCU0 RECA_2 domain-containing protein3.0e-16491.61Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLKSLE++CP IDSNF TFCASHGIFTVEDFLI DLYVLAAFAEQQPASEKLKQGITQILSIIDA ERQPW+NGLELLEDA ENK++LS GFEGVDVL
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA NY AEVFY+DTGNSFSPQR+SGFVNWKPGTALD +EQSMLQ+VM+SISCHSVF+IFA+FDVLHQLE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC
        FNLRSQ CK DRRVQLLI+DSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSR AGSNVC
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC

Query:  QASILKHSSMPSGMTVRFVIYE
        QASILKHSSM SGMT RFVIYE
Subjt:  QASILKHSSMPSGMTVRFVIYE

A0A1S3BIX1 DNA repair protein RAD51 homolog 4 isoform X11.8e-15683.38Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLK---------------------------QGITQILSIIDAAERQPW
        MAPLK LE++ P IDSNF TFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLK                           QGITQILSIID  ERQPW
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLK---------------------------QGITQILSIIDAAERQPW

Query:  MNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSM
        +NGLELLEDA ENKH+LS GFE VDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA NYNAEVFY+DTGNSFSPQR+SGFVNWKPGTALD +EQSM
Subjt:  MNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSM

Query:  LQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTS
        LQQVMSSISCHSVFDIFA+FDVLH+LEFNLRSQ CK DRRVQ LI+DSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTS
Subjt:  LQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTS

Query:  KPALGESWKSVPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVRFVIYE
        KPALGESWKSVPHVRLQLSR AGSNV QASILKHSSM SG   RFV+YE
Subjt:  KPALGESWKSVPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVRFVIYE

A0A1S3BJI2 DNA repair protein RAD51 homolog 4 isoform X27.0e-16190.37Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLK LE++ P IDSNF TFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIID  ERQPW+NGLELLEDA ENKH+LS GFE VDVL
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA NYNAEVFY+DTGNSFSPQR+SGFVNWKPGTALD +EQSMLQQVMSSISCHSVFDIFA+FDVLH+LE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC
        FNLRSQ CK DRRVQ LI+DSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSR AGSNV 
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC

Query:  QASILKHSSMPSGMTVRFVIYE
        QASILKHSSM SG   RFV+YE
Subjt:  QASILKHSSMPSGMTVRFVIYE

A0A6J1G6S8 DNA repair protein RAD51 homolog 48.0e-15788.51Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLKSLEQ+ P+ID+ FQ+FCASHGIFTVEDFLIHDLYVLAAFAEQQP SEKLKQGITQILS+I+ AERQ WMNGLELLEDA ENKHVLSTGFEGVD L
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNA+VFYLDTGNSFSPQR+SGFVNWK G ++ RT Q+MLQQVM+SISCHSV+DIF  FDVLHQLE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC
        FNLRSQ CK DRRVQLLIVDSISSLITPILGGSSSQGHALM+SAG+LLKKIAHE+NIAVLV NHTVGGDRGTSKPALGESWKSVPHVRLQLSR  GS+VC
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC

Query:  QASILKHSSMPSGMTVRFVIYE
        QASILKHSSM SGMT RFVIYE
Subjt:  QASILKHSSMPSGMTVRFVIYE

A0A6J1L7F6 DNA repair protein RAD51 homolog 43.2e-15889.13Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLKSLEQ+ P+ID+ FQ+FCASHGIFTVEDFLIHDLYVLAAFAEQQP SEKLKQGITQILS+I+ AERQPWMNGLELLEDA ENKH LSTGFEGVD L
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNA+VFYLDTGNSFSPQR+SGFVNWK G ++ RT Q+MLQQVMSSISCHSV+DIFA FDVLHQLE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC
        FNLRSQ CK DRRVQLLIVDSISSLITPILGGSSSQGHALM+SAG+LLKKIAHE+NIAVLV NHTVGGDRGTSKPALG+SWKSVPHVRLQLSR  GSNVC
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVC

Query:  QASILKHSSMPSGMTVRFVIYE
        QASILKHSSM SGMT RFVIYE
Subjt:  QASILKHSSMPSGMTVRFVIYE

SwissProt top hitse value%identityAlignment
O55230 DNA repair protein RAD51 homolog 42.0e-3236.84Show/hide
Query:  MNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSM
        +NG +L E+   +  +LSTG   +D LL  GL  G++TEIVG   SGKTQVCL  A+NVA +    V Y+D+    +  R+   +  +  T  +  + S 
Subjt:  MNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSM

Query:  LQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNH-TVGGDRGT
        LQ++    S    FDIF M D+L  L   +  Q       V+++IVDS+++++ P+LGG   +G ALM+     LK +A +  +AV+VTNH T   D   
Subjt:  LQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNH-TVGGDRGT

Query:  SKPALGESWKSVPHVR--LQLSRGA---GSNVCQASILKHSSMPSGM
         KPALG SW  VP  R  L ++ GA   GS+     + K    P+G+
Subjt:  SKPALGESWKSVPHVR--LQLSRGA---GSNVCQASILKHSSMPSGM

O75771 DNA repair protein RAD51 homolog 43.0e-3634.63Show/hide
Query:  VCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQ
        +CP +         SH I TV D +  D   L   A++   S K    + ++L    +A     +NG +L E+   +  +LSTG   +D LL  GL  G+
Subjt:  VCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQ

Query:  LTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCK
        +TEIVG   SGKTQVCL  A+NVA      V Y+D+    +  R+   +  K     D  EQ+   + +  I     FDIF M DVL +L   +  Q+  
Subjt:  LTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCK

Query:  RDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNH-TVGGDRGTSKPALGESWKSVPHVRLQL----SRGAGSNVCQASI
            V++++VDS++++++P+LGG   +G ALM+     LK +A +  +AV+VTNH T   D G  KPALG SW  VP  R+ L      GA      A +
Subjt:  RDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNH-TVGGDRGTSKPALGESWKSVPHVRLQL----SRGAGSNVCQASI

Query:  LKHSSMPSG
         K S  P+G
Subjt:  LKHSSMPSG

Q1ZXF0 Probable DNA repair protein RAD51 homolog 43.0e-2831.2Show/hide
Query:  MNGLELLEDAGENKHVLSTGFEGVDVLLGG-GLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRV-----SGFV---------
        +NG +   D  E K   S+G + +D LLGG G   G++ E+VG +S GKTQ+ +  + N++  YN+ + Y+D+ NSFSP R+     S ++         
Subjt:  MNGLELLEDAGENKHVLSTGFEGVDVLLGG-GLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRV-----SGFV---------

Query:  ------NWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNL----------RSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMI
              + K     D+ EQ  + +++  I   + FD   + ++L  ++  L           +Q  K    ++++++DSI +L+ PI+GG  +QGH  M+
Subjt:  ------NWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNL----------RSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMI

Query:  SAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLS
            L+K IA  + I  L+TN+TVGG+   +K ALGE+W  VP+ +L ++
Subjt:  SAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLS

Q2HJ51 DNA repair protein RAD51 homolog 41.3e-3433.97Show/hide
Query:  VCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQ
        +CP +  +      S GI TV D +  D   L   A++   S K    + ++L    +A      NG +L E+   +  +LSTG   +D LL  GL  G+
Subjt:  VCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQ

Query:  LTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCK
        +TEIVG   SGKTQVCL  A++VA      V Y+D+    +  R+   +        D  EQ+     +  I     FDIF M DVL  L   +  Q+  
Subjt:  LTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCK

Query:  RDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNH-TVGGDRGTSKPALGESWKSVPHVRLQL-------SRGAGSNVCQ
            +++++VDS+++++ P+LGG   +G ALM+     LK +A + ++AVLVTNH T   D G  KPALG SW  VP  RL L       S G+   VC 
Subjt:  RDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNH-TVGGDRGTSKPALGESWKSVPHVRLQL-------SRGAGSNVCQ

Query:  ASILKHSSMPSG
          + K   +P+G
Subjt:  ASILKHSSMPSG

Q9LQQ2 DNA repair protein RAD51 homolog 46.6e-10058.62Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLK LE+  P+ID+ FQ FCASHGI T+EDFL+HDLY L AF+++Q  +++LK+GIT ILS+I+  + +P +NGL+LLED   NKH LSTG +  D L
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        L GG REGQLTE+VGPSSSGKTQ C++AA++VA N+   V YLDTGNSFS +R++ F+          ++ ++ Q+VMS I CH+V+DI+ +FD L  LE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTV--GGDRGTSKPALGESWKSVPHVRLQLSRGAGSN
          LR QM   + R++LL+VDSISSLITPILGGS SQG ALM++ G LLKK+AHEH+IA+LVTNHTV  GG+ G +KPALGE+WKS+PHVRL LSR   ++
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTV--GGDRGTSKPALGESWKSVPHVRLQLSRGAGSN

Query:  VCQASILKHSSMPSGMTVR
         C  SILKH+S+PSG   +
Subjt:  VCQASILKHSSMPSGMTVR

Arabidopsis top hitse value%identityAlignment
AT1G07745.1 homolog of RAD51 D4.7e-10158.62Show/hide
Query:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL
        MAPLK LE+  P+ID+ FQ FCASHGI T+EDFL+HDLY L AF+++Q  +++LK+GIT ILS+I+  + +P +NGL+LLED   NKH LSTG +  D L
Subjt:  MAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVL

Query:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE
        L GG REGQLTE+VGPSSSGKTQ C++AA++VA N+   V YLDTGNSFS +R++ F+          ++ ++ Q+VMS I CH+V+DI+ +FD L  LE
Subjt:  LGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLE

Query:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTV--GGDRGTSKPALGESWKSVPHVRLQLSRGAGSN
          LR QM   + R++LL+VDSISSLITPILGGS SQG ALM++ G LLKK+AHEH+IA+LVTNHTV  GG+ G +KPALGE+WKS+PHVRL LSR   ++
Subjt:  FNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTV--GGDRGTSKPALGESWKSVPHVRLQLSRGAGSN

Query:  VCQASILKHSSMPSGMTVR
         C  SILKH+S+PSG   +
Subjt:  VCQASILKHSSMPSGMTVR

AT1G07745.2 homolog of RAD51 D3.2e-8957.93Show/hide
Query:  VEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAA
        VEDFL+HDLY L AF+++Q  +++LK+GIT ILS+I+  + +P +NGL+LLED   NKH LSTG +  D LL GG REGQLTE+VGPSSSGKTQ C++AA
Subjt:  VEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLELLEDAGENKHVLSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAA

Query:  SNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPI
        ++VA N+   V YLDTGNSFS +R++ F+          ++ ++ Q+VMS I CH+V+DI+ +FD L  LE  LR QM   + R++LL+VDSISSLITPI
Subjt:  SNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPI

Query:  LGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTV--GGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVR
        LGGS SQG ALM++ G LLKK+AHEH+IA+LVTNHTV  GG+ G +KPALGE+WKS+PHVRL LSR   ++ C  SILKH+S+PSG   +
Subjt:  LGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTV--GGDRGTSKPALGESWKSVPHVRLQLSRGAGSNVCQASILKHSSMPSGMTVR

AT2G28560.1 DNA repair (Rad51) family protein5.9e-1124.38Show/hide
Query:  ITQILSIIDAAERQPWMNGLELLEDAGENKHV---LSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA-----TNYNAEVFYLDTGNSF
        I   +S I  A   P  +   LLE   EN+H+   L T  +G+D  L GG+  G LTE+VGP   GK+Q C++ A + +        +  V Y+D  + F
Subjt:  ITQILSIIDAAERQPWMNGLELLEDAGENKHV---LSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA-----TNYNAEVFYLDTGNSF

Query:  SPQRV--SGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAG--
        S +RV   G  ++     L    + M Q++   I       +    + + +L+ ++         +V+LL++DS    +T +L G +  G       G  
Subjt:  SPQRV--SGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAG--

Query:  -TLLKKIAHEHNIAVLVTNHTVGGDRGTSK------------------------PALGESWKSVPHVRLQLSRGAGSNVCQAS
         + LK +A    I ++VTN     +R  +                          ALG +W     +RL L   +G  + + +
Subjt:  -TLLKKIAHEHNIAVLVTNHTVGGDRGTSK------------------------PALGESWKSVPHVRLQLSRGAGSNVCQAS

AT2G28560.2 DNA repair (Rad51) family protein5.9e-1124.38Show/hide
Query:  ITQILSIIDAAERQPWMNGLELLEDAGENKHV---LSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA-----TNYNAEVFYLDTGNSF
        I   +S I  A   P  +   LLE   EN+H+   L T  +G+D  L GG+  G LTE+VGP   GK+Q C++ A + +        +  V Y+D  + F
Subjt:  ITQILSIIDAAERQPWMNGLELLEDAGENKHV---LSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVA-----TNYNAEVFYLDTGNSF

Query:  SPQRV--SGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAG--
        S +RV   G  ++     L    + M Q++   I       +    + + +L+ ++         +V+LL++DS    +T +L G +  G       G  
Subjt:  SPQRV--SGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAG--

Query:  -TLLKKIAHEHNIAVLVTNHTVGGDRGTSK------------------------PALGESWKSVPHVRLQLSRGAGSNVCQAS
         + LK +A    I ++VTN     +R  +                          ALG +W     +RL L   +G  + + +
Subjt:  -TLLKKIAHEHNIAVLVTNHTVGGDRGTSK------------------------PALGESWKSVPHVRLQLSRGAGSNVCQAS

AT3G22880.1 DNA repair (Rad51) family protein2.6e-1129.31Show/hide
Query:  LSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVC--LRAASNVATNY---NAEVFYLDTGNSFSPQRVSGFV---NWKPGTALDRTEQSMLQQVMSSI
        ++TG + +D LLGGG+    +TE  G   SGKTQ+   L   + + TN    N +V Y+DT  +F P R+          PG  LD              
Subjt:  LSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVC--LRAASNVATNY---NAEVFYLDTGNSFSPQRVSGFV---NWKPGTALDRTEQSMLQQVMSSI

Query:  SCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGG-----SSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRG-----
          + ++     ++  + L   L ++M +   R+  LIVDSI +L      G        Q  A M+S    L KIA E N+AV +TN  +    G     
Subjt:  SCHSVFDIFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGG-----SSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRG-----

Query:  -TSKPALGESWKSVPHVRLQLSRGAG-SNVCQ
           KPA G        +RL   +G G + VC+
Subjt:  -TSKPALGESWKSVPHVRLQLSRGAG-SNVCQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCATTTTTATCAAATTCCCGATGTTAAAAGCTACGTAGTTTTGCATTCGAACCGAACGCAGCCCCAAAATCTTCGGAGCGCGAAGAAGAAAATGGCGCCATTGAA
GTCTTTGGAGCAGGTTTGCCCACTCATAGACTCTAATTTTCAGACATTTTGTGCTTCGCACGGCATTTTCACTGTGGAGGATTTCCTCATCCATGACCTATACGTTTTAG
CTGCTTTTGCAGAGCAACAGCCTGCATCCGAGAAATTGAAGCAGGGGATTACCCAAATCCTTTCAATTATTGATGCTGCTGAGCGACAACCATGGATGAACGGTTTGGAG
CTTTTGGAAGATGCTGGAGAAAACAAGCATGTTCTGTCCACTGGTTTTGAAGGGGTTGATGTTTTGCTTGGAGGTGGACTACGCGAGGGACAATTGACTGAAATTGTTGG
GCCATCGTCTTCTGGTAAAACTCAGGTTTGTCTCCGAGCTGCTTCAAATGTGGCAACGAACTACAATGCTGAGGTTTTCTACTTGGACACGGGGAATTCCTTTTCACCCC
AACGCGTCTCAGGCTTTGTTAATTGGAAGCCTGGAACTGCTTTAGATCGGACTGAGCAGAGCATGCTTCAACAAGTAATGAGCAGTATTTCATGTCACTCTGTGTTCGAC
ATTTTCGCAATGTTTGATGTCTTGCATCAGCTGGAATTCAATTTGAGATCCCAGATGTGCAAAAGGGATCGGAGAGTGCAGTTACTGATTGTAGATTCGATTTCTTCGCT
AATTACACCCATCCTTGGTGGAAGTAGTTCACAGGGACATGCTTTGATGATATCTGCTGGAACACTACTGAAGAAAATAGCTCATGAACATAATATAGCAGTACTGGTAA
CGAATCACACTGTGGGTGGAGATAGAGGTACTTCAAAACCTGCCCTAGGAGAGAGTTGGAAGAGTGTTCCACACGTGAGGCTTCAGCTTTCCCGAGGTGCAGGAAGCAAT
GTCTGCCAAGCTTCAATATTAAAACACTCATCCATGCCATCTGGTATGACTGTAAGATTCGTAATTTATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCATTTTTATCAAATTCCCGATGTTAAAAGCTACGTAGTTTTGCATTCGAACCGAACGCAGCCCCAAAATCTTCGGAGCGCGAAGAAGAAAATGGCGCCATTGAA
GTCTTTGGAGCAGGTTTGCCCACTCATAGACTCTAATTTTCAGACATTTTGTGCTTCGCACGGCATTTTCACTGTGGAGGATTTCCTCATCCATGACCTATACGTTTTAG
CTGCTTTTGCAGAGCAACAGCCTGCATCCGAGAAATTGAAGCAGGGGATTACCCAAATCCTTTCAATTATTGATGCTGCTGAGCGACAACCATGGATGAACGGTTTGGAG
CTTTTGGAAGATGCTGGAGAAAACAAGCATGTTCTGTCCACTGGTTTTGAAGGGGTTGATGTTTTGCTTGGAGGTGGACTACGCGAGGGACAATTGACTGAAATTGTTGG
GCCATCGTCTTCTGGTAAAACTCAGGTTTGTCTCCGAGCTGCTTCAAATGTGGCAACGAACTACAATGCTGAGGTTTTCTACTTGGACACGGGGAATTCCTTTTCACCCC
AACGCGTCTCAGGCTTTGTTAATTGGAAGCCTGGAACTGCTTTAGATCGGACTGAGCAGAGCATGCTTCAACAAGTAATGAGCAGTATTTCATGTCACTCTGTGTTCGAC
ATTTTCGCAATGTTTGATGTCTTGCATCAGCTGGAATTCAATTTGAGATCCCAGATGTGCAAAAGGGATCGGAGAGTGCAGTTACTGATTGTAGATTCGATTTCTTCGCT
AATTACACCCATCCTTGGTGGAAGTAGTTCACAGGGACATGCTTTGATGATATCTGCTGGAACACTACTGAAGAAAATAGCTCATGAACATAATATAGCAGTACTGGTAA
CGAATCACACTGTGGGTGGAGATAGAGGTACTTCAAAACCTGCCCTAGGAGAGAGTTGGAAGAGTGTTCCACACGTGAGGCTTCAGCTTTCCCGAGGTGCAGGAAGCAAT
GTCTGCCAAGCTTCAATATTAAAACACTCATCCATGCCATCTGGTATGACTGTAAGATTCGTAATTTATGAATGA
Protein sequenceShow/hide protein sequence
MAHFYQIPDVKSYVVLHSNRTQPQNLRSAKKKMAPLKSLEQVCPLIDSNFQTFCASHGIFTVEDFLIHDLYVLAAFAEQQPASEKLKQGITQILSIIDAAERQPWMNGLE
LLEDAGENKHVLSTGFEGVDVLLGGGLREGQLTEIVGPSSSGKTQVCLRAASNVATNYNAEVFYLDTGNSFSPQRVSGFVNWKPGTALDRTEQSMLQQVMSSISCHSVFD
IFAMFDVLHQLEFNLRSQMCKRDRRVQLLIVDSISSLITPILGGSSSQGHALMISAGTLLKKIAHEHNIAVLVTNHTVGGDRGTSKPALGESWKSVPHVRLQLSRGAGSN
VCQASILKHSSMPSGMTVRFVIYE