; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039301 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039301
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionplastid division protein CDP1, chloroplastic-like
Genome locationchr2:40953401..40970462
RNA-Seq ExpressionLag0039301
SyntenyLag0039301
Gene Ontology termsGO:0010020 - chloroplast fission (biological process)
GO:0015074 - DNA integration (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0009528 - plastid inner membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0043621 - protein self-association (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025344 - Protein ACCUMULATION AND REPLICATION OF CHLOROPLASTS 6-like, IMS domain
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR044685 - Protein ACCUMULATION AND REPLICATION OF CHLOROPLASTS 6-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607554.1 Plastid division protein CDP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.9Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVG+AK+VLDIG+ V+Q P+AK Y+HDILLSMVLAECAIAKIGFEKN VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLR QTSL KLKLLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALRELLRQGLDVET CQVQDWPCFL+QALGRLM AE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQT+LIEKAKTICECLIASEGVDLKLEEAFC FLLGQCSDSEVFEKL QSTLNSKPAMP
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
        TRLSNSGMEKKNAENTYQSLEIWLKD VLGVFKDTRDCSLTL RFF SEKK +A+KKIN   QSI+H NNRPI+SSS SEWRDVEDSFP+L+++QNLGNI
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED
        VRRL PTNLPSQLGT KKT D NSSSVQLKRDLRINKWKI+ELWL RGSLVKNMK+L +VGCISFACFKLTS MIK+  VPTW PHK SLNTSSLF DE 
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED

Query:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA
        LS D+VIA PNMK SSNL SLK LL KLMRKGR LSG S++PL SAITAP  KLMS+EEAEALVNQWQ IKAEALGPNY+ +RL EILDGTMLFQWQALA
Subjt:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA

Query:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT
        DAAKAKSCYWKFVLLQ SVLRAE LSDKFGATTLEIEVHLEEAAELVNEAEPK+P+YYSNYKVRYVVKRQQDGSWKF E DILVPT
Subjt:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT

XP_022932466.1 plastid division protein CDP1, chloroplastic-like [Cucurbita moschata]0.0e+0087.9Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVG+AK+VLDIG+ V+Q P+AKPY+HDILLSMVLAECAIAKIGFEKN VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLR QTSL KLKLLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALRELLRQGLDVET CQVQDWPCFL+QALGRLM AE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        DELA IRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQT+LIEKAKTICECLIASEGVDLKLEEAFC FLLGQCSDSEVFEKL QSTLN KPAMP
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
        TRLSNSGMEKKNAENTYQSLEIWLKD VLGVFKDTRDCSLTL RFF SEKK +A+KKIN   QSI+H NNRPI+SSS SEWRDVEDSFPNL++SQNLGNI
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED
        VRRL PTNLPSQLGT KKT D NSSSVQLKRDLRINKWKI+ELWL RGSLVKNMK+L +VGCISFACFKLTS MIK+  VPTW PHK SLNTSSLF DE 
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED

Query:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA
        LS D+VIA PNMK SSNL SLK LL KLMRKGR LSG S++PL SAITAP  KLMS+EEAEALVNQWQ IKAEALGPNY+ +RL EILDGTMLFQWQALA
Subjt:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA

Query:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT
        DAAKAKSCYWKFVLLQ SVLRA+ LSDKFGATTLEIEVHLEEAAELVNEAEPK+P+YYSNYKVRYVVKRQQDGSWKF E DILVPT
Subjt:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT

XP_022973447.1 plastid division protein CDP1, chloroplastic-like [Cucurbita maxima]0.0e+0087.35Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEP YAGNMKENIPPKSSIRIPWAWLPGALCLLQEVG+AK+VLDIG+ V+Q P+AKPY+HDILLSMVLAECAIAKIGFEKN VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLR QTSL KLKLLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALRELLRQGLDVET CQVQDWPCFLSQALGRLMAAE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQT+L+EKAKTICECLIASEGVDLKLEEAFC FLLGQCSDSEVFEKL QSTLNSKPAMP
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSV--SEWRDVEDSFPNLNSSQNLG
        TRLSNSGMEKK AENTYQSLEIWLKD VLGVFKDTRDCSLTL RFF SEKK +A+KKIN   QSI+H NNRPI+SSS   SEWRDVEDSFPNL++SQNLG
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSV--SEWRDVEDSFPNLNSSQNLG

Query:  NIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGD
        NIVRRL PTNLPSQLGT KKT D NSSSVQ KRDL INKWKI+ELWL RG+LVKNMK+L +VGCISFACFKLTS MIK+  VPTW PHK SLNTSSLF D
Subjt:  NIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGD

Query:  EDLSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQA
        + LS D+VIA PNMK SSNL SLK LL KLMRKGR LSG S++PL SAITAP  KLMS+EEAEALVNQWQ IKAEALGPNY+ +RL EILDGTMLFQWQA
Subjt:  EDLSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQA

Query:  LADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT
        LADAAKAKSCYWKFVLLQ SVLRA+ LSDKFGATTLEIEVHLEEAAELVNEAEPK+P+YYSNYKVRYVVKRQQDGSWKF EGDILVPT
Subjt:  LADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT

XP_023523807.1 plastid division protein CDP1, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0088.19Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEPHYAGNMKENI PKSSIRIPWAWLPGALCLLQEVG+AK+VLDIG+ V+Q P+AKPY+HDILLSMVLAECAIAKIGFEKN VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLR QTSL KLKLLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALRELLRQGLDVET CQVQDWPCFLSQALGRLMAAE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQT+LIEKAKTICECLIASEGVDLKLEEAFC FLLGQCSDSEVFEKL QSTLNSKPAMP
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
        TRLSNSGMEKKNAENTYQSLEIWLKD VLGVFKDTRDCSLTL RFF SEKK +A+KKIN   QSI+H NNRPI+SSSVSEWRDVEDSFPNL++SQNLGN+
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED
        VRRL PTNLPSQLGT KKT D NSSSVQLKRDLRINKWKI+ELWL RGSLVKNMK+L +VGCISFACFKLTS MIK+  VPTW PHK SLNTSSLF DE 
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED

Query:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA
        LS D+VIA PNMK SSNL SLK LL K+MRKGR LSG S++PL SAITAP  KLMS+EEAEALVNQWQ IKAEALGPNY+ +RL EILDGTMLFQWQALA
Subjt:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA

Query:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT
        DAAKAKSCYWKFVLLQ SVLRA+ LSDKFGATTLEIEVHLEEAAELVNEAEPK+P+YYSNYKVRYVVKR QDGSWKF EGDILVPT
Subjt:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT

XP_038885039.1 plastid division protein CDP1, chloroplastic isoform X2 [Benincasa hispida]0.0e+0086.79Show/hide
Query:  YRDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGF
        Y+DLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQ VVQ PMAKPY+HDILLSMVLAECAIAK+GFEKNMVSQGF
Subjt:  YRDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGF

Query:  EALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLP
        EALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELL + +L  N ERR GAIAALRELLRQGLDVET CQVQDWPCFLSQALGRLMAAE+VDLLP
Subjt:  EALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLP

Query:  WDELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAM
        WDELALIRKNKKSIESQNQRVVVDF+CF+MAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFC FLLGQCSDSEVFEKLQQS LNSKPAM
Subjt:  WDELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAM

Query:  PTRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN--QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLG
        PTR SN  MEKK+AENTYQ LEIWLKD VLGVFKDTRDCSLTLV F   EKKMDA+KKIN  QQQ I+  NNRPI++SS+SEWRDVE+SF N NSSQNLG
Subjt:  PTRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN--QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLG

Query:  NIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGD
        NI+RRL PTNLPSQLGTGKK +D NSSSVQLKRDLRI +WKI+ELW ARGSLV  MK+LV++GCISFA F L STMIK+K  PTW PHKASLNTSS+F D
Subjt:  NIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGD

Query:  EDLSADDVIAPPNMKSSSNL-RSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQ
        E LS D+VI PPN KS +NL  SLK LLSKLMRKGRNL+GTS+M LSSAITA +QKLM VEEAEALV QWQTIKAEALGPNYQ +RLA+ILDGTML QWQ
Subjt:  EDLSADDVIAPPNMKSSSNL-RSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQ

Query:  ALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT
        ALADAAKAKSCYW+FVLLQLSVLRAELLSDKFGA TLEIEVHLEEAAELVNEAEPK+PSYYSNYKVRY+VKRQQDGSWKF EGDILVPT
Subjt:  ALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT

TrEMBL top hitse value%identityAlignment
A0A6J1DS05 plastid division protein CDP1, chloroplastic0.0e+0085.94Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEPHYAGNMKENIPPKSSIR+PWAWLPGALCLLQEVGEAK+VLDIGQ VVQ PMAKPYIHDILLSM LAECAIAKIGFEKNMVSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLRSQTSLGKLKLL QIEESLEELAPACTLELLGM +L  NTERR GAIAALRELLRQGLDVET CQVQDWPCFLSQALGRLMAAE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
         +LAL+RKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSD+EVFEKL+QSTLNSK AM 
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQ-SIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
        TR SNSGMEKKNAENTYQ LEIWLKDAVLG FKDT+DCSLTLV FFH EKK  A+KKIN  Q SI+H NNRPI+SSSVSEWRD+EDSFP LNSSQNLGNI
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQ-SIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED
        VRRL PTNLPSQLGT KK SD NSSSVQLKRDLR NKWKI+ELWL R S VKN++ LV+VGCISFACFKLTSTMI +KF  TW PHKASLN+S+L  + D
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED

Query:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNM-------PLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTML
        LS D+VIAP +MKSS++L SLK LLS+LMRKGRNLSGT +M        LSS +TA +QKLMSV+EAEALV  WQ+IKAEALGPNYQ +RLAEILDGTML
Subjt:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNM-------PLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTML

Query:  FQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDIL
         QWQALADAA+AKSCYWKFVLLQLSVLRAELLSDKFGA T+EIEVHLEEAAELVNEAEPK+PSYYSNYKVRYVVKRQ+DGSWKFCEGDIL
Subjt:  FQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDIL

A0A6J1EWF8 plastid division protein CDP1, chloroplastic-like0.0e+0087.9Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVG+AK+VLDIG+ V+Q P+AKPY+HDILLSMVLAECAIAKIGFEKN VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLR QTSL KLKLLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALRELLRQGLDVET CQVQDWPCFL+QALGRLM AE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        DELA IRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQT+LIEKAKTICECLIASEGVDLKLEEAFC FLLGQCSDSEVFEKL QSTLN KPAMP
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
        TRLSNSGMEKKNAENTYQSLEIWLKD VLGVFKDTRDCSLTL RFF SEKK +A+KKIN   QSI+H NNRPI+SSS SEWRDVEDSFPNL++SQNLGNI
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED
        VRRL PTNLPSQLGT KKT D NSSSVQLKRDLRINKWKI+ELWL RGSLVKNMK+L +VGCISFACFKLTS MIK+  VPTW PHK SLNTSSLF DE 
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDED

Query:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA
        LS D+VIA PNMK SSNL SLK LL KLMRKGR LSG S++PL SAITAP  KLMS+EEAEALVNQWQ IKAEALGPNY+ +RL EILDGTMLFQWQALA
Subjt:  LSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA

Query:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT
        DAAKAKSCYWKFVLLQ SVLRA+ LSDKFGATTLEIEVHLEEAAELVNEAEPK+P+YYSNYKVRYVVKRQQDGSWKF E DILVPT
Subjt:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT

A0A6J1EXU2 plastid division protein CDP1, chloroplastic-like0.0e+0081.38Show/hide
Query:  IGVPFAESVKNPIRIFSSLVHMLVPFGRPFSLLSTGPLHYRDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQK
        IGVP        ++    L ++ +  G     +S+     +DLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAK VLDIGQ 
Subjt:  IGVPFAESVKNPIRIFSSLVHMLVPFGRPFSLLSTGPLHYRDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQK

Query:  VVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFEALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALR
        V+Q PMAKP++HDILLSMVLAECAIAKIGFEKNMVSQGFEALARAQYLLRSQTSL KL+LLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALR
Subjt:  VVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFEALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALR

Query:  ELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPWDELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIAS
        ELLRQGLDVE+ CQVQDWPCFLSQALGRLMAAE+VDLLPWDELALIRKNKKSIESQNQRVV+DF+CF MAFKAHLALGFS+RQTELIEKAKTICECL++S
Subjt:  ELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPWDELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIAS

Query:  EGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMPTRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQ
        EGVDLKLEEAF +FLLGQCSDSEVFEKLQQSTLNSKPAMPTRL N GMEKKNAENTYQ LEIWLKD VL VFKDTRDCSLTLV F H +KKMDA+KK+N 
Subjt:  EGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMPTRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQ

Query:  QQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVG
         Q  I  NNRPI+SS VSEWRDVE+SFPNL SSQNLGNI+R+L PTNLPSQLGT K+ +D NSSSVQLKR+LR+NKWKI+E WLAR SLV NMK+LV+VG
Subjt:  QQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVG

Query:  CISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDEDLSADDVIAPPNMKSSSNL-RSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEA
        CISFA FKL STMIK K VP W PH ASLN SSLF  E LS D+VI  PN KS SNL  SLK LLS +MRKGRNLSGTS+ PL SAI+A HQK MSVEEA
Subjt:  CISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDEDLSADDVIAPPNMKSSSNL-RSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEA

Query:  EALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSN
        EALV QWQ IKAEALGPNYQ +RLAEILDG MLFQWQALADAAKAKSCYWKFVLL+LSVLRAELLSDK GA TLEIEVHLEEAAELVNEAEPK+PSYYSN
Subjt:  EALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSN

Query:  YKVRYVVKRQQDGSWKFCEGDILVP
        Y VRY+ KRQQDGSWKFCEG+I VP
Subjt:  YKVRYVVKRQQDGSWKFCEGDILVP

A0A6J1I7J5 plastid division protein CDP1, chloroplastic-like0.0e+0087.35Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEP YAGNMKENIPPKSSIRIPWAWLPGALCLLQEVG+AK+VLDIG+ V+Q P+AKPY+HDILLSMVLAECAIAKIGFEKN VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLR QTSL KLKLLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALRELLRQGLDVET CQVQDWPCFLSQALGRLMAAE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQT+L+EKAKTICECLIASEGVDLKLEEAFC FLLGQCSDSEVFEKL QSTLNSKPAMP
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSV--SEWRDVEDSFPNLNSSQNLG
        TRLSNSGMEKK AENTYQSLEIWLKD VLGVFKDTRDCSLTL RFF SEKK +A+KKIN   QSI+H NNRPI+SSS   SEWRDVEDSFPNL++SQNLG
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKIN-QQQSIIHPNNRPIASSSV--SEWRDVEDSFPNLNSSQNLG

Query:  NIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGD
        NIVRRL PTNLPSQLGT KKT D NSSSVQ KRDL INKWKI+ELWL RG+LVKNMK+L +VGCISFACFKLTS MIK+  VPTW PHK SLNTSSLF D
Subjt:  NIVRRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGD

Query:  EDLSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQA
        + LS D+VIA PNMK SSNL SLK LL KLMRKGR LSG S++PL SAITAP  KLMS+EEAEALVNQWQ IKAEALGPNY+ +RL EILDGTMLFQWQA
Subjt:  EDLSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQA

Query:  LADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT
        LADAAKAKSCYWKFVLLQ SVLRA+ LSDKFGATTLEIEVHLEEAAELVNEAEPK+P+YYSNYKVRYVVKRQQDGSWKF EGDILVPT
Subjt:  LADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPT

A0A6J1IGB5 plastid division protein CDP1, chloroplastic-like0.0e+0085.26Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFEP+YAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAK VLDIGQ V+Q PMAKP++HDILLSMVLAECAIAKIGFEKNMVSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQYLLRSQTSL KLKLLSQIEESLEELAPACTLELLGM SL TNTERR GAIAALRELLRQGLDVET CQVQDWPCFLSQALGRLMAAE+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        DELALIRKNKKSIESQNQRVV+DF+CF MAFKAHLALGFS+RQTELIEKAKTICECL++SEGVDLKLEEAF +FLLGQCSDSEVFEKLQQSTLNSKPAMP
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNIV
        TRL N GMEKKNAENT Q LEIWLKD VL VFKDTRDCSLTLV F H +KKMDA+KKIN  Q  I  NNRPI+SS VSEWRDVE+SFPNL+SSQNLGNI+
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNIV

Query:  RRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDEDL
        R+L PTNLPSQLGT K+ +D N+SSVQLKR+LR+NKWKI+E WLAR SLV NMK+LV+VGCISFA FKL ST +K K VP W PH ASLN SSLF DE L
Subjt:  RRLAPTNLPSQLGTGKKTSDGNSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDEDL

Query:  SADDVIAPPNMKSSSNL-RSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA
        SAD+VI  PN KS SNL  SLK LLS LMRKGRNLSGTS+ P+ SAI+A HQK MSVEEAEALV QWQ IKAEALGPNYQ +RL EILDGTMLFQWQALA
Subjt:  SADDVIAPPNMKSSSNL-RSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALA

Query:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVP
        DAAKAKSCYWKFVLL+LSVLRAELLSDK GA TLEIEVHLEEAAELVNEAEPK+PSYYSNYKVRY+ KRQQDGSWKFCEG+I VP
Subjt:  DAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVP

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-2923.69Show/hide
Query:  WKKQDSLISSWLLGSMSESILEQVIHCKTTKEIWSCLFQIFNSRNRAQIMRMKSKLQSLQKGS-LSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNG
        WKK +    S ++  +S+S L       T ++I   L  ++  ++ A  + ++ +L SL+  S +SL  +F    + +  L A G +I+  D I ++L  
Subjt:  WKKQDSLISSWLLGSMSESILEQVIHCKTTKEIWSCLFQIFNSRNRAQIMRMKSKLQSLQKGS-LSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNG

Query:  LGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESKLKAVNADGTQPMANLMTQNHHSVENDPQKNNNSYEGNHGFNSRARGRSNRGGGRWNNRNKPQ
        L   Y+ +++A+ T +++      +A++       E K+K  + D ++ + N +  N          NNN+Y+ N   N   + +    G   N++ K +
Subjt:  LGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESKLKAVNADGTQPMANLMTQNHHSVENDPQKNNNSYEGNHGFNSRARGRSNRGGGRWNNRNKPQ

Query:  CQICFKFGHTAIKCYS----LNGRFDQNRGSFPLNNSPITTMLTATDLN---QDNVWYP-DSGATNHLTNNFNNLVVGAEYSGGSQMQVG-NGTGLPISH
        C  C + GH    C+     LN +  +N        S     +     N    DN  +  DSGA++HL N+ +      E     ++ V   G  +  + 
Subjt:  CQICFKFGHTAIKCYS----LNGRFDQNRGSFPLNNSPITTMLTATDLN---QDNVWYP-DSGATNHLTNNFNNLVVGAEYSGGSQMQVG-NGTGLPISH

Query:  CGYTSFSSPNRIFHLNNLLHVPHITKNLISINQFAKDNLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHIS
         G     + + I  L ++L       NL+S+ +  +                    G ++           FD S  + S       K   +L++   I+
Subjt:  CGYTSFSSPNRIFHLNNLLHVPHITKNLISINQFAKDNLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHIS

Query:  ----SVNTNASNSMLDVWHRRLGHPAFSIV-----KYVAKSCSPALSLNKHVAFCNACAVGKSHALPFP--ISTTQYTSPLQLIVADVCGPSYTMSRNRF
            S+N    N+   +WH R GH +   +     K +    S   +L      C  C  GK   LPF      T    PL ++ +DVCGP   ++ +  
Subjt:  ----SVNTNASNSMLDVWHRRLGHPAFSIV-----KYVAKSCSPALSLNKHVAFCNACAVGKSHALPFP--ISTTQYTSPLQLIVADVCGPSYTMSRNRF

Query:  RYYISFVDVFSRYTWVYFLQTKVEALQTFLSFKTSVERLLGCRILRFQSNGGGEF--NFFTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLL
         Y++ FVD F+ Y   Y ++ K +    F  F    E     +++    + G E+  N       K  I +  + P+T Q NG+ ER  R I +   T++
Subjt:  RYYISFVDVFSRYTWVYFLQTKVEALQTFLSFKTSVERLLGCRILRFQSNGGGEF--NFFTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLL

Query:  SQASMPLTYWDDAFSTATYLINRLPSTTL
        S A +  ++W +A  TATYLINR+PS  L
Subjt:  SQASMPLTYWDDAFSTATYLINRLPSTTL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-4124.4Show/hide
Query:  GNKISTVKLNNDN-FLMWKVQIEFALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVK---WKKQDSLISSWLLGSMSESILEQVIHCKTTKEIW
        G K    K N DN F  W+ ++   L    + Q + K  D  S+K             P+ +K   W   D   +S +   +S+ ++  +I   T + IW
Subjt:  GNKISTVKLNNDN-FLMWKVQIEFALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVK---WKKQDSLISSWLLGSMSESILEQVIHCKTTKEIW

Query:  SCLFQIFNSRNRAQIMRMKSKLQSLQKG-SLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHE--
        + L  ++ S+     + +K +L +L      +   + +     +  LA +G +I+ ED  + +LN L   Y+++ + +        ++DV + L  +E  
Subjt:  SCLFQIFNSRNRAQIMRMKSKLQSLQKG-SLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHE--

Query:  -NRIESKLKAVNADGTQPMANLMTQNHHSVENDPQKNNNSYEGNHGFNSRARGRS-NRGGGRWNNRNKPQCQICFKFGHTAIKC--------YSLNGRFD
          + E++ +A+  +G                   Q+++N+Y       S ARG+S NR   R  N     C  C + GH    C         +   + D
Subjt:  -NRIESKLKAVNADGTQPMANLMTQNHHSVENDPQKNNNSYEGNHGFNSRARGRS-NRGGGRWNNRNKPQCQICFKFGHTAIKC--------YSLNGRFD

Query:  QNRGSFPLNNSPITTMLTATD-----LNQDNVWYPDSGATNHLT---NNFNNLVVGAEYSGGSQMQVGNGTGLPISHCGYTSF-SSPNRIFHLNNLLHVP
         N  +   NN  +   +   +        ++ W  D+ A++H T   + F   V G        +++GN +   I+  G     ++      L ++ HVP
Subjt:  QNRGSFPLNNSPITTMLTATD-----LNQDNVWYPDSGATNHLT---NNFNNLVVGAEYSGGSQMQVGNGTGLPISHCGYTSF-SSPNRIFHLNNLLHVP

Query:  HITKNLISINQFAKDNLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHISSVNTNASNSMLDVWHRRLGHPA
         +  NLIS     +D    +  +  +   K SL    + +G     LYR      +++ + +                 +N       +D+WH+R+GH +
Subjt:  HITKNLISINQFAKDNLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHISSVNTNASNSMLDVWHRRLGHPA

Query:  FSIVKYVAKSCSPALSLNKHVAFCNACAVGKSHALPFPISTTQYTSPLQLIVADVCGPSYTMSRNRFRYYISFVDVFSRYTWVYFLQTKVEALQTFLSFK
           ++ +AK    + +    V  C+ C  GK H + F  S+ +  + L L+ +DVCGP    S    +Y+++F+D  SR  WVY L+TK +  Q F  F 
Subjt:  FSIVKYVAKSCSPALSLNKHVAFCNACAVGKSHALPFPISTTQYTSPLQLIVADVCGPSYTMSRNRFRYYISFVDVFSRYTWVYFLQTKVEALQTFLSFK

Query:  TSVERLLGCRILRFQSNGGGEFNF--FTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLLSQASMPLTYWDDAFSTATYLINRLPSTTLNGPI
          VER  G ++ R +S+ GGE+    F        I H  + P T Q NG+ ER +R IV+   ++L  A +P ++W +A  TA YLINR PS  L   I
Subjt:  TSVERLLGCRILRFQSNGGGEFNF--FTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLLSQASMPLTYWDDAFSTATYLINRLPSTTLNGPI

Query:  HPTASPTSKISQS
                ++S S
Subjt:  HPTASPTSKISQS

Q8VY16 Plastid division protein CDP1, chloroplastic1.1e-18953.15Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFE  YAGN+KE I PKS +RIPWAWLPGALCLLQEVG+ K+VLDIG+  ++   +KPYIHDI LSM LAECAIAK  FE N VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQ  L+S+ +LGKL LL+QIEESLEELAP CTL+LLG+     N ERR GAIAALRELLRQGL VE  CQ+QDWPCFLSQA+ RL+A E+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        D+LA+ RKNKKS+ES NQRVV+DFNCFYM    H+A+GFS +Q E I KAKTICECLIASEGVDLK EEAFC FLL Q S++E  EKL+Q   NS  A+ 
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIH-PNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
         R S  G E ++   T  SLE WL ++VL  F DTR CS +L  FF +EKK    KK+     + H  N RP++++              +NSSQ+L   
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIH-PNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSS--SVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMI-KIKFVP---TWAPHKASLNTSS
        V +L PT+L S + + K   + ++S  SVQLKR+L ++K KI + WL++ SL+  + ++ L+GC  F   KL+     +++ +P   +  PH  S   S 
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSS--SVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMI-KIKFVP---TWAPHKASLNTSS

Query:  LFGDEDLSADDVIAPPNMKS-SSNLRSLKGLLSKLMRKGRN-------LSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAE
        L+  E  +    +   N      N++ L  +L   M  G +        SG S   LS + +  H++ M  EEAE LV QW+ +KAEALGP +Q + L+E
Subjt:  LFGDEDLSADDVIAPPNMKS-SSNLRSLKGLLSKLMRKGRN-------LSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAE

Query:  ILDGTMLFQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILV
        +LD +ML QWQ LA  A+AKSCYW+FVLL L VL+A +  D       EIE  LEEAAELV+E++PK+  YYS YK+RY++K+Q+DG WKFC+ DI +
Subjt:  ILDGTMLFQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-10035.05Show/hide
Query:  NKISTVKLNNDNFLMWKVQIEFALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVKWKKQDSLISSWLLGSMSESILEQVIHCKTTKEIWSCLFQ
        N  +  KL + N+LMW  Q+    +G+ L  F+      P   I         ++NP++ +WK+QD LI S +LG++S S+   V    T  +IW  L +
Subjt:  NKISTVKLNNDNFLMWKVQIEFALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVKWKKQDSLISSWLLGSMSESILEQVIHCKTTKEIWSCLFQ

Query:  IFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESKLK
        I+ + +   + +++++L+   KG+ +++DY   +    D LA +GK +D ++ +  +L  L  EY+ ++  +       ++ ++   L  H    ESK+ 
Subjt:  IFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESKLK

Query:  AVNADGTQPM-ANLMTQNHHSVENDPQ--KNNNSYEG-NHGFNSRARGRSNRGGGRWNNRNKP---QCQICFKFGHTAIKCYSLNGRFDQNRGSFPLNNS
        AV++    P+ AN ++  + +  N+      NN Y+  N+  NS+   +S+      NN++KP   +CQIC   GH+A +C  L           P   S
Subjt:  AVNADGTQPM-ANLMTQNHHSVENDPQ--KNNNSYEG-NHGFNSRARGRSNRGGGRWNNRNKP---QCQICFKFGHTAIKCYSLNGRFDQNRGSFPLNNS

Query:  PITTMLTATDL-----NQDNVWYPDSGATNHLTNNFNNLVVGAEYSGGSQMQVGNGTGLPISHCGYTSFSSPNRIFHLNNLLHVPHITKNLISINQFAKD
        P T      +L        N W  DSGAT+H+T++FNNL +   Y+GG  + V +G+ +PISH G TS S+ +R  +L+N+L+VP+I KNLIS+ +    
Subjt:  PITTMLTATDL-----NQDNVWYPDSGATNHLTNNFNNLVVGAEYSGGSQMQVGNGTGLPISHCGYTSFSSPNRIFHLNNLLHVPHITKNLISINQFAKD

Query:  NLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHISSVNTNASNSMLDVWHRRLGHPAFSIVKYVAKSCS-PA
        N V  EF P    VKD  TG  LLQG   D LY + I          +SS+P   +S F+  SS  T++S      WH RLGHPA SI+  V  + S   
Subjt:  NLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHISSVNTNASNSMLDVWHRRLGHPAFSIVKYVAKSCS-PA

Query:  LSLNKHVAFCNACAVGKSHALPFPISTTQYTSPLQLIVADVCGPSYTMSRNRFRYYISFVDVFSRYTWVYFLQTKVEALQTFLSFKTSVERLLGCRILRF
        L+ +     C+ C + KS+ +PF  ST   T PL+ I +DV   S  +S + +RYY+ FVD F+RYTW+Y L+ K +  +TF++FK  +E     RI  F
Subjt:  LSLNKHVAFCNACAVGKSHALPFPISTTQYTSPLQLIVADVCGPSYTMSRNRFRYYISFVDVFSRYTWVYFLQTKVEALQTFLSFKTSVERLLGCRILRF

Query:  QSNGGGEFNFFTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLLSQASMPLTYWDDAFSTATYLINRLPSTTLNGPIHPTASPTSKISQSLP
         S+ GGEF        +  I H  S P+T + NG+ ERKHR IV+ GLTLLS AS+P TYW  AF+ A YLINRLP+     P+    SP  K+  + P
Subjt:  QSNGGGEFNFFTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLLSQASMPLTYWDDAFSTATYLINRLPSTTLNGPIHPTASPTSKISQSLP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-9233.19Show/hide
Query:  LVILEIVNPGNKISTVKLNNDNFLMWKVQIEFALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVKWKKQDSLISSWLLGSMSESILEQVIHCKT
        LV   I+N  N  +  KL + N+LMW  Q+    +G+ L  F+      P   I       + ++NP++ +W++QD LI S +LG++S S+   V    T
Subjt:  LVILEIVNPGNKISTVKLNNDNFLMWKVQIEFALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVKWKKQDSLISSWLLGSMSESILEQVIHCKT

Query:  TKEIWSCLFQIFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFT
          +IW  L +I+ + +   + +++   +                    D LA +GK +D ++ +  +L  L  +Y+ ++  +       S+ ++      
Subjt:  TKEIWSCLFQIFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFT

Query:  HENRI--ESKLKAVNADGTQPM-ANLMT-QNHHSVENDPQKNNNSYEGNHGFNSRARGRSNRGGGRWNNRNKP---QCQICFKFGHTAIKC---YSLNGR
        HE  I  ESKL A+N+    P+ AN++T +N ++  N   + +N    N+   S +   S+ G    N + KP   +CQIC   GH+A +C   +     
Subjt:  HENRI--ESKLKAVNADGTQPM-ANLMT-QNHHSVENDPQKNNNSYEGNHGFNSRARGRSNRGGGRWNNRNKP---QCQICFKFGHTAIKC---YSLNGR

Query:  FDQNRGSFPLNNSPITTMLTATDLNQDNVWYPDSGATNHLTNNFNNLVVGAEYSGGSQMQVGNGTGLPISHCGYTSFSSPNRIFHLNNLLHVPHITKNLI
         +Q + + P         L        N W  DSGAT+H+T++FNNL     Y+GG  + + +G+ +PI+H G  S  + +R   LN +L+VP+I KNLI
Subjt:  FDQNRGSFPLNNSPITTMLTATDLNQDNVWYPDSGATNHLTNNFNNLVVGAEYSGGSQMQVGNGTGLPISHCGYTSFSSPNRIFHLNNLLHVPHITKNLI

Query:  SINQFAKDNLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHISSVNTNASNSMLDVWHRRLGHPAFSIVKYV
        S+ +    N V  EF P    VKD  TG  LLQG   D LY + I+   + S+             F+   S  T++S      WH RLGHP+ +I+  V
Subjt:  SINQFAKDNLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHISSVNTNASNSMLDVWHRRLGHPAFSIVKYV

Query:  AKSCS-PALSLNKHVAFCNACAVGKSHALPFPISTTQYTSPLQLIVADVCGPSYTMSRNRFRYYISFVDVFSRYTWVYFLQTKVEALQTFLSFKTSVERL
          + S P L+ +  +  C+ C + KSH +PF  ST   + PL+ I +DV   S  +S + +RYY+ FVD F+RYTW+Y L+ K +   TF+ FK+ VE  
Subjt:  AKSCS-PALSLNKHVAFCNACAVGKSHALPFPISTTQYTSPLQLIVADVCGPSYTMSRNRFRYYISFVDVFSRYTWVYFLQTKVEALQTFLSFKTSVERL

Query:  LGCRILRFQSNGGGEFNFFTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLLSQASMPLTYWDDAFSTATYLINRLPSTTLNGPIHPTASPTS
           RI    S+ GGEF      L +  I H  S P+T + NG+ ERKHR IV++GLTLLS AS+P TYW  AFS A YLINRLP+     P+    SP  
Subjt:  LGCRILRFQSNGGGEFNFFTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLLSQASMPLTYWDDAFSTATYLINRLPSTTLNGPIHPTASPTS

Query:  KI
        K+
Subjt:  KI

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.3e-1225Show/hide
Query:  TITKINPEFVKWKKQDSLISSWLLGSMSESILE-QVIHCKTTKEIWSCLFQIFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEID
        T+   N   V W+K+D ++   L G+++    +   +   T+++IW  +   F +   A+ +R+ S+L++   G + + DY+ ++KK  D+L  V   + 
Subjt:  TITKINPEFVKWKKQDSLISSWLLGSMSESILE-QVIHCKTTKEIWSCLFQIFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEID

Query:  SEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESKLK--AVNADGTQPMANLMTQNHHSVENDPQKNNNSYEGNHGFNSRARGRSN
          + +MY+LNGL P+++++++ +       S  D    L   E+R++  +K    + D +     L      +    P   N    G +    R RGR N
Subjt:  SEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESKLK--AVNADGTQPMANLMTQNHHSVENDPQKNNNSYEGNHGFNSRARGRSN

Query:  ---RG-GGRWNNRNKP
           RG GGR++  N P
Subjt:  ---RG-GGRWNNRNKP

AT3G19180.1 paralog of ARC67.6e-19153.15Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFE  YAGN+KE I PKS +RIPWAWLPGALCLLQEVG+ K+VLDIG+  ++   +KPYIHDI LSM LAECAIAK  FE N VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQ  L+S+ +LGKL LL+QIEESLEELAP CTL+LLG+     N ERR GAIAALRELLRQGL VE  CQ+QDWPCFLSQA+ RL+A E+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        D+LA+ RKNKKS+ES NQRVV+DFNCFYM    H+A+GFS +Q E I KAKTICECLIASEGVDLK EEAFC FLL Q S++E  EKL+Q   NS  A+ 
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIH-PNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
         R S  G E ++   T  SLE WL ++VL  F DTR CS +L  FF +EKK    KK+     + H  N RP++++              +NSSQ+L   
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIH-PNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSS--SVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMI-KIKFVP---TWAPHKASLNTSS
        V +L PT+L S + + K   + ++S  SVQLKR+L ++K KI + WL++ SL+  + ++ L+GC  F   KL+     +++ +P   +  PH  S   S 
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSS--SVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMI-KIKFVP---TWAPHKASLNTSS

Query:  LFGDEDLSADDVIAPPNMKS-SSNLRSLKGLLSKLMRKGRN-------LSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAE
        L+  E  +    +   N      N++ L  +L   M  G +        SG S   LS + +  H++ M  EEAE LV QW+ +KAEALGP +Q + L+E
Subjt:  LFGDEDLSADDVIAPPNMKS-SSNLRSLKGLLSKLMRKGRN-------LSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAE

Query:  ILDGTMLFQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILV
        +LD +ML QWQ LA  A+AKSCYW+FVLL L VL+A +  D       EIE  LEEAAELV+E++PK+  YYS YK+RY++K+Q+DG WKFC+ DI +
Subjt:  ILDGTMLFQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILV

AT3G19180.2 paralog of ARC63.2e-15752.55Show/hide
Query:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE
        +DLLMDVRDKLLFE  YAGN+KE I PKS +RIPWAWLPGALCLLQEVG+ K+VLDIG+  ++   +KPYIHDI LSM LAECAIAK  FE N VSQGFE
Subjt:  RDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNMVSQGFE

Query:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW
        ALARAQ  L+S+ +LGKL LL+QIEESLEELAP CTL+LLG+     N ERR GAIAALRELLRQGL VE  CQ+QDWPCFLSQA+ RL+A E+VDLLPW
Subjt:  ALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPW

Query:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP
        D+LA+ RKNKKS+ES NQRVV+DFNCFYM    H+A+GFS +Q E I KAKTICECLIASEGVDLK EEAFC FLL Q S++E  EKL+Q   NS  A+ 
Subjt:  DELALIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMP

Query:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIH-PNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI
         R S  G E ++   T  SLE WL ++VL  F DTR CS +L  FF +EKK    KK+     + H  N RP++++              +NSSQ+L   
Subjt:  TRLSNSGMEKKNAENTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIH-PNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNI

Query:  VRRLAPTNLPSQLGTGKKTSDGNSS--SVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMI-KIKFVP---TWAPHKASLNTSS
        V +L PT+L S + + K   + ++S  SVQLKR+L ++K KI + WL++ SL+  + ++ L+GC  F   KL+     +++ +P   +  PH  S   S 
Subjt:  VRRLAPTNLPSQLGTGKKTSDGNSS--SVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMI-KIKFVP---TWAPHKASLNTSS

Query:  LFGDEDLSADDVIAPPNMKS-SSNLRSLKGLLSKLMRKGRN-------LSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAE
        L+  E  +    +   N      N++ L  +L   M  G +        SG S   LS + +  H++ M  EEAE LV QW+ +KAEALGP +Q + L+E
Subjt:  LFGDEDLSADDVIAPPNMKS-SSNLRSLKGLLSKLMRKGRN-------LSGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAE

Query:  ILDGTMLFQ
        +LD +ML Q
Subjt:  ILDGTMLFQ

AT5G42480.1 Chaperone DnaJ-domain superfamily protein2.9e-2523.36Show/hide
Query:  IPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAK--IGFEKNMVSQGFEALARAQYLLRSQ--TSLGKLKLLSQIEESL
        +PW  +PGALC+LQE GE ++VL +G+ +++  + K +  D++L M LA   +++  +  +      G+E +  A  LL+ +  +SL    L +QI+E+L
Subjt:  IPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAK--IGFEKNMVSQGFEALARAQYLLRSQ--TSLGKLKLLSQIEESL

Query:  EELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQ--GLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPWDELALIRKNKKSIESQNQRVVVDFNC
        EE+ P   LELLG+        +R   ++ +R +L    G              F+++A  R+ AAE VDL       +  ++ +  E     V   F  
Subjt:  EELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQ--GLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPWDELALIRKNKKSIESQNQRVVVDFNC

Query:  FYMAFKAHLALGFSSRQTELIEKAKTICECLIA-------SEGVDLKLEEAFCVFLLGQCSDSEVFEKL--QQSTLNSKPAMPTRLSNSGM-EKKNAENT
          +  K HL L  + +Q + +++AK +   + A       +  +D  LE   C  L+G+  +  ++  L  + S   +   +   L NS   +  +    
Subjt:  FYMAFKAHLALGFSSRQTELIEKAKTICECLIA-------SEGVDLKLEEAFCVFLLGQCSDSEVFEKL--QQSTLNSKPAMPTRLSNSGM-EKKNAENT

Query:  YQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIHPNNRPIASSS-----------VSEWRDVEDSFPNLNSSQNLG--NIVRRL
         + LE WL   V   F+DT+D    L  ++     +   +++   Q        P+A+++            S  + ++  FP+  + +N      V+  
Subjt:  YQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIHPNNRPIASSS-----------VSEWRDVEDSFPNLNSSQNLG--NIVRRL

Query:  APTNLPSQLGTGKKTSDG--NSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDEDLS
          +  P     G+    G   + +V+   +   N + I    ++  S+ +    + +   +  A  K+ +  + I  +             SLF  +   
Subjt:  APTNLPSQLGTGKKTSDG--NSSSVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDEDLS

Query:  ADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKL---MSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQAL
                 +KSSS+ +          RK    S  S++    ++ A   +    M    AE +V++WQ IK+ A GP+++   L E+LDG ML  W   
Subjt:  ADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNLSGTSNMPLSSAITAPHQKL---MSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQAL

Query:  ADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPK-HPSYYSNYKVRYVVKRQQDGSWKFCEGDIL
        A         + + LL+LSV    + +D    T   +E  LEE+A L +   P+ + +    Y  RY V   + G WK  EG +L
Subjt:  ADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAELVNEAEPK-HPSYYSNYKVRYVVKRQQDGSWKFCEGDIL

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.2e-1626.39Show/hide
Query:  TVKLNNDNFLMWKVQIE-----FALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVKWKKQDSLISSWLLGSMSESILEQVIHCK-TTKEIWSCL
        T+ LN  N+ +W+   E     F + GH          D  S   P++E            +WK++D L+  W+ G++++S+L+ +I    T +++W  L
Subjt:  TVKLNNDNFLMWKVQIE-----FALEGHSLGQFIAKDCDPPSEKIPVSEGSTITKINPEFVKWKKQDSLISSWLLGSMSESILEQVIHCK-TTKEIWSCL

Query:  FQIFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESK
          +F     A+ ++ +++L++     LS+++Y  ++K   D L  V   I     +M++LNGL  +Y+ +++ +   +   S  +  + L   E+R+ +K
Subjt:  FQIFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEIDSEDHIMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESK

Query:  LKAVNADGTQP-MANLMTQNHHSVENDPQK--NNNSYEGNHGFNSRARGRS---NRGG----GRWNNRN
         K+  +    P ++N++       E  PQ+  NNNS  G        RGRS   NRGG    GR+NN N
Subjt:  LKAVNADGTQP-MANLMTQNHHSVENDPQK--NNNSYEGNHGFNSRARGRS---NRGG----GRWNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGTACAAAGTAATTGCAAGGATGCTGCTGTCCGACAGGTTGAAGAGGGTCTTGCTGTCCACCATCACAGAATACCTGGTCCAAGGTGTAAACAATTCTGTGGATG
GCCACATGAAAAATTTAACCTTCTTGGAGAGATGTTTAAGAATTTGTTATGGCATGGAAGTAAAGGTAAAGGTGGAATGCACCTCTATGGCATAAAGTACAATTTCCATC
CAAGGAGAGGAGATCTCCTTGGATTTCCATCTCTCCAAATTGGTGTACCCTTTGCTGAAAGTGTGAAGAATCCAATCAGAATCTTTTCATCACTTGTCCATATGCTGGTG
CCTTTTGGAAGGCCATTCTCCTTGCTTTCAACTGGCCCATTGCACTACCGGGATCTTTTAATGGATGTGAGAGACAAGCTTCTATTTGAACCACATTATGCTGGTAACAT
GAAGGAAAACATCCCACCTAAGTCTTCCATTCGAATTCCTTGGGCTTGGTTGCCAGGTGCTCTTTGCCTACTTCAAGAGGTTGGAGAAGCAAAAATGGTGCTTGACATTG
GACAGAAAGTTGTTCAATATCCAATGGCTAAGCCTTATATCCATGATATACTGCTTTCCATGGTATTAGCGGAGTGTGCAATTGCAAAGATTGGTTTTGAGAAGAACATG
GTGTCTCAAGGATTTGAAGCTCTTGCACGTGCACAATATCTACTAAGAAGTCAAACATCTCTTGGGAAACTAAAATTGTTATCTCAGATTGAAGAATCTTTGGAGGAACT
TGCACCTGCTTGCACATTGGAGTTATTGGGTATGACAAGCTTATCCACGAATACTGAACGTAGAGGAGGAGCAATTGCAGCATTACGGGAATTGCTGAGACAAGGTCTTG
ATGTGGAAACATTTTGTCAGGTTCAAGATTGGCCATGCTTCTTAAGCCAAGCTCTTGGTAGACTAATGGCTGCAGAGATGGTTGATCTTCTTCCATGGGATGAATTAGCT
CTTATAAGAAAGAATAAGAAATCCATTGAGTCACAGAATCAAAGGGTTGTGGTTGATTTTAATTGCTTCTATATGGCTTTCAAAGCTCATCTTGCCCTTGGGTTTTCAAG
CAGGCAGACAGAGTTGATTGAAAAAGCAAAAACTATATGTGAATGTCTGATAGCATCCGAGGGTGTCGATCTGAAACTTGAGGAAGCTTTCTGTGTTTTTCTTCTTGGTC
AGTGCAGTGATTCTGAGGTTTTTGAAAAGCTTCAGCAGTCTACTTTGAATTCAAAACCAGCTATGCCTACTCGATTGTCAAATTCAGGAATGGAGAAAAAGAACGCAGAA
AACACATACCAATCATTGGAAATATGGTTGAAGGATGCTGTACTTGGTGTCTTTAAAGATACACGGGATTGCTCTCTGACACTGGTTAGATTTTTCCACAGCGAGAAGAA
AATGGATGCAAGAAAGAAAATTAACCAACAGCAGAGTATAATTCACCCAAATAACAGGCCCATAGCGTCTTCCTCTGTATCAGAGTGGAGGGATGTTGAGGACTCCTTTC
CTAATTTGAATTCTTCCCAAAATCTTGGGAATATTGTTAGACGGTTGGCACCTACTAACTTGCCAAGTCAATTAGGAACGGGCAAAAAAACGAGTGATGGCAACTCATCA
TCAGTTCAATTGAAAAGGGACCTTCGCATAAACAAATGGAAAATTACAGAATTATGGTTGGCCAGAGGCAGTCTTGTCAAGAACATGAAAATTCTTGTTCTAGTTGGATG
TATTAGTTTTGCTTGCTTCAAGCTGACGAGCACAATGATAAAAATAAAATTTGTTCCTACATGGGCTCCACATAAAGCAAGCCTGAATACCAGCTCTCTTTTCGGTGATG
AGGATTTGTCTGCAGATGATGTTATAGCACCTCCAAATATGAAGAGCAGTTCAAATCTTAGGAGTCTTAAAGGGCTTTTGTCGAAGTTAATGAGGAAGGGCAGGAACTTA
TCAGGCACAAGCAATATGCCACTGTCATCTGCAATTACAGCTCCGCACCAGAAGCTGATGTCAGTTGAAGAAGCTGAAGCCCTTGTGAACCAATGGCAAACGATTAAAGC
TGAAGCTTTGGGACCTAACTATCAATTTCATAGACTTGCTGAAATTCTTGATGGAACAATGCTTTTCCAGTGGCAAGCTCTAGCTGATGCTGCAAAAGCTAAATCATGCT
ATTGGAAATTTGTTTTGCTGCAATTGTCTGTCCTACGAGCTGAACTTTTGTCAGATAAATTTGGAGCAACGACATTAGAAATTGAGGTTCATCTAGAGGAAGCAGCTGAG
CTTGTCAATGAAGCTGAACCAAAGCACCCAAGCTATTATAGCAATTATAAAGTTCGTTATGTGGTAAAGAGGCAACAGGATGGTTCTTGGAAGTTCTGTGAAGGAGATAT
ACTAGTACCAACTCAGATCTTGAGCCTTCGATGGATTCCTCAAGCACAGAAATCCAAACAGTTGGTGATCTTGGAAATCGTCAATCCGGGTAACAAAATCTCTACTGTTA
AGCTTAACAATGACAATTTCCTTATGTGGAAAGTTCAGATTGAATTTGCCCTAGAGGGCCACAGTCTTGGACAATTCATTGCGAAAGATTGCGATCCACCGTCCGAGAAA
ATACCTGTAAGTGAAGGCTCGACTATTACAAAAATTAATCCTGAATTTGTTAAGTGGAAGAAACAGGATAGTCTTATTTCGTCTTGGCTTCTTGGATCCATGTCTGAAAG
TATACTAGAACAAGTAATACACTGCAAAACTACAAAAGAGATTTGGAGTTGTTTGTTTCAGATTTTTAATTCAAGAAATAGAGCTCAAATAATGCGGATGAAATCAAAAT
TACAATCTCTTCAGAAAGGATCTTTATCCCTAAATGACTATTTCTCACAAGTTAAGAAGTGTGTCGATGCACTAGCGGCTGTAGGCAAAGAAATTGATTCAGAAGACCAC
ATCATGTATATTCTAAACGGTTTAGGGCCTGAGTATGAATCTATGGTATCTGCACTAACCACCTCTACAGATCAACAAAGCGTCCAGGATGTTATGGCTTACCTTTTCAC
TCATGAAAATCGAATTGAAAGTAAGTTGAAAGCTGTCAATGCAGATGGAACACAACCTATGGCAAATCTAATGACTCAGAACCATCACAGTGTAGAAAACGACCCTCAGA
AAAATAACAATTCATACGAGGGAAATCATGGCTTCAATTCAAGAGCCAGAGGTCGTTCGAATAGAGGTGGTGGTCGTTGGAATAATAGGAACAAACCACAGTGCCAAATT
TGTTTCAAGTTTGGTCACACTGCCATAAAGTGTTACTCACTCAATGGCCGTTTTGATCAAAACCGTGGATCTTTTCCTCTAAATAACTCACCGATTACAACCATGCTGAC
AGCTACTGATCTCAATCAAGATAATGTCTGGTATCCAGACTCTGGGGCTACGAATCACCTAACCAACAACTTCAATAATCTTGTTGTTGGTGCTGAATACTCGGGAGGCA
GTCAGATGCAAGTTGGGAATGGTACAGGTCTTCCTATATCTCATTGTGGCTATACTTCATTTTCTTCACCTAATCGTATCTTTCACCTAAATAATCTTCTCCATGTGCCT
CACATAACTAAAAACCTAATAAGTATCAACCAATTTGCTAAAGATAACTTAGTTTACTTTGAATTTCATCCTAATTTTTGTTATGTGAAGGACTCCCTTACTGGCCAAAC
ACTCCTTCAAGGACCACTCCATGATGGGTTGTACCGGTTCGATATATCACCACAATCATCCTCCTCATTGGCGAAAAGTAGCTCCAAACCCCAGTGTTTGTTATCTCATT
TCTCTCATATATCCTCTGTTAATACCAATGCTTCAAATTCTATGTTGGATGTTTGGCATAGGAGATTAGGACATCCTGCCTTTTCCATTGTTAAGTATGTTGCTAAAAGT
TGTAGTCCTGCCTTATCATTGAATAAACATGTTGCATTTTGTAATGCTTGTGCTGTTGGGAAGAGTCACGCTTTACCCTTTCCTATATCTACTACCCAGTACACATCTCC
TTTACAGCTCATTGTTGCTGATGTTTGCGGGCCCTCTTACACTATGTCTAGAAACAGATTTAGATACTACATAAGCTTTGTTGATGTCTTTTCCCGATACACCTGGGTAT
ATTTTTTACAAACCAAAGTTGAAGCCCTCCAAACATTCTTATCATTTAAAACTTCAGTTGAGAGACTTTTAGGTTGCCGTATTCTACGTTTTCAATCTAATGGGGGAGGA
GAGTTCAACTTTTTCACACCTCTTCTCCACAAACTGAGCATAGAACATCGTTTTTCCTGCCCCTACACCTCTCAACAAAATGGTATAGTAGAGCGCAAGCATAGGCAAAT
TGTGGATGTGGGCCTTACTCTCCTATCACAAGCTTCTATGCCACTGACCTACTGGGATGATGCCTTCAGTACAGCAACCTATCTCATTAACCGTCTACCCTCAACAACCC
TTAATGGACCTATTCATCCTACTGCCTCTCCTACTTCTAAGATATCTCAATCTCTTCCCTTGGTATCTTCCCCTGTTACTTCAACACCTTCCTCTTCAGACTTATCTCCT
ACGGTTGCACCAATTCTTTCTCCTTTGTCTTTCAATTCTGATCCTGTACCATCTCCCGTGGCATCTGAATCCTCTTCTACAGCTGTCTCCTCCGAGTCTATTACTCAAGC
CTCTGGTGATAATCAAACACATACTGCTCCTACGATTGAGAATGTTGCATCTGCTCTGGCAGGGAAATTTAATCACTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGTACAAAGTAATTGCAAGGATGCTGCTGTCCGACAGGTTGAAGAGGGTCTTGCTGTCCACCATCACAGAATACCTGGTCCAAGGTGTAAACAATTCTGTGGATG
GCCACATGAAAAATTTAACCTTCTTGGAGAGATGTTTAAGAATTTGTTATGGCATGGAAGTAAAGGTAAAGGTGGAATGCACCTCTATGGCATAAAGTACAATTTCCATC
CAAGGAGAGGAGATCTCCTTGGATTTCCATCTCTCCAAATTGGTGTACCCTTTGCTGAAAGTGTGAAGAATCCAATCAGAATCTTTTCATCACTTGTCCATATGCTGGTG
CCTTTTGGAAGGCCATTCTCCTTGCTTTCAACTGGCCCATTGCACTACCGGGATCTTTTAATGGATGTGAGAGACAAGCTTCTATTTGAACCACATTATGCTGGTAACAT
GAAGGAAAACATCCCACCTAAGTCTTCCATTCGAATTCCTTGGGCTTGGTTGCCAGGTGCTCTTTGCCTACTTCAAGAGGTTGGAGAAGCAAAAATGGTGCTTGACATTG
GACAGAAAGTTGTTCAATATCCAATGGCTAAGCCTTATATCCATGATATACTGCTTTCCATGGTATTAGCGGAGTGTGCAATTGCAAAGATTGGTTTTGAGAAGAACATG
GTGTCTCAAGGATTTGAAGCTCTTGCACGTGCACAATATCTACTAAGAAGTCAAACATCTCTTGGGAAACTAAAATTGTTATCTCAGATTGAAGAATCTTTGGAGGAACT
TGCACCTGCTTGCACATTGGAGTTATTGGGTATGACAAGCTTATCCACGAATACTGAACGTAGAGGAGGAGCAATTGCAGCATTACGGGAATTGCTGAGACAAGGTCTTG
ATGTGGAAACATTTTGTCAGGTTCAAGATTGGCCATGCTTCTTAAGCCAAGCTCTTGGTAGACTAATGGCTGCAGAGATGGTTGATCTTCTTCCATGGGATGAATTAGCT
CTTATAAGAAAGAATAAGAAATCCATTGAGTCACAGAATCAAAGGGTTGTGGTTGATTTTAATTGCTTCTATATGGCTTTCAAAGCTCATCTTGCCCTTGGGTTTTCAAG
CAGGCAGACAGAGTTGATTGAAAAAGCAAAAACTATATGTGAATGTCTGATAGCATCCGAGGGTGTCGATCTGAAACTTGAGGAAGCTTTCTGTGTTTTTCTTCTTGGTC
AGTGCAGTGATTCTGAGGTTTTTGAAAAGCTTCAGCAGTCTACTTTGAATTCAAAACCAGCTATGCCTACTCGATTGTCAAATTCAGGAATGGAGAAAAAGAACGCAGAA
AACACATACCAATCATTGGAAATATGGTTGAAGGATGCTGTACTTGGTGTCTTTAAAGATACACGGGATTGCTCTCTGACACTGGTTAGATTTTTCCACAGCGAGAAGAA
AATGGATGCAAGAAAGAAAATTAACCAACAGCAGAGTATAATTCACCCAAATAACAGGCCCATAGCGTCTTCCTCTGTATCAGAGTGGAGGGATGTTGAGGACTCCTTTC
CTAATTTGAATTCTTCCCAAAATCTTGGGAATATTGTTAGACGGTTGGCACCTACTAACTTGCCAAGTCAATTAGGAACGGGCAAAAAAACGAGTGATGGCAACTCATCA
TCAGTTCAATTGAAAAGGGACCTTCGCATAAACAAATGGAAAATTACAGAATTATGGTTGGCCAGAGGCAGTCTTGTCAAGAACATGAAAATTCTTGTTCTAGTTGGATG
TATTAGTTTTGCTTGCTTCAAGCTGACGAGCACAATGATAAAAATAAAATTTGTTCCTACATGGGCTCCACATAAAGCAAGCCTGAATACCAGCTCTCTTTTCGGTGATG
AGGATTTGTCTGCAGATGATGTTATAGCACCTCCAAATATGAAGAGCAGTTCAAATCTTAGGAGTCTTAAAGGGCTTTTGTCGAAGTTAATGAGGAAGGGCAGGAACTTA
TCAGGCACAAGCAATATGCCACTGTCATCTGCAATTACAGCTCCGCACCAGAAGCTGATGTCAGTTGAAGAAGCTGAAGCCCTTGTGAACCAATGGCAAACGATTAAAGC
TGAAGCTTTGGGACCTAACTATCAATTTCATAGACTTGCTGAAATTCTTGATGGAACAATGCTTTTCCAGTGGCAAGCTCTAGCTGATGCTGCAAAAGCTAAATCATGCT
ATTGGAAATTTGTTTTGCTGCAATTGTCTGTCCTACGAGCTGAACTTTTGTCAGATAAATTTGGAGCAACGACATTAGAAATTGAGGTTCATCTAGAGGAAGCAGCTGAG
CTTGTCAATGAAGCTGAACCAAAGCACCCAAGCTATTATAGCAATTATAAAGTTCGTTATGTGGTAAAGAGGCAACAGGATGGTTCTTGGAAGTTCTGTGAAGGAGATAT
ACTAGTACCAACTCAGATCTTGAGCCTTCGATGGATTCCTCAAGCACAGAAATCCAAACAGTTGGTGATCTTGGAAATCGTCAATCCGGGTAACAAAATCTCTACTGTTA
AGCTTAACAATGACAATTTCCTTATGTGGAAAGTTCAGATTGAATTTGCCCTAGAGGGCCACAGTCTTGGACAATTCATTGCGAAAGATTGCGATCCACCGTCCGAGAAA
ATACCTGTAAGTGAAGGCTCGACTATTACAAAAATTAATCCTGAATTTGTTAAGTGGAAGAAACAGGATAGTCTTATTTCGTCTTGGCTTCTTGGATCCATGTCTGAAAG
TATACTAGAACAAGTAATACACTGCAAAACTACAAAAGAGATTTGGAGTTGTTTGTTTCAGATTTTTAATTCAAGAAATAGAGCTCAAATAATGCGGATGAAATCAAAAT
TACAATCTCTTCAGAAAGGATCTTTATCCCTAAATGACTATTTCTCACAAGTTAAGAAGTGTGTCGATGCACTAGCGGCTGTAGGCAAAGAAATTGATTCAGAAGACCAC
ATCATGTATATTCTAAACGGTTTAGGGCCTGAGTATGAATCTATGGTATCTGCACTAACCACCTCTACAGATCAACAAAGCGTCCAGGATGTTATGGCTTACCTTTTCAC
TCATGAAAATCGAATTGAAAGTAAGTTGAAAGCTGTCAATGCAGATGGAACACAACCTATGGCAAATCTAATGACTCAGAACCATCACAGTGTAGAAAACGACCCTCAGA
AAAATAACAATTCATACGAGGGAAATCATGGCTTCAATTCAAGAGCCAGAGGTCGTTCGAATAGAGGTGGTGGTCGTTGGAATAATAGGAACAAACCACAGTGCCAAATT
TGTTTCAAGTTTGGTCACACTGCCATAAAGTGTTACTCACTCAATGGCCGTTTTGATCAAAACCGTGGATCTTTTCCTCTAAATAACTCACCGATTACAACCATGCTGAC
AGCTACTGATCTCAATCAAGATAATGTCTGGTATCCAGACTCTGGGGCTACGAATCACCTAACCAACAACTTCAATAATCTTGTTGTTGGTGCTGAATACTCGGGAGGCA
GTCAGATGCAAGTTGGGAATGGTACAGGTCTTCCTATATCTCATTGTGGCTATACTTCATTTTCTTCACCTAATCGTATCTTTCACCTAAATAATCTTCTCCATGTGCCT
CACATAACTAAAAACCTAATAAGTATCAACCAATTTGCTAAAGATAACTTAGTTTACTTTGAATTTCATCCTAATTTTTGTTATGTGAAGGACTCCCTTACTGGCCAAAC
ACTCCTTCAAGGACCACTCCATGATGGGTTGTACCGGTTCGATATATCACCACAATCATCCTCCTCATTGGCGAAAAGTAGCTCCAAACCCCAGTGTTTGTTATCTCATT
TCTCTCATATATCCTCTGTTAATACCAATGCTTCAAATTCTATGTTGGATGTTTGGCATAGGAGATTAGGACATCCTGCCTTTTCCATTGTTAAGTATGTTGCTAAAAGT
TGTAGTCCTGCCTTATCATTGAATAAACATGTTGCATTTTGTAATGCTTGTGCTGTTGGGAAGAGTCACGCTTTACCCTTTCCTATATCTACTACCCAGTACACATCTCC
TTTACAGCTCATTGTTGCTGATGTTTGCGGGCCCTCTTACACTATGTCTAGAAACAGATTTAGATACTACATAAGCTTTGTTGATGTCTTTTCCCGATACACCTGGGTAT
ATTTTTTACAAACCAAAGTTGAAGCCCTCCAAACATTCTTATCATTTAAAACTTCAGTTGAGAGACTTTTAGGTTGCCGTATTCTACGTTTTCAATCTAATGGGGGAGGA
GAGTTCAACTTTTTCACACCTCTTCTCCACAAACTGAGCATAGAACATCGTTTTTCCTGCCCCTACACCTCTCAACAAAATGGTATAGTAGAGCGCAAGCATAGGCAAAT
TGTGGATGTGGGCCTTACTCTCCTATCACAAGCTTCTATGCCACTGACCTACTGGGATGATGCCTTCAGTACAGCAACCTATCTCATTAACCGTCTACCCTCAACAACCC
TTAATGGACCTATTCATCCTACTGCCTCTCCTACTTCTAAGATATCTCAATCTCTTCCCTTGGTATCTTCCCCTGTTACTTCAACACCTTCCTCTTCAGACTTATCTCCT
ACGGTTGCACCAATTCTTTCTCCTTTGTCTTTCAATTCTGATCCTGTACCATCTCCCGTGGCATCTGAATCCTCTTCTACAGCTGTCTCCTCCGAGTCTATTACTCAAGC
CTCTGGTGATAATCAAACACATACTGCTCCTACGATTGAGAATGTTGCATCTGCTCTGGCAGGGAAATTTAATCACTTCTAA
Protein sequenceShow/hide protein sequence
MYVQSNCKDAAVRQVEEGLAVHHHRIPGPRCKQFCGWPHEKFNLLGEMFKNLLWHGSKGKGGMHLYGIKYNFHPRRGDLLGFPSLQIGVPFAESVKNPIRIFSSLVHMLV
PFGRPFSLLSTGPLHYRDLLMDVRDKLLFEPHYAGNMKENIPPKSSIRIPWAWLPGALCLLQEVGEAKMVLDIGQKVVQYPMAKPYIHDILLSMVLAECAIAKIGFEKNM
VSQGFEALARAQYLLRSQTSLGKLKLLSQIEESLEELAPACTLELLGMTSLSTNTERRGGAIAALRELLRQGLDVETFCQVQDWPCFLSQALGRLMAAEMVDLLPWDELA
LIRKNKKSIESQNQRVVVDFNCFYMAFKAHLALGFSSRQTELIEKAKTICECLIASEGVDLKLEEAFCVFLLGQCSDSEVFEKLQQSTLNSKPAMPTRLSNSGMEKKNAE
NTYQSLEIWLKDAVLGVFKDTRDCSLTLVRFFHSEKKMDARKKINQQQSIIHPNNRPIASSSVSEWRDVEDSFPNLNSSQNLGNIVRRLAPTNLPSQLGTGKKTSDGNSS
SVQLKRDLRINKWKITELWLARGSLVKNMKILVLVGCISFACFKLTSTMIKIKFVPTWAPHKASLNTSSLFGDEDLSADDVIAPPNMKSSSNLRSLKGLLSKLMRKGRNL
SGTSNMPLSSAITAPHQKLMSVEEAEALVNQWQTIKAEALGPNYQFHRLAEILDGTMLFQWQALADAAKAKSCYWKFVLLQLSVLRAELLSDKFGATTLEIEVHLEEAAE
LVNEAEPKHPSYYSNYKVRYVVKRQQDGSWKFCEGDILVPTQILSLRWIPQAQKSKQLVILEIVNPGNKISTVKLNNDNFLMWKVQIEFALEGHSLGQFIAKDCDPPSEK
IPVSEGSTITKINPEFVKWKKQDSLISSWLLGSMSESILEQVIHCKTTKEIWSCLFQIFNSRNRAQIMRMKSKLQSLQKGSLSLNDYFSQVKKCVDALAAVGKEIDSEDH
IMYILNGLGPEYESMVSALTTSTDQQSVQDVMAYLFTHENRIESKLKAVNADGTQPMANLMTQNHHSVENDPQKNNNSYEGNHGFNSRARGRSNRGGGRWNNRNKPQCQI
CFKFGHTAIKCYSLNGRFDQNRGSFPLNNSPITTMLTATDLNQDNVWYPDSGATNHLTNNFNNLVVGAEYSGGSQMQVGNGTGLPISHCGYTSFSSPNRIFHLNNLLHVP
HITKNLISINQFAKDNLVYFEFHPNFCYVKDSLTGQTLLQGPLHDGLYRFDISPQSSSSLAKSSSKPQCLLSHFSHISSVNTNASNSMLDVWHRRLGHPAFSIVKYVAKS
CSPALSLNKHVAFCNACAVGKSHALPFPISTTQYTSPLQLIVADVCGPSYTMSRNRFRYYISFVDVFSRYTWVYFLQTKVEALQTFLSFKTSVERLLGCRILRFQSNGGG
EFNFFTPLLHKLSIEHRFSCPYTSQQNGIVERKHRQIVDVGLTLLSQASMPLTYWDDAFSTATYLINRLPSTTLNGPIHPTASPTSKISQSLPLVSSPVTSTPSSSDLSP
TVAPILSPLSFNSDPVPSPVASESSSTAVSSESITQASGDNQTHTAPTIENVASALAGKFNHF