; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G00530 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G00530
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationClcChr08:993191..994540
RNA-Seq ExpressionClc08G00530
SyntenyClc08G00530
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72141.1 hypothetical protein VITISV_017108 [Vitis vinifera]7.9e-12651.12Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MIE QFQTKI IL SDNGT++FN+ L TF + KGI+HQ++  DTPQQNG+A+RKN+HLLE+ARA+MF M++PKYLWGDA+LT +YLINRMP K+L + TP
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
        L+ LK+ FP  R+  ELPLK+FGCT YVH    S+ KLDPRA KCVFVGY P KK YKCF+ LT +++ +MDVSF+EN  +F+   LQ E  L+E NFW+
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  -TSPLPNII-----------------SPEIMSSSPSISSMENFSTGGETL-------------------QTDPTVNGPENSGMSLSPSS-----HNTLSN
           P P++I                   EI  S   I  M+      E++                      P   G  +  +S +P S     H + S+
Subjt:  -TSPLPNII-----------------SPEIMSSSPSISSMENFSTGGETL-------------------QTDPTVNGPENSGMSLSPSS-----HNTLSN

Query:  VS--------------------------------DLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEM
        V+                                DLD+PIA RKGT+ CTK+ IA Y+SY  LSDNH+AFT+ I+ L VP+NIQEAL++ +WKLAV +EM
Subjt:  VS--------------------------------DLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEM

Query:  NALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI
        NALK++G W+ VDLP +KK VGCK VFTIK   DGS+ERYKA+LVAKGFTQTYGI YQETFA VAKINSIR+LL + VN +WPL+QLD+
Subjt:  NALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI

RVX10668.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.8e-11751.2Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        M++TQFQ KI++L +DN  E+++  L ++L + GI+HQ++  DTPQQNGVA+RKNRHL+E+AR+LM + +VPK LWG+A LT  YLINRMP ++L FKTP
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDES-SLLEENFW
         Q L   +P+ R+ S +P+KVFGCTA+VH     +SKLDP A KC+F+GY P +K YKC+   T K++ SMDV+F ENQ F+  T++Q E+ S  E  FW
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDES-SLLEENFW

Query:  DTSPLPNIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSK
        +T         EI ++SP  SS+          QTD T++ PEN+ + +   + N  ++  DLD PIA RKG R CT++PI N++SY +LS N +AF + 
Subjt:  DTSPLPNIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSK

Query:  ITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRIL
        + +  +P NIQEAL    WK  V EE+ AL+++G W+I +LPE K+ +GCK +FT+K N+DGSI R+KA+LVAKGFTQ+Y I Y+ETFA VAK+NSIR+L
Subjt:  ITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRIL

Query:  LFVAVNLDWPLYQLDI
        L VA+NLDW L+QLD+
Subjt:  LFVAVNLDWPLYQLDI

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]1.3e-12050.57Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MI+TQFQ KI++  +DNG E+F   L  +  + GI+HQ++  DTPQQNGVA+RKNRHLLE+AR+LMF+  VPK  WG+A+LT +YLINRMP ++ NF++P
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
        L    + +P  ++F+ LP K+FGC A+VH    ++SKLDPRA+KCVF+GY P +K YKC+D L+NK+F +MDV+F EN+SFF  TSLQ E    E++FW+
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  TS-PLPNIIS--PEIMSSSPSI----------------------SSMENFSTGGETLQTDPTVNGPENSG-MSLSPSSHNTLSNVSDLDIPIAQRKGTRQ
         S P+P I+S  P + S+ PSI                      S   N  +      +DP     E +G +  +P S + L   +DLD+PIAQRKGTR 
Subjt:  TS-PLPNIIS--PEIMSSSPSI----------------------SSMENFSTGGETLQTDPTVNGPENSG-MSLSPSSHNTLSNVSDLDIPIAQRKGTRQ

Query:  CTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKG
        CT +PI+ Y+SYHRLS   +AFT+ ++ + +PK++Q+AL+   W+ AV  EM AL+++  W++V LPE+KK VGCK +FT+K   DGS+ERYKA+LVAKG
Subjt:  CTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKG

Query:  FTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI
        FTQTYGI YQETFA VAK+NSIR+LL +A +L W L QLD+
Subjt:  FTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]1.3e-12050.57Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MI+TQFQ KI++  +DNG E+F   L  +  + GI+HQ++  DTPQQNGVA+RKNRHLLE+AR+LMF+  VPK  WG+A+LT +YLINRMP ++ NF++P
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
        L    + +P  ++F+ LP K+FGC A+VH    ++SKLDPRA+KCVF+GY P +K YKC+D L+NK+F +MDV+F EN+SFF  TSLQ E    E++FW+
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  TS-PLPNIIS--PEIMSSSPSI----------------------SSMENFSTGGETLQTDPTVNGPENSG-MSLSPSSHNTLSNVSDLDIPIAQRKGTRQ
         S P+P I+S  P + S+ PSI                      S   N  +      +DP     E +G +  +P S + L   +DLD+PIAQRKGTR 
Subjt:  TS-PLPNIIS--PEIMSSSPSI----------------------SSMENFSTGGETLQTDPTVNGPENSG-MSLSPSSHNTLSNVSDLDIPIAQRKGTRQ

Query:  CTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKG
        CT +PI+ Y+SYHRLS   +AFT+ ++ + +PK++Q+AL+   W+ AV  EM AL+++  W++V LPE+KK VGCK +FT+K   DGS+ERYKA+LVAKG
Subjt:  CTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKG

Query:  FTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI
        FTQTYGI YQETFA VAK+NSIR+LL +A +L W L QLD+
Subjt:  FTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI

XP_024044153.1 uncharacterized protein LOC18046468 isoform X3 [Citrus clementina]1.3e-12050.57Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MI+TQFQ KI++  +DNG E+F   L  +  + GI+HQ++  DTPQQNGVA+RKNRHLLE+AR+LMF+  VPK  WG+A+LT +YLINRMP ++ NF++P
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
        L    + +P  ++F+ LP K+FGC A+VH    ++SKLDPRA+KCVF+GY P +K YKC+D L+NK+F +MDV+F EN+SFF  TSLQ E    E++FW+
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  TS-PLPNIIS--PEIMSSSPSI----------------------SSMENFSTGGETLQTDPTVNGPENSG-MSLSPSSHNTLSNVSDLDIPIAQRKGTRQ
         S P+P I+S  P + S+ PSI                      S   N  +      +DP     E +G +  +P S + L   +DLD+PIAQRKGTR 
Subjt:  TS-PLPNIIS--PEIMSSSPSI----------------------SSMENFSTGGETLQTDPTVNGPENSG-MSLSPSSHNTLSNVSDLDIPIAQRKGTRQ

Query:  CTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKG
        CT +PI+ Y+SYHRLS   +AFT+ ++ + +PK++Q+AL+   W+ AV  EM AL+++  W++V LPE+KK VGCK +FT+K   DGS+ERYKA+LVAKG
Subjt:  CTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKG

Query:  FTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI
        FTQTYGI YQETFA VAK+NSIR+LL +A +L W L QLD+
Subjt:  FTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI

TrEMBL top hitse value%identityAlignment
A0A438F419 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-11750.93Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MI+TQFQ+KI+IL SDN  ++FN  L  FL  +GI+H ++  DTPQQNG+A+RKNRHLLE+AR+LMFSM+VPK  WG AVLT AYLINRMP +VL F+TP
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
         Q L + FPT RL S +P K+FGC+ +VH     +SKLDPR++KC+F+GY   +K YKC+  +T K++ SMDV+F E Q ++    +Q E+S  E  FWD
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  -----TSPL--PNIISPEIMSSSPSISSM-------ENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDI---PIAQRKGTRQCTKYPIANY
              SP+   N I PE  +   SI  +       E       + QT     GP  S +  + +   T+ +  + DI   PIA RKG R CT++PI N+
Subjt:  -----TSPL--PNIISPEIMSSSPSISSM-------ENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDI---PIAQRKGTRQCTKYPIANY

Query:  LSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGY
        +SY +LS   +AFTS IT + VP+NI EA     WK AV EE+ AL+++G W+I DLP DKK VGCK +FT+K   DG+++RYKA+LVAKGFTQ+YGI Y
Subjt:  LSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGY

Query:  QETFASVAKINSIRILLFVAVNLDWPLYQLDI
        QETFA VAK+N++R+LL +A NLDW L+QLD+
Subjt:  QETFASVAKINSIRILLFVAVNLDWPLYQLDI

A0A438JNX2 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-11751.2Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        M++TQFQ KI++L +DN  E+++  L ++L + GI+HQ++  DTPQQNGVA+RKNRHL+E+AR+LM + +VPK LWG+A LT  YLINRMP ++L FKTP
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDES-SLLEENFW
         Q L   +P+ R+ S +P+KVFGCTA+VH     +SKLDP A KC+F+GY P +K YKC+   T K++ SMDV+F ENQ F+  T++Q E+ S  E  FW
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDES-SLLEENFW

Query:  DTSPLPNIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSK
        +T         EI ++SP  SS+          QTD T++ PEN+ + +   + N  ++  DLD PIA RKG R CT++PI N++SY +LS N +AF + 
Subjt:  DTSPLPNIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSK

Query:  ITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRIL
        + +  +P NIQEAL    WK  V EE+ AL+++G W+I +LPE K+ +GCK +FT+K N+DGSI R+KA+LVAKGFTQ+Y I Y+ETFA VAK+NSIR+L
Subjt:  ITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRIL

Query:  LFVAVNLDWPLYQLDI
        L VA+NLDW L+QLD+
Subjt:  LFVAVNLDWPLYQLDI

A0A438JUV3 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-11650Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MI+TQFQ+KI+IL SDN  ++FN  L  FL  +GI+H ++  DTPQQNG+A+RKNRHLLE+AR+LMFSM+VPK  WG AVLT AYLINRMP +VL F+TP
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
         Q L + FPT RL S +P K+FGC+ +VH     +SKLDPR++KC+F+GY   +K YKC+  +T K++ SMDV+F E Q ++    +Q E+S  E  FWD
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  -----TSPL--PNIISPEIMSSSPSISSM----------ENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANY
              SP+   N I PE  +   SI  +          E  +   +T + +P  N  +  G +    + ++      L++PIA RKG R CT++PI N+
Subjt:  -----TSPL--PNIISPEIMSSSPSISSM----------ENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANY

Query:  LSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGY
        +SY +LS   +AFTS IT + VP+NI EA     WK AV EE+ AL+++G W+I DLP  KK VGCK +FT+K   DG+++RYKA+LVAKGFTQ+YGI Y
Subjt:  LSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGY

Query:  QETFASVAKINSIRILLFVAVNLDWPLYQLDI
        QETFA VAK+N++R+LL +A NLDW L+QLD+
Subjt:  QETFASVAKINSIRILLFVAVNLDWPLYQLDI

A5AGT0 Integrase catalytic domain-containing protein1.4e-11550Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MI+TQFQ+KI+IL SDN  ++FN  L  FL  +GI+H ++  DTPQQNG+A+RKNRHLLE+AR+LMFSM+VPK  WG AVLT AYLINRM  +VL F+TP
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
         Q L + FPT RL S +P K+FGC+ +VH     +SKLDPR++KC+F+GY   +K YKC+  +T K++ SMDV+F E Q ++    +Q E+S  E  FWD
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  -----TSPL--PNIISPEIMSSSPSISSM----------ENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANY
              SP+   N I PE  +   SI  +          E  +   +T +  P  N  +  G +    + ++      L++PIA RKG R CT++PI N+
Subjt:  -----TSPL--PNIISPEIMSSSPSISSM----------ENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANY

Query:  LSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGY
        +SY +LS   +AFTS IT + VP+NIQEA     WK AV EE+ AL+++G W+I DLP  KK VGCK +FT+K   DG+++RYKA+LVAKGFTQ+YGI Y
Subjt:  LSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGY

Query:  QETFASVAKINSIRILLFVAVNLDWPLYQLDI
        QETFA VAK+N++R+LL +A NLDW L+QLD+
Subjt:  QETFASVAKINSIRILLFVAVNLDWPLYQLDI

A5B9Y8 Integrase catalytic domain-containing protein3.8e-12651.12Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        MIE QFQTKI IL SDNGT++FN+ L TF + KGI+HQ++  DTPQQNG+A+RKN+HLLE+ARA+MF M++PKYLWGDA+LT +YLINRMP K+L + TP
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
        L+ LK+ FP  R+  ELPLK+FGCT YVH    S+ KLDPRA KCVFVGY P KK YKCF+ LT +++ +MDVSF+EN  +F+   LQ E  L+E NFW+
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  -TSPLPNII-----------------SPEIMSSSPSISSMENFSTGGETL-------------------QTDPTVNGPENSGMSLSPSS-----HNTLSN
           P P++I                   EI  S   I  M+      E++                      P   G  +  +S +P S     H + S+
Subjt:  -TSPLPNII-----------------SPEIMSSSPSISSMENFSTGGETL-------------------QTDPTVNGPENSGMSLSPSS-----HNTLSN

Query:  VS--------------------------------DLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEM
        V+                                DLD+PIA RKGT+ CTK+ IA Y+SY  LSDNH+AFT+ I+ L VP+NIQEAL++ +WKLAV +EM
Subjt:  VS--------------------------------DLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEM

Query:  NALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI
        NALK++G W+ VDLP +KK VGCK VFTIK   DGS+ERYKA+LVAKGFTQTYGI YQETFA VAKINSIR+LL + VN +WPL+QLD+
Subjt:  NALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-4129.77Show/hide
Query:  ETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVL--NFKTP
        E  F  K+  L+ DNG E+ +  +  F   KGI +  T   TPQ NGV++R  R + E AR ++    + K  WG+AVLT  YLINR+P + L  + KTP
Subjt:  ETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVL--NFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSF------FSLTSLQDESSLL
         +      P ++      L+VFG T YVH     Q K D ++ K +FVGY P    +K +D++  K+  + DV   E          F    L+D     
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSF------FSLTSLQDESSLL

Query:  EENFWDTS------PLPN----------------------------IISPEIMSSS----------PSISSMENFSTGGETLQTDPTVNGPENSGMSLSP
         +NF + S        PN                            II  E  + S           S  S + F    +  + D  +N  + SG     
Subjt:  EENFWDTS------PLPN----------------------------IISPEIMSSS----------PSISSMENFSTGGETLQTDPTVNGPENSGMSLSP

Query:  SSHNTLSNVSDL---------DIPIAQRKGTRQCTKYPIA---NYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIV
            T  ++ ++          I I  R+  R  TK  I+      S +++  N     + + N F    IQ   + S+W+ A+  E+NA K +  W I 
Subjt:  SSHNTLSNVSDL---------DIPIAQRKGTRQCTKYPIA---NYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIV

Query:  DLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI
          PE+K  V  + VF++K N  G+  RYKA+LVA+GFTQ Y I Y+ETFA VA+I+S R +L + +  +  ++Q+D+
Subjt:  DLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-4830.62Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        ++E +   K++ L SDNG E+ +     +    GI H+ T   TPQ NGVA+R NR ++E  R+++    +PK  WG+AV T  YLINR P   L F+ P
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD
             E   T +  S   LKVFGC A+ H     ++KLD ++I C+F+GY   +  Y+ +D +  K   S DV F E++    + +  D S  ++     
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWD

Query:  TSPLPNIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSKI
           +PN ++    S++P+ +         +  Q    +   E     +    H T        +  ++R    +  +YP   Y+    +SD+ +      
Subjt:  TSPLPNIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSKI

Query:  TNLFVPKNIQEALN---DSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIR
             P++++E L+    +    A+ EEM +L+++G + +V+LP+ K+ + CK VF +K + D  + RYKA+LV KGF Q  GI + E F+ V K+ SIR
Subjt:  TNLFVPKNIQEALN---DSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIR

Query:  ILLFVAVNLDWPLYQLDI
         +L +A +LD  + QLD+
Subjt:  ILLFVAVNLDWPLYQLDI

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-1542.16Show/hide
Query:  PKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVN
        PK++  AL D  W  A+ EE++AL ++  W +V  P ++  +GCK VF  K + DG+++R KA+LVAKGF Q  GI + ET++ V +  +IR +L VA  
Subjt:  PKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVN

Query:  LD
        L+
Subjt:  LD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-4529.52Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        ++E +FQT+I   +SDNG EF    L  +    GI H  +   TP+ NG+++RK+RH++E    L+    +PK  W  A     YLINR+P  +L  ++P
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVEN----QSFFSLTSLQDESSLLEE
         Q L   F T   + +  L+VFGC  Y      +Q KLD ++ +CVF+GY   + AY C    T++ + S  V F EN     ++ +  S   E      
Subjt:  LQHLKEFFPTVRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVEN----QSFFSLTSLQDESSLLEE

Query:  NFWD----------TSPLPNIISPEIMSSSPS----------ISSMENFSTGGETLQTDPTVNGPENSG----------MSLSPSSHNTLSN--------
          W             P P+   P   ++ PS          +SS    S+   +  + P    P  +G           + + SS NT  N        
Subjt:  NFWD----------TSPLPNIISPEIMSSSPS----------ISSMENFSTGGETLQTDPTVNGPENSG----------MSLSPSSHNTLSN--------

Query:  --VSDLDIPIAQRKGTRQCTKY-------------------PIANY--------LSYHRLSDNHKAFTSKITNLF----------VPKNIQEALNDSNWK
             L  P      +   T                     P+A          L+ H +    KA   K    +           P+   +AL D  W+
Subjt:  --VSDLDIPIAQRKGTRQCTKY-------------------PIANY--------LSYHRLSDNHKAFTSKITNLF----------VPKNIQEALNDSNWK

Query:  LAVMEEMNALKQSGAWDIVDLPEDK-KAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDIS
         A+  E+NA   +  WD+V  P      VGC+ +FT K N DGS+ RYKA+LVAKG+ Q  G+ Y ETF+ V K  SIRI+L VAV+  WP+ QLD++
Subjt:  LAVMEEMNALKQSGAWDIVDLPEDK-KAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDIS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.6e-4730.43Show/hide
Query:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP
        ++E +FQT+I  L+SDNG EF    L  +L   GI H  +   TP+ NG+++RK+RH++E+   L+    VPK  W  A     YLINR+P  +L  ++P
Subjt:  MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTP

Query:  LQHLKEFFPTVRLFSELP----LKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSL-----QDES
         Q         +LF + P    LKVFGC  Y      ++ KL+ ++ +C F+GY   + AY C    T + + S  V F E    FS T+      Q++ 
Subjt:  LQHLKEFFPTVRLFSELP----LKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSL-----QDES

Query:  SLLEENF----------------------WDTSPLP---------------NIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGM-------S
        S    N+                       DTSP P               N+ S  I S S S  +  + +    T Q   T N   NS +       S
Subjt:  SLLEENF----------------------WDTSPLP---------------NIISPEIMSSSPSISSMENFSTGGETLQTDPTVNGPENSGM-------S

Query:  LSPSSHNTLS-----------------NVSDLDIPIAQRKGT-------------RQCTKYPIANYLSYHRLSD-----NHK-AFTSKITNLFVPKNIQE
         SP+S N  S                 ++S+ + P +    T             +   + P+  +    R  D     N K ++ + +     P+   +
Subjt:  LSPSSHNTLS-----------------NVSDLDIPIAQRKGT-------------RQCTKYPIANYLSYHRLSD-----NHK-AFTSKITNLFVPKNIQE

Query:  ALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDK-KAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPL
        A+ D  W+ A+  E+NA   +  WD+V  P      VGC+ +FT K N DGS+ RYKA+LVAKG+ Q  G+ Y ETF+ V K  SIRI+L VAV+  WP+
Subjt:  ALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDK-KAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPL

Query:  YQLDIS
         QLD++
Subjt:  YQLDIS

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-2541.13Show/hide
Query:  TKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGF
        T + I+ +LSY ++S  + +F   I     P    EA     W  A+ +E+ A++ +  W+I  LP +KK +GCK V+ IK N DG+IERYKA+LVAKG+
Subjt:  TKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGF

Query:  TQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDIS
        TQ  GI + ETF+ V K+ S++++L ++   ++ L+QLDIS
Subjt:  TQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDIS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.4e-1742.16Show/hide
Query:  PKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVN
        PK++  AL D  W  A+ EE++AL ++  W +V  P ++  +GCK VF  K + DG+++R KA+LVAKGF Q  GI + ET++ V +  +IR +L VA  
Subjt:  PKNIQEALNDSNWKLAVMEEMNALKQSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVN

Query:  LD
        L+
Subjt:  LD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAGACTCAATTTCAAACTAAAATTCGCATTCTTCATTCTGATAATGGGACTGAATTTTTTAACGAACCACTAAGCACCTTCTTGCATGATAAGGGCATCATTCA
CCAAGCTACATATTGTGATACCCCTCAACAAAATGGTGTTGCTAAACGGAAAAATCGACACTTGCTTGAAATTGCTCGTGCCCTTATGTTTTCGATGCATGTTCCAAAAT
ATTTGTGGGGGGATGCAGTCCTAACAGATGCTTACCTAATCAATAGAATGCCTATTAAGGTGTTGAATTTTAAAACCCCTCTACAACACCTCAAAGAGTTTTTTCCTACT
GTCCGATTGTTCTCAGAGTTACCTTTAAAAGTTTTTGGGTGTACTGCTTATGTTCATCGAACCCTTCTTTCCCAATCCAAATTGGACCCTCGGGCTATTAAATGTGTTTT
TGTAGGCTATGTTCCTTTTAAAAAGGCCTACAAATGTTTTGACTCCCTAACTAACAAGTATTTTGAGAGTATGGATGTGTCCTTTGTGGAAAATCAATCGTTTTTTAGCC
TAACTTCTCTTCAGGATGAGTCATCTCTACTTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATGAGCTCTAGTCCTTCGATCTCA
AGCATGGAAAATTTTTCAACAGGGGGAGAAACACTACAAACAGATCCAACAGTAAATGGTCCTGAAAATTCGGGTATGTCTCTTAGTCCTTCCTCTCATAATACGTTGTC
TAATGTCTCTGATCTTGATATTCCAATTGCCCAGAGAAAAGGTACCCGCCAATGTACAAAATATCCCATTGCGAACTATCTCTCCTATCATAGATTGTCTGATAATCATA
AAGCTTTTACATCCAAAATAACCAATCTATTTGTTCCAAAGAATATACAGGAAGCTCTAAATGATTCGAATTGGAAATTAGCAGTGATGGAAGAGATGAATGCGCTGAAA
CAAAGTGGTGCTTGGGATATAGTTGATCTACCAGAAGACAAGAAAGCAGTGGGATGTAAGGGGGTTTTCACGATAAAATGTAATGTTGATGGTAGTATCGAAAGGTACAA
GGCCAAACTAGTGGCTAAGGGATTCACCCAGACCTATGGAATTGGTTATCAAGAGACATTTGCCTCTGTAGCTAAAATTAACTCAATTAGAATTTTGCTCTTTGTTGCAG
TTAATTTAGATTGGCCACTGTATCAACTAGATATTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAGACTCAATTTCAAACTAAAATTCGCATTCTTCATTCTGATAATGGGACTGAATTTTTTAACGAACCACTAAGCACCTTCTTGCATGATAAGGGCATCATTCA
CCAAGCTACATATTGTGATACCCCTCAACAAAATGGTGTTGCTAAACGGAAAAATCGACACTTGCTTGAAATTGCTCGTGCCCTTATGTTTTCGATGCATGTTCCAAAAT
ATTTGTGGGGGGATGCAGTCCTAACAGATGCTTACCTAATCAATAGAATGCCTATTAAGGTGTTGAATTTTAAAACCCCTCTACAACACCTCAAAGAGTTTTTTCCTACT
GTCCGATTGTTCTCAGAGTTACCTTTAAAAGTTTTTGGGTGTACTGCTTATGTTCATCGAACCCTTCTTTCCCAATCCAAATTGGACCCTCGGGCTATTAAATGTGTTTT
TGTAGGCTATGTTCCTTTTAAAAAGGCCTACAAATGTTTTGACTCCCTAACTAACAAGTATTTTGAGAGTATGGATGTGTCCTTTGTGGAAAATCAATCGTTTTTTAGCC
TAACTTCTCTTCAGGATGAGTCATCTCTACTTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATGAGCTCTAGTCCTTCGATCTCA
AGCATGGAAAATTTTTCAACAGGGGGAGAAACACTACAAACAGATCCAACAGTAAATGGTCCTGAAAATTCGGGTATGTCTCTTAGTCCTTCCTCTCATAATACGTTGTC
TAATGTCTCTGATCTTGATATTCCAATTGCCCAGAGAAAAGGTACCCGCCAATGTACAAAATATCCCATTGCGAACTATCTCTCCTATCATAGATTGTCTGATAATCATA
AAGCTTTTACATCCAAAATAACCAATCTATTTGTTCCAAAGAATATACAGGAAGCTCTAAATGATTCGAATTGGAAATTAGCAGTGATGGAAGAGATGAATGCGCTGAAA
CAAAGTGGTGCTTGGGATATAGTTGATCTACCAGAAGACAAGAAAGCAGTGGGATGTAAGGGGGTTTTCACGATAAAATGTAATGTTGATGGTAGTATCGAAAGGTACAA
GGCCAAACTAGTGGCTAAGGGATTCACCCAGACCTATGGAATTGGTTATCAAGAGACATTTGCCTCTGTAGCTAAAATTAACTCAATTAGAATTTTGCTCTTTGTTGCAG
TTAATTTAGATTGGCCACTGTATCAACTAGATATTTCTTAA
Protein sequenceShow/hide protein sequence
MIETQFQTKIRILHSDNGTEFFNEPLSTFLHDKGIIHQATYCDTPQQNGVAKRKNRHLLEIARALMFSMHVPKYLWGDAVLTDAYLINRMPIKVLNFKTPLQHLKEFFPT
VRLFSELPLKVFGCTAYVHRTLLSQSKLDPRAIKCVFVGYVPFKKAYKCFDSLTNKYFESMDVSFVENQSFFSLTSLQDESSLLEENFWDTSPLPNIISPEIMSSSPSIS
SMENFSTGGETLQTDPTVNGPENSGMSLSPSSHNTLSNVSDLDIPIAQRKGTRQCTKYPIANYLSYHRLSDNHKAFTSKITNLFVPKNIQEALNDSNWKLAVMEEMNALK
QSGAWDIVDLPEDKKAVGCKGVFTIKCNVDGSIERYKAKLVAKGFTQTYGIGYQETFASVAKINSIRILLFVAVNLDWPLYQLDIS