; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G022270 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G022270
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptiontetratricopeptide repeat protein 5-like
Genome locationCG_Chr05:34237038..34254980
RNA-Seq ExpressionClCG05G022270
SyntenyClCG05G022270
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR032076 - Tetratricopeptide repeat protein 5, OB fold domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034605.1 tetratricopeptide repeat protein 5-like isoform X3 [Cucumis melo var. makuwa]3.0e-15177.78Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGK            CTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKVLFFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

KAG6601256.1 Tetratricopeptide repeat protein 5, partial [Cucurbita argyrosperma subsp. sororia]1.6e-18970.49Show/hide
Query:  FHCPSSRRRRPSDHRRAPRCHGGVGDEENQKDIVEANLMVLRARMEDLRKKERRILPPARRGGLEFDNGWRYLSISDDAKFDRDFDCKILKNFALISECL
        F+ PS  RRRPSDHRRAPRCHGGVGDE NQKDIVEANL VL+ARMEDLRKKERRI  P R GG+E DNGWRY S + DAK     D K++KNFALIS CL
Subjt:  FHCPSSRRRRPSDHRRAPRCHGGVGDEENQKDIVEANLMVLRARMEDLRKKERRILPPARRGGLEFDNGWRYLSISDDAKFDRDFDCKILKNFALISECL

Query:  EVVTTVGSAVGLVFVGGSLGICLVSLLLNDLPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELS
        E                                 +LPAS LVG APR VA GRLTEEMCSE  E+ FDKATAAVE+LYHIRDTFFPVNPDDK SKLRELS
Subjt:  EVVTTVGSAVGLVFVGGSLGICLVSLLLNDLPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELS

Query:  DLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLC
        DLA+KILDSI P +       A       + Y      D+   + K   D     VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLAL+KRPEKKLLC
Subjt:  DLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLC

Query:  QLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG-----------
        QLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDH+KLLQSLKAYQNA             
Subjt:  QLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG-----------

Query:  ---VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFF
             VNKYLENY+RALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAK RR  S  SSVD VSSN SYKRATVD LSEGLNK+VAVIGKVLFF
Subjt:  ---VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFF

Query:  IKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        IKHDSLAPLYYL+CDSNQ CFV+SLYGMRNDT
Subjt:  IKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

KAG7032049.1 Tetratricopeptide repeat protein 5, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-15673.29Show/hide
Query:  LPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEF
        + TQ+LPAS LVG APR VA GRLTEEMCSE  E+ FDKATAAVE+LYHIRDTFFPVNPDDK SKLRELSDLA+KILDSI P +       A       +
Subjt:  LPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEF

Query:  AYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVT-----
         Y      D+   + K   D     VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLAL+KRPEKKLLCQLSMLERKMAQGKC A SSSS++VT     
Subjt:  AYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVT-----

Query:  -C---------------TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKY
         C               TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDH+KLLQSLKAYQNA                  VNKY
Subjt:  -C---------------TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKY

Query:  LENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPL
        LENY+RALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAK RR  S  SSVD V +N SYKRATVD LSEGLNK+VAV+GKVLFFIKHDSLAPL
Subjt:  LENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPL

Query:  YYLVCDSNQTCFVLSLYGMRNDT
        YYLVCDSNQ CFV+SLYGMRNDT
Subjt:  YYLVCDSNQTCFVLSLYGMRNDT

TYK09157.1 tetratricopeptide repeat protein 5-like isoform X3 [Cucumis melo var. makuwa]1.7e-14977.25Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKVLFFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

XP_038891369.1 tetratricopeptide repeat protein 5-like [Benincasa hispida]8.2e-14977.6Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LYHIRDTFFPVNPDDKTSK RELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNK PEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENYDRALSGFEAAALKDPSLSA REV KMVNLLDKLDNMLKAH K R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        RGASLPSSVDA+SSNFSYKRATVD LSEGLNK VAV GKVLFFIKHDSLAPLYYLVCDSNQTCF+LSLYGMRNDT
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

TrEMBL top hitse value%identityAlignment
A0A1S3BF62 tetratricopeptide repeat protein 5-like isoform X28.9e-14977.33Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKVLFFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

A0A1S3BFP3 tetratricopeptide repeat protein 5-like isoform X34.9e-14776.52Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLK    AH
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH

Query:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        AK R+GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKVLFFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT
Subjt:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

A0A1S3BGG9 tetratricopeptide repeat protein 5-like isoform X14.9e-14776.52Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLK    AH
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH

Query:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        AK R+GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKVLFFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT
Subjt:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

A0A5A7SZG5 Tetratricopeptide repeat protein 5-like isoform X31.5e-15177.78Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGK            CTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKVLFFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

A0A5D3CAY6 Tetratricopeptide repeat protein 5-like isoform X38.0e-15077.25Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKVLFFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

SwissProt top hitse value%identityAlignment
Q0P5H9 Tetratricopeptide repeat protein 54.1e-2629.84Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW
        VKL P L +AW  LG   WKKGD+++A  CF+ AL     K  L  LSM+ R++                  +  ++ V +S++ AK AV +D+ DG SW
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW

Query:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSL-SATREVHKMVNLLDKLDNMLKA
        Y LGNA L+ +F TG   + K+  Q+L AY  A  V                ++KY ENY  AL GF  AA  DP+     +   ++++ L +L + L++
Subjt:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSL-SATREVHKMVNLLDKLDNMLKA

Query:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVDH-----LSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI
          K++           R A L    D    + S ++ T++      L  G+N    V+GKV+F +  +   P  + + DS+  C+ + +Y M       I
Subjt:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVDH-----LSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI

Query:  KEAFA
         ++ A
Subjt:  KEAFA

Q5BK48 Tetratricopeptide repeat protein 51.0e-2429.51Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW
        VKL P L +AW  LG   WKKGD+++A  CF+ AL     K  L  LSM+ R++                  +  ++ V +S++ AK AV +DV DG SW
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW

Query:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA
        Y LGNA L+ +F TG   + K+  Q+L AY  A  V                ++KY E+Y  AL GF  AA  DP+    ++   +++  L +L N+L +
Subjt:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA

Query:  HAKIR-----------RGASLPSSVD-----AVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI
          K +           R A L    D     A     + +   +  L  G+N    V+GKV+F +  +   P  + + DS+  C+ + +Y +       I
Subjt:  HAKIR-----------RGASLPSSVD-----AVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI

Query:  KEAFA
         ++ A
Subjt:  KEAFA

Q8N0Z6 Tetratricopeptide repeat protein 58.3e-2730.72Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENE-AKLVEESIQHAKEAVTLDVKDGNS
        VKL P L +AW  LG   WKKGD+++A  CF+ AL     K  L  LSM+ R++               T TE+E +  V +S++ AK AV +DV DG S
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENE-AKLVEESIQHAKEAVTLDVKDGNS

Query:  WYNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLK
        WY LGN+ L+ +F TG   + K+  Q+L AY  A  V                ++KY E+Y  AL GF  AA  DP+    R+   +++  LD+L ++L+
Subjt:  WYNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLK

Query:  AHAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +  K++           R A L    D    + S ++ T++      L  G+N    ++GKV+F +  +   P  + + DS+  C+ + +Y +       
Subjt:  AHAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

Query:  IKEAFA
        I ++ A
Subjt:  IKEAFA

Q99LG4 Tetratricopeptide repeat protein 57.8e-2529.84Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW
        VKL P L +AW  LG   WKKGD++SA  CF+ AL     K  L  LSM+ R++                  +  ++ V +S++ AK AV +DV DG SW
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW

Query:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA
        Y LGNA L+ +F TG   + K+  Q+L AY  A  V                ++KY E+Y  AL GF  AA  DP     ++   +++  L +L ++L++
Subjt:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA

Query:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI
          K +           R A L    D    + S ++ T++      L  G+N    V+GKV+F +  +   P  + + DS+  C+ + +Y +       I
Subjt:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI

Query:  KEAFA
         ++ A
Subjt:  KEAFA

Arabidopsis top hitse value%identityAlignment
AT2G01100.1 unknown protein2.2e-2742.68Show/hide
Query:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK
        +FIQ+VEEKK+R LEK+EAPLKWEQKLEAAA AKAD E K ++ K  K K+R+ S+S   SESDS    R+  +RSH KHR+H+HSDS D ++RK++KS+
Subjt:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK

Query:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH
        R+ +R  S  DDS+ +++  SED  R K ++HRRH        + +SS    D ++T+    +H +HHRR +  +S DS  E      R++R ++HR H+
Subjt:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH

Query:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH
        R   S      DS      R   R + +  SS+   E+ A+  +RH
Subjt:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH

AT2G01100.2 unknown protein2.2e-2742.68Show/hide
Query:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK
        +FIQ+VEEKK+R LEK+EAPLKWEQKLEAAA AKAD E K ++ K  K K+R+ S+S   SESDS    R+  +RSH KHR+H+HSDS D ++RK++KS+
Subjt:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK

Query:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH
        R+ +R  S  DDS+ +++  SED  R K ++HRRH        + +SS    D ++T+    +H +HHRR +  +S DS  E      R++R ++HR H+
Subjt:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH

Query:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH
        R   S      DS      R   R + +  SS+   E+ A+  +RH
Subjt:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH

AT2G01100.3 unknown protein2.2e-2742.68Show/hide
Query:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK
        +FIQ+VEEKK+R LEK+EAPLKWEQKLEAAA AKAD E K ++ K  K K+R+ S+S   SESDS    R+  +RSH KHR+H+HSDS D ++RK++KS+
Subjt:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK

Query:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH
        R+ +R  S  DDS+ +++  SED  R K ++HRRH        + +SS    D ++T+    +H +HHRR +  +S DS  E      R++R ++HR H+
Subjt:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH

Query:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH
        R   S      DS      R   R + +  SS+   E+ A+  +RH
Subjt:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGCTCGTCTCCCGTCGTTCCATTTCCACTGTCCGTCTTCTCGGCGGCGAAGACCATCCGACCACCGTAGAGCTCCGAGATGCCATGGCGGCGTCGGCGATGA
AGAGAACCAGAAGGACATCGTGGAAGCGAATTTGATGGTGTTGAGAGCGAGAATGGAGGATTTGAGGAAGAAGGAGCGGCGGATTTTGCCTCCGGCACGGCGCGGAGGAC
TTGAATTCGATAATGGCTGGAGGTATTTGTCGATAAGCGACGACGCGAAGTTCGATCGCGATTTTGATTGCAAGATTTTGAAGAATTTTGCCCTAATTTCAGAGTGTTTG
GAGGTGGTGACCACGGTTGGCAGTGCAGTCGGGCTTGTTTTTGTTGGTGGATCTTTAGGCATCTGTTTGGTTTCTCTTTTGCTCAACGATCTGCCAACACAGTATTTGCC
TGCCTCCCGTCTGGTGGGCGTTGCTCCGAGAATTGTCGCCGCCGGCCGATTGACGGAGGAGATGTGCAGCGAAGCCATGGAAGACATATTCGACAAAGCCACAGCTGCGG
TCGAGAAGCTATATCACATCAGAGACACCTTTTTTCCTGTAAATCCCGATGACAAAACTTCCAAATTACGGGAGCTATCGGATCTTGCTCTCAAGATCCTCGATTCAATT
CCTCCAGGTCGATTCTGCTTTCATTTCTCTTACGCCTTTCATTTCAGCTTCTCCGAATTTGCTTACTGCAATAGCTCGGCTTGGGATTTAATCATTCTCTTCTGTAAATT
TGACTGTTTCAAGCTCCAGGTTAAGTTGAATCCCTCTCTTGCCGATGCCTGGCTATGTTTAGGCAACTGCATTTGGAAAAAAGGAGATCTATCTTCAGCAAAGAACTGCT
TTACTTTAGCATTAAACAAGCGCCCTGAGAAAAAGTTACTTTGTCAGTTATCGATGCTTGAAAGGAAAATGGCTCAAGGCAAGTGTTGTGCTTCTTCATCATCTTCTTAC
ATTGTAACCTGTACTGAGAATGAGGCAAAACTTGTAGAGGAAAGCATTCAACATGCAAAAGAAGCAGTTACTTTGGATGTGAAGGATGGAAACTCTTGGTATAATCTAGG
AAATGCATGCCTCACAAGTTTTTTTGTTACTGGAGCATGGGATCATAGTAAGCTTCTGCAATCTTTGAAGGCATATCAAAATGCGGTGGGTGTCTTAGTTAACAAATATC
TGGAGAACTATGACAGGGCCCTTAGTGGATTTGAGGCTGCTGCTTTGAAGGATCCTAGTCTTAGTGCCACTAGGGAGGTACACAAGATGGTGAATCTTCTTGACAAATTG
GATAACATGTTGAAGGCACATGCTAAAATACGGAGAGGTGCGTCTTTGCCATCATCAGTGGATGCTGTTAGTTCGAACTTCTCATACAAGAGAGCCACTGTAGACCATCT
GTCAGAAGGTCTGAACAAAGCAGTTGCAGTAATTGGGAAGGTGTTGTTCTTCATTAAGCATGATAGTCTTGCCCCACTATATTATTTGGTGTGTGATTCAAACCAAACAT
GCTTTGTTTTATCGTTGTATGGTATGCGGAATGATACGAACGCGTATATAAAAGAGGCATTTGCTCCCCAATTTGGCCTTCCGCTGGAGCTTTCTGGAAGAGTAGAAGTC
GGAGTCGACTCAAAGTCAAAGAAAAGCCCCAACCACTCTCTTCTTCTTCTAGGGCTCCTTCCTCTCATCCTTCCATCCTCCTTTTTCTATTGCAATCTTGTTTCTCCTCT
CCTTCAGATTCAGTTCATCCAATTGGTTGAGGAGAAGAAGAAGAGGGCTTTGGAGAAGAAGGAAGCCCCATTGAAATGGGAACAGAAACTTGAAGCTGCTGCCAAGGCGA
AGGCTGATGCTGAAGCCAAAGCAAGGAAGATCAAGGCTTCCAAGCATAAAAGAAGATCCATATCAGATTCTGATACTGATTCAGAAAGTGATAGCCATGATGGAGGAAGG
AAAGCAGGTAAGAGAAGTCATAAGAAGCACAGGAAACATAGCCACTCTGATTCGGGTGACATTGAGAAGAGAAAAGATAGAAAATCTAAGAGAAAACTGAAACGACATCG
CTCATCTCATGATGACAGCAGTGACGAATTTGACCACTCTAGTGAAGATAGGAGGAAGAAGAGAAACCATAGAAGACATGCTCATGACAATTCAAATTCGGATGAAAGTT
ACTCTAGTTCTAGTGGTGATGATGTTGAGACGACAAAAAGAAGTCATTCCAGGCATCACAGACATCATAGGCGAATTGACTATTCATCATCTGACGATTCTAGCAGTGAG
GATGATACTGCTTTGCGAAGGAAAAAGCGTGTCAGACACCACAGGCCCCACCATCGCCACGTGCAGTCTCATCGATCATGTAGTATTGATTCATCAGACCGTAATTACTG
GAGGTGTGATGCCAGAAGCCAATCCTCAGGGAAGTCATCTGATGATAATCATGAAGAATCAGCAATATTAGAATCCAGGCACAAGAGTAGCCACCATATCAAACCCCGGC
ACCATCATTCAGGTGCCAATGGTTTGACCCAGATCGATGAAAAACATGCTGACAATGATGCTGATGAAAATGACCATGATCGTGCTAAGGATTCTCACTAA
mRNA sequenceShow/hide mRNA sequence
AGCATGGTGCAAAATGGCTTCCGCTCGTCTCCCGTCGTTCCATTTCCACTGTCCGTCTTCTCGGCGGCGAAGACCATCCGACCACCGTAGAGCTCCGAGATGCCATGGCG
GCGTCGGCGATGAAGAGAACCAGAAGGACATCGTGGAAGCGAATTTGATGGTGTTGAGAGCGAGAATGGAGGATTTGAGGAAGAAGGAGCGGCGGATTTTGCCTCCGGCA
CGGCGCGGAGGACTTGAATTCGATAATGGCTGGAGGTATTTGTCGATAAGCGACGACGCGAAGTTCGATCGCGATTTTGATTGCAAGATTTTGAAGAATTTTGCCCTAAT
TTCAGAGTGTTTGGAGGTGGTGACCACGGTTGGCAGTGCAGTCGGGCTTGTTTTTGTTGGTGGATCTTTAGGCATCTGTTTGGTTTCTCTTTTGCTCAACGATCTGCCAA
CACAGTATTTGCCTGCCTCCCGTCTGGTGGGCGTTGCTCCGAGAATTGTCGCCGCCGGCCGATTGACGGAGGAGATGTGCAGCGAAGCCATGGAAGACATATTCGACAAA
GCCACAGCTGCGGTCGAGAAGCTATATCACATCAGAGACACCTTTTTTCCTGTAAATCCCGATGACAAAACTTCCAAATTACGGGAGCTATCGGATCTTGCTCTCAAGAT
CCTCGATTCAATTCCTCCAGGTCGATTCTGCTTTCATTTCTCTTACGCCTTTCATTTCAGCTTCTCCGAATTTGCTTACTGCAATAGCTCGGCTTGGGATTTAATCATTC
TCTTCTGTAAATTTGACTGTTTCAAGCTCCAGGTTAAGTTGAATCCCTCTCTTGCCGATGCCTGGCTATGTTTAGGCAACTGCATTTGGAAAAAAGGAGATCTATCTTCA
GCAAAGAACTGCTTTACTTTAGCATTAAACAAGCGCCCTGAGAAAAAGTTACTTTGTCAGTTATCGATGCTTGAAAGGAAAATGGCTCAAGGCAAGTGTTGTGCTTCTTC
ATCATCTTCTTACATTGTAACCTGTACTGAGAATGAGGCAAAACTTGTAGAGGAAAGCATTCAACATGCAAAAGAAGCAGTTACTTTGGATGTGAAGGATGGAAACTCTT
GGTATAATCTAGGAAATGCATGCCTCACAAGTTTTTTTGTTACTGGAGCATGGGATCATAGTAAGCTTCTGCAATCTTTGAAGGCATATCAAAATGCGGTGGGTGTCTTA
GTTAACAAATATCTGGAGAACTATGACAGGGCCCTTAGTGGATTTGAGGCTGCTGCTTTGAAGGATCCTAGTCTTAGTGCCACTAGGGAGGTACACAAGATGGTGAATCT
TCTTGACAAATTGGATAACATGTTGAAGGCACATGCTAAAATACGGAGAGGTGCGTCTTTGCCATCATCAGTGGATGCTGTTAGTTCGAACTTCTCATACAAGAGAGCCA
CTGTAGACCATCTGTCAGAAGGTCTGAACAAAGCAGTTGCAGTAATTGGGAAGGTGTTGTTCTTCATTAAGCATGATAGTCTTGCCCCACTATATTATTTGGTGTGTGAT
TCAAACCAAACATGCTTTGTTTTATCGTTGTATGGTATGCGGAATGATACGAACGCGTATATAAAAGAGGCATTTGCTCCCCAATTTGGCCTTCCGCTGGAGCTTTCTGG
AAGAGTAGAAGTCGGAGTCGACTCAAAGTCAAAGAAAAGCCCCAACCACTCTCTTCTTCTTCTAGGGCTCCTTCCTCTCATCCTTCCATCCTCCTTTTTCTATTGCAATC
TTGTTTCTCCTCTCCTTCAGATTCAGTTCATCCAATTGGTTGAGGAGAAGAAGAAGAGGGCTTTGGAGAAGAAGGAAGCCCCATTGAAATGGGAACAGAAACTTGAAGCT
GCTGCCAAGGCGAAGGCTGATGCTGAAGCCAAAGCAAGGAAGATCAAGGCTTCCAAGCATAAAAGAAGATCCATATCAGATTCTGATACTGATTCAGAAAGTGATAGCCA
TGATGGAGGAAGGAAAGCAGGTAAGAGAAGTCATAAGAAGCACAGGAAACATAGCCACTCTGATTCGGGTGACATTGAGAAGAGAAAAGATAGAAAATCTAAGAGAAAAC
TGAAACGACATCGCTCATCTCATGATGACAGCAGTGACGAATTTGACCACTCTAGTGAAGATAGGAGGAAGAAGAGAAACCATAGAAGACATGCTCATGACAATTCAAAT
TCGGATGAAAGTTACTCTAGTTCTAGTGGTGATGATGTTGAGACGACAAAAAGAAGTCATTCCAGGCATCACAGACATCATAGGCGAATTGACTATTCATCATCTGACGA
TTCTAGCAGTGAGGATGATACTGCTTTGCGAAGGAAAAAGCGTGTCAGACACCACAGGCCCCACCATCGCCACGTGCAGTCTCATCGATCATGTAGTATTGATTCATCAG
ACCGTAATTACTGGAGGTGTGATGCCAGAAGCCAATCCTCAGGGAAGTCATCTGATGATAATCATGAAGAATCAGCAATATTAGAATCCAGGCACAAGAGTAGCCACCAT
ATCAAACCCCGGCACCATCATTCAGGTGCCAATGGTTTGACCCAGATCGATGAAAAACATGCTGACAATGATGCTGATGAAAATGACCATGATCGTGCTAAGGATTCTCA
CTAA
Protein sequenceShow/hide protein sequence
MASARLPSFHFHCPSSRRRRPSDHRRAPRCHGGVGDEENQKDIVEANLMVLRARMEDLRKKERRILPPARRGGLEFDNGWRYLSISDDAKFDRDFDCKILKNFALISECL
EVVTTVGSAVGLVFVGGSLGICLVSLLLNDLPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSI
PPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCKFDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSY
IVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVGVLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKL
DNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVLFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYIKEAFAPQFGLPLELSGRVEV
GVDSKSKKSPNHSLLLLGLLPLILPSSFFYCNLVSPLLQIQFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGR
KAGKRSHKKHRKHSHSDSGDIEKRKDRKSKRKLKRHRSSHDDSSDEFDHSSEDRRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSE
DDTALRRKKRVRHHRPHHRHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRHKSSHHIKPRHHHSGANGLTQIDEKHADNDADENDHDRAKDSH