; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G24540 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G24540
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptiontetratricopeptide repeat protein 5-like
Genome locationClcChr05:32492155..32509861
RNA-Seq ExpressionClc05G24540
SyntenyClc05G24540
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR032076 - Tetratricopeptide repeat protein 5, OB fold domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034605.1 tetratricopeptide repeat protein 5-like isoform X3 [Cucumis melo var. makuwa]1.5e-15077.51Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGK            CTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKV FFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

KAG6601256.1 Tetratricopeptide repeat protein 5, partial [Cucurbita argyrosperma subsp. sororia]8.1e-18970.3Show/hide
Query:  FHCPSSRRRRPSDHRRAPRCHGGVGDEENQKDIVEANLMVLRARMEDLRKKERRILPPARRGGLEFDNGWRYLSISDDAKFDRDFDCKILKNFALISECL
        F+ PS  RRRPSDHRRAPRCHGGVGDE NQKDIVEANL VL+ARMEDLRKKERRI  P R GG+E DNGWRY S + DAK     D K++KNFALIS CL
Subjt:  FHCPSSRRRRPSDHRRAPRCHGGVGDEENQKDIVEANLMVLRARMEDLRKKERRILPPARRGGLEFDNGWRYLSISDDAKFDRDFDCKILKNFALISECL

Query:  EVVTTVGSAVGLVFVGGSLGICLVSLLLNDLPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELS
        E                                 +LPAS LVG APR VA GRLTEEMCSE  E+ FDKATAAVE+LYHIRDTFFPVNPDDK SKLRELS
Subjt:  EVVTTVGSAVGLVFVGGSLGICLVSLLLNDLPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELS

Query:  DLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLC
        DLA+KILDSI P +       A       + Y      D+   + K   D     VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLAL+KRPEKKLLC
Subjt:  DLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLC

Query:  QLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG-----------
        QLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDH+KLLQSLKAYQNA             
Subjt:  QLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG-----------

Query:  ---VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFF
             VNKYLENY+RALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAK RR  S  SSVD VSSN SYKRATVD LSEGLNK+VAVIGKV FF
Subjt:  ---VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFF

Query:  IKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        IKHDSLAPLYYL+CDSNQ CFV+SLYGMRNDT
Subjt:  IKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

KAG7032049.1 Tetratricopeptide repeat protein 5, partial [Cucurbita argyrosperma subsp. argyrosperma]9.1e-15673.05Show/hide
Query:  LPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEF
        + TQ+LPAS LVG APR VA GRLTEEMCSE  E+ FDKATAAVE+LYHIRDTFFPVNPDDK SKLRELSDLA+KILDSI P +       A       +
Subjt:  LPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEF

Query:  AYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVT-----
         Y      D+   + K   D     VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLAL+KRPEKKLLCQLSMLERKMAQGKC A SSSS++VT     
Subjt:  AYCNSSAWDLIILFCK--FDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVT-----

Query:  -C---------------TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKY
         C               TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDH+KLLQSLKAYQNA                  VNKY
Subjt:  -C---------------TENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKY

Query:  LENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPL
        LENY+RALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAK RR  S  SSVD V +N SYKRATVD LSEGLNK+VAV+GKV FFIKHDSLAPL
Subjt:  LENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPL

Query:  YYLVCDSNQTCFVLSLYGMRNDT
        YYLVCDSNQ CFV+SLYGMRNDT
Subjt:  YYLVCDSNQTCFVLSLYGMRNDT

TYK09157.1 tetratricopeptide repeat protein 5-like isoform X3 [Cucumis melo var. makuwa]8.2e-14976.98Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKV FFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

XP_038891369.1 tetratricopeptide repeat protein 5-like [Benincasa hispida]3.1e-14877.33Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LYHIRDTFFPVNPDDKTSK RELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNK PEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENYDRALSGFEAAALKDPSLSA REV KMVNLLDKLDNMLKAH K R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        RGASLPSSVDA+SSNFSYKRATVD LSEGLNK VAV GKV FFIKHDSLAPLYYLVCDSNQTCF+LSLYGMRNDT
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

TrEMBL top hitse value%identityAlignment
A0A1S3BF62 tetratricopeptide repeat protein 5-like isoform X23.4e-14877.07Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKV FFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

A0A1S3BFP3 tetratricopeptide repeat protein 5-like isoform X31.8e-14676.25Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLK    AH
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH

Query:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        AK R+GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKV FFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT
Subjt:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

A0A1S3BGG9 tetratricopeptide repeat protein 5-like isoform X11.8e-14676.25Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLK    AH
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLK----AH

Query:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT
        AK R+GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKV FFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT
Subjt:  AKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDT

A0A5A7SZG5 Tetratricopeptide repeat protein 5-like isoform X37.3e-15177.51Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGK            CTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKV FFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

A0A5D3CAY6 Tetratricopeptide repeat protein 5-like isoform X34.0e-14976.98Show/hide
Query:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK
        MCSEA ED FDKATAAVE+LY IRDTFFPVNPDDKTSKLRELSDLALKILDSIPP +       A       + Y      D+   + K   D     VK
Subjt:  MCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSIPPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCK--FDCFKLQVK

Query:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN
        LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQG              TENEAKLVEESIQHAKEAVTLDVKDGNSWYN
Subjt:  LNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYN

Query:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR
        LGNACLTSFFVTGAWDHSKLLQSLKAYQNA                  VNKYLENY RALSGFEAAALKDPSLSATREVHKMV LLDKLDNMLKAHAK R
Subjt:  LGNACLTSFFVTGAWDHSKLLQSLKAYQNAVG--------------VLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKLDNMLKAHAKIR

Query:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +GAS PSSVDA+SSNFSYKR T+DHLSEGLNK VAV GKV FFIKHDSLAPLYYL CDSNQTCFVLSLYGMRNDT AY
Subjt:  RGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

SwissProt top hitse value%identityAlignment
Q0P5H9 Tetratricopeptide repeat protein 59.2e-2629.84Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW
        VKL P L +AW  LG   WKKGD+++A  CF+ AL     K  L  LSM+ R++                  +  ++ V +S++ AK AV +D+ DG SW
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW

Query:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSL-SATREVHKMVNLLDKLDNMLKA
        Y LGNA L+ +F TG   + K+  Q+L AY  A  V                ++KY ENY  AL GF  AA  DP+     +   ++++ L +L + L++
Subjt:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSL-SATREVHKMVNLLDKLDNMLKA

Query:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVDH-----LSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI
          K++           R A L    D    + S ++ T++      L  G+N    V+GKV F +  +   P  + + DS+  C+ + +Y M       I
Subjt:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVDH-----LSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI

Query:  KEAFA
         ++ A
Subjt:  KEAFA

Q5BK48 Tetratricopeptide repeat protein 52.3e-2429.51Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW
        VKL P L +AW  LG   WKKGD+++A  CF+ AL     K  L  LSM+ R++                  +  ++ V +S++ AK AV +DV DG SW
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW

Query:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA
        Y LGNA L+ +F TG   + K+  Q+L AY  A  V                ++KY E+Y  AL GF  AA  DP+    ++   +++  L +L N+L +
Subjt:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA

Query:  HAKIR-----------RGASLPSSVD-----AVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI
          K +           R A L    D     A     + +   +  L  G+N    V+GKV F +  +   P  + + DS+  C+ + +Y +       I
Subjt:  HAKIR-----------RGASLPSSVD-----AVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI

Query:  KEAFA
         ++ A
Subjt:  KEAFA

Q8N0Z6 Tetratricopeptide repeat protein 51.8e-2630.72Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENE-AKLVEESIQHAKEAVTLDVKDGNS
        VKL P L +AW  LG   WKKGD+++A  CF+ AL     K  L  LSM+ R++               T TE+E +  V +S++ AK AV +DV DG S
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENE-AKLVEESIQHAKEAVTLDVKDGNS

Query:  WYNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLK
        WY LGN+ L+ +F TG   + K+  Q+L AY  A  V                ++KY E+Y  AL GF  AA  DP+    R+   +++  LD+L ++L+
Subjt:  WYNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLK

Query:  AHAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY
        +  K++           R A L    D    + S ++ T++      L  G+N    ++GKV F +  +   P  + + DS+  C+ + +Y +       
Subjt:  AHAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAY

Query:  IKEAFA
        I ++ A
Subjt:  IKEAFA

Q99LG4 Tetratricopeptide repeat protein 51.7e-2429.84Show/hide
Query:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW
        VKL P L +AW  LG   WKKGD++SA  CF+ AL     K  L  LSM+ R++                  +  ++ V +S++ AK AV +DV DG SW
Subjt:  VKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSYIVTCTENEAKLVEESIQHAKEAVTLDVKDGNSW

Query:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA
        Y LGNA L+ +F TG   + K+  Q+L AY  A  V                ++KY E+Y  AL GF  AA  DP     ++   +++  L +L ++L++
Subjt:  YNLGNACLTSFFVTGAWDHSKL-LQSLKAYQNAVGV---------------LVNKYLENYDRALSGFEAAALKDPSLSATRE-VHKMVNLLDKLDNMLKA

Query:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI
          K +           R A L    D    + S ++ T++      L  G+N    V+GKV F +  +   P  + + DS+  C+ + +Y +       I
Subjt:  HAKIR-----------RGASLPSSVDAVSSNFSYKRATVD-----HLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYI

Query:  KEAFA
         ++ A
Subjt:  KEAFA

Arabidopsis top hitse value%identityAlignment
AT2G01100.1 unknown protein2.2e-2742.68Show/hide
Query:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK
        +FIQ+VEEKK+R LEK+EAPLKWEQKLEAAA AKAD E K ++ K  K K+R+ S+S   SESDS    R+  +RSH KHR+H+HSDS D ++RK++KS+
Subjt:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK

Query:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH
        R+ +R  S  DDS+ +++  SED  R K ++HRRH        + +SS    D ++T+    +H +HHRR +  +S DS  E      R++R ++HR H+
Subjt:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH

Query:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH
        R   S      DS      R   R + +  SS+   E+ A+  +RH
Subjt:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH

AT2G01100.2 unknown protein2.2e-2742.68Show/hide
Query:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK
        +FIQ+VEEKK+R LEK+EAPLKWEQKLEAAA AKAD E K ++ K  K K+R+ S+S   SESDS    R+  +RSH KHR+H+HSDS D ++RK++KS+
Subjt:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK

Query:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH
        R+ +R  S  DDS+ +++  SED  R K ++HRRH        + +SS    D ++T+    +H +HHRR +  +S DS  E      R++R ++HR H+
Subjt:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH

Query:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH
        R   S      DS      R   R + +  SS+   E+ A+  +RH
Subjt:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH

AT2G01100.3 unknown protein2.2e-2742.68Show/hide
Query:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK
        +FIQ+VEEKK+R LEK+EAPLKWEQKLEAAA AKAD E K ++ K  K K+R+ S+S   SESDS    R+  +RSH KHR+H+HSDS D ++RK++KS+
Subjt:  QFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGRKAGKRSHKKHRKHSHSDSGDIEKRKDRKSK

Query:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH
        R+ +R  S  DDS+ +++  SED  R K ++HRRH        + +SS    D ++T+    +H +HHRR +  +S DS  E      R++R ++HR H+
Subjt:  RKLKRHRSSHDDSSDEFDHSSED--RRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSEDDTALRRKKRVRHHRPHH

Query:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH
        R   S      DS      R   R + +  SS+   E+ A+  +RH
Subjt:  RHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCGCTCGTCTCCCGTCGTTCCATTTCCACTGTCCGTCTTCTCGGCGGCGAAGACCATCCGACCACCGTAGAGCTCCGAGATGCCATGGCGGCGTCGGCGATGA
AGAGAACCAGAAGGACATCGTGGAAGCGAATTTGATGGTGTTGAGAGCGAGAATGGAGGATTTGAGGAAGAAGGAGCGGCGGATTTTGCCTCCGGCACGGCGCGGAGGAC
TTGAATTCGATAATGGCTGGAGGTATTTGTCGATAAGCGACGACGCGAAGTTCGATCGCGATTTTGATTGCAAGATTTTGAAGAATTTTGCCCTAATTTCAGAGTGTTTG
GAGGTGGTGACCACGGTTGGCAGTGCAGTCGGGCTTGTTTTTGTTGGTGGATCTTTAGGCATCTGTTTGGTTTCTCTTTTGCTCAACGATCTGCCAACACAGTATTTGCC
TGCCTCCCGTCTGGTGGGCGTTGCTCCGAGAATTGTCGCCGCCGGCCGATTGACGGAGGAGATGTGCAGCGAAGCCATGGAAGACATATTCGACAAAGCCACAGCTGCGG
TCGAGAAGCTATATCACATCAGAGACACCTTTTTTCCTGTAAATCCCGATGACAAAACTTCCAAATTACGGGAGCTATCGGATCTTGCTCTCAAGATCCTCGATTCAATT
CCTCCAGGTCGATTCTGCTTTCATTTCTCTTACGCCTTTCATTTCAGCTTCTCCGAATTTGCTTACTGCAATAGCTCGGCTTGGGATTTAATCATTCTCTTCTGTAAATT
TGACTGTTTCAAGCTCCAGGTTAAGTTGAATCCCTCTCTTGCCGATGCCTGGCTATGTTTAGGCAACTGCATTTGGAAAAAAGGAGATCTATCTTCAGCAAAGAACTGCT
TTACTTTAGCATTAAACAAGCGCCCTGAGAAAAAGTTACTTTGTCAGTTATCGATGCTTGAAAGGAAAATGGCTCAAGGCAAGTGTTGTGCTTCTTCATCATCTTCTTAC
ATTGTAACCTGTACTGAGAATGAGGCAAAACTTGTAGAGGAAAGCATTCAACATGCAAAAGAAGCAGTTACTTTGGATGTGAAGGATGGAAACTCTTGGTATAATCTAGG
AAATGCATGCCTCACAAGTTTTTTTGTTACTGGAGCATGGGATCATAGTAAGCTTCTGCAATCTTTGAAGGCATATCAAAATGCGGTGGGTGTCTTAGTTAACAAATATC
TGGAGAACTATGACAGGGCCCTTAGTGGATTTGAGGCTGCTGCTTTGAAGGATCCTAGTCTTAGTGCCACTAGGGAGGTACACAAGATGGTGAATCTTCTTGACAAATTG
GATAACATGTTGAAGGCACATGCTAAAATACGGAGAGGTGCGTCTTTGCCATCATCAGTGGATGCTGTTAGTTCGAACTTCTCATACAAGAGAGCCACTGTAGACCATCT
GTCAGAAGGTCTGAACAAAGCAGTTGCAGTAATTGGGAAGGTGTCGTTCTTCATTAAGCATGATAGTCTTGCCCCACTATATTATTTGGTGTGTGATTCAAACCAAACAT
GCTTTGTTTTATCGTTGTATGGTATGCGGAATGATACGAACGCGTATATAAAAGAGGCATTTGCTCCCCAATTTGGCCTTCCGCTGGAGCTTTCTGGAAGAGTAGAAGTC
GGAGTCGACTCAAAGTCAAAGAAAAGCCCCAACCACTCTCTTCTTCTTCTAGGGCTCCTTCCTCTCATCCTTCCATCCTCCTTTTTCTATTGCAATCTTGTTTCTCCTCT
CCTTCAGATTCAGTTCATCCAATTGGTTGAGGAGAAGAAGAAGAGGGCTTTGGAGAAGAAGGAAGCCCCATTGAAATGGGAACAGAAACTTGAAGCTGCTGCCAAGGCGA
AGGCTGATGCTGAAGCCAAAGCAAGGAAGATCAAGGCTTCCAAGCATAAAAGAAGATCCATATCAGATTCTGATACTGATTCAGAAAGTGATAGCCATGATGGAGGAAGG
AAAGCAGGTAAGAGAAGTCATAAGAAGCACAGGAAACATAGCCACTCTGATTCGGGTGACATTGAGAAGAGAAAAGATAGAAAATCTAAGAGAAAACTGAAACGACATCG
CTCATCTCATGATGACAGCAGTGACGAATTTGACCACTCTAGTGAAGATAGGAGGAAGAAGAGAAACCATAGAAGACATGCTCATGACAATTCAAATTCGGATGAAAGTT
ACTCTAGTTCTAGTGGTGATGATGTTGAGACGACAAAAAGAAGTCATTCCAGGCATCACAGACATCATAGGCGAATTGACTATTCATCATCTGACGATTCTAGCAGTGAG
GATGATACTGCTTTGCGAAGGAAAAAGCGTGTCAGACACCACAGGCCCCACCATCGCCACGTGCAGTCTCATCGATCATGTAGTATTGATTCATCAGACCGTAATTACTG
GAGGTGTGATGCCAGAAGCCAATCCTCAGGGAAGTCATCTGATGATAATCATGAAGAATCAGCAATATTAGAATCCAGGCACAAGAGTAGCCACCATATCAAACCCCGGC
ACCATCATTCAGGTGCCAATGGTTTGACCCAGATCGATGAAAAACATGCTGACAATGATGCTGATGAAAATGACCATGATCGTGCTAAGGATTCTCACTAA
mRNA sequenceShow/hide mRNA sequence
AGCATGGTGCAAAATGGCTTCCGCTCGTCTCCCGTCGTTCCATTTCCACTGTCCGTCTTCTCGGCGGCGAAGACCATCCGACCACCGTAGAGCTCCGAGATGCCATGGCG
GCGTCGGCGATGAAGAGAACCAGAAGGACATCGTGGAAGCGAATTTGATGGTGTTGAGAGCGAGAATGGAGGATTTGAGGAAGAAGGAGCGGCGGATTTTGCCTCCGGCA
CGGCGCGGAGGACTTGAATTCGATAATGGCTGGAGGTATTTGTCGATAAGCGACGACGCGAAGTTCGATCGCGATTTTGATTGCAAGATTTTGAAGAATTTTGCCCTAAT
TTCAGAGTGTTTGGAGGTGGTGACCACGGTTGGCAGTGCAGTCGGGCTTGTTTTTGTTGGTGGATCTTTAGGCATCTGTTTGGTTTCTCTTTTGCTCAACGATCTGCCAA
CACAGTATTTGCCTGCCTCCCGTCTGGTGGGCGTTGCTCCGAGAATTGTCGCCGCCGGCCGATTGACGGAGGAGATGTGCAGCGAAGCCATGGAAGACATATTCGACAAA
GCCACAGCTGCGGTCGAGAAGCTATATCACATCAGAGACACCTTTTTTCCTGTAAATCCCGATGACAAAACTTCCAAATTACGGGAGCTATCGGATCTTGCTCTCAAGAT
CCTCGATTCAATTCCTCCAGGTCGATTCTGCTTTCATTTCTCTTACGCCTTTCATTTCAGCTTCTCCGAATTTGCTTACTGCAATAGCTCGGCTTGGGATTTAATCATTC
TCTTCTGTAAATTTGACTGTTTCAAGCTCCAGGTTAAGTTGAATCCCTCTCTTGCCGATGCCTGGCTATGTTTAGGCAACTGCATTTGGAAAAAAGGAGATCTATCTTCA
GCAAAGAACTGCTTTACTTTAGCATTAAACAAGCGCCCTGAGAAAAAGTTACTTTGTCAGTTATCGATGCTTGAAAGGAAAATGGCTCAAGGCAAGTGTTGTGCTTCTTC
ATCATCTTCTTACATTGTAACCTGTACTGAGAATGAGGCAAAACTTGTAGAGGAAAGCATTCAACATGCAAAAGAAGCAGTTACTTTGGATGTGAAGGATGGAAACTCTT
GGTATAATCTAGGAAATGCATGCCTCACAAGTTTTTTTGTTACTGGAGCATGGGATCATAGTAAGCTTCTGCAATCTTTGAAGGCATATCAAAATGCGGTGGGTGTCTTA
GTTAACAAATATCTGGAGAACTATGACAGGGCCCTTAGTGGATTTGAGGCTGCTGCTTTGAAGGATCCTAGTCTTAGTGCCACTAGGGAGGTACACAAGATGGTGAATCT
TCTTGACAAATTGGATAACATGTTGAAGGCACATGCTAAAATACGGAGAGGTGCGTCTTTGCCATCATCAGTGGATGCTGTTAGTTCGAACTTCTCATACAAGAGAGCCA
CTGTAGACCATCTGTCAGAAGGTCTGAACAAAGCAGTTGCAGTAATTGGGAAGGTGTCGTTCTTCATTAAGCATGATAGTCTTGCCCCACTATATTATTTGGTGTGTGAT
TCAAACCAAACATGCTTTGTTTTATCGTTGTATGGTATGCGGAATGATACGAACGCGTATATAAAAGAGGCATTTGCTCCCCAATTTGGCCTTCCGCTGGAGCTTTCTGG
AAGAGTAGAAGTCGGAGTCGACTCAAAGTCAAAGAAAAGCCCCAACCACTCTCTTCTTCTTCTAGGGCTCCTTCCTCTCATCCTTCCATCCTCCTTTTTCTATTGCAATC
TTGTTTCTCCTCTCCTTCAGATTCAGTTCATCCAATTGGTTGAGGAGAAGAAGAAGAGGGCTTTGGAGAAGAAGGAAGCCCCATTGAAATGGGAACAGAAACTTGAAGCT
GCTGCCAAGGCGAAGGCTGATGCTGAAGCCAAAGCAAGGAAGATCAAGGCTTCCAAGCATAAAAGAAGATCCATATCAGATTCTGATACTGATTCAGAAAGTGATAGCCA
TGATGGAGGAAGGAAAGCAGGTAAGAGAAGTCATAAGAAGCACAGGAAACATAGCCACTCTGATTCGGGTGACATTGAGAAGAGAAAAGATAGAAAATCTAAGAGAAAAC
TGAAACGACATCGCTCATCTCATGATGACAGCAGTGACGAATTTGACCACTCTAGTGAAGATAGGAGGAAGAAGAGAAACCATAGAAGACATGCTCATGACAATTCAAAT
TCGGATGAAAGTTACTCTAGTTCTAGTGGTGATGATGTTGAGACGACAAAAAGAAGTCATTCCAGGCATCACAGACATCATAGGCGAATTGACTATTCATCATCTGACGA
TTCTAGCAGTGAGGATGATACTGCTTTGCGAAGGAAAAAGCGTGTCAGACACCACAGGCCCCACCATCGCCACGTGCAGTCTCATCGATCATGTAGTATTGATTCATCAG
ACCGTAATTACTGGAGGTGTGATGCCAGAAGCCAATCCTCAGGGAAGTCATCTGATGATAATCATGAAGAATCAGCAATATTAGAATCCAGGCACAAGAGTAGCCACCAT
ATCAAACCCCGGCACCATCATTCAGGTGCCAATGGTTTGACCCAGATCGATGAAAAACATGCTGACAATGATGCTGATGAAAATGACCATGATCGTGCTAAGGATTCTCA
CTAA
Protein sequenceShow/hide protein sequence
MASARLPSFHFHCPSSRRRRPSDHRRAPRCHGGVGDEENQKDIVEANLMVLRARMEDLRKKERRILPPARRGGLEFDNGWRYLSISDDAKFDRDFDCKILKNFALISECL
EVVTTVGSAVGLVFVGGSLGICLVSLLLNDLPTQYLPASRLVGVAPRIVAAGRLTEEMCSEAMEDIFDKATAAVEKLYHIRDTFFPVNPDDKTSKLRELSDLALKILDSI
PPGRFCFHFSYAFHFSFSEFAYCNSSAWDLIILFCKFDCFKLQVKLNPSLADAWLCLGNCIWKKGDLSSAKNCFTLALNKRPEKKLLCQLSMLERKMAQGKCCASSSSSY
IVTCTENEAKLVEESIQHAKEAVTLDVKDGNSWYNLGNACLTSFFVTGAWDHSKLLQSLKAYQNAVGVLVNKYLENYDRALSGFEAAALKDPSLSATREVHKMVNLLDKL
DNMLKAHAKIRRGASLPSSVDAVSSNFSYKRATVDHLSEGLNKAVAVIGKVSFFIKHDSLAPLYYLVCDSNQTCFVLSLYGMRNDTNAYIKEAFAPQFGLPLELSGRVEV
GVDSKSKKSPNHSLLLLGLLPLILPSSFFYCNLVSPLLQIQFIQLVEEKKKRALEKKEAPLKWEQKLEAAAKAKADAEAKARKIKASKHKRRSISDSDTDSESDSHDGGR
KAGKRSHKKHRKHSHSDSGDIEKRKDRKSKRKLKRHRSSHDDSSDEFDHSSEDRRKKRNHRRHAHDNSNSDESYSSSSGDDVETTKRSHSRHHRHHRRIDYSSSDDSSSE
DDTALRRKKRVRHHRPHHRHVQSHRSCSIDSSDRNYWRCDARSQSSGKSSDDNHEESAILESRHKSSHHIKPRHHHSGANGLTQIDEKHADNDADENDHDRAKDSH