; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005058 (gene) of Snake gourd v1 genome

Gene IDTan0005058
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHomeodomain-like superfamily protein
Genome locationLG03:73404400..73407272
RNA-Seq ExpressionTan0005058
SyntenyTan0005058
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143433.1 uncharacterized protein LOC101221631 [Cucumis sativus]4.4e-21274.35Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MP  NSIASSVDIKTLRRSPRFL PST   QQ+F +T RSLRFL++NEIS PT P   R+   IRQVHSS A L+PSKNVSLKTPKSV VNTP+++SK  
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD
         VSSK+KDS TGSKK S FENGF+  R PRRSPRLS      NALEGRNA VSKSSI+SGG S DLKNPSP VRRSPR SNG  G++S G S SFS QQ 
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD

Query:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ
         LEKSSRKRE  SGS +  G     NV    SSHG+ VA  ER+KGNSADHEDIA +    QVVDGE+EKKSV  RKRKREDGVVGIRQGWTKEQE +LQ
Subjt:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ

Query:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ
        RAYYAAKPTP FWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPRPR RT+STK S +ELL  SE +LLN+DGAKSRKP  KSQKSHNAQK VR+LLEKNFQ
Subjt:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ

Query:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK
        GA++ EADLFSQLEPNINLSN +PLPSKQLSS MDLQGNQGFLH RSLSNHKKPLSRFS+SVE+ VVSPPVLKQVKNR LHEKYIDQLH REAKRKSMSK
Subjt:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK

Query:  CTKNCISEERG-LKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE
        C K+CIS+E G  KE+HA RT+DLRAAKNALISDARDAI QLQHL+ + M++ P F+DD +  DNVD ++ED+
Subjt:  CTKNCISEERG-LKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE

XP_008440503.1 PREDICTED: uncharacterized protein LOC103484910 [Cucumis melo]8.1e-21474.48Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MP ANSIASSVDIKTLRRSPRFL  +    QQ+ P+TRRSLRFL+KNEIS PT P  RR+ S IRQVHSS A L PS +VSLKTPKSV VNTP+++SK  
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD
         VSSK+K S TGSKK S FEN FE    PRRSPRLS      NALEGRN  VSKSSI+SGG   DLKNPSP VRRSPR SNG  G++SIGKS SFS QQ 
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD

Query:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ
         LEKSSRKRE  +GS +  G     NV    SSHGE VA  ER++GNSAD EDIA +  GTQVVDGE+EKKSV  RKRKREDGVVGIRQGWTKEQE ALQ
Subjt:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ

Query:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ
        RAYY AKPTPQFWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPRPR RT+STKSS  ELL  SE +LLNLDGAKSRKPSRKSQKSHNAQK VR+LLEKNFQ
Subjt:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ

Query:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK
        GA++ EADLFSQLEPNINLSN++PLPSKQLSS +DLQGNQGFLH RSLSNHKKPLSRFS+SVE+ VVSPPVLKQVKNR LHEKYIDQLHCREAKRKSMSK
Subjt:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK

Query:  CTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE
        C K+CIS+E G K +H  RT+DLRAAKNALISDARDAI Q QHL+A+  ++ PDF+D  +   NVD +NED+
Subjt:  CTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE

XP_022132864.1 uncharacterized protein LOC111005607 [Momordica charantia]2.7e-21775.52Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MPS NSI SSV+IKTLRRSPRF T + P  Q++FP TRRSLRFLQKN+IS PTLPE+RRSHSAIRQVHSS AC+RP +NVSLKTPKSVL NT  KSSK  
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVS-KSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQ
         VSSK++ S TGSKKS+ FENGFEGIR PRRSPRLS      NALEGRNA VS  SSI SG RS DL +PSP VRRSPRL+NG   HQS GKS+ FS QQ
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVS-KSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQ

Query:  DALEKSSRKR-EKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAA
        DALE+  R R +KSSGSDKK GL HV+N+ T  SS G+NVAE ERRKGNSAD E    + GGTQVVDGE++KKSVA+RKRKRE+ VVGIRQGWT+EQEAA
Subjt:  DALEKSSRKR-EKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAA

Query:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKN
        L RAYYAAKPTP+FWKKVSKLVPGKSAQDCFDKVHS+H+TPPQPRPRSR +STKSSQIELLS SE KLLNLDGAK+RK SRK+QKSHNAQKTVR LLEKN
Subjt:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKN

Query:  FQGALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM
        +QGALSCEAD FS LEPNINLS+ SP PSK+L ST  L GNQ FLHERSL NHKKP SRFSSSVE VVVSPPVLKQVKNR+LHEKYIDQLH REAKRKS+
Subjt:  FQGALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM

Query:  SKCTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENE
        S+C +NC  EE+ LKE HAART+DLRAAKNALISDAR+AIHQLQ L AS+  +  DFDD ++  DN+D E+E
Subjt:  SKCTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENE

XP_023517243.1 uncharacterized protein LOC111781067 [Cucurbita pepo subsp. pepo]1.5e-19953.46Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MPS+NSIASSVDIK LRRSPR L P+ P  + + PSTRRSLRFLQK +ISPPTLPE  RSHSAIRQVH S  CL PSKNVS KTPK VLVNTP+KS KPS
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS----------------------------------------------------------------
         VSS++KDS +GSKKSSTFENGFEGI+ PRRS RLS                                                                
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS----------------------------------------------------------------

Query:  --NALEGRNANVSKSSIASGGRSRDLKNP-----------------------------------------------------------------------
          NALE +NA VSKSSI  GGRSRDLK+                                                                        
Subjt:  --NALEGRNANVSKSSIASGGRSRDLKNP-----------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------SPKV-------------------
                                                                                     +PK+                   
Subjt:  -----------------------------------------------------------------------------SPKV-------------------

Query:  -----------------------------RRS------------------------------------PRLSNGARGHQSIGKSQSFSDQQDALEKSSRK
                                     RRS                                    PR++N   GHQSI K Q    QQDALEKSSRK
Subjt:  -----------------------------RRS------------------------------------PRLSNGARGHQSIGKSQSFSDQQDALEKSSRK

Query:  REKSSGSDKKAGLSHVQNVVTRESSHGENVAE-EERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQRAYYAAK
        RE+S  SDKK  L +VQNVVTRESSH ENV E  ERRKGNSADHE IA EGGGT+VV GE+EKKSVA RKRKREDGVVGIR GWTKEQEAALQRAYYAAK
Subjt:  REKSSGSDKKAGLSHVQNVVTRESSHGENVAE-EERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQRAYYAAK

Query:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEA
        PTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQS+KS QIEL SLSEDKLLN +GAKSRKP RK+Q+S NAQKTVR+LLEK FQ A+S EA
Subjt:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEA

Query:  DLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCIS
        DLFSQLEPN N SN+SPLPSKQLS T DLQGNQGFLHERSLSNHKKPLSRFSSSVE+ VVSPPVLKQVKN+ALHEKYIDQLHCREAKRKSM+KCTK CIS
Subjt:  DLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCIS

Query:  EERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDEI
        EE+GLKEVHA RT+DLRAAKNALISDARDAIHQLQHLQA+ M+D+P+FDD  NLYDNVD ENEDEI
Subjt:  EERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDEI

XP_038882568.1 uncharacterized protein LOC120073795 [Benincasa hispida]5.4e-22677.29Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MPS +S ASSVDIKTLRRSPRFL  STP  QQ FP+TRRSLRFLQKNEIS PT P  R + S IRQVHSS A L PSK+VSLKTPKS+LVNTP+++SKP 
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLSNA--LEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQDALEK
         VSSK++ S TGSKKSSTFENGFEG R PRRSPRLS A  ++     VSKSSI+SG  S DLKNPSPKVRRSPR SNG  G+Q+IGKSQSFS QQD +EK
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLSNA--LEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQDALEK

Query:  SSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQRAYY
        SSRKR+KSSG  +K    H  NV    +SHGE VAEEE+RKGNS DHE IA +  GT+VVDGE+EKKSVA+RKRKREDGVV IRQGWTKEQE ALQRAYY
Subjt:  SSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQRAYY

Query:  AAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALS
        AAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPRPRSRT+ TKSS IELLSLSE KLLNLDG KSRKPSRKSQK+HNAQK VR+LLEKNF+GAL+
Subjt:  AAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALS

Query:  CEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKN
        CEADLFSQLEPNINLSN++PLPS+QLSS  DL G+QGFLHERSLSNHKKPLSRFSSS ++VV+SPPVLKQVKNRALHEKYIDQLHCREAKRKS+SKC K+
Subjt:  CEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKN

Query:  CISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE
        CISEE+ LKE HA RT+DLRAAKNALISDARDAIHQL+HL+A+   +  DFD D +LYDN D +NED+
Subjt:  CISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE

TrEMBL top hitse value%identityAlignment
A0A0A0KIP3 Uncharacterized protein2.1e-21274.35Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MP  NSIASSVDIKTLRRSPRFL PST   QQ+F +T RSLRFL++NEIS PT P   R+   IRQVHSS A L+PSKNVSLKTPKSV VNTP+++SK  
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD
         VSSK+KDS TGSKK S FENGF+  R PRRSPRLS      NALEGRNA VSKSSI+SGG S DLKNPSP VRRSPR SNG  G++S G S SFS QQ 
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD

Query:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ
         LEKSSRKRE  SGS +  G     NV    SSHG+ VA  ER+KGNSADHEDIA +    QVVDGE+EKKSV  RKRKREDGVVGIRQGWTKEQE +LQ
Subjt:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ

Query:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ
        RAYYAAKPTP FWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPRPR RT+STK S +ELL  SE +LLN+DGAKSRKP  KSQKSHNAQK VR+LLEKNFQ
Subjt:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ

Query:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK
        GA++ EADLFSQLEPNINLSN +PLPSKQLSS MDLQGNQGFLH RSLSNHKKPLSRFS+SVE+ VVSPPVLKQVKNR LHEKYIDQLH REAKRKSMSK
Subjt:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK

Query:  CTKNCISEERG-LKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE
        C K+CIS+E G  KE+HA RT+DLRAAKNALISDARDAI QLQHL+ + M++ P F+DD +  DNVD ++ED+
Subjt:  CTKNCISEERG-LKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE

A0A1S3B194 uncharacterized protein LOC1034849103.9e-21474.48Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MP ANSIASSVDIKTLRRSPRFL  +    QQ+ P+TRRSLRFL+KNEIS PT P  RR+ S IRQVHSS A L PS +VSLKTPKSV VNTP+++SK  
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD
         VSSK+K S TGSKK S FEN FE    PRRSPRLS      NALEGRN  VSKSSI+SGG   DLKNPSP VRRSPR SNG  G++SIGKS SFS QQ 
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD

Query:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ
         LEKSSRKRE  +GS +  G     NV    SSHGE VA  ER++GNSAD EDIA +  GTQVVDGE+EKKSV  RKRKREDGVVGIRQGWTKEQE ALQ
Subjt:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ

Query:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ
        RAYY AKPTPQFWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPRPR RT+STKSS  ELL  SE +LLNLDGAKSRKPSRKSQKSHNAQK VR+LLEKNFQ
Subjt:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ

Query:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK
        GA++ EADLFSQLEPNINLSN++PLPSKQLSS +DLQGNQGFLH RSLSNHKKPLSRFS+SVE+ VVSPPVLKQVKNR LHEKYIDQLHCREAKRKSMSK
Subjt:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK

Query:  CTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE
        C K+CIS+E G K +H  RT+DLRAAKNALISDARDAI Q QHL+A+  ++ PDF+D  +   NVD +NED+
Subjt:  CTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE

A0A5D3CMV2 Uncharacterized protein3.9e-21474.48Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MP ANSIASSVDIKTLRRSPRFL  +    QQ+ P+TRRSLRFL+KNEIS PT P  RR+ S IRQVHSS A L PS +VSLKTPKSV VNTP+++SK  
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD
         VSSK+K S TGSKK S FEN FE    PRRSPRLS      NALEGRN  VSKSSI+SGG   DLKNPSP VRRSPR SNG  G++SIGKS SFS QQ 
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQD

Query:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ
         LEKSSRKRE  +GS +  G     NV    SSHGE VA  ER++GNSAD EDIA +  GTQVVDGE+EKKSV  RKRKREDGVVGIRQGWTKEQE ALQ
Subjt:  ALEKSSRKREKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQ

Query:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ
        RAYY AKPTPQFWKKVSKLVPGKSAQDCFDKVHSDH+TPPQPRPR RT+STKSS  ELL  SE +LLNLDGAKSRKPSRKSQKSHNAQK VR+LLEKNFQ
Subjt:  RAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQ

Query:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK
        GA++ EADLFSQLEPNINLSN++PLPSKQLSS +DLQGNQGFLH RSLSNHKKPLSRFS+SVE+ VVSPPVLKQVKNR LHEKYIDQLHCREAKRKSMSK
Subjt:  GALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSK

Query:  CTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE
        C K+CIS+E G K +H  RT+DLRAAKNALISDARDAI Q QHL+A+  ++ PDF+D  +   NVD +NED+
Subjt:  CTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDE

A0A6J1BUA6 uncharacterized protein LOC1110056071.3e-21775.52Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MPS NSI SSV+IKTLRRSPRF T + P  Q++FP TRRSLRFLQKN+IS PTLPE+RRSHSAIRQVHSS AC+RP +NVSLKTPKSVL NT  KSSK  
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVS-KSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQ
         VSSK++ S TGSKKS+ FENGFEGIR PRRSPRLS      NALEGRNA VS  SSI SG RS DL +PSP VRRSPRL+NG   HQS GKS+ FS QQ
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS------NALEGRNANVS-KSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQ

Query:  DALEKSSRKR-EKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAA
        DALE+  R R +KSSGSDKK GL HV+N+ T  SS G+NVAE ERRKGNSAD E    + GGTQVVDGE++KKSVA+RKRKRE+ VVGIRQGWT+EQEAA
Subjt:  DALEKSSRKR-EKSSGSDKKAGLSHVQNVVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAA

Query:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKN
        L RAYYAAKPTP+FWKKVSKLVPGKSAQDCFDKVHS+H+TPPQPRPRSR +STKSSQIELLS SE KLLNLDGAK+RK SRK+QKSHNAQKTVR LLEKN
Subjt:  LQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKN

Query:  FQGALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM
        +QGALSCEAD FS LEPNINLS+ SP PSK+L ST  L GNQ FLHERSL NHKKP SRFSSSVE VVVSPPVLKQVKNR+LHEKYIDQLH REAKRKS+
Subjt:  FQGALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSM

Query:  SKCTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENE
        S+C +NC  EE+ LKE HAART+DLRAAKNALISDAR+AIHQLQ L AS+  +  DFDD ++  DN+D E+E
Subjt:  SKCTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENE

A0A6J1KQ47 uncharacterized protein LOC1114972411.1e-19752.89Show/hide
Query:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS
        MPS+NSIASSVDIK LRRSPR L P+ P  Q + PSTRRSLRFLQK +ISPPTLPE RRSHSAIRQVH S  CL PSKNVS KTPK VLVNTP+KS KPS
Subjt:  MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPS

Query:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS----------------------------------------------------------------
         VSS++KDS +G KKSSTF NGFEGI+ PRRS RLS                                                                
Subjt:  AVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLS----------------------------------------------------------------

Query:  --NALEGRNANVSKSSIASGGRSRDLKNP-----------------------------------------------------------------------
          NALE +NA VSKSSI  GGRSRDLK+                                                                        
Subjt:  --NALEGRNANVSKSSIASGGRSRDLKNP-----------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------SPKV-------------------
                                                                                     +PK+                   
Subjt:  -----------------------------------------------------------------------------SPKV-------------------

Query:  -----------------------------RRS------------------------------------PRLSNGARGHQSIGKSQSFSDQQDALEKSSRK
                                     RRS                                    PRL+N   GHQSI KSQ    QQDALEKSSRK
Subjt:  -----------------------------RRS------------------------------------PRLSNGARGHQSIGKSQSFSDQQDALEKSSRK

Query:  REKSSGSDKKAGLSHVQNVVTRESSHGENVAE-EERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQRAYYAAK
        RE+S   DKK  L +VQNVVTRE+SH EN+ E  ERRKGNSADHE IA EGGGT+VV GE+EKKSVA RKRKREDGVVGIRQGWTKEQEAALQRAYYAAK
Subjt:  REKSSGSDKKAGLSHVQNVVTRESSHGENVAE-EERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQRAYYAAK

Query:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEA
        PTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQ +KS QIEL SLSEDKLLN +GAKSRKP RK+Q+S NAQKTVR+LLEK FQ A+S EA
Subjt:  PTPQFWKKVSKLVPGKSAQDCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEA

Query:  DLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCIS
        DLFSQLEPN+N SN+SPLPSKQLS T DLQGNQGFLHERSLSNHKKPLSRFS+SVE+ VVSP VLKQVKN+ALHEKYIDQLHCREAKRKSM+KCTK CIS
Subjt:  DLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCIS

Query:  EERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDEI
        E++GLKEVHA RT+DLRAAKNALISDARDAIHQLQH+QA+ ++D+PDFDDD  LYDNVD ENEDEI
Subjt:  EERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENEDEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53440.1 Homeodomain-like superfamily protein1.2e-4234.5Show/hide
Query:  SPPTLPELRRSHSAIRQVHSSDACLRPSKNVSL------KTPKSVLVNTPRKSSKPSAVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLSNALEGRN
        S   LP+  R  + +  + + D+ ++P K++        KT + VL    R  SK    SS      + SKK +  ++GF  +   RRS RLS+  E   
Subjt:  SPPTLPELRRSHSAIRQVHSSDACLRPSKNVSL------KTPKSVLVNTPRKSSKPSAVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLSNALEGRN

Query:  ANVSKSSIASGGRSRDLK----NPSPKVRRSPRLSNGA-----RGHQSIGKSQSFSDQQDALEKSSRKREKSSGSD----KKAGLSHVQNVVTRESSHGE
         N  K   AS   SR       + S  +RRSPR S+G      +   SIGK    S       KS    EK  G D    K    S V+    +  S  +
Subjt:  ANVSKSSIASGGRSRDLK----NPSPKVRRSPRLSNGA-----RGHQSIGKSQSFSDQQDALEKSSRKREKSSGSD----KKAGLSHVQNVVTRESSHGE

Query:  NVAEEERRKG---NSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQ-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVPGKSAQ
        +  EEE              IAK            E K   K + + ED   G++        GWT+E E ALQ AY   KP+P FWKKV+K+VPGKSAQ
Subjt:  NVAEEERRKG---NSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQ-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVPGKSAQ

Query:  DCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEADLFSQLEPNINLSNYSPLP
        +CFD+V+S  +TP Q +PR R   T  S I   SLS  KLL  +  K++   R++  S   +K VRHLLEK          DLFS LEPN   SN+   P
Subjt:  DCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEADLFSQLEPNINLSNYSPLP

Query:  SKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCISEERGLKEVHAARTHDLRAA
         ++                RSL    +     SS     +VSPPVLKQVKN+ALHEKYID LH R+AKRK+ S      ++ +  ++ +   +   +RAA
Subjt:  SKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCISEERGLKEVHAARTHDLRAA

Query:  KNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENED
        K+AL  D +DAI +L+ L+A     + +F      YD+V+ + ED
Subjt:  KNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENED

AT3G53440.2 Homeodomain-like superfamily protein1.2e-4234.5Show/hide
Query:  SPPTLPELRRSHSAIRQVHSSDACLRPSKNVSL------KTPKSVLVNTPRKSSKPSAVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLSNALEGRN
        S   LP+  R  + +  + + D+ ++P K++        KT + VL    R  SK    SS      + SKK +  ++GF  +   RRS RLS+  E   
Subjt:  SPPTLPELRRSHSAIRQVHSSDACLRPSKNVSL------KTPKSVLVNTPRKSSKPSAVSSKDKDSITGSKKSSTFENGFEGIRGPRRSPRLSNALEGRN

Query:  ANVSKSSIASGGRSRDLK----NPSPKVRRSPRLSNGA-----RGHQSIGKSQSFSDQQDALEKSSRKREKSSGSD----KKAGLSHVQNVVTRESSHGE
         N  K   AS   SR       + S  +RRSPR S+G      +   SIGK    S       KS    EK  G D    K    S V+    +  S  +
Subjt:  ANVSKSSIASGGRSRDLK----NPSPKVRRSPRLSNGA-----RGHQSIGKSQSFSDQQDALEKSSRKREKSSGSD----KKAGLSHVQNVVTRESSHGE

Query:  NVAEEERRKG---NSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQ-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVPGKSAQ
        +  EEE              IAK            E K   K + + ED   G++        GWT+E E ALQ AY   KP+P FWKKV+K+VPGKSAQ
Subjt:  NVAEEERRKG---NSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQ-------GWTKEQEAALQRAYYAAKPTPQFWKKVSKLVPGKSAQ

Query:  DCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEADLFSQLEPNINLSNYSPLP
        +CFD+V+S  +TP Q +PR R   T  S I   SLS  KLL  +  K++   R++  S   +K VRHLLEK          DLFS LEPN   SN+   P
Subjt:  DCFDKVHSDHLTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEADLFSQLEPNINLSNYSPLP

Query:  SKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCISEERGLKEVHAARTHDLRAA
         ++                RSL    +     SS     +VSPPVLKQVKN+ALHEKYID LH R+AKRK+ S      ++ +  ++ +   +   +RAA
Subjt:  SKQLSSTMDLQGNQGFLHERSLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCISEERGLKEVHAARTHDLRAA

Query:  KNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENED
        K+AL  D +DAI +L+ L+A     + +F      YD+V+ + ED
Subjt:  KNALISDARDAIHQLQHLQASTMHDAPDFDDDSNLYDNVDCENED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCAGCGCAAACTCCATCGCCAGCTCCGTTGACATCAAAACTCTTCGAAGATCTCCAAGATTTCTTACTCCTTCTACACCATTAGCACAACAGAAGTTCCCCTCTAC
ACGAAGATCTCTCAGATTTCTTCAAAAGAATGAAATTTCACCTCCTACACTCCCGGAGCTCCGCCGCTCTCACTCTGCAATTCGCCAGGTACATTCTTCTGACGCTTGTC
TCCGACCTTCAAAGAATGTTTCTCTTAAAACCCCTAAATCTGTTCTTGTAAATACCCCCCGAAAATCGAGTAAACCTAGTGCTGTCTCGAGTAAAGACAAGGATTCAATT
ACAGGGTCGAAAAAGTCTTCAACATTCGAAAATGGGTTTGAGGGAATACGAGGTCCGAGACGGTCCCCTAGGTTATCCAATGCGCTCGAGGGTCGTAACGCGAATGTTTC
TAAGAGTTCAATTGCTTCCGGAGGGCGGTCGAGGGATTTGAAGAATCCAAGCCCTAAGGTGAGAAGATCTCCAAGACTCAGTAATGGAGCTCGGGGGCATCAAAGTATCG
GCAAATCTCAAAGTTTTTCGGACCAGCAAGATGCTCTCGAGAAAAGTAGCCGGAAACGGGAGAAGTCCAGTGGTTCGGATAAGAAAGCGGGGTTATCACATGTTCAGAAT
GTTGTTACCAGGGAGAGCTCTCATGGGGAAAATGTGGCAGAAGAAGAGAGGAGAAAAGGAAATTCTGCTGATCACGAGGATATTGCAAAAGAAGGTGGTGGAACACAAGT
AGTTGATGGAGAAGTAGAAAAGAAGTCAGTGGCTAAAAGGAAAAGAAAGCGAGAGGATGGTGTGGTTGGGATTAGACAAGGGTGGACTAAAGAACAGGAAGCCGCATTAC
AGAGAGCTTATTATGCTGCAAAGCCCACTCCCCAATTTTGGAAGAAGGTTTCCAAACTGGTGCCTGGAAAGTCTGCCCAAGATTGCTTTGATAAAGTTCATTCCGACCAT
TTGACCCCTCCTCAACCTCGACCTCGATCCAGAACACAGAGCACAAAATCATCTCAAATTGAACTTTTGTCTCTTTCGGAGGATAAACTTCTAAACCTTGATGGTGCCAA
GTCTAGAAAGCCTAGTCGCAAGAGTCAGAAAAGCCATAATGCGCAGAAAACTGTGAGACATTTGTTAGAGAAGAACTTTCAGGGGGCCCTTAGCTGTGAGGCCGATTTGT
TCTCACAACTCGAGCCAAATATTAATCTCTCTAACTATTCTCCTCTACCCAGTAAACAACTCTCTAGCACTATGGATTTACAGGGAAACCAAGGATTTCTTCATGAGAGA
TCCTTGTCGAATCACAAGAAGCCCCTTTCAAGATTTAGCAGCTCAGTTGAGAAAGTTGTCGTAAGTCCACCAGTACTGAAACAGGTGAAGAACAGGGCCTTGCATGAGAA
GTATATCGACCAGTTACATTGTAGGGAAGCAAAGAGAAAATCAATGTCAAAATGCACAAAAAACTGCATCTCCGAAGAGAGAGGTTTAAAGGAAGTCCATGCTGCAAGAA
CTCATGATCTTAGAGCTGCTAAAAATGCCTTGATTTCTGATGCAAGGGATGCCATTCATCAGTTACAACACTTGCAAGCCAGTACCATGCACGATGCTCCTGATTTCGAT
GACGACAGCAATTTATATGACAATGTCGATTGTGAAAATGAAGATGAAATATGA
mRNA sequenceShow/hide mRNA sequence
TGTCTACAAACTGTTCGTGAGAATGCCCAGCGCAAACTCCATCGCCAGCTCCGTTGACATCAAAACTCTTCGAAGATCTCCAAGATTTCTTACTCCTTCTACACCATTAG
CACAACAGAAGTTCCCCTCTACACGAAGATCTCTCAGATTTCTTCAAAAGAATGAAATTTCACCTCCTACACTCCCGGAGCTCCGCCGCTCTCACTCTGCAATTCGCCAG
GTACATTCTTCTGACGCTTGTCTCCGACCTTCAAAGAATGTTTCTCTTAAAACCCCTAAATCTGTTCTTGTAAATACCCCCCGAAAATCGAGTAAACCTAGTGCTGTCTC
GAGTAAAGACAAGGATTCAATTACAGGGTCGAAAAAGTCTTCAACATTCGAAAATGGGTTTGAGGGAATACGAGGTCCGAGACGGTCCCCTAGGTTATCCAATGCGCTCG
AGGGTCGTAACGCGAATGTTTCTAAGAGTTCAATTGCTTCCGGAGGGCGGTCGAGGGATTTGAAGAATCCAAGCCCTAAGGTGAGAAGATCTCCAAGACTCAGTAATGGA
GCTCGGGGGCATCAAAGTATCGGCAAATCTCAAAGTTTTTCGGACCAGCAAGATGCTCTCGAGAAAAGTAGCCGGAAACGGGAGAAGTCCAGTGGTTCGGATAAGAAAGC
GGGGTTATCACATGTTCAGAATGTTGTTACCAGGGAGAGCTCTCATGGGGAAAATGTGGCAGAAGAAGAGAGGAGAAAAGGAAATTCTGCTGATCACGAGGATATTGCAA
AAGAAGGTGGTGGAACACAAGTAGTTGATGGAGAAGTAGAAAAGAAGTCAGTGGCTAAAAGGAAAAGAAAGCGAGAGGATGGTGTGGTTGGGATTAGACAAGGGTGGACT
AAAGAACAGGAAGCCGCATTACAGAGAGCTTATTATGCTGCAAAGCCCACTCCCCAATTTTGGAAGAAGGTTTCCAAACTGGTGCCTGGAAAGTCTGCCCAAGATTGCTT
TGATAAAGTTCATTCCGACCATTTGACCCCTCCTCAACCTCGACCTCGATCCAGAACACAGAGCACAAAATCATCTCAAATTGAACTTTTGTCTCTTTCGGAGGATAAAC
TTCTAAACCTTGATGGTGCCAAGTCTAGAAAGCCTAGTCGCAAGAGTCAGAAAAGCCATAATGCGCAGAAAACTGTGAGACATTTGTTAGAGAAGAACTTTCAGGGGGCC
CTTAGCTGTGAGGCCGATTTGTTCTCACAACTCGAGCCAAATATTAATCTCTCTAACTATTCTCCTCTACCCAGTAAACAACTCTCTAGCACTATGGATTTACAGGGAAA
CCAAGGATTTCTTCATGAGAGATCCTTGTCGAATCACAAGAAGCCCCTTTCAAGATTTAGCAGCTCAGTTGAGAAAGTTGTCGTAAGTCCACCAGTACTGAAACAGGTGA
AGAACAGGGCCTTGCATGAGAAGTATATCGACCAGTTACATTGTAGGGAAGCAAAGAGAAAATCAATGTCAAAATGCACAAAAAACTGCATCTCCGAAGAGAGAGGTTTA
AAGGAAGTCCATGCTGCAAGAACTCATGATCTTAGAGCTGCTAAAAATGCCTTGATTTCTGATGCAAGGGATGCCATTCATCAGTTACAACACTTGCAAGCCAGTACCAT
GCACGATGCTCCTGATTTCGATGACGACAGCAATTTATATGACAATGTCGATTGTGAAAATGAAGATGAAATATGATACATTCCATTCTTTTATCTTCAGCTAATTGCCT
TAGATCGCATGTATCTGTGTATAATGAGAATTTAGAAGTTGTAGTCTATGTTAACCGTGGTTCTTGAAACCTCTTTGTCTTTTCTAATTGCTTGTACAGTCAAGC
Protein sequenceShow/hide protein sequence
MPSANSIASSVDIKTLRRSPRFLTPSTPLAQQKFPSTRRSLRFLQKNEISPPTLPELRRSHSAIRQVHSSDACLRPSKNVSLKTPKSVLVNTPRKSSKPSAVSSKDKDSI
TGSKKSSTFENGFEGIRGPRRSPRLSNALEGRNANVSKSSIASGGRSRDLKNPSPKVRRSPRLSNGARGHQSIGKSQSFSDQQDALEKSSRKREKSSGSDKKAGLSHVQN
VVTRESSHGENVAEEERRKGNSADHEDIAKEGGGTQVVDGEVEKKSVAKRKRKREDGVVGIRQGWTKEQEAALQRAYYAAKPTPQFWKKVSKLVPGKSAQDCFDKVHSDH
LTPPQPRPRSRTQSTKSSQIELLSLSEDKLLNLDGAKSRKPSRKSQKSHNAQKTVRHLLEKNFQGALSCEADLFSQLEPNINLSNYSPLPSKQLSSTMDLQGNQGFLHER
SLSNHKKPLSRFSSSVEKVVVSPPVLKQVKNRALHEKYIDQLHCREAKRKSMSKCTKNCISEERGLKEVHAARTHDLRAAKNALISDARDAIHQLQHLQASTMHDAPDFD
DDSNLYDNVDCENEDEI