; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001067 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001067
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr4:23472862..23477747
RNA-Seq ExpressionLag0001067
SyntenyLag0001067
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008447174.1 PREDICTED: uncharacterized protein LOC103489686 isoform X3 [Cucumis melo]1.8e-24782.82Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV
        MES ++IVKKEHKTFVRRRLPPTPPPPPSLSSSSS PLNPTL   PS PCP+KKTRDLPNFSECH+CGFR+D VDGRSRLNSL+SEWRIVLLCKKCFSLV
Subjt:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV

Query:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA
        ESSQVCSYCFADS GD F CCECNRRVHRECF+QYSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRK+RRKN++VSDLRSSKVSTSG CKSLSA
Subjt:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA

Query:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG
        +VKDANCLVEKKVDAAVRARE ALKKAA ARRAS LASDALNLVAQRDESAAKESG+SA+DAELAIQLHR MN+SPRFSKNLCS NSNY+ F+N RVDDG
Subjt:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG

Query:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN
        + S GA++SG+FDFFK P VLV+NN+CNS DN  SEPSVTAKDH +PLE N  E +GKD + VK N C VKCD+ES NVEL P+ +++S SIKLT AG N
Subjt:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN

Query:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC
        +D  CSESQ +  QDKY LPRDERC+AKPYHYFFKYRRRDTTKRY LKYSKRNS+LKRMPDC P I VDGMC+GVPSSSAAI+ISSTE FPVISNASFGC
Subjt:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC

Query:  CAVPLQASGLGVNAVQEISNHGGR
        CAVPLQASGLGVNAVQEISN GGR
Subjt:  CAVPLQASGLGVNAVQEISNHGGR

XP_022952090.1 uncharacterized protein LOC111454852 [Cucurbita moschata]3.1e-24784.25Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPT--PPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFS
        M+S  QI+KKEHKTFVRRRLPPT  PPPPPSLSSSSS PLNPTL  KPS PCP KKTRDLPNFSECHACGFRVDAVDGRSRLNSL+SEWRIVLLCKKCFS
Subjt:  MESTIQIVKKEHKTFVRRRLPPT--PPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFS

Query:  LVESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSL
        LVESSQVCSYCFADSRGD FNC ECNRRVHRECF+ YSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRK+RRKNI+VSDLRSSKVSTSG CKS 
Subjt:  LVESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSL

Query:  SAVVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMR-V
        S++VKDANCL +K VDAAVR RE ALKKAA  RRASELASDALNLVAQRDE+AAKESGESADDAELAIQLHR MN+SPR SKN CS NSNY+AFEN R V
Subjt:  SAVVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMR-V

Query:  DDGDPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGA
        DDGD S+G + S +FDFFKTPQVL++N++CNS DNA SEPSVTAKDH APLEINR ESMG +PIP K N C+VKCDTESENVEL PK+DLRSRSI+L  A
Subjt:  DDGDPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGA

Query:  GSNHDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNAS
        G NHDR CSE       DKYTLPRDERC+AKPYHYFFKY+RRDTTKRY LKYSKR SRLKRMPDCKP ILVDGMCLGVPSSSAAILI STEKFPVISNAS
Subjt:  GSNHDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNAS

Query:  FGCCAVPLQASGLGVNAVQEISNHGGR
        FGCCAVPLQAS LGV+AVQEIS+ GGR
Subjt:  FGCCAVPLQASGLGVNAVQEISNHGGR

XP_023554148.1 uncharacterized protein LOC111811499 isoform X2 [Cucurbita pepo subsp. pepo]4.1e-24784.06Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPT--PPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFS
        M+S  QI+KKEHKTFVRRRLPPT  PPPPPSLSSSSS PLNPTL AKPS PCP KKTRDLPNFSECHACGFRVDAVDGRSRLNSL+SEWRIVLLCKKCFS
Subjt:  MESTIQIVKKEHKTFVRRRLPPT--PPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFS

Query:  LVESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSL
        LVESSQVCSYCFADSRGD FNC ECNRRVHRECF+ YSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRK+RRKNI+VSDLRSSKVSTSG CKSL
Subjt:  LVESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSL

Query:  SAVVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMR-V
        S++VKDANCL +K VDAAVR RE ALKKAA  RRASELASDALNLVAQRDE+AAKESGESADDAELAIQLHR MN+SPR SKN CS NSNY+AFEN R V
Subjt:  SAVVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMR-V

Query:  DDGDPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGA
        DDGD S+G + S +FDFFKTPQV ++N++CNS DNA SEPSVTAKDH APLEINR ESMG +P+P K N C+VKCDTESENVEL PK+DLRSRSI+L  A
Subjt:  DDGDPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGA

Query:  GSNHDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNAS
        G NHDR CSE       DKY+LPRDERC+AKPYHYFFKY+RRDTTKRY LKYSKR SRLKRMPDCKP ILVDGMCLGVPSSSAAILI STEKFPVISNAS
Subjt:  GSNHDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNAS

Query:  FGCCAVPLQASGLGVNAVQEISNHGGR
        FGCCAVPLQAS LGV+AVQEIS+ GGR
Subjt:  FGCCAVPLQASGLGVNAVQEISNHGGR

XP_038887852.1 uncharacterized protein LOC120077849 isoform X1 [Benincasa hispida]1.5e-25284.95Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV
        MES I IVKKEHKTFVRRRLPPTPPPPPSLSSSSS PLNPTL AK     P+KKTRDLPNFSECHACGFR+DAVDG SRLNSL+SEWRIVLLCKKCFSLV
Subjt:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV

Query:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA
        ESSQVCSYCFADS GD FNCCECNRRVHRECF+QYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRK+RRKNIHVSDLRSSKVSTSG CKSLSA
Subjt:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA

Query:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG
        +VKDANCLVEKKVDAAVRAREQALKKAA ARRAS LASDALNLVAQRDESAAKESG+SA+DAELAIQLHR MN+SPRFSKNLCS NSNY+ FENMRVDDG
Subjt:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG

Query:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN
        + SVGA++SG+FDFFK P VLV++N+CNS DN  SEPSVTAKDH +PLEIN  ES GKDP+ VK N C VKCD+ES NVEL  K ++RS SIKLT AG N
Subjt:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN

Query:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC
        HDR CSESQ +L Q+KY LPRDERC+AKPYHYFFKYRRRDTTKRY LKYSKRNSRLKRMPDC P ILVDGMC+GVPSSSAAI+ISSTE FPVISNASFGC
Subjt:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC

Query:  CAVPLQASGLGVNAVQEISNHGGRQ
        CAVPLQASGLGVNAVQE+SN G  +
Subjt:  CAVPLQASGLGVNAVQEISNHGGRQ

XP_038887853.1 uncharacterized protein LOC120077849 isoform X2 [Benincasa hispida]3.8e-25385.31Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV
        MES I IVKKEHKTFVRRRLPPTPPPPPSLSSSSS PLNPTL AK     P+KKTRDLPNFSECHACGFR+DAVDG SRLNSL+SEWRIVLLCKKCFSLV
Subjt:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV

Query:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA
        ESSQVCSYCFADS GD FNCCECNRRVHRECF+QYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRK+RRKNIHVSDLRSSKVSTSG CKSLSA
Subjt:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA

Query:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG
        +VKDANCLVEKKVDAAVRAREQALKKAA ARRAS LASDALNLVAQRDESAAKESG+SA+DAELAIQLHR MN+SPRFSKNLCS NSNY+ FENMRVDDG
Subjt:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG

Query:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN
        + SVGA++SG+FDFFK P VLV++N+CNS DN  SEPSVTAKDH +PLEIN  ES GKDP+ VK N C VKCD+ES NVEL  K ++RS SIKLT AG N
Subjt:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN

Query:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC
        HDR CSESQ +L Q+KY LPRDERC+AKPYHYFFKYRRRDTTKRY LKYSKRNSRLKRMPDC P ILVDGMC+GVPSSSAAI+ISSTE FPVISNASFGC
Subjt:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC

Query:  CAVPLQASGLGVNAVQEISNHGGR
        CAVPLQASGLGVNAVQE+SN G R
Subjt:  CAVPLQASGLGVNAVQEISNHGGR

TrEMBL top hitse value%identityAlignment
A0A1S3BG91 uncharacterized protein LOC103489686 isoform X38.9e-24882.82Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV
        MES ++IVKKEHKTFVRRRLPPTPPPPPSLSSSSS PLNPTL   PS PCP+KKTRDLPNFSECH+CGFR+D VDGRSRLNSL+SEWRIVLLCKKCFSLV
Subjt:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV

Query:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA
        ESSQVCSYCFADS GD F CCECNRRVHRECF+QYSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRK+RRKN++VSDLRSSKVSTSG CKSLSA
Subjt:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA

Query:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG
        +VKDANCLVEKKVDAAVRARE ALKKAA ARRAS LASDALNLVAQRDESAAKESG+SA+DAELAIQLHR MN+SPRFSKNLCS NSNY+ F+N RVDDG
Subjt:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG

Query:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN
        + S GA++SG+FDFFK P VLV+NN+CNS DN  SEPSVTAKDH +PLE N  E +GKD + VK N C VKCD+ES NVEL P+ +++S SIKLT AG N
Subjt:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN

Query:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC
        +D  CSESQ +  QDKY LPRDERC+AKPYHYFFKYRRRDTTKRY LKYSKRNS+LKRMPDC P I VDGMC+GVPSSSAAI+ISSTE FPVISNASFGC
Subjt:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC

Query:  CAVPLQASGLGVNAVQEISNHGGR
        CAVPLQASGLGVNAVQEISN GGR
Subjt:  CAVPLQASGLGVNAVQEISNHGGR

A0A1S3BHF1 uncharacterized protein LOC103489686 isoform X12.0e-24782.63Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV
        MES ++IVKKEHKTFVRRRLPPTPPPPPSLSSSSS PLNPTL   PS PCP+KKTRDLPNFSECH+CGFR+D VDGRSRLNSL+SEWRIVLLCKKCFSLV
Subjt:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV

Query:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA
        ESSQVCSYCFADS GD F CCECNRRVHRECF+QYSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRK+RRKN++VSDLRSSKVSTSG CKSLSA
Subjt:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA

Query:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG
        +VKDANCLVEKKVDAAVRARE ALKKAA ARRAS LASDALNLVAQRDESAAKESG+SA+DAELAIQLHR MN+SPRFSKNLCS NSNY+ F+N RVDDG
Subjt:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG

Query:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN
        + S GA++SG+FDFFK P VLV+NN+CNS DN  SEPSVTAKDH +PLE N  E +GKD + VK N C VKCD+ES NVEL P+ +++S SIKLT AG N
Subjt:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN

Query:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC
        +D  CSESQ +  QDKY LPRDERC+AKPYHYFFKYRRRDTTKRY LKYSKRNS+LKRMPDC P I VDGMC+GVPSSSAAI+ISSTE FPVISNASFGC
Subjt:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC

Query:  CAVPLQASGLGVNAVQEISNHGGR
        CAVPLQASGLGVNAVQEISN GG+
Subjt:  CAVPLQASGLGVNAVQEISNHGGR

A0A1S4DWI3 uncharacterized protein LOC103489686 isoform X23.4e-24782.79Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV
        MES ++IVKKEHKTFVRRRLPPTPPPPPSLSSSSS PLNPTL   PS PCP+KKTRDLPNFSECH+CGFR+D VDGRSRLNSL+SEWRIVLLCKKCFSLV
Subjt:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV

Query:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA
        ESSQVCSYCFADS GD F CCECNRRVHRECF+QYSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRK+RRKN++VSDLRSSKVSTSG CKSLSA
Subjt:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA

Query:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG
        +VKDANCLVEKKVDAAVRARE ALKKAA ARRAS LASDALNLVAQRDESAAKESG+SA+DAELAIQLHR MN+SPRFSKNLCS NSNY+ F+N RVDDG
Subjt:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG

Query:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN
        + S GA++SG+FDFFK P VLV+NN+CNS DN  SEPSVTAKDH +PLE N  E +GKD + VK N C VKCD+ES NVEL P+ +++S SIKLT AG N
Subjt:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN

Query:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC
        +D  CSESQ +  QDKY LPRDERC+AKPYHYFFKYRRRDTTKRY LKYSKRNS+LKRMPDC P I VDGMC+GVPSSSAAI+ISSTE FPVISNASFGC
Subjt:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC

Query:  CAVPLQASGLGVNAVQEISNHGG
        CAVPLQASGLGVNAVQEISN GG
Subjt:  CAVPLQASGLGVNAVQEISNHGG

A0A5D3D3Q5 Uncharacterized protein4.4e-24782.63Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV
        MES ++IVKKEHKTFVRRRLPPTPPPPPSLSSSSS PLNPTL   PS PCP+KKTRDLPNFSECH+CGFR+D VDGRSRLNSL+SEWRIVLLCKKCFSLV
Subjt:  MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLV

Query:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA
        ESSQVCSYCFADS  D F CCECNRRVHRECF+QYSRVAPWSYSSSGS FSVCIDCWVPKPIVTARAVLRSRK+RRKN++VSDLRSSKVSTSG CKSLSA
Subjt:  ESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSA

Query:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG
        +VKDANCLVEKKVDAAVRARE ALKKAA ARRAS LASDALNLVAQRDESAAKESG+SA+DAELAIQLHR MN+SPRFSKNLCS NSNY+ F+N RVDDG
Subjt:  VVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDG

Query:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN
        + S GA++SG+FDFFK P VLV+NN+CNS DN  SEPSVTAKDH +PLE N  E +GKD + VK N C VKCD+ES NVEL P+ +++S SIKLT AG N
Subjt:  DPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSN

Query:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC
        +D  CSESQ +  QDKY LPRDERC+AKPYHYFFKYRRRDTTKRY LKYSKRNS+LKRMPDC P I VDGMC+GVPSSSAAI+ISSTE FPVISNASFGC
Subjt:  HDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGC

Query:  CAVPLQASGLGVNAVQEISNHGGR
        CAVPLQASGLGVNAVQEISN GGR
Subjt:  CAVPLQASGLGVNAVQEISNHGGR

A0A6J1GKS5 uncharacterized protein LOC1114548521.5e-24784.25Show/hide
Query:  MESTIQIVKKEHKTFVRRRLPPT--PPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFS
        M+S  QI+KKEHKTFVRRRLPPT  PPPPPSLSSSSS PLNPTL  KPS PCP KKTRDLPNFSECHACGFRVDAVDGRSRLNSL+SEWRIVLLCKKCFS
Subjt:  MESTIQIVKKEHKTFVRRRLPPT--PPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFS

Query:  LVESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSL
        LVESSQVCSYCFADSRGD FNC ECNRRVHRECF+ YSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRK+RRKNI+VSDLRSSKVSTSG CKS 
Subjt:  LVESSQVCSYCFADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSL

Query:  SAVVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMR-V
        S++VKDANCL +K VDAAVR RE ALKKAA  RRASELASDALNLVAQRDE+AAKESGESADDAELAIQLHR MN+SPR SKN CS NSNY+AFEN R V
Subjt:  SAVVKDANCLVEKKVDAAVRAREQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMR-V

Query:  DDGDPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGA
        DDGD S+G + S +FDFFKTPQVL++N++CNS DNA SEPSVTAKDH APLEINR ESMG +PIP K N C+VKCDTESENVEL PK+DLRSRSI+L  A
Subjt:  DDGDPSVGAMYSGDFDFFKTPQVLVHNNVCNSSDNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGA

Query:  GSNHDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNAS
        G NHDR CSE       DKYTLPRDERC+AKPYHYFFKY+RRDTTKRY LKYSKR SRLKRMPDCKP ILVDGMCLGVPSSSAAILI STEKFPVISNAS
Subjt:  GSNHDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRDTTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNAS

Query:  FGCCAVPLQASGLGVNAVQEISNHGGR
        FGCCAVPLQAS LGV+AVQEIS+ GGR
Subjt:  FGCCAVPLQASGLGVNAVQEISNHGGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCCACCATTCAAATTGTAAAGAAAGAGCACAAAACCTTCGTTAGGCGACGATTACCACCAACCCCACCACCGCCGCCTTCTCTTTCTTCTTCCTCTTCGGTTCC
TCTCAACCCAACTCTAACCGCGAAGCCCTCAGACCCTTGTCCGAAGAAGAAAACTCGGGATTTGCCTAATTTTTCGGAATGTCATGCTTGTGGATTTCGTGTTGACGCCG
TTGACGGTCGTTCGAGGCTCAATTCCCTCTTCAGCGAGTGGCGGATTGTTCTTCTTTGTAAGAAGTGCTTCTCTCTTGTCGAATCTTCCCAGGTCTGTTCTTACTGTTTT
GCTGATTCGAGAGGCGATTGTTTTAATTGTTGTGAGTGTAACCGTCGGGTTCATAGGGAATGTTTTGCACAGTATAGTAGAGTTGCTCCTTGGTCGTATTCCTCTTCGGG
TTCTGAGTTTTCTGTGTGTATTGATTGTTGGGTTCCTAAACCAATTGTGACAGCGAGGGCTGTTTTGAGAAGTAGGAAAGTTAGGAGAAAGAATATCCATGTTTCGGATT
TGCGGAGTTCTAAGGTTTCAACTAGCGGAGGTTGCAAATCTTTGAGTGCAGTGGTGAAAGATGCTAATTGTTTGGTGGAGAAGAAAGTTGATGCTGCTGTTAGAGCTAGG
GAGCAGGCGCTGAAAAAGGCTGCTGAGGCTAGGAGGGCTTCTGAGTTGGCTAGCGATGCCTTAAATTTGGTTGCTCAAAGGGATGAAAGTGCTGCCAAAGAGTCTGGGGA
ATCTGCCGACGACGCCGAGTTGGCAATTCAGTTGCACCGAGTTATGAATACTTCTCCAAGATTTTCCAAGAACTTGTGTTCGGTGAATTCAAACTACATTGCCTTTGAGA
ATATGAGGGTGGATGATGGTGACCCATCAGTAGGAGCAATGTATAGCGGGGACTTTGATTTCTTTAAAACACCTCAAGTTTTGGTTCATAACAATGTATGCAACTCTTCT
GATAATGCTGTTTCTGAGCCTTCAGTCACGGCCAAAGATCATGCTGCACCATTAGAAATCAACCGTTCAGAATCCATGGGTAAAGATCCTATACCTGTCAAGAGCAATGC
TTGCACGGTAAAGTGTGATACTGAGTCTGAGAATGTGGAATTGCTACCAAAAGATGATCTGAGGAGCAGATCAATTAAATTAACGGGTGCCGGTTCCAATCATGATAGGT
CATGCTCTGAATCTCAACCGACACTCGAACAAGATAAGTACACTTTACCCAGGGATGAGAGATGTGTTGCAAAGCCCTATCACTATTTCTTTAAGTATCGGAGGAGAGAT
ACAACTAAACGCTATCAATTGAAGTATAGCAAGCGAAATTCTAGACTGAAGAGGATGCCAGATTGCAAACCTAACATTCTTGTTGATGGAATGTGCTTGGGTGTTCCGTC
TTCCTCTGCTGCAATTCTAATTAGTAGTACTGAAAAATTTCCAGTAATTTCAAATGCCTCATTTGGCTGCTGCGCTGTTCCTCTGCAAGCTTCTGGCCTGGGTGTGAATG
CAGTTCAAGAAATCAGTAACCATGGAGGCAGGCAAACATTACCAGTCCATCAATTTGTTGGTATATCCTCCCTTACTCAGTATAGGGGTAGTTGCTGGCTGCAAAAACTG
CCACAGTTTTGTTCATTGCATGGTGGGGACGGGAAAGACACATATTTTTGGGAGGATAAGTGGGTGGGGGATTGCCCCCTCTACTTGAAGTTCCCCCACCTTTACCACCT
TTCGTCCATGAGAAATCGTTCCATTGCTGATGTCTTGTTTCATTCAGGGAGCTCCCCTTCGCCTTTGTTTGGGTTCTGTCGTGCTTTGTCCAATAGGGAAATGATGGAAG
TCACGACTCTTCTTTCTTTGATTGAGGAGTTCAAGTTTAGGTTAGGGGGAAGAGATATTCGTTGTTGGAGCCCCAAGCCTTCTGAGGATTTTCATGTGAAAGTTCTGAAA
AAGATTAGATTCTTTATCTGGAAAATTATCCGTGGGAGAGTTAATACTTTCAATCGTCTCTTGAGAAGGAAGAATCCTTTGATGGGTCCATTCTGTTGCATTCTTTGTCG
GAAGGCAGAGGAAGACCTAGATCACTTATTCTGGAGTTGTGACTTTGCTCGTCATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCCACCATTCAAATTGTAAAGAAAGAGCACAAAACCTTCGTTAGGCGACGATTACCACCAACCCCACCACCGCCGCCTTCTCTTTCTTCTTCCTCTTCGGTTCC
TCTCAACCCAACTCTAACCGCGAAGCCCTCAGACCCTTGTCCGAAGAAGAAAACTCGGGATTTGCCTAATTTTTCGGAATGTCATGCTTGTGGATTTCGTGTTGACGCCG
TTGACGGTCGTTCGAGGCTCAATTCCCTCTTCAGCGAGTGGCGGATTGTTCTTCTTTGTAAGAAGTGCTTCTCTCTTGTCGAATCTTCCCAGGTCTGTTCTTACTGTTTT
GCTGATTCGAGAGGCGATTGTTTTAATTGTTGTGAGTGTAACCGTCGGGTTCATAGGGAATGTTTTGCACAGTATAGTAGAGTTGCTCCTTGGTCGTATTCCTCTTCGGG
TTCTGAGTTTTCTGTGTGTATTGATTGTTGGGTTCCTAAACCAATTGTGACAGCGAGGGCTGTTTTGAGAAGTAGGAAAGTTAGGAGAAAGAATATCCATGTTTCGGATT
TGCGGAGTTCTAAGGTTTCAACTAGCGGAGGTTGCAAATCTTTGAGTGCAGTGGTGAAAGATGCTAATTGTTTGGTGGAGAAGAAAGTTGATGCTGCTGTTAGAGCTAGG
GAGCAGGCGCTGAAAAAGGCTGCTGAGGCTAGGAGGGCTTCTGAGTTGGCTAGCGATGCCTTAAATTTGGTTGCTCAAAGGGATGAAAGTGCTGCCAAAGAGTCTGGGGA
ATCTGCCGACGACGCCGAGTTGGCAATTCAGTTGCACCGAGTTATGAATACTTCTCCAAGATTTTCCAAGAACTTGTGTTCGGTGAATTCAAACTACATTGCCTTTGAGA
ATATGAGGGTGGATGATGGTGACCCATCAGTAGGAGCAATGTATAGCGGGGACTTTGATTTCTTTAAAACACCTCAAGTTTTGGTTCATAACAATGTATGCAACTCTTCT
GATAATGCTGTTTCTGAGCCTTCAGTCACGGCCAAAGATCATGCTGCACCATTAGAAATCAACCGTTCAGAATCCATGGGTAAAGATCCTATACCTGTCAAGAGCAATGC
TTGCACGGTAAAGTGTGATACTGAGTCTGAGAATGTGGAATTGCTACCAAAAGATGATCTGAGGAGCAGATCAATTAAATTAACGGGTGCCGGTTCCAATCATGATAGGT
CATGCTCTGAATCTCAACCGACACTCGAACAAGATAAGTACACTTTACCCAGGGATGAGAGATGTGTTGCAAAGCCCTATCACTATTTCTTTAAGTATCGGAGGAGAGAT
ACAACTAAACGCTATCAATTGAAGTATAGCAAGCGAAATTCTAGACTGAAGAGGATGCCAGATTGCAAACCTAACATTCTTGTTGATGGAATGTGCTTGGGTGTTCCGTC
TTCCTCTGCTGCAATTCTAATTAGTAGTACTGAAAAATTTCCAGTAATTTCAAATGCCTCATTTGGCTGCTGCGCTGTTCCTCTGCAAGCTTCTGGCCTGGGTGTGAATG
CAGTTCAAGAAATCAGTAACCATGGAGGCAGGCAAACATTACCAGTCCATCAATTTGTTGGTATATCCTCCCTTACTCAGTATAGGGGTAGTTGCTGGCTGCAAAAACTG
CCACAGTTTTGTTCATTGCATGGTGGGGACGGGAAAGACACATATTTTTGGGAGGATAAGTGGGTGGGGGATTGCCCCCTCTACTTGAAGTTCCCCCACCTTTACCACCT
TTCGTCCATGAGAAATCGTTCCATTGCTGATGTCTTGTTTCATTCAGGGAGCTCCCCTTCGCCTTTGTTTGGGTTCTGTCGTGCTTTGTCCAATAGGGAAATGATGGAAG
TCACGACTCTTCTTTCTTTGATTGAGGAGTTCAAGTTTAGGTTAGGGGGAAGAGATATTCGTTGTTGGAGCCCCAAGCCTTCTGAGGATTTTCATGTGAAAGTTCTGAAA
AAGATTAGATTCTTTATCTGGAAAATTATCCGTGGGAGAGTTAATACTTTCAATCGTCTCTTGAGAAGGAAGAATCCTTTGATGGGTCCATTCTGTTGCATTCTTTGTCG
GAAGGCAGAGGAAGACCTAGATCACTTATTCTGGAGTTGTGACTTTGCTCGTCATGTTTGA
Protein sequenceShow/hide protein sequence
MESTIQIVKKEHKTFVRRRLPPTPPPPPSLSSSSSVPLNPTLTAKPSDPCPKKKTRDLPNFSECHACGFRVDAVDGRSRLNSLFSEWRIVLLCKKCFSLVESSQVCSYCF
ADSRGDCFNCCECNRRVHRECFAQYSRVAPWSYSSSGSEFSVCIDCWVPKPIVTARAVLRSRKVRRKNIHVSDLRSSKVSTSGGCKSLSAVVKDANCLVEKKVDAAVRAR
EQALKKAAEARRASELASDALNLVAQRDESAAKESGESADDAELAIQLHRVMNTSPRFSKNLCSVNSNYIAFENMRVDDGDPSVGAMYSGDFDFFKTPQVLVHNNVCNSS
DNAVSEPSVTAKDHAAPLEINRSESMGKDPIPVKSNACTVKCDTESENVELLPKDDLRSRSIKLTGAGSNHDRSCSESQPTLEQDKYTLPRDERCVAKPYHYFFKYRRRD
TTKRYQLKYSKRNSRLKRMPDCKPNILVDGMCLGVPSSSAAILISSTEKFPVISNASFGCCAVPLQASGLGVNAVQEISNHGGRQTLPVHQFVGISSLTQYRGSCWLQKL
PQFCSLHGGDGKDTYFWEDKWVGDCPLYLKFPHLYHLSSMRNRSIADVLFHSGSSPSPLFGFCRALSNREMMEVTTLLSLIEEFKFRLGGRDIRCWSPKPSEDFHVKVLK
KIRFFIWKIIRGRVNTFNRLLRRKNPLMGPFCCILCRKAEEDLDHLFWSCDFARHV