; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g12170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g12170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:8798077..8803652
RNA-Seq ExpressionMoc02g12170
SyntenyMoc02g12170
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]5.7e-25693.44Show/hide
Query:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW
        EFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKC AF+IALTGSARLW
Subjt:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW

Query:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK
        Y RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETL+EYVTRFQEEQLKVAHCSDDS MCYFLTGLADEALTVKLG+EAPATFAEVLQK
Subjt:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK

Query:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
        AKKVIDGQELLRTKTGRPERKIG+GRSGKDIE ADPKSKDKGSFSSGRAEY+RAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
Subjt:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP

Query:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC
        ERRSKDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVC
Subjt:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC

Query:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPLVTQMAEFVVIDG
        IIREQRPTCPITFDGADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANI SL TYLALGWTRSQLKKSPTPLV    E V+ +G
Subjt:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPLVTQMAEFVVIDG

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]5.4e-22293.36Show/hide
Query:  KEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQ
        K+ SLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AASDAIKC AFQIALTGSARLWY RLPARSISTYSQLRREFLAQ
Subjt:  KEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TL+EYVTRFQEEQLKVAHCSDDS MCYFLTGLADEALTVKLG++AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        G+GRSGKD+E+ADPKSKDKGSFSSGRAEY+RAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVH
        WELK QIEDLIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFDGAD +EVH
Subjt:  WELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.7e-25379.15Show/hide
Query:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW
        EFDQLRGKL+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKC AFQIALTGSARLW
Subjt:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW

Query:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK
                                                             FQE+QLKVA  SDDS MCYFLTGLADEALTVKLG EAPATFAEVLQK
Subjt:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK

Query:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
        AKKVIDGQELLRTKTGRPER I +GRSGKD EKAD KSKDKGSFSSGRAE++RA NGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
Subjt:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP

Query:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC
        ERR+KDKYCRFHREH HNTSD WELK QIEDLIQD YFKKFVGKP+TSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVC
Subjt:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC

Query:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL------------------------
        IIREQRPTCPITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI SLLTYLALGWTRSQLKKS TPL                        
Subjt:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL------------------------

Query:  --VTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLPRREFAAPT
          VTQMAEFVVIDGRSAYNAIFGR IIHSFRA+PSTLHQVLKYSTPNGV MVRGEQ ASRECYASALKGSSVCALETL SRDGTLEF+A+LPRREFAAPT
Subjt:  --VTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLPRREFAAPT

Query:  KELELVPLL
        +ELELVPLL
Subjt:  KELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.0e-25665.92Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SKA                                   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPIEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQAEFDQLRGKLDAQVEALKAKC
              E  YN +                           G I  E                                  EFDQL+ K DAQVEALKA+C
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPIEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQAEFDQLRGKLDAQVEALKAKC

Query:  EQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFL
        E+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWY RLPAR ISTYSQLR+EF+
Subjt:  EQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFL

Query:  AQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
        +QFSSRHYD+KT THLATIRQKEGETL+EYVTRF EEQLKVAHCSDDS MCYFLTGLADE LTVKL +EAPATFAEVLQK KKVIDGQELLRTKTGRPE+
Subjt:  AQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPER

Query:  KIGQGRSGKDIEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT
         I QGR+GKD  KAD KS+DKG S SS R +Y+R+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNT
Subjt:  KIGQGRSGKDIEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT

Query:  SDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLK
        S+ WELK QIEDLIQDGYFKKFVGKP+++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+ ADL+
Subjt:  SDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLK

Query:  EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL--------------------------VTQMAEFVVIDGRSAYN
         VHLPHNDALVIAPLID V+VRR+LVDGGASANI SL TYLALGWTRSQLKKSPTPL                          VTQMAEFVVIDGRSAYN
Subjt:  EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL--------------------------VTQMAEFVVIDGRSAYN

Query:  AIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRD
        AIFGR IIHSFRA+PSTLHQVLKYST NGV  VRGE   SRECYAS  K SSVCALE    RD
Subjt:  AIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRD

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]6.9e-21775.47Show/hide
Query:  MDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFL
        MDFQAA+DAIKC AFQIALTGSARLWY RLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETL+EYVTRFQEEQLKVAHCSDDS MCYFL
Subjt:  MDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFL

Query:  TGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLG+EAP TF EVLQKAKKVIDGQELLRTKTGRPE++I Q +  ++  KAD KS+DKGS SS  R EY+R E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELK QIEDLIQDGYFKKFVGKP+++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF  ADL+ VHLPHNDALVIA LIDH +VRRVL+DGG   ++P     + +G   +Q     
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSP

Query:  TPLVTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLP---RREF
           VTQMAEFVVIDGRSAYNAIFGR IIHSFRA+PSTLHQVLKYSTPN V MVRGEQ  SRECYASALKGS+VCALE   +R    E EADLP   +R+F
Subjt:  TPLVTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLP---RREF

Query:  AAPTKELELVPLLSPEKQSD------LMEIDAPE
          PT+ELELVPLLSPE+Q++      ++E++AP+
Subjt:  AAPTKELELVPLLSPEKQSD------LMEIDAPE

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.8e-25693.44Show/hide
Query:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW
        EFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASDAIKC AF+IALTGSARLW
Subjt:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW

Query:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK
        Y RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETL+EYVTRFQEEQLKVAHCSDDS MCYFLTGLADEALTVKLG+EAPATFAEVLQK
Subjt:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK

Query:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
        AKKVIDGQELLRTKTGRPERKIG+GRSGKDIE ADPKSKDKGSFSSGRAEY+RAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
Subjt:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP

Query:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC
        ERRSKDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVC
Subjt:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC

Query:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPLVTQMAEFVVIDG
        IIREQRPTCPITFDGADL+EVHLPHNDALVIAPLIDHVVV RVLVDGG SANI SL TYLALGWTRSQLKKSPTPLV    E V+ +G
Subjt:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPLVTQMAEFVVIDG

A0A6J1D9E1 uncharacterized protein LOC1110188231.3e-25379.15Show/hide
Query:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW
        EFDQLRGKL+AQVEALKAKCEQKEG LNDGDLGESPFTSDVLE        APTVK YDGSKDPKDYVEVFEGLMDFQAASDAIKC AFQIALTGSARLW
Subjt:  EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLW

Query:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK
                                                             FQE+QLKVA  SDDS MCYFLTGLADEALTVKLG EAPATFAEVLQK
Subjt:  YWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQK

Query:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
        AKKVIDGQELLRTKTGRPER I +GRSGKD EKAD KSKDKGSFSSGRAE++RA NGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP
Subjt:  AKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAP

Query:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC
        ERR+KDKYCRFHREH HNTSD WELK QIEDLIQD YFKKFVGKP+TSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVC
Subjt:  ERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVC

Query:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL------------------------
        IIREQRPTCPITFD ADL+EVHLPHNDALVIAPLIDHVVVRRVLVD G SANI SLLTYLALGWTRSQLKKS TPL                        
Subjt:  IIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL------------------------

Query:  --VTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLPRREFAAPT
          VTQMAEFVVIDGRSAYNAIFGR IIHSFRA+PSTLHQVLKYSTPNGV MVRGEQ ASRECYASALKGSSVCALETL SRDGTLEF+A+LPRREFAAPT
Subjt:  --VTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLPRREFAAPT

Query:  KELELVPLL
        +ELELVPLL
Subjt:  KELELVPLL

A0A6J1D9W7 uncharacterized protein LOC1110187082.6e-22293.36Show/hide
Query:  KEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQ
        K+ SLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AASDAIKC AFQIALTGSARLWY RLPARSISTYSQLRREFLAQ
Subjt:  KEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQ

Query:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
        FSSR Y KKT THLATIRQKEG TL+EYVTRFQEEQLKVAHCSDDS MCYFLTGLADEALTVKLG++AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKI
Subjt:  FSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI

Query:  GQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
        G+GRSGKD+E+ADPKSKDKGSFSSGRAEY+RAE+GPT+SRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Subjt:  GQGRSGKDIEKADPKSKDKGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC

Query:  WELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVH
        WELK QIEDLIQDGYFKKFVGKP+TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTCPITFDGAD +EVH
Subjt:  WELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVH

Query:  LPHNDALVIAPLIDHVVVRRVL
        LPHNDA VIAPLIDHVVVRRVL
Subjt:  LPHNDALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204799.6e-25765.92Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQPANSTNTADRR LAA+  HQREVGA VVEGQGH+ L TEPL RSARIT PVLPPAHP+ SKA                                   
Subjt:  MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPIEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQAEFDQLRGKLDAQVEALKAKC
              E  YN +                           G I  E                                  EFDQL+ K DAQVEALKA+C
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPIEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQAEFDQLRGKLDAQVEALKAKC

Query:  EQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFL
        E+KE S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDGSKDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWY RLPAR ISTYSQLR+EF+
Subjt:  EQKEGSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFL

Query:  AQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPER
        +QFSSRHYD+KT THLATIRQKEGETL+EYVTRF EEQLKVAHCSDDS MCYFLTGLADE LTVKL +EAPATFAEVLQK KKVIDGQELLRTKTGRPE+
Subjt:  AQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPER

Query:  KIGQGRSGKDIEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT
         I QGR+GKD  KAD KS+DKG S SS R +Y+R+ +   +SRPYE +TPTTIPI EILTNIEE+GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNT
Subjt:  KIGQGRSGKDIEKADPKSKDKG-SFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNT

Query:  SDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLK
        S+ WELK QIEDLIQDGYFKKFVGKP+++S EKKEERKR RTPPRR DRPAVIN             K+KELAR ARREVCIIREQRPT  I F+ ADL+
Subjt:  SDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLK

Query:  EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL--------------------------VTQMAEFVVIDGRSAYN
         VHLPHNDALVIAPLID V+VRR+LVDGGASANI SL TYLALGWTRSQLKKSPTPL                          VTQMAEFVVIDGRSAYN
Subjt:  EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSPTPL--------------------------VTQMAEFVVIDGRSAYN

Query:  AIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRD
        AIFGR IIHSFRA+PSTLHQVLKYST NGV  VRGE   SRECYAS  K SSVCALE    RD
Subjt:  AIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRD

A0A6J1DZB9 uncharacterized protein LOC1110249043.3e-21775.47Show/hide
Query:  MDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFL
        MDFQAA+DAIKC AFQIALTGSARLWY RLPARSISTYSQLR+EF++QFSS HYD+KTATHLATIRQKE ETL+EYVTRFQEEQLKVAHCSDDS MCYFL
Subjt:  MDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFL

Query:  TGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIP
        T LADE LTVKLG+EAP TF EVLQKAKKVIDGQELLRTKTGRPE++I Q +  ++  KAD KS+DKGS SS  R EY+R E+GP+RSRPYER+T +TIP
Subjt:  TGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKDKGSFSS-GRAEYQRAENGPTRSRPYERFTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRG  E+R+K+KYCRFHR+HGHNT+ CWELK QIEDLIQDGYFKKFVGKP+++S EKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKPKTSSAEKKEERKRSRTPPRRTDRPAVIN

Query:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSP
        TIFGGP+GGQSG+KRKELAR ARREVCIIRE +PTC ITF  ADL+ VHLPHNDALVIA LIDH +VRRVL+DGG   ++P     + +G   +Q     
Subjt:  TIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANIPSLLTYLALGWTRSQLKKSP

Query:  TPLVTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLP---RREF
           VTQMAEFVVIDGRSAYNAIFGR IIHSFRA+PSTLHQVLKYSTPN V MVRGEQ  SRECYASALKGS+VCALE   +R    E EADLP   +R+F
Subjt:  TPLVTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASRDGTLEFEADLP---RREF

Query:  AAPTKELELVPLLSPEKQSD------LMEIDAPE
          PT+ELELVPLLSPE+Q++      ++E++AP+
Subjt:  AAPTKELELVPLLSPEKQSD------LMEIDAPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCC
ATGGAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTC
GGTCCAATCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGTCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTC
CGAAAAGGACAGGCGGAGTTCGACCAGTTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGC
GACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAG
GATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCACGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGG
TATTGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCAT
CTCGCCACCATCAGACAGAAGGAGGGTGAAACGCTGCAAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGACCATG
TGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGATGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATC
GATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCAAGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGAC
AAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCAAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCC
GAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGC
TTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGGGTCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCC
AAGACCAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGC
GGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACGGT
GCAGACTTGAAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCCCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCT
GCTAACATCCCGTCCTTACTGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTCACCCAAATGGCCGAGTTCGTGGTA
ATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACGCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCC
AATGGCGTGGACATGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGG
GATGGGACGCTCGAGTTCGAGGCTGACCTGCCAAGGAGGGAGTTTGCCGCACCCACTAAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATCAGAT
CTGATGGAGATCGACGCTCCAGAGTGTTCATGGATGGACCCGATTGTGGACTTTATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGTTGGCGAGG
CAAGCAGCTCGGTTCGTGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTTGAGGTCAACGACATAGTCCGACCTGGGACGTACATATTG
GCCGATCTGAAAGGAGACGTCCTGAAGCGTTATTATCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGT
CACGACGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCC
ATGGAGGAAATGTATAACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTC
GGTCCAATCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGTCGGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCATCTCTC
CGAAAAGGACAGGCGGAGTTCGACCAGTTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCCTTAAAGGCTAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGC
GACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGACCCCAAG
GATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCACGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGG
TATTGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCAT
CTCGCCACCATCAGACAGAAGGAGGGTGAAACGCTGCAAGAATATGTCACCAGATTCCAGGAGGAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGACCATG
TGCTATTTTCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGATGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATC
GATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCAAGGCAGAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGAC
AAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCAAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCC
GAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGC
TTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGAAGGGTCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGGGAAAGCCC
AAGACCAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCGAGGACGCCACCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGC
GGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGGGAGCAGAGGCCGACCTGCCCAATCACCTTCGACGGT
GCAGACTTGAAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATCGCCCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGGCGCATCT
GCTAACATCCCGTCCTTACTGACCTACCTCGCCTTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTCACCCAAATGGCCGAGTTCGTGGTA
ATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACGCATCATCCACTCATTTCGGGCCCTTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCC
AATGGCGTGGACATGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGCTATGCCTCCGCACTCAAAGGCTCATCGGTCTGCGCCCTTGAAACTCTCGCCAGTAGG
GATGGGACGCTCGAGTTCGAGGCTGACCTGCCAAGGAGGGAGTTTGCCGCACCCACTAAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAATCAGAT
CTGATGGAGATCGACGCTCCAGAGTGTTCATGGATGGACCCGATTGTGGACTTTATTAGGGGCAATTCACCACAAGACCCCAAGGAGCGCAGAAGGTTGGCGAGG
CAAGCAGCTCGGTTCGTGGTCCAAACCCATGTGGGTGCCCTTGACCCGACCTGGGAGGGGCCGTTTGAGGTCAACGACATAGTCCGACCTGGGACGTACATATTG
GCCGATCTGAAAGGAGACGTCCTGAAGCGTTATTATCCCTGA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRS
MEEMYNEMILAAGAGSRSENRMTRIDIREQRGSHLGPIEEEHPEDNESEGHTRRRGDLREHLNRKRGSSLRKGQAEFDQLRGKLDAQVEALKAKCEQKEGSLNDG
DLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCHAFQIALTGSARLWYWRLPARSISTYSQLRREFLAQFSSRHYDKKTATH
LATIRQKEGETLQEYVTRFQEEQLKVAHCSDDSTMCYFLTGLADEALTVKLGDEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGQGRSGKDIEKADPKSKD
KGSFSSGRAEYQRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKGQIEDLIQDGYFKKFVGKP
KTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCPITFDGADLKEVHLPHNDALVIAPLIDHVVVRRVLVDGGAS
ANIPSLLTYLALGWTRSQLKKSPTPLVTQMAEFVVIDGRSAYNAIFGRRIIHSFRALPSTLHQVLKYSTPNGVDMVRGEQTASRECYASALKGSSVCALETLASR
DGTLEFEADLPRREFAAPTKELELVPLLSPEKQSDLMEIDAPECSWMDPIVDFIRGNSPQDPKERRRLARQAARFVVQTHVGALDPTWEGPFEVNDIVRPGTYIL
ADLKGDVLKRYYP