; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024674 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024674
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAT-hook motif nuclear-localized protein 20
Genome locationtig00002486:1753339..1757971
RNA-Seq ExpressionSgr024674
SyntenySgr024674
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR005175 - PPC domain
IPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7837506.1 uncharacterized protein G2W53_005988 [Senna tora]4.9e-10346.51Show/hide
Query:  MGNRVSSNL-------------CHSLKPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQV
        MGN+VSSNL              H+ K  E    IGILSFE+AN++S+  HLHK+L D EIS L+ EI  +EG+ NLVSSDE+YLL L LAEK+++LN+V
Subjt:  MGNRVSSNL-------------CHSLKPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQV

Query:  AGTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNN
        A  VSRLGK+CS PALQGFEH+Y DIVSG +DVK+LG L K M+ +V+KM  YV  T NLY  +EV N           N+H+           WQ K +
Subjt:  AGTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNN

Query:  VKHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHD---------------HLA-------------QNDMKSSLLR---------------------
        V+HLK ISLWN++Y++VVELL R V  +Y RIS+VF D               HL              Q DM+ S L                      
Subjt:  VKHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHD---------------HLA-------------QNDMKSSLLR---------------------

Query:  ----------------------------------------------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRT
                                                      A  ST+GGSALA+ YAN+IIV EK LR+PHLVG++ARDDLY+MLPTSL+ SLR 
Subjt:  ----------------------------------------------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRT

Query:  HLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLD
         LK   ++ AIYDA +   WK   D IL WL+P+AH+ IRWQ+ER FEQ+ QI  RS NI L+QTL+FADR K E A+C +LVGLNY+CRYEHQQ+ALLD
Subjt:  HLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLD

Query:  C
        C
Subjt:  C

KAF8098691.1 hypothetical protein N665_0260s0001 [Sinapis alba]2.2e-10345.59Show/hide
Query:  MGNRVSSNLCHSL--------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVS
        M N+VS NL H+L        KP     TIGILSFE+ANV+S+  HLH++L D E+S L+ E+F ++G+  LVSSDEN+LL L ++EK+DDL++VA  VS
Subjt:  MGNRVSSNLCHSL--------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVS

Query:  RLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEV------------SNNEHK-----------WQTKNNVKHL
        RLGK+C+ PALQGFEH+Y DIV+G +D +KLG L KDM+++VKKM+ +V  T NLY  +EV             + +H+           WQ + +VK L
Subjt:  RLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEV------------SNNEHK-----------WQTKNNVKHL

Query:  KHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQNDMKSSLLR-----------------------------------------------------
        +  SLWN++Y++VVE+L R V  +Y RI  VF     +    S+L R                                                     
Subjt:  KHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQNDMKSSLLR-----------------------------------------------------

Query:  ----------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAH
                  A++ST+GGSAL++ YAN++IV EK L++PHL+G++ARDDLY+MLPTSLK+SL+ +L+   ++ +IYDA +   WK   D IL WL+P+AH
Subjt:  ----------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAH

Query:  DTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC
        + IRWQ+ER FEQ+ QI KR TN+ L+QTL+FADREKTEAAICK+LVGLNY+C YE QQ+ALLDC
Subjt:  DTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC

KAG2306699.1 hypothetical protein Bca52824_026447 [Brassica carinata]4.4e-10446.7Show/hide
Query:  RVSSNLCHSL------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRLGKR
        ++S+NL H+L      K P    TIGILSFE+AN++S+  HLH++L D E+S L+ E+F ++G+T LVSSDEN+LL L ++EK+DDL++VA  VSRLGK+
Subjt:  RVSSNLCHSL------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRLGKR

Query:  CSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN------------NEHK-----------WQTKNNVKHLKHISL
        C+ PALQGFEH+Y DIV+G +D +KLG L KDM+++VKKM+ +V  T NLY  +EV N             +H+           WQ + +VK L+  SL
Subjt:  CSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN------------NEHK-----------WQTKNNVKHLKHISL

Query:  WNKSYNRVVELLGRMVVLLYARISLV-----------------------------FHDHLAQND---------------------------MKSSLLR-A
        WN++Y++VVE+L R V  +Y RI  V                             F + L  N+                            KS L + A
Subjt:  WNKSYNRVVELLGRMVVLLYARISLV-----------------------------FHDHLAQND---------------------------MKSSLLR-A

Query:  ASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGF
        ++ST+GGSAL++ YAN++IV EK L++PHL+G++ARDDLY+MLPTSLK+SL+ +L+   ++ +IYDA +   WK   D IL WL+P+AH+ IRWQ+ER F
Subjt:  ASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGF

Query:  EQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC
        EQ+ QI KR TN+ L+QTL+FADREKTEAAICK+LVGLNY+C YE QQ+ALLDC
Subjt:  EQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC

KDO42748.1 hypothetical protein CISIN_1g047107mg [Citrus sinensis]7.5e-10445.17Show/hide
Query:  MGNRVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTV
        MGN+VS+NL H+L         K PE    IGILSFE+AN +S+  HLHK+L D EIS L+ EI  +EGI  LVS D++YLL LVLAEK+DDLN+V   V
Subjt:  MGNRVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTV

Query:  SRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNNVKHL
        SRLGK+CS PAL+GFEH+Y D+VSG +DVK+LG L KDMD++V+KM+ +V  T+NLY  +EV N           N+H+           WQ K +V+HL
Subjt:  SRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNNVKHL

Query:  KHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQND-------------------------------------------MKSSLLR----------
        K ISLWN++Y++VVELL R V  +YA+I + F D   + D                                           + SS+ +          
Subjt:  KHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQND-------------------------------------------MKSSLLR----------

Query:  ----------------------------------------------------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSL
                                                            A+ STVGGSALA+ YAN+IIV EK LR+PHLVG++AR+DLY+MLP SL
Subjt:  ----------------------------------------------------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSL

Query:  KSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQ
        + SL+T+LK   ++ AIYDA +   WK   D IL WL+P+AH+ IRWQ+ER FEQ  QI  R TN+ L+QTL+FADREKTEAAIC++LVGLNY+CRYEHQ
Subjt:  KSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQ

Query:  QDALLDC
        Q+ALLDC
Subjt:  QDALLDC

XP_018486131.1 PREDICTED: uncharacterized protein LOC108856751 [Raphanus sativus]2.0e-10145.38Show/hide
Query:  RVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRL
        ++S+NL H+L         K P    TIGILSFE+AN++S+  HLH++L D ++SNL+  +F ++G+T LVSSD N+LL L ++EK+DDL +VA  VSRL
Subjt:  RVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRL

Query:  GKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN---------------NEHK-----------WQTKNNVKH
        GK+C+ PALQGFEH+Y DIV+G +D +KLG L KDM+++VKKM+ +V  T +LY  +EV N                +H+           WQ + +VK 
Subjt:  GKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN---------------NEHK-----------WQTKNNVKH

Query:  LKHISLWNKSYNRVVELLGRMVVLLYARISLV--------------------------------------FHDHLAQND----------------MKSSL
        L+  SLWN++Y++VVE+L R V  +Y RI  V                                      F + LA N                 MKS+ 
Subjt:  LKHISLWNKSYNRVVELLGRMVVLLYARISLV--------------------------------------FHDHLAQND----------------MKSSL

Query:  L--------RAASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAH
                  A++STVGGSAL++ YAN++IV EK L++PHL+G++ARDDLY+MLPTSLK+SL+ +L+   ++ +IYDA +   WK   D IL WL+P+AH
Subjt:  L--------RAASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAH

Query:  DTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC
        + IRWQ+ER FEQ+ QI KR TN+ L+QTL+FADREKTEAAICK+LVGLNY+C YE QQ+ALLDC
Subjt:  DTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC

TrEMBL top hitse value%identityAlignment
A0A067DIT3 Uncharacterized protein3.6e-10445.17Show/hide
Query:  MGNRVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTV
        MGN+VS+NL H+L         K PE    IGILSFE+AN +S+  HLHK+L D EIS L+ EI  +EGI  LVS D++YLL LVLAEK+DDLN+V   V
Subjt:  MGNRVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTV

Query:  SRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNNVKHL
        SRLGK+CS PAL+GFEH+Y D+VSG +DVK+LG L KDMD++V+KM+ +V  T+NLY  +EV N           N+H+           WQ K +V+HL
Subjt:  SRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNNVKHL

Query:  KHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQND-------------------------------------------MKSSLLR----------
        K ISLWN++Y++VVELL R V  +YA+I + F D   + D                                           + SS+ +          
Subjt:  KHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQND-------------------------------------------MKSSLLR----------

Query:  ----------------------------------------------------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSL
                                                            A+ STVGGSALA+ YAN+IIV EK LR+PHLVG++AR+DLY+MLP SL
Subjt:  ----------------------------------------------------AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSL

Query:  KSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQ
        + SL+T+LK   ++ AIYDA +   WK   D IL WL+P+AH+ IRWQ+ER FEQ  QI  R TN+ L+QTL+FADREKTEAAIC++LVGLNY+CRYEHQ
Subjt:  KSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQ

Query:  QDALLDC
        Q+ALLDC
Subjt:  QDALLDC

A0A151QRW7 Uncharacterized protein1.7e-10144.77Show/hide
Query:  MGNRVSSNLCHSL------------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVA
        MGN+VSSNL H+L            K  +   TIGILSFE+ANV+S+  HLH++L + EIS LR EI  +EG+ NLVSSDE+YLL L LAEK+++LN+VA
Subjt:  MGNRVSSNLCHSL------------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVA

Query:  GTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNNV
          VSRLGK+CS PALQGFEH+Y DIV G +DVK+LG L K M+ +V+KM  YV  T NLY+ +EV N           N+H+           WQ K +V
Subjt:  GTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN-----------NEHK-----------WQTKNNV

Query:  KHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHDH--------------LAQNDM------------------------------------------
        +HLK +SLWN+++++VVELL R V  +YARIS +F +               +AQN+                                           
Subjt:  KHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHDH--------------LAQNDM------------------------------------------

Query:  ---------------------------------------------KSSL-LRAASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSL
                                                     KS L + A  ST+GG ALA+ YAN+IIV EK LR+PHLVG++ARDDLY+MLP+SL
Subjt:  ---------------------------------------------KSSL-LRAASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSL

Query:  KSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQ
        + SL+  LK   ++ AIYDA +   WK   D IL WL+P+AH+ IRWQ+ER FEQ+ QI  R TN+ L+QTL+FADREKTE +IC++LVGLNY+CRYEHQ
Subjt:  KSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQ

Query:  QDALLDC
        Q+ALLDC
Subjt:  QDALLDC

A0A3N6SKZ6 Uncharacterized protein3.8e-10145.28Show/hide
Query:  RVSSNLCHSL-------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRLGK
        ++S+NL H+L       K P    TIGILSFE+AN++S+  HLH++L D E+S L+ ++F ++G+T LVSSD N+LL L ++EK+DDL++VA  VSRLGK
Subjt:  RVSSNLCHSL-------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRLGK

Query:  RCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSNN---------------------EHK--WQTKNNVKHLKHIS
        +C+ PALQGFEH+Y DIV+G +D +KLG L KDM+++VKKM+ +V  T  LY  +EV                        E K  WQ + +VK L+  S
Subjt:  RCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSNN---------------------EHK--WQTKNNVKHLKHIS

Query:  LWNKSYNRVVELLGRMVVLLYARISLVF-----------------------------------------------------HD--------------HLA
        LWN++Y++VVE+L R V  +Y RI  VF                                                     HD                +
Subjt:  LWNKSYNRVVELLGRMVVLLYARISLVF-----------------------------------------------------HD--------------HLA

Query:  QNDMKSSLLRAAS-STVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVA
        +   KS L + AS ST+GGSAL++ YAN++IV EK L++PHL+G++ARDDLY+MLPTSLK+SL+ +L+   ++ +IYDA +   WK A D IL WL+P+A
Subjt:  QNDMKSSLLRAAS-STVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVA

Query:  HDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC
        H+ IRWQ+ER FEQ+ QI KR TN+ L+QTL+FADREKTEAAICK+LVGLNY+C YE QQ+ALLDC
Subjt:  HDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC

A0A6J0NPR6 uncharacterized protein LOC1088567519.9e-10245.38Show/hide
Query:  RVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRL
        ++S+NL H+L         K P    TIGILSFE+AN++S+  HLH++L D ++SNL+  +F ++G+T LVSSD N+LL L ++EK+DDL +VA  VSRL
Subjt:  RVSSNLCHSL---------KPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRL

Query:  GKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN---------------NEHK-----------WQTKNNVKH
        GK+C+ PALQGFEH+Y DIV+G +D +KLG L KDM+++VKKM+ +V  T +LY  +EV N                +H+           WQ + +VK 
Subjt:  GKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN---------------NEHK-----------WQTKNNVKH

Query:  LKHISLWNKSYNRVVELLGRMVVLLYARISLV--------------------------------------FHDHLAQND----------------MKSSL
        L+  SLWN++Y++VVE+L R V  +Y RI  V                                      F + LA N                 MKS+ 
Subjt:  LKHISLWNKSYNRVVELLGRMVVLLYARISLV--------------------------------------FHDHLAQND----------------MKSSL

Query:  L--------RAASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAH
                  A++STVGGSAL++ YAN++IV EK L++PHL+G++ARDDLY+MLPTSLK+SL+ +L+   ++ +IYDA +   WK   D IL WL+P+AH
Subjt:  L--------RAASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAH

Query:  DTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC
        + IRWQ+ER FEQ+ QI KR TN+ L+QTL+FADREKTEAAICK+LVGLNY+C YE QQ+ALLDC
Subjt:  DTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC

A0A6J1CAM1 uncharacterized protein LOC1110099061.3e-10142.65Show/hide
Query:  MGNRVSSNLCHSL----------KPPEATA----TIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQ
        MGN+VSSNL H+L          K PE ++    TIGILSFE+ANV+S+  +LHK+L    IS L+ EI +++G+ NLVSSDE +LL L +AEK++DLN+
Subjt:  MGNRVSSNLCHSL----------KPPEATA----TIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQ

Query:  VAGTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEV-----------SNNEHK-----------WQTKN
        VA  VSRLGK+CS PALQGF+H+Y DIV+G ++VK+LG L KDM+ +++KM+ YV  TANLY  +EV            NN+H+           WQ K 
Subjt:  VAGTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEV-----------SNNEHK-----------WQTKN

Query:  NVKHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQNDMKSSLLR------------------------------------------------
         V HLK ISLWN++Y++VVELL R V  +YARI LVF D   + D+  +++                                                 
Subjt:  NVKHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQNDMKSSLLR------------------------------------------------

Query:  -----------------------------------------------------------------------------------------AASSTVGGSAL
                                                                                                 A  STVGGSAL
Subjt:  -----------------------------------------------------------------------------------------AASSTVGGSAL

Query:  AVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRS
        A+ YANIIIV EK LR+PHLVGD+ARDDLY+MLPTSL+SSL+THLK   +  AIYDA V   WK   D IL WL+P+AH+ IRWQ+ER FEQ+ QI  R 
Subjt:  AVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRS

Query:  TNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC
        TN+ L+QTL+FADR+KTE AIC++LVGLNY+CRYEHQQ+ALLDC
Subjt:  TNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDC

SwissProt top hitse value%identityAlignment
O22130 AT-hook motif nuclear-localized protein 222.8e-5363.79Show/hide
Query:  ENSGGGSG---------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAA-PG---SVMPLQ
        E+ GGG G          RP GSKNKPKPPI +TRDS NAL+S+V+EVA G DV E +  FARRRQRG+CVLS +G V NVT+RQPA+ PG   SV+ L 
Subjt:  ENSGGGSG---------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAA-PG---SVMPLQ

Query:  GRFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHE
        GRFEILSL+G+FLP PAPP ++GLT+YL+GGQGQVVGGSVVG L+A+GP++++AA+F NA YERLPLE+ D  E
Subjt:  GRFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHE

O23620 AT-hook motif nuclear-localized protein 232.5e-5461.27Show/hide
Query:  NSGGGSG-------------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQG
        +SGGG G              RPPGSKNKPKPP+ +TR+S N LR+++LEV  G DV +C+A +ARRRQRG+CVLS SG V NV++RQP+A G+V+ LQG
Subjt:  NSGGGSG-------------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQG

Query:  RFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHE
         FEILSL+G+FLP PAPPG+T LT++L+GGQGQVVGGSVVG L AAGP++VIAA+F N  YERLPLE+ +  +
Subjt:  RFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHE

O82166 AT-hook motif nuclear-localized protein 214.8e-5362.42Show/hide
Query:  GGGSG--------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQGRFEILSL
        GGGSG         RP GSKNKPKPP+ VTR+S N LR+++LEV  G DV ECI+ +ARRRQRG+CVLS +G V NV++RQP A G+V+ L+G FEILSL
Subjt:  GGGSG--------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQGRFEILSL

Query:  TGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDH
        +G+FLP PAPPG+T LT++L+G QGQVVGG+VVG L+AAGP+MV+AA+F N  YERLPL++ ++H
Subjt:  TGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDH

Q8GWQ2 AT-hook motif nuclear-localized protein 201.6e-6180.67Show/hide
Query:  RPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQGRFEILSLTGAFLPGPAPPGST
        RPPGSKNKPK PIFVTRDSPNALRS+VLE++ GSDVA+ IA F+RRRQRGVCVLS +G VANVTLRQ AAPG V+ LQGRFEILSLTGAFLPGP+PPGST
Subjt:  RPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQGRFEILSLTGAFLPGPAPPGST

Query:  GLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDD
        GLTVYL+G QGQVVGGSVVG L+A G +MVIAATF+NATYERLP+E+ +D
Subjt:  GLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDD

Q9SR17 AT-hook motif nuclear-localized protein 191.1e-6064.97Show/hide
Query:  LPGVDHPAVNSPMFKQSDRPEENSGGGSGSRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQP--
        L G DH      +   + RP          RP GSKNKPKPPIFVTRDSPNAL+S+V+E+A G+DV E +A FARRRQRG+C+LS +G VANVTLRQP  
Subjt:  LPGVDHPAVNSPMFKQSDRPEENSGGGSGSRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQP--

Query:  ----AAPG--SVMPLQGRFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHEVGSVSAS
            AAPG  +V+ LQGRFEILSLTG+FLPGPAPPGSTGLT+YL+GGQGQVVGGSVVG L+AAGP+M+IAATF+NATYERLPLE+ +  E G    S
Subjt:  ----AAPG--SVMPLQGRFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHEVGSVSAS

Arabidopsis top hitse value%identityAlignment
AT3G04570.1 AT-hook motif nuclear-localized protein 197.6e-6264.97Show/hide
Query:  LPGVDHPAVNSPMFKQSDRPEENSGGGSGSRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQP--
        L G DH      +   + RP          RP GSKNKPKPPIFVTRDSPNAL+S+V+E+A G+DV E +A FARRRQRG+C+LS +G VANVTLRQP  
Subjt:  LPGVDHPAVNSPMFKQSDRPEENSGGGSGSRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQP--

Query:  ----AAPG--SVMPLQGRFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHEVGSVSAS
            AAPG  +V+ LQGRFEILSLTG+FLPGPAPPGSTGLT+YL+GGQGQVVGGSVVG L+AAGP+M+IAATF+NATYERLPLE+ +  E G    S
Subjt:  ----AAPG--SVMPLQGRFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHEVGSVSAS

AT3G23160.1 Protein of unknown function (DUF668)4.4e-10243.23Show/hide
Query:  MGNRVSSNLCHSL--------KPPEATA----TIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVA
        M N+VSSNL H+L        K P+  +    TIGILSFE+ANV+S+  HLH++L D EIS L+ E+F +EG+  LVSSDEN+LL L ++EK+DDL++VA
Subjt:  MGNRVSSNLCHSL--------KPPEATA----TIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVA

Query:  GTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN------------NEHK-----------WQTKNN
          VSRLGK+C+ PALQGFEH+Y DIV+G +D +KLG L KDM+++VKKM+ +V  T +LY  +EV N             +H+           WQ + +
Subjt:  GTVSRLGKRCSIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSN------------NEHK-----------WQTKNN

Query:  VKHLKHISLWNKSYNRVVELLGRMVVLLYARISLV---------------------------------------------------------------FH
        VK L+  SLWN++Y++VVE+L R V  +Y RI  V                                                               F 
Subjt:  VKHLKHISLWNKSYNRVVELLGRMVVLLYARISLV---------------------------------------------------------------FH

Query:  DHLAQN--------------------------------DMKSSLLR-AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLR
        + LA N                                  KS L + A++ST+GGSAL++ YAN++IV EK L++PHL+G++ARDDLY+MLPTSLK++L+
Subjt:  DHLAQN--------------------------------DMKSSLLR-AASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLR

Query:  THLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALL
          L+   ++ +IYDA +   WK   D IL WL+P+AH+ IRWQ+ER FEQ  QI KR TN+ L+QTL+FADREKTEAAICK+LVGLNY+C YE QQ+ALL
Subjt:  THLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALL

Query:  DC
        DC
Subjt:  DC

AT4G14465.1 AT-hook motif nuclear-localized protein 201.2e-6280.67Show/hide
Query:  RPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQGRFEILSLTGAFLPGPAPPGST
        RPPGSKNKPK PIFVTRDSPNALRS+VLE++ GSDVA+ IA F+RRRQRGVCVLS +G VANVTLRQ AAPG V+ LQGRFEILSLTGAFLPGP+PPGST
Subjt:  RPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQGRFEILSLTGAFLPGPAPPGST

Query:  GLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDD
        GLTVYL+G QGQVVGGSVVG L+A G +MVIAATF+NATYERLP+E+ +D
Subjt:  GLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDD

AT4G17800.1 Predicted AT-hook DNA-binding family protein1.8e-5561.27Show/hide
Query:  NSGGGSG-------------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQG
        +SGGG G              RPPGSKNKPKPP+ +TR+S N LR+++LEV  G DV +C+A +ARRRQRG+CVLS SG V NV++RQP+A G+V+ LQG
Subjt:  NSGGGSG-------------SRPPGSKNKPKPPIFVTRDSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQG

Query:  RFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHE
         FEILSL+G+FLP PAPPG+T LT++L+GGQGQVVGGSVVG L AAGP++VIAA+F N  YERLPLE+ +  +
Subjt:  RFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSLVAAGPIMVIAATFANATYERLPLEDPDDHE

AT5G51670.1 Protein of unknown function (DUF668)2.9e-6134.97Show/hide
Query:  MGNRVSSNLCHSLKPPEATAT--IGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRLGKRC
        + ++ +S   H   PP +T T  +G+LSFE+A V+++  HL  +L D  +   R    + EG+T +V+ DE + L LV AE  D L   A +VSRL  RC
Subjt:  MGNRVSSNLCHSLKPPEATAT--IGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRLGKRC

Query:  SIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIE---VSNNEHKWQT--------------------------------K
        +  +L+ F  ++ +      D     +  KD +A  KK++ YV  T  LY  +E   +  N  + Q+                                K
Subjt:  SIPALQGFEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIE---VSNNEHKWQT--------------------------------K

Query:  NNVKHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHD-----------------------------HLAQNDMK--------------SSLLRAASS
         +VK+LK  SLWNKS++ VV +L R V    AR+  VF                               H + ND +              S LL+   +
Subjt:  NNVKHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHD-----------------------------HLAQNDMK--------------SSLLRAASS

Query:  TVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFE-Q
        T+GG+ +A+ YAN+I+V EK ++ P LVG DARDDLY MLP S++SSLR+ LK         D  + + WK A  RIL WL P+A + IRWQ+ER FE Q
Subjt:  TVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNERGFE-Q

Query:  YCQIGKRSTN-IALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDA
        +      S N + LVQTL FAD+ KTEAAI ++LVGLNY+ R+E +  A
Subjt:  YCQIGKRSTN-IALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACCGGGTAAGTTCCAATCTCTGCCATTCTCTCAAACCCCCCGAGGCCACCGCCACCATTGGCATTCTTTCCTTCGAGATTGCCAACGTGATTTCCAGA
GCCACACACCTCCACAAGACCCTCAAAGACCTCGAGATCTCAAATCTCCGGAGAGAAATTTTCACGGCGGAGGGAATCACAAACCTCGTCTCCTCCGATGAGAAT
TACCTTCTCGGACTGGTTTTGGCCGAGAAAGTTGACGATCTGAACCAGGTGGCCGGCACTGTCTCGAGACTCGGGAAGAGATGCTCCATTCCGGCCCTGCAAGGA
TTCGAGCATATATACACCGACATTGTTAGTGGGAATCTTGACGTGAAAAAGCTTGGAATTTTGACCAAGGACATGGATGCATTGGTGAAGAAAATGAAAACATAC
GTGAAACGCACGGCGAATCTGTACAACGCCATCGAGGTCTCGAATAATGAACACAAATGGCAGACAAAGAACAACGTCAAGCATCTCAAACACATCTCACTTTGG
AACAAAAGCTACAACAGAGTCGTCGAACTGTTGGGAAGAATGGTCGTTTTGCTTTACGCCAGAATCAGTCTAGTATTCCACGATCATCTCGCTCAAAACGACATG
AAATCATCTCTCCTCCGCGCCGCTTCTTCCACCGTCGGCGGCTCAGCTCTCGCCGTACGTTACGCGAACATCATCATCGTAACGGAGAAATTCCTCCGCCACCCC
CATCTGGTGGGCGACGACGCCAGAGACGACTTGTACGAGATGCTACCGACGAGCTTGAAATCGTCTCTAAGAACCCATTTGAAATGCAACGCGAGAAGCCAAGCG
ATCTACGACGCTACGGTTTGCAGTTACTGGAAAGGAGCGGCGGATCGGATACTGGGGTGGCTGTCGCCGGTGGCACACGACACGATCCGGTGGCAGAACGAGCGT
GGGTTTGAACAATATTGTCAAATTGGGAAGAGGTCGACGAACATTGCGCTGGTTCAGACGCTGCATTTTGCGGACCGGGAGAAGACAGAAGCAGCCATTTGCAAG
GTTCTTGTTGGTCTGAACTATCTGTGCCGCTACGAGCATCAGCAGGATGCATTGTTGGACTGTGTGGGGCTGCCGGGCGTCGATCACCCGGCGGTCAACTCACCT
ATGTTCAAACAAAGCGATCGCCCTGAGGAAAACAGCGGCGGCGGAAGTGGCAGCAGACCACCTGGATCTAAAAACAAACCAAAACCACCGATCTTTGTCACTCGT
GACAGCCCTAACGCTCTGCGGAGCTATGTGCTGGAGGTCGCCGGAGGATCCGACGTGGCGGAGTGCATAGCCCAATTCGCCCGGAGACGCCAGCGTGGCGTCTGC
GTGCTCAGTGCAAGCGGCTTGGTCGCCAACGTCACCCTAAGGCAGCCGGCAGCGCCCGGCTCCGTAATGCCACTTCAAGGAAGGTTCGAGATTCTGTCTTTAACC
GGGGCGTTTTTGCCGGGGCCAGCGCCGCCGGGATCCACAGGGCTAACTGTGTACTTATCCGGCGGTCAGGGTCAAGTTGTGGGTGGGAGCGTCGTGGGATCACTG
GTGGCGGCCGGGCCGATAATGGTCATAGCGGCAACTTTTGCTAATGCAACATATGAAAGATTGCCTCTGGAAGATCCCGACGACCACGAAGTTGGCAGCGTCTCA
GCCTCCACGGCGGAGCGAGAAGCCCACCGCCGGAAATCAGAGGAAATGGAGGGCAGATGCAGACTGGGATGCCTGAACCAACTTTGCCTTTGTACAATCTACTGC
CGGACATGCTGCCAAACGGCGTTCAGCTCGGGCACGACGGATATGCTTATGTCCGGCCGCCGTTCTGAAACAAAGGGAAATCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACCGGGTAAGTTCCAATCTCTGCCATTCTCTCAAACCCCCCGAGGCCACCGCCACCATTGGCATTCTTTCCTTCGAGATTGCCAACGTGATTTCCAGA
GCCACACACCTCCACAAGACCCTCAAAGACCTCGAGATCTCAAATCTCCGGAGAGAAATTTTCACGGCGGAGGGAATCACAAACCTCGTCTCCTCCGATGAGAAT
TACCTTCTCGGACTGGTTTTGGCCGAGAAAGTTGACGATCTGAACCAGGTGGCCGGCACTGTCTCGAGACTCGGGAAGAGATGCTCCATTCCGGCCCTGCAAGGA
TTCGAGCATATATACACCGACATTGTTAGTGGGAATCTTGACGTGAAAAAGCTTGGAATTTTGACCAAGGACATGGATGCATTGGTGAAGAAAATGAAAACATAC
GTGAAACGCACGGCGAATCTGTACAACGCCATCGAGGTCTCGAATAATGAACACAAATGGCAGACAAAGAACAACGTCAAGCATCTCAAACACATCTCACTTTGG
AACAAAAGCTACAACAGAGTCGTCGAACTGTTGGGAAGAATGGTCGTTTTGCTTTACGCCAGAATCAGTCTAGTATTCCACGATCATCTCGCTCAAAACGACATG
AAATCATCTCTCCTCCGCGCCGCTTCTTCCACCGTCGGCGGCTCAGCTCTCGCCGTACGTTACGCGAACATCATCATCGTAACGGAGAAATTCCTCCGCCACCCC
CATCTGGTGGGCGACGACGCCAGAGACGACTTGTACGAGATGCTACCGACGAGCTTGAAATCGTCTCTAAGAACCCATTTGAAATGCAACGCGAGAAGCCAAGCG
ATCTACGACGCTACGGTTTGCAGTTACTGGAAAGGAGCGGCGGATCGGATACTGGGGTGGCTGTCGCCGGTGGCACACGACACGATCCGGTGGCAGAACGAGCGT
GGGTTTGAACAATATTGTCAAATTGGGAAGAGGTCGACGAACATTGCGCTGGTTCAGACGCTGCATTTTGCGGACCGGGAGAAGACAGAAGCAGCCATTTGCAAG
GTTCTTGTTGGTCTGAACTATCTGTGCCGCTACGAGCATCAGCAGGATGCATTGTTGGACTGTGTGGGGCTGCCGGGCGTCGATCACCCGGCGGTCAACTCACCT
ATGTTCAAACAAAGCGATCGCCCTGAGGAAAACAGCGGCGGCGGAAGTGGCAGCAGACCACCTGGATCTAAAAACAAACCAAAACCACCGATCTTTGTCACTCGT
GACAGCCCTAACGCTCTGCGGAGCTATGTGCTGGAGGTCGCCGGAGGATCCGACGTGGCGGAGTGCATAGCCCAATTCGCCCGGAGACGCCAGCGTGGCGTCTGC
GTGCTCAGTGCAAGCGGCTTGGTCGCCAACGTCACCCTAAGGCAGCCGGCAGCGCCCGGCTCCGTAATGCCACTTCAAGGAAGGTTCGAGATTCTGTCTTTAACC
GGGGCGTTTTTGCCGGGGCCAGCGCCGCCGGGATCCACAGGGCTAACTGTGTACTTATCCGGCGGTCAGGGTCAAGTTGTGGGTGGGAGCGTCGTGGGATCACTG
GTGGCGGCCGGGCCGATAATGGTCATAGCGGCAACTTTTGCTAATGCAACATATGAAAGATTGCCTCTGGAAGATCCCGACGACCACGAAGTTGGCAGCGTCTCA
GCCTCCACGGCGGAGCGAGAAGCCCACCGCCGGAAATCAGAGGAAATGGAGGGCAGATGCAGACTGGGATGCCTGAACCAACTTTGCCTTTGTACAATCTACTGC
CGGACATGCTGCCAAACGGCGTTCAGCTCGGGCACGACGGATATGCTTATGTCCGGCCGCCGTTCTGAAACAAAGGGAAATCATTAG
Protein sequenceShow/hide protein sequence
MGNRVSSNLCHSLKPPEATATIGILSFEIANVISRATHLHKTLKDLEISNLRREIFTAEGITNLVSSDENYLLGLVLAEKVDDLNQVAGTVSRLGKRCSIPALQG
FEHIYTDIVSGNLDVKKLGILTKDMDALVKKMKTYVKRTANLYNAIEVSNNEHKWQTKNNVKHLKHISLWNKSYNRVVELLGRMVVLLYARISLVFHDHLAQNDM
KSSLLRAASSTVGGSALAVRYANIIIVTEKFLRHPHLVGDDARDDLYEMLPTSLKSSLRTHLKCNARSQAIYDATVCSYWKGAADRILGWLSPVAHDTIRWQNER
GFEQYCQIGKRSTNIALVQTLHFADREKTEAAICKVLVGLNYLCRYEHQQDALLDCVGLPGVDHPAVNSPMFKQSDRPEENSGGGSGSRPPGSKNKPKPPIFVTR
DSPNALRSYVLEVAGGSDVAECIAQFARRRQRGVCVLSASGLVANVTLRQPAAPGSVMPLQGRFEILSLTGAFLPGPAPPGSTGLTVYLSGGQGQVVGGSVVGSL
VAAGPIMVIAATFANATYERLPLEDPDDHEVGSVSASTAEREAHRRKSEEMEGRCRLGCLNQLCLCTIYCRTCCQTAFSSGTTDMLMSGRRSETKGNH