; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G033280 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G033280
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionUNC-50 family protein
Genome locationCla97Chr02:6663280..6678391
RNA-Seq ExpressionCla97C02G033280
SyntenyCla97C02G033280
Gene Ontology termsGO:0030173 - integral component of Golgi membrane (cellular component)
InterPro domainsIPR007881 - UNC-50
IPR022051 - Protein of unknown function DUF3611


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2277965.1 hypothetical protein Bca52824_060520 [Brassica carinata]8.8e-15961.47Show/hide
Query:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSN-LNSPPNSL--PSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITP-TV
        M SL  P A S    A A      L     HR P     S N L   PN L  P      SS  S S  LL  F  +   +      T ASS P +P ++
Subjt:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSN-LNSPPNSL--PSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITP-TV

Query:  SPNDEAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKA
        S + E +KAKLAQVAKRLEKTS+YFKRLGS+GFWGQLV T+VAAVILSFSV +TGK TSPATFYATA GI AAF+SVFWSFGYIRLS++L+RTA  P+KA
Subjt:  SPNDEAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKA

Query:  PPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQISEQRSGYNNSTVAGDRRTRNLKQTEMLPTASR
        PPRADVV  LR+GI+VNL+GMGAAILGMQATVG LVAKALT+SA P+YQ  S G SPVLALDVFLVQ                                 
Subjt:  PPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQISEQRSGYNNSTVAGDRRTRNLKQTEMLPTASR

Query:  GRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSV
                               WQQMD+EYTFWQML+LCTSPKVVYQHTKYHKQTKNQWARDDPAF+VICSLLL VATLAYCA YDHS +HA  VVVSV
Subjt:  GRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSV

Query:  LLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYD
         L H LITGA++ATCC                       WLY FDVHCNSFFPMFVLLYV+HYF+SPLL+ HGFI LLLSNLLFM+GASYYHY+NFLGYD
Subjt:  LLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYD

Query:  VLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        VLPFLE+TTFFLYPIG+V VL+PI IL GFNPSRYFMNMYFSQ L
Subjt:  VLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

KAG7022476.1 Protein unc-50-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-12091.9Show/hide
Query:  NNSTVAGDRRTRNLKQTEMLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLL
        N+S     + TR    T+MLPTASRGRSSSS+SRANPMYLQYLRRIVKWQQMDIEYTFWQML LCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLL
Subjt:  NNSTVAGDRRTRNLKQTEMLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLL

Query:  AVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATCCWLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLG
        AVATLAYCAAYDHSA+HA+FVV+SVLL HLLITGAILATCCWLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLG
Subjt:  AVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATCCWLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLG

Query:  YDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        YDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFS+NL
Subjt:  YDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

XP_004136957.1 protein TIC 21, chloroplastic [Cucumis sativus]5.2e-11991.32Show/hide
Query:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITPTVSPND
        MQSLLSPVAR+GPFPA AAVYSSDLHLSRFHRRP S  HS       NSLP FSAI   SSSSSLLLLK FGTL+H RGQ CSETRASSSPI+PTVSPND
Subjt:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITPTVSPND

Query:  EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA
        EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA
Subjt:  EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA

Query:  DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS
        DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSA+PYYQAASPGNSPVLALDVFLVQ S
Subjt:  DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS

XP_004136958.1 protein unc-50 homolog [Cucumis sativus]1.2e-11889.68Show/hide
Query:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA
        MLPTASRGRSSSS+SRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAF+VICSLLLAVATLAYCAAYDHSAAHA
Subjt:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA

Query:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
        SFVV+SVLLFHLLITGAILATCC                       WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
Subjt:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY

Query:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
Subjt:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

XP_038888764.1 protein TIC 21, chloroplastic [Benincasa hispida]2.2e-12593.96Show/hide
Query:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITPTVSPND
        MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFS  HSSNLNSPPNSL SF+A+R SSSSSSLLLLK FGTLEHRRGQF SET ASSSPI+P V PND
Subjt:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITPTVSPND

Query:  EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA
        EA+KAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA
Subjt:  EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA

Query:  DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS
        DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSA+PYYQ ASPGNSPVLALDVFLVQ S
Subjt:  DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS

TrEMBL top hitse value%identityAlignment
A0A0A0K2I1 Uncharacterized protein5.7e-11989.68Show/hide
Query:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA
        MLPTASRGRSSSS+SRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAF+VICSLLLAVATLAYCAAYDHSAAHA
Subjt:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA

Query:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
        SFVV+SVLLFHLLITGAILATCC                       WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
Subjt:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY

Query:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
Subjt:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

A0A0A0K4J8 Uncharacterized protein2.5e-11991.32Show/hide
Query:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITPTVSPND
        MQSLLSPVAR+GPFPA AAVYSSDLHLSRFHRRP S  HS       NSLP FSAI   SSSSSLLLLK FGTL+H RGQ CSETRASSSPI+PTVSPND
Subjt:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITPTVSPND

Query:  EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA
        EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA
Subjt:  EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRA

Query:  DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS
        DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSA+PYYQAASPGNSPVLALDVFLVQ S
Subjt:  DVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS

A0A5A7SP50 Protein TIC 212.1e-11888.93Show/hide
Query:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAI------RSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITP
        MQSLLSPVAR+GPFP  AAVYSSD HLSR HRRPFS  HS       NSLP FSAI       SSSSSSSLLLLK FGTLEHRRGQ CSETRASSSPI+P
Subjt:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAI------RSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITP

Query:  TVSPNDEAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPS
        TV PNDEAEKAKLAQVAKRLEKTS+YFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKI SPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPS
Subjt:  TVSPNDEAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPS

Query:  KAPPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS
        KAPPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSA+PYYQAASPGNSPVLALDVFLVQ S
Subjt:  KAPPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS

A0A5D3C4E1 Protein unc-50-like protein2.1e-11889.29Show/hide
Query:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA
        MLPTASRGRSSSS+SR NPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA
Subjt:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA

Query:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
        SFVV+SVL+FHLLITGAILATCC                       WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
Subjt:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY

Query:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
Subjt:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

A0A6J1D6A1 protein unc-50 homolog2.1e-11889.29Show/hide
Query:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA
        MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLL VATLAYCAAYDHSAAHA
Subjt:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA

Query:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
        +FVV+SVLLFHLLITGAILATCC                       WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSN+LFMIGASYYHY
Subjt:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY

Query:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
Subjt:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

SwissProt top hitse value%identityAlignment
O55227 Protein unc-50 homolog3.0e-3237.39Show/hide
Query:  QYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATC
        +YLRR+ +++QMD E+  WQML+L TSP+ VY++  Y KQTK+QWARDDPAF+V+ S+ L V+T+ +    D        +++ V+    +  G +++T 
Subjt:  QYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATC

Query:  CWL-------------------YAFDVHCNSFFPMFVLLYVIH-YFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYDVLPFLEKTTFFLYPIGVV
         W                    YAFDVH N+F+P+ V+L+ I  +FI+ +++   FI  L+ N L++I   YY YV FLGY  LPFL+ T   LYP   +
Subjt:  CWL-------------------YAFDVHCNSFFPMFVLLYVIH-YFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYDVLPFLEKTTFFLYPIGVV

Query:  FVLTPIFILIGFNPSRYFMNMY
         VL  + + +G+N +    + Y
Subjt:  FVLTPIFILIGFNPSRYFMNMY

Q53HI1 Protein unc-50 homolog1.0e-3237.39Show/hide
Query:  QYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATC
        +YLRR+ +++QMD E+  WQML+L TSP+ VY++  Y KQTK+QWARDDPAF+V+ S+ L V+T+ +    D        +++ V+L   +  G ++AT 
Subjt:  QYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATC

Query:  CWL-------------------YAFDVHCNSFFPMFVLLYVIH-YFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYDVLPFLEKTTFFLYPIGVV
         W                    YAFDVH N+F+P+ V+L+ I  +FI+ +++   FI  L+ N L+++   YY YV FLGY  LPFL+ T   LYP   +
Subjt:  CWL-------------------YAFDVHCNSFFPMFVLLYVIH-YFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYDVLPFLEKTTFFLYPIGVV

Query:  FVLTPIFILIGFNPSRYFMNMY
         +L  + + +G+N +    + Y
Subjt:  FVLTPIFILIGFNPSRYFMNMY

Q54DD7 Protein unc-50 homolog5.6e-4742.51Show/hide
Query:  SRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHS---------
        +R  ++SS+SR   +  +Y RRI  + QMDIEYTFW M +LC +P  VY+ T +HKQTKNQWARDDPAF VI    +A+A+++Y   +            
Subjt:  SRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHS---------

Query:  --AAHASFVVVSVLLFHL--LITGAILATCC----------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLG
          A    F+ V +L+  +   +T   L              WLYAFD+HCNSFFP+F++LYV+ +F+ P+L+++     +LSN L++IG SYY+YV FLG
Subjt:  --AAHASFVVVSVLLFHL--LITGAILATCC----------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLG

Query:  YDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        Y+ LPFL+ T  FLYPIG++F L  + +++G N +   +N YF   L
Subjt:  YDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

Q9CQ61 Protein unc-50 homolog2.3e-3237.39Show/hide
Query:  QYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATC
        +YLRR+ +++QMD E+  WQML+L TSP+ VY++  Y KQTK+QWARDDPAF+V+ S+ L V+T+ +    D        +++ V+    +  G +++T 
Subjt:  QYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATC

Query:  CWL-------------------YAFDVHCNSFFPMFVLLYVIH-YFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYDVLPFLEKTTFFLYPIGVV
         W                    YAFDVH N+F+P+ V+L+ I  +FI+ +++   FI  L+ N L++I   YY YV FLGY  LPFL+ T   LYP   +
Subjt:  CWL-------------------YAFDVHCNSFFPMFVLLYVIH-YFISPLLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYDVLPFLEKTTFFLYPIGVV

Query:  FVLTPIFILIGFNPSRYFMNMY
         VL  + + +G+N +    + Y
Subjt:  FVLTPIFILIGFNPSRYFMNMY

Q9SHU7 Protein TIC 21, chloroplastic1.3e-6759.64Show/hide
Query:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNS-PPNSLPSFSAI-------RSSSSSSSLLLLKAFG-TLEHRRGQFCSETRASSSP
        MQSLL P A S    A A              RP  F HS N  S    SLP F+ +        + SS  S   L  +G  +   +  F   T A S P
Subjt:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNS-PPNSLPSFSAI-------RSSSSSSSLLLLKAFG-TLEHRRGQFCSETRASSSP

Query:  ITPTVSPND-EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTA
         +P+  P D E +KAKLAQVAKRLEKTS+YFKRLGS+GFWGQLV T+VAAVILSFS+V+TGK TSPATFYATA GI AAF+SVFWSFGYIRLS++L+RT+
Subjt:  ITPTVSPND-EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTA

Query:  NQPSKAPPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS
          P+KAPPRADVV  LR+GI+VN+LGMG+A+LGMQATVG LVAKALT+SA P+YQ  S G SPVLALDVFLVQ S
Subjt:  NQPSKAPPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS

Arabidopsis top hitse value%identityAlignment
AT2G15240.1 UNC-50 family protein5.2e-10978.17Show/hide
Query:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA
        MLPT SR RSSSSSSRANPM+LQY RRIVKWQQMD+EYTFWQML+LCTSPKVVYQHTKYHKQTKNQWARDDPAF+VICSLLL VAT+AYC  YDHS++HA
Subjt:  MLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFWQMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHA

Query:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY
          VVVSVL  H LITGA++ATCC                       WLY FDVHCNSFFPMFVLLYV+HYF+SPLL+AHGFIPLLLSNLLFM+GASYYHY
Subjt:  SFVVVSVLLFHLLITGAILATCC-----------------------WLYAFDVHCNSFFPMFVLLYVIHYFISPLLVAHGFIPLLLSNLLFMIGASYYHY

Query:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL
        +NFLGYDVLPFLE+TTFFLYPIGVV VL+PI IL GFNPSRYFMNMYFSQ L
Subjt:  VNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL

AT2G15290.1 translocon at inner membrane of chloroplasts 219.0e-6959.64Show/hide
Query:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNS-PPNSLPSFSAI-------RSSSSSSSLLLLKAFG-TLEHRRGQFCSETRASSSP
        MQSLL P A S    A A              RP  F HS N  S    SLP F+ +        + SS  S   L  +G  +   +  F   T A S P
Subjt:  MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNS-PPNSLPSFSAI-------RSSSSSSSLLLLKAFG-TLEHRRGQFCSETRASSSP

Query:  ITPTVSPND-EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTA
         +P+  P D E +KAKLAQVAKRLEKTS+YFKRLGS+GFWGQLV T+VAAVILSFS+V+TGK TSPATFYATA GI AAF+SVFWSFGYIRLS++L+RT+
Subjt:  ITPTVSPND-EAEKAKLAQVAKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTA

Query:  NQPSKAPPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS
          P+KAPPRADVV  LR+GI+VN+LGMG+A+LGMQATVG LVAKALT+SA P+YQ  S G SPVLALDVFLVQ S
Subjt:  NQPSKAPPRADVVNSLRNGIVVNLLGMGAAILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCGCTACTCTCGCCGGTAGCTCGCAGCGGTCCCTTCCCGGCCGCCGCCGCAGTTTATTCATCGGATTTACATCTCAGCCGCTTCCATCGCCGGCCCTTTTCTTT
CTGCCACTCGTCCAATCTCAATTCTCCTCCCAATTCCCTCCCTTCATTTAGTGCCATTCGATCTTCTTCTTCTTCTTCTTCGTTGTTGCTTTTGAAGGCTTTTGGTACTC
TGGAGCATAGAAGAGGACAGTTTTGCTCGGAAACACGCGCTTCTTCTAGTCCCATTACCCCGACGGTTTCCCCTAATGATGAGGCTGAAAAGGCAAAACTGGCTCAGGTG
GCAAAGAGATTAGAGAAGACATCGAAGTATTTCAAGAGGTTAGGTAGTCTTGGATTTTGGGGACAGTTGGTGTGCACAATTGTCGCTGCCGTTATTCTTTCATTCTCTGT
TGTCATTACCGGGAAAATTACATCACCTGCCACTTTTTATGCTACTGCTGGTGGTATTGTGGCAGCCTTCATTTCCGTATTCTGGTCCTTTGGTTACATTCGGCTTTCTG
ATAAACTTCAACGGACTGCAAATCAACCCTCCAAGGCTCCTCCCCGAGCTGATGTTGTGAATAGCTTAAGAAATGGGATAGTAGTGAACCTACTGGGCATGGGTGCTGCC
ATTCTGGGGATGCAGGCAACGGTGGGATTGTTAGTTGCCAAGGCTCTTACTTCTTCAGCAGTTCCATATTACCAGGCAGCCTCCCCTGGGAATAGCCCTGTTCTTGCTTT
GGATGTATTCTTAGTACAGATCTCAGAGCAAAGAAGCGGCTACAACAATTCAACTGTCGCCGGAGATAGAAGGACGCGCAATTTGAAGCAAACGGAGATGTTGCCCACAG
CTTCCAGAGGGCGCTCCTCGTCCTCTTCATCTCGAGCTAATCCCATGTATCTACAGTACCTCCGTCGAATTGTCAAGTGGCAACAAATGGACATCGAATATACATTCTGG
CAAATGCTTCATCTATGCACTTCACCAAAAGTTGTCTATCAGCATACCAAGTATCACAAACAAACTAAGAATCAGTGGGCACGTGACGATCCTGCTTTTGTTGTGATCTG
CAGTCTTCTTTTGGCAGTTGCGACTCTGGCTTACTGTGCTGCATATGATCATAGTGCTGCACATGCCTCTTTTGTAGTTGTGTCTGTACTGCTCTTCCACTTGCTAATAA
CTGGGGCGATTTTGGCAACTTGTTGTTGGCTCTATGCATTTGATGTGCACTGCAATTCTTTCTTCCCGATGTTTGTTTTGCTGTACGTGATCCATTACTTTATATCACCG
CTTCTGGTGGCCCATGGCTTCATTCCATTATTGCTATCAAATTTGCTTTTCATGATTGGGGCTTCGTATTACCATTATGTCAACTTTTTAGGTTATGACGTACTGCCATT
TTTGGAGAAAACTACTTTCTTCCTCTATCCAATTGGTGTGGTCTTTGTCCTTACTCCAATCTTTATCTTGATCGGCTTCAACCCTTCAAGATATTTCATGAACATGTACT
TTAGTCAGAACTTATAA
mRNA sequenceShow/hide mRNA sequence
CCAAAACCCAGCAAAACGCTAACTACTACAATATCCACTGTAGGCTCTCTGCCTGCAGCGCCATCAAAATTGTATTTGATCGTCAATCATGCAGTCGCTACTCTCGCCGG
TAGCTCGCAGCGGTCCCTTCCCGGCCGCCGCCGCAGTTTATTCATCGGATTTACATCTCAGCCGCTTCCATCGCCGGCCCTTTTCTTTCTGCCACTCGTCCAATCTCAAT
TCTCCTCCCAATTCCCTCCCTTCATTTAGTGCCATTCGATCTTCTTCTTCTTCTTCTTCGTTGTTGCTTTTGAAGGCTTTTGGTACTCTGGAGCATAGAAGAGGACAGTT
TTGCTCGGAAACACGCGCTTCTTCTAGTCCCATTACCCCGACGGTTTCCCCTAATGATGAGGCTGAAAAGGCAAAACTGGCTCAGGTGGCAAAGAGATTAGAGAAGACAT
CGAAGTATTTCAAGAGGTTAGGTAGTCTTGGATTTTGGGGACAGTTGGTGTGCACAATTGTCGCTGCCGTTATTCTTTCATTCTCTGTTGTCATTACCGGGAAAATTACA
TCACCTGCCACTTTTTATGCTACTGCTGGTGGTATTGTGGCAGCCTTCATTTCCGTATTCTGGTCCTTTGGTTACATTCGGCTTTCTGATAAACTTCAACGGACTGCAAA
TCAACCCTCCAAGGCTCCTCCCCGAGCTGATGTTGTGAATAGCTTAAGAAATGGGATAGTAGTGAACCTACTGGGCATGGGTGCTGCCATTCTGGGGATGCAGGCAACGG
TGGGATTGTTAGTTGCCAAGGCTCTTACTTCTTCAGCAGTTCCATATTACCAGGCAGCCTCCCCTGGGAATAGCCCTGTTCTTGCTTTGGATGTATTCTTAGTACAGATC
TCAGAGCAAAGAAGCGGCTACAACAATTCAACTGTCGCCGGAGATAGAAGGACGCGCAATTTGAAGCAAACGGAGATGTTGCCCACAGCTTCCAGAGGGCGCTCCTCGTC
CTCTTCATCTCGAGCTAATCCCATGTATCTACAGTACCTCCGTCGAATTGTCAAGTGGCAACAAATGGACATCGAATATACATTCTGGCAAATGCTTCATCTATGCACTT
CACCAAAAGTTGTCTATCAGCATACCAAGTATCACAAACAAACTAAGAATCAGTGGGCACGTGACGATCCTGCTTTTGTTGTGATCTGCAGTCTTCTTTTGGCAGTTGCG
ACTCTGGCTTACTGTGCTGCATATGATCATAGTGCTGCACATGCCTCTTTTGTAGTTGTGTCTGTACTGCTCTTCCACTTGCTAATAACTGGGGCGATTTTGGCAACTTG
TTGTTGGCTCTATGCATTTGATGTGCACTGCAATTCTTTCTTCCCGATGTTTGTTTTGCTGTACGTGATCCATTACTTTATATCACCGCTTCTGGTGGCCCATGGCTTCA
TTCCATTATTGCTATCAAATTTGCTTTTCATGATTGGGGCTTCGTATTACCATTATGTCAACTTTTTAGGTTATGACGTACTGCCATTTTTGGAGAAAACTACTTTCTTC
CTCTATCCAATTGGTGTGGTCTTTGTCCTTACTCCAATCTTTATCTTGATCGGCTTCAACCCTTCAAGATATTTCATGAACATGTACTTTAGTCAGAACTTATAATCTCG
ACTTAAGGCTTCGGCTACAGCTTGAGGCAGGATGATTGATAGAACAAAAGATTTCATCGACACAGGTCACCAATGACTGCATTTTATTTTTCACTTGCATCTTGAAAGAA
ATGAGGAAGTCCAGGTCGTTGGAGCCTCAGCTGCTAATTTTTCAGGCCTTCGAGCGATATCAGGAATTGTAGGGTAGGATCTTGGTATCTTGTGCATCACTGCTGCAATT
GAGGTGTGATGCGGCTGAATTTTTGATGAAAATAGATGTGTTGAATGTAATGATTTTTTACCTTTATACAATTTGTTTTTGCAGTTTACACATATACACTTCAAAAGAAA
TTTAATTCCATCAGGCTTTCTGAATATGTAGAGACCATTTATTTGGCTCGCTTAACACTGCTTTCCAGATTCCGTTTCCCATCTTTGTTCAATGCTGGAAAGAAGTATTT
CTTATGAAGTGCAAATGACACAGGAAAATGTGGAGGTAATGGTATGATTCACCTAAATTTTGGTCCCACTTTTTTTATACTTTTTTTTTTTTCTTTTTTTTCTTTTTTTT
TTTTATTTTTATTTTTAATTTTGATTTTTTTGCTTGTCATTATTAGCCGATGTTATATAAATGATCTTATTGTGCACTCATGCTCTTGCTTTTACTCTTGTTATGATTCA
TCCTCACTTTCAATCTTTACTTTTGACTCTCTTAACTCTTGTTTCTTCTTCTCGACACTCTTGCTTCACTTTTAGTCCTTACTCTTGACACTTCCTACTCTTGCTATTCA
CTCTCTTTTTATAAAAAATTTGAGAGCAAAAAGTATAGAGTAGAGTGAGAACGATAACAAGTAGCAAGAAAATATGAAGACGGCAGGAAAACTCACACTCTAAGGAGTGA
GAGGAGTGAGTAAGATCTCAACTTCTATGCACAATATTTTATTTTGGTAATTTCACGTTGATATGAGTAAGCGTGAGATCAAAAAAAAATATTGTTTTCGAAAATAGGTC
ATCTGGTTAAATGACCCATAAATGTGTGACAAACCGGCCCCAACCAGACTTTTCATTGCAAT
Protein sequenceShow/hide protein sequence
MQSLLSPVARSGPFPAAAAVYSSDLHLSRFHRRPFSFCHSSNLNSPPNSLPSFSAIRSSSSSSSLLLLKAFGTLEHRRGQFCSETRASSSPITPTVSPNDEAEKAKLAQV
AKRLEKTSKYFKRLGSLGFWGQLVCTIVAAVILSFSVVITGKITSPATFYATAGGIVAAFISVFWSFGYIRLSDKLQRTANQPSKAPPRADVVNSLRNGIVVNLLGMGAA
ILGMQATVGLLVAKALTSSAVPYYQAASPGNSPVLALDVFLVQISEQRSGYNNSTVAGDRRTRNLKQTEMLPTASRGRSSSSSSRANPMYLQYLRRIVKWQQMDIEYTFW
QMLHLCTSPKVVYQHTKYHKQTKNQWARDDPAFVVICSLLLAVATLAYCAAYDHSAAHASFVVVSVLLFHLLITGAILATCCWLYAFDVHCNSFFPMFVLLYVIHYFISP
LLVAHGFIPLLLSNLLFMIGASYYHYVNFLGYDVLPFLEKTTFFLYPIGVVFVLTPIFILIGFNPSRYFMNMYFSQNL