Protein

MIA_03084_1

Length
563 amino acids


Browser: contig04:198900-200639-

Protein function

EGGNOG:0PFEVFG01399.1splicing factor
SGD closest match:S000002772CDC40Pre-mRNA-processing factor 17
CGD closest match:CAL0000174348orf19.6347Uncharacterized protein

Protein alignments

%idAln lengthE-value
MCA_00540_155.962%6290.0MCA_00540_1
A0A1E3PHZ4_9ASCO48.475%5900.0WD40 repeat-like protein OS=Nadsonia fulvescens var. elongata DSM 6958 GN=NADFUDRAFT_51634 PE=4 SV=1
A0A060T5R6_BLAAD46.309%5966.09e-160ARAD1C14630p OS=Blastobotrys adeninivorans GN=GNLVRS02_ARAD1C14630g PE=4 SV=1
Q6C7K4_YARLI49.900%4991.68e-158YALI0D27346p OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=YALI0_D27346g PE=4 SV=1
UniRef50_Q6C7K449.900%4993.89e-155YALI0D27346p n=2 Tax=Yarrowia lipolytica TaxID=4952 RepID=Q6C7K4_YARLI
A0A167D928_9ASCO59.409%3722.21e-155Cdc40p OS=Sugiyamaella lignohabitans GN=CDC40 PE=4 SV=1
A0A1E4TKN0_9ASCO42.795%4587.39e-127Uncharacterized protein OS=Tortispora caseinolytica NRRL Y-17796 GN=CANCADRAFT_899 PE=4 SV=1
A0A0J9X9T6_GEOCN48.670%3763.02e-113Similar to Saccharomyces cerevisiae YDR364C CDC40 Pre-mRNA splicing factor OS=Geotrichum candidum GN=BN980_GECA07s00538g PE=4 SV=1
A0A1D8PFH7_CANAL43.544%3954.69e-107Uncharacterized protein OS=Candida albicans (strain SC5314 / ATCC MYA-2876) GN=orf19.6347 PE=4 SV=1
PRP17_YEAST40.327%3673.21e-87Pre-mRNA-processing factor 17 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=CDC40 PE=1 SV=2

Mitochondrial localization by mitoprotII

Probability of mitochondrial location: 0.0208

Protein family membership

None predicted.

Domains and repeats

  1. Domain
  2. Repeat
  3. Repeat
1 100 200 300 400 500 563

Detailed signature matches

    1. SSF50978 (WD40 repe...)
    2. PS50294 (WD_REPEATS...)
    1. PF00400 (WD40)
    2. PS50082 (WD_REPEATS_2)
    3. SM00320 (WD40_4)
    1. PR00320 (GPROTEINBRPT)
    1. PS00678 (WD_REPEATS_1)
Unintegrated signatures no IPR
Unintegrated signatures
  1. cd00200 (WD40)
  2. mobidb-lite (disord...)

Residue annotation

  1. structural tetrad ...

Protein sequence

>MIA_03084_1
MSLVPDYQSESDSEPDGLALVQKPTVSIVGVPTSQTALATLQEYRKTQKTYTKPAGPANPFDPLGPERVRKNIITGRMDK
EAFDNATFEDNYRMFRRFGETTGPSEDNGDLKRKAQREQEKAVATKLKRKRLERGDSSVLDGEAAYKGPWAGYQKEQDES
SSSESSEEEQAVVAPAVVAGPPQPTIPKEWTEFVGTQEYDYLGRTYMHIPQDLDVNLRKEPGSQETFVPKRKIHTWAGHS
GGVNALRFFPYSGHLLLSCGNDTAIKLWDCHRNNREQLRVYHGHAKAVKDVAFNGVRGQLGGGTRFLSASYDKTIKLWDT
ETGKVVQRYANLGRKSTGARAARAMANCVKFSPEADKQTEFLSGMSDNTILQWDTRLPPAEAVVQTYDHHLGPVNSITFV
DEDRRFMTTSDDKSVRVWDWQINVPIKFIADPTQHAMPTVALHPSGKFVAAQSMDNRVLVFGALDKSKFRQNRRKEFHGH
ACAGYPIQVAFSPDGKYLMSGDAHGYAFFWDWKTCALKAKLQVSGGGAGGSTEGSVVTCIAAHPQESSKVAVAGKDSVIT
YWD

GO term prediction

Biological Process

None predicted.

Molecular Function

GO:0005515 protein binding

Cellular Component

None predicted.